Web Hosting Geeks published a very extensive research into domain names and web hosting provider options. It includes the analysis of domain name trends by TLD, as well as over 24,000 hosting companies and how they are doing.
Complete with reviews, and detailed stats about each and every company, I think, this is one of the most complete and in-depth data I’ve seen for a long time.
vpnMentor blog runs a post with a lengthy infographic ranking online censorship in different countries. There’s plenty of interesting data regarding torrents, social media, political media, pornography, and other types of online censorship targets.
The Internet in real time provides a visual insight into how much activity is happening on the web every second. Counts for things like Facebook likes, tweets, and YouTube video views are updated every second, all on one page.
It fascinates me every time to see stuff like this, because, apart from the human activity in itself, I have a glimpse of an understanding of how much technology work is happening behind the scenes.
The New York Times has this awesome chart of highest paid Chief Executive Officers (CEOs) in 2016. You can sort and filter the data in a variety of ways.
Most of these guys and gals make more a year than the rest of us will ever make in our lifetime. I guess, they totally deserve it. I’m sure all of them work really hard to get these money.
Datagraver has a few charts related to the number of victims of terrorism in Western Europe in the last almost 50 years. Given how much hype terrorism gets and how many changes we see in the day-to-day life related to it, the stats are quite interesting.
OK, this one is socially funny and statistically cool – Stack Overflow question on how to exit Vim editor was viewed over a million times in the last few years. Now, there’s a breakdown of all sorts of statistics about who gets stuck in Vim the most. It’s pretty amazing the kind of questions and answers one can ponder at when having access to a lot of statistical data.
Here’s an interactive collection of the world’s biggest data breaches. It goes back to 2004, where about 92,000,000 email addresses and screen names were stolen by an AOL employee, and covers most of the major events up until and including 2016. There are a few ways to filter the data and change the representation.
Overall, should give you a pretty good idea of how safe and secure your online data is. Oh, and how private it is too.
Any git repository contains a tonne of information about commits, contributors, and files. Extracting this information is not always trivial, mostly because of a gadzillion options to a gadzillion git commands – I don’t think there is a single person alive who knows them all. Probably not even Linus Torvalds himself.
git-quick-stats is a tool that simplifies access to some of that information and makes reports and statistics quick and easy to extract. It also works across UNIX-like operating systems, Mac OS X, and Windows.
Here’s a recent infographic for 2017 with plenty of Instagram statistics. The two bits that I found interesting were:
- Russia is the second largest country by the number of Instagram visitors (after the US).
- The difference in the number of followers between entertainment celebrities and politicians. Dwayne Johnson, aka The Rock, is at the bottom of the celebrity top 10, with 73 million followers. Barack Obama is the first in top 10 politicians, with 11.5 million followers.
GitHub to MySQL is a handy little app in PHP that pulls labels, milestones and issues from GitHub into your local MySQL database. This is useful for analysis and backup purposes.
There are a few example queries provided that show issues vs. pull requests, average number of days to merge a pull request over the past weeks, average number of pull requests open every day, and total number of issues.
I think this tool can be easily extended to pull other information from GitHub, such as release notes, projects, web hooks. Also, if you are using multiple version control services, such as BitBucket and GitLab, extending this tool can help with merging data from multiple sources and cross-referencing it with the company internal tools (bug trackers, support ticketing systems, CRM, etc).
This is not something I’ll be doing now, but I’m sure the future is not too far away.