GitHub’s quick review and summary of 2013. There are some pretty nitty links from there.
Tag archives for statistics
Around 55 per cent of companies in Cyprus who took part in a public tender over the last three years claim that corruption prevented them from winning the contract, the highest percentage in the EU, according to the Commission’s anti-corruption report published on Monday.
Conflicts of interest in bid evaluation were reported in 76 per cent of cases, collusive bidding in 68 per cent, abuse of negotiated procedures in 62 per cent, unclear selection or evaluation criteria in 61 per cent, and amendment of contract terms after the contract is concluded stood at 55 per cent.
The concert hall at the Sydney Opera House holds 2,700 people. This blog was viewed about 58,000 times in 2013. If it were a concert at Sydney Opera House, it would take about 21 sold-out performances for that many people to see it.
Until now, it was particularly difficult to obtain reliable figures on the results of the Android operating system in China. Indeed, there is no “centralized app store” and most smartphones sold in the country do not use Google services, including activation. In fact, it is very difficult to know the actual results. The search engine Baidu has corrected this by publishing a report on trends in the mobile internet for the 3rd quarter 2013. It appears that there would be now 270 million active users of the Google platform in the country (more than 20% of the total population). Growth would, however, decrease with a small 13% against 55% for the same period last year but up 10% compared to Q2 2013.
BayesDB, a Bayesian database, lets users query the probable implications of their data as easily as a SQL database lets them query the data itself. Using the built-in Bayesian Query Language (BQL), users with no statistics training can solve basic data science problems, such as detecting predictive relationships between variables, inferring missing values, simulating probable observations, and identifying statistically similar database entries.
BayesDB is suitable for analyzing complex, heterogeneous data tables with up to tens of thousands of rows and hundreds of variables. No preprocessing or parameter adjustment is required, though experts can override BayesDB’s default assumptions when appropriate.
BayesDB’s inferences are based in part on CrossCat, a new, nonparametric Bayesian machine learning method, that automatically estimates the full joint distribution behind arbitrary data tables.
- The average top 1,000 web page is 1575 KB.
- More than half of this page size is due to images.
- Flash is on the decrease. Custom fonts are on the increase.
By Leonid Mamchenkov
Fast Company shares the 10 surprising social media statistics that will make you rethink your social strategy. I wouldn’t go as far as saying that all of them are really all that surprising, but they are mostly interesting. Here they are in a nutshell:
- The fastest growing demographic on Twitter is the 55-64 year age bracket.
- 189 million of Facebook’s users are “mobile only”.
- YouTube reaches more US adults aged 18-34 than any cable network.
- Every second two new members join LinkedIn.
- Social media has overtaken port as #1 activity on the web.
- LinkedIn has a lower percentage of active users than Pinterest, Google+, Twitter and Facebook.
- 93% of marketers use social media for business.
- 25% of smartphone owners ages 18-44 say they can’t recall the last time their smartphone wasn’t next to them.
- Even though 62% of marketers blog or plan to blog in 2013, only 9% of US marketing companies employ a full-time blogger.
- 25% of Facebook users don’t bother with privacy settings.
Read the whole thing for more details, links, stats, visualization, and ideas on how to utilize this information.
On the average Web page, users have time to read at most 28% of the words during an average visit; 20% is more likely.