BayesDB, a Bayesian database, lets users query the probable implications of their data as easily as a SQL database lets them query the data itself. Using the built-in Bayesian Query Language (BQL), users with no statistics training can solve basic data science problems, such as detecting predictive relationships between variables, inferring missing values, simulating probable observations, and identifying statistically similar database entries.
BayesDB is suitable for analyzing complex, heterogeneous data tables with up to tens of thousands of rows and hundreds of variables. No preprocessing or parameter adjustment is required, though experts can override BayesDB’s default assumptions when appropriate.
BayesDB’s inferences are based in part on CrossCat, a new, nonparametric Bayesian machine learning method, that automatically estimates the full joint distribution behind arbitrary data tables.
I work in technology sector. And I do round a clock, not only from 9 to 5. It is my bread and butter, it is my hobby, it is the fascination of my life. And with the current rate of change particular in information technology (IT), there is always something new to learn, to try, to talk about. I often post news, thoughts, and reviews. And when I do, this is the category I use.
By Leonid Mamchenkov
I am a well known Google fan. But even those who call it an Evil Corporation and a Global Spy, can’t argue with the awesomeness of these news:
Starting today we’re rolling out the ability to export a copy of your Gmail and Google Calendar data, making it easy to back up your data or move to another service.
You can download all of your mail and calendars or choose a subset of labels and calendars. You can also download a single archive file for multiple products with a copy of your Gmail, Calendar, Google+, YouTube, Drive, and other Google data.
Most of the 20 GB of data I store on Google Drive is actually my email archive. I’ve imported email into my Gmail from as early as 1998 – much, much earlier than Gmail was even born. Having a way to export them all out in one go, without using clunky POP or IMAP is much appreciated.
By Leonid Mamchenkov
Here is a excellent Quora question: how many lines of code, if any, from Linux 1.0 are still in the modern Linux kernel, and what is it? And an evenly excellent answer:
According to git diff, 21228 of the 176250 lines from Linux 1.0 (1994-03-14) are still present in Linux 3.12 (2013-11-03):
$ empty_tree="$(git mktree < /dev/null)" $ git diff --shortstat "$empty_tree" 1.0 561 files changed, 176250 insertions(+) $ git diff --shortstat -M -l99999 1.0 v3.12 44905 files changed, 17702349 insertions(+), 155022 deletions(-)
Over half of these lines are blank or consist entirely of punctuation; only 10419 of them have at least one letter or number.
We can go back even farther to the very first public release, Linux 0.01 (1991-09-17). Of the 10239 lines in Linux 0.01, 954 survive in Linux 3.12, of which just 242 have at least one letter or number. 123 of them were structs and constants in include/a.out.h (now include/uapi/linux/a.out.h), and 26 of them were the S_* macros in include/sys/stat.h (now include/uapi/linux/stat.h). The rest were scattered through 24 other files with at most 9 lines each.
It might appear that not much. But it is in fact impressive. How much code have you written that survived for over 20 years in a project that has changed so much – from a hobby experiment to a dominant operating system across servers, mobiles, and embedded devices?
From the Geek & Poke of course.
- The average top 1,000 web page is 1575 KB.
- More than half of this page size is due to images.
- Flash is on the decrease. Custom fonts are on the increase.
By Leonid Mamchenkov
While reading Ben Ramsey’s blog post “The Fall of PEAR and the Rise of Composer“, I caught myself thinking of entirely different subject – technical conferences, in general. Two bits in particular pushed my train of thought.
The first one is Ben’s experience as a first time speaker at a technical conference. We’ve all probably been to one or seen one on YouTube, but that’s from the audience perspective. Here is how it looks from the other side:
Palms sweaty, stomach aflutter with butterflies, I stood before my first audience as a technical speaker. It was a time of many firsts for me—my first PHP conference, my first time in Europe, my first technical presentation. I had been accepted to speak at the 2005 International PHP Conference Spring Edition in Amsterdam. I was nervous, jet-lagged, and tired from an all nighter working on slides.
After my presentation, I found myself downstairs during the break, discussing the topic of frameworks with a German gentleman. I don’t recall much of our conversation, but after a while, I made the biggest mistake I have ever made in my entire career as a speaker. I asked the question “how did it go?”
“Oh, I think we both know how it went,” responded the gentleman drily.
“No. No, I don’t think I do. Can you elaborate?” My second mistake.
“It was horrible!” he exclaimed.
Wow. I was floored. I had to get out of the conversation quickly. I thanked him for his frankness and excused myself. I found a quiet room and sat down for a while, taking deep breaths, trying to calm myself and recover before giving a second talk that day. If I was nervous before, I was frantic now. I didn’t think I could muster up the courage to give another presentation, but I pressed on, and I’m better for it.
That day, I learned two valuable lessons: 1) after giving a presentation never, ever, ever, ever ask someone how they thought it went; let them volunteer that on their own, and 2) unless you want a brutally honest response, don’t ask a German for their opinion.
I haven’t spoken at any major events yet – just a couple of local Linux user group meetings – but I can attest to the very humbling and crushingly nervous experience. It gets easier, I guess, but who would want to listen to you when your first few times suck so badly?
The second bit that got me thinking was the this one:
At php|tek 2009, a group of leaders from a few of the frameworks and libraries communities got together to, as David Coallier described, “develop a set of common standards which PHP projects can strive towards adopting.” This moment represented a turning point in the PHP community. All modern PHP development hinges on this moment. As the Doctor might say, it is a fixed point in time.
This group produced what would become known as PSR-0, an autoloading standard for PHP userland frameworks and libraries, and the group’s name would eventually become the PHP Framework Interop Group (PHP-FIG). It was this autoloading standard that I think changed the course of all PHP userland development.
Quite a few people I know are under the impression that technical conferences are a waste of time. People just go there to escape a day or two from their work. And maybe learn a thing or two. But it’s not just that. It’s not only about the sessions, workshops and learning.
A great deal of networking and cross-industry communication happens at these events. Some of these just forge friendships and beer sessions. Yet others develop into powerful partnership and great accomplishments. Like the one mentioned above.
Like it or not, but even the techiest of techiest among us need an occasional face-to-face session to fix “lost in translation” things, to discuss and argue about everything, and agree on something.
By Leonid Mamchenkov
Here is a nice collection of old websites that were abandoned, but still work. Most of these were last updated in mid-90′s. And, though I haven’t seen most of them back in the day, the overall atmosphere and common elements of those time bring a nostalgic tear to my eye.
Off the whole selection, my obvious favorite one is Internet Explorer is EVIL! of course. Back in 1998, 15 years ago, way before any of the modern Web 2.0 or whatever version we are running now, some people knew the truth and were not afraid to say it. In the most expressing matter – including satanic stars, fires of Hell, and a face image of Bill Gates.
Some things never change …