Tag Archives: Hadoop

There is a growing body of evidence, at least in text processing, that of … data, features, [and] algorithms … data probably matters the most. Superficial word-level features coupled with simple models in most cases trump sophisticated models over deeper … Continue reading

Quote | Posted on by | Tagged , , , , , | Leave a comment

Infinite Data

Since people liked my last opinion piece on #big data, here’s another one. Imagine there was a technology that allowed me to record the position of every atom in a small room, thereby generating some ridiculous amount of data (Avogadro’s … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , , , , , | Leave a comment

Big Data vs Quality Data

theLoneFuturist: I’m not certain why learning Hadoop isn’t more attractive to you. If you are fine with R, doesn’t having lots of data interest you? theLoneFuturist: Don’t get me wrong, there are probably unexciting tasks associated with big data, but … Continue reading

Posted in Uncategorized | Tagged , , , , , , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

Word use frequencies in Shakespeare’s works follow a power law: 13452, 4277, 2340, 1503, 1111, 872, 658, 598, 474, 381. (Minute 8:21) video link: http://vimeo.com/3598672 (Source: http://player.vimeo.com/)

Video | Posted on by | Tagged , , , , , , , , , | Leave a comment