About Me
A physicist by education, who now works in machine learning. The opinions expressed here are mine alone.-
Blog under the Creative Commons Attribution-ShareAlike 3.0 License
-
Recent Posts
Archives
- January 2018 (1)
- December 2017 (1)
- May 2017 (3)
- January 2016 (1)
- July 2015 (1)
- May 2015 (1)
- March 2015 (1)
- December 2014 (1)
- November 2014 (1)
- May 2014 (1)
- December 2013 (1)
- April 2013 (1)
- December 2012 (4)
- October 2012 (1)
- August 2012 (2)
- July 2012 (1)
- June 2012 (3)
- May 2012 (1)
- March 2012 (1)
About Me
A physicist by education, who now works in machine learning. The opinions expressed here are mine alone.-
Recent Posts
Archives
- January 2018 (1)
- December 2017 (1)
- May 2017 (3)
- January 2016 (1)
- July 2015 (1)
- May 2015 (1)
- March 2015 (1)
- December 2014 (1)
- November 2014 (1)
- May 2014 (1)
- December 2013 (1)
- April 2013 (1)
- December 2012 (4)
- October 2012 (1)
- August 2012 (2)
- July 2012 (1)
- June 2012 (3)
- May 2012 (1)
- March 2012 (1)
Category Archives: Text Mining
Classifying and visualizing with fastText and tSNE
Previously I wrote a three-part series on classifying text, in which I walked through the creation of a text classifier from the bottom up. It was interesting but it was purely an academic exercise. Here I’m going to use methods … Continue reading
Posted in Machine Learning, Statistics, Text Mining, Uncategorized, Visualizations
1 Comment
Subreddit Map
Reddit describes itself as the “front page of the internet”, and given how many users it has, that’s not too far off. It’s divided into subreddits, which can have either broad or narrow topics. These subreddits are (mostly) user-created, with … Continue reading
Posted in reddit, Social Media, Text Mining
4 Comments
Properties of angry speech
Note: This post contains profanity Sit down if you’re standing: There’s a lot of angry speech on the internet. There’s a lot of regular speech too. For exact meaning, the order and context of words is critical, but for general tone … Continue reading
Posted in Social Media, Text Mining
Leave a comment
I before e
Despite the fact that English is an absolutely terrible language and nobody should speak it, people still do. So to cope with the many irregularities and near impossibility of getting anything right, people try to come up with catchy rhymes, … Continue reading
Posted in Text Mining, Uncategorized
Leave a comment