-
Blog under the Creative Commons Attribution-ShareAlike 3.0 License
Category Archives: Text Mining
Classifying and visualizing with fastText and tSNE
Previously I wrote a three-part series on classifying text, in which I walked through the creation of a text classifier from the bottom up. It was interesting but it was purely an academic exercise. Here I’m going to use methods … Continue reading
Posted in Machine Learning, Statistics, Text Mining, Uncategorized, Visualizations
1 Comment
Subreddit Map
Reddit describes itself as the “front page of the internet”, and given how many users it has, that’s not too far off. It’s divided into subreddits, which can have either broad or narrow topics. These subreddits are (mostly) user-created, with … Continue reading
Posted in reddit, Social Media, Text Mining
4 Comments
Properties of angry speech
Note: This post contains profanity Sit down if you’re standing: There’s a lot of angry speech on the internet. There’s a lot of regular speech too. For exact meaning, the order and context of words is critical, but for general tone … Continue reading
Posted in Social Media, Text Mining
Leave a comment
I before e
Despite the fact that English is an absolutely terrible language and nobody should speak it, people still do. So to cope with the many irregularities and near impossibility of getting anything right, people try to come up with catchy rhymes, … Continue reading
Posted in Text Mining, Uncategorized
Leave a comment