Text Mining, Analytics & More

The basics, the not so basics and the nitty-gritty of text mining, retrieval and summarization and other related topics.

  • Home
  • Tutorials
  • Text Mining Resources
  • Author

Text Mining Resources


  • Porter Stemmer in Java 
  • Wrapper over Stanford's POS Tagger
  • Stanford POS Tagger
  • Web-NGram API
  • API for Sentiment Polarity
  • API for Sentence Clustering
  • API for N-Gram and Word Counting
  • API for Text Similarity using Jaccard, Dice and Cosine
  • API for Topic Extractions with Sentence Support
  • API for Core NLP - POS Tagging, Chunking and Stemming
  • API for Extracting Text from HTML Pages or directly from URL's
  • API for Textual Summarization of Reviews
  • ROUGE Tool for Evaluating Textual Summaries (Platform independent)
  • ROUGE Tool in Perl for Evaluating Summaries
  • SentiWordNet - Get sentiment probability of words
  • Word2Vec in Python
  • Word2Vec in Java/Scala Spark
  • Stop Words in Various Languages
  • Topic Modeling in Java




Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest
Home
Subscribe to: Posts (Atom)

Popular Posts

  • What are N-Grams?
    N-grams of texts are extensively used in text mining and natural language processing tasks. They are basically a set of co-occuring words ...
  • How to install MySQL on an Amazon EC2 Server Instance?
    Being a newbie to server administration especially with Linux, I found myself looking all over the internet on how to install a mysql serv...
  • Computing Precision and Recall for Multi-Class Classification Problems
    In evaluating multi-class classification problems, we often think that the only way to evaluate performance is by computing the accurac...
  • All About Stop Words for Text Mining and Information Retrieval
    What are Stop Words? When working with text mining applications, we often hear of the term “stop words” or "stop word list" o...
  • User Review Data Set for Sentiment Analysis, Opinion Mining and Summarization
    If you are looking for user review data sets for opinion analysis / sentiment analysis tasks, there are quite a few out there. These datase...

Read Articles by Email

Search This Blog

  • Natural Language Processing (10)
  • text mining (7)
  • Sentiment Analysis (5)
  • Programming (4)
  • Server Administration (2)
  • Text Summarization (2)
  • UMLS Metathesaurus (2)
  • Clinical Text Mining (1)
  • Counting N-Grams (1)
  • Hadoop (1)
  • Information Retrieval Tools (1)
  • Mysql (1)
  • ROUGE (1)
  • Spark (1)
  • Text Similarity (1)
  • opinion mining (1)

Google+ Followers

Blog Followers

Awesome Inc. theme. Theme images by Jason Morrow. Powered by Blogger.