Sematext Blog Introduction

Here it is – Sematext’s new and shinny blog.

We’ll be writing about topics that are dear and important to us – search (both web search and enterprise search), text analytics, natural language processing (sentiment detection, named entity recognition…), machine learning, information gathering (e.g. web crawling), information extraction, e-discovery, recommendation engines, etc.  There will be a lot of talk about tools we use regularly – Lucene, Solr, Nutch, Mahout and Taste, Hadoop, HBase and friends, and more.

To subscribe, use the orange feed icon or just go to  If you are a Twitter user, you can follow @sematext on Twitter, too.

Leave a comment