Data Science
www.complexityintelligence.com - Natural Language Processing API Tagging, Entity Recognition in Java
www.digicol.de - SE is an indexing system to categorises texts automatically
www.mozenda.com/Mining+Text - Harvest anything from the Internet Looking for Mining Text?
OPUS is a growing collection of translated texts from the web
corpora, size, queries = better resources, more insight, more insight