Automatic summarization is the process of shortening a set of data computationally, to create a subset that represents the most important or relevant information within the original content
Sentiment analysis is the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract, quantify, and study affective states and subjective information
Named-entity recognition is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc.
Language detection is a technique which identifies the language of a text and the parts of that text in which the language changes, all the way down to the word level