Linguistic registers are known to have a strong influence on the expressivity conveyed by utterances. However, their study in natural language processing (NLP) is still marginal. To compensate for this deficiency, the TREMoLo project focuses on their analysis and automatic manipulation, with a particular attention on French. Beside its originality, this research work will be complementary with the wide-spread activities in textual information extraction in NLP.
The project is part of the growing interest towards stylistics in NLP, domain for which the number of potential applications increases. For instance, stylistics can take part in authorship authentication, access to information, human-machine dialogue or interaction, and language learning. Societal consequences of the project are thus naturally in these domains by opening possibilities for automatic text modulation according to a specific goal or audience. Scientific advances mainly stand in the joint use of data mining and statistical NLP approaches, along with the discovery of new linguistic and sociolinguistic findings. All these aspects provide a high industrial valorisation potential to the project.