Tools & Resources Archive Details

Tools for Normalisation | CLARIN ERIC

What it is

CLARIN resource family listing NLP tools for text normalization and preprocessing.

Gabriel’s notes

The CLARIN infrastructure offers 15 tools for text normalisation. Most of the tools are aimed at normalising texts within a single language (1 Czech, 3 Dutch, 1 English, 3 German, 1 Hungarian, 1 Icelandic, 1 Slovenian, 1 Turkish), while the rest have a very broad multilingual scope. Half of the tools are dedicated normalisers, while the others provide additional functionalities such as PoS-tagging, lemmatisation and named entity recognition.

Good fit if you want to:

  • Use this when you want a practical starting point for exploring the topic.

Note: pricing and policy details can change—verify on the official site before making decisions.

Visit the resource