This documentation is still a work in progress. If you have any issues or questions, please ask on the unitex-devel mailing list or file a bug in our issue tracker.

Unitex Library - User’s Guide

Unitex/GramLab is an open source, cross-platform, multilingual, lexicon- and grammar-based corpus processing suite. Unitex tools are designed to perform several Natural Language Processing (NLP) tasks on a textual corpus relying on linguistic resources such as electronic dictionaries and grammars represented as finite state transducteurs (FSTs), recursive transition networks (RTNs) and lexicon-grammars.

This document is a guide to using the Unitex C/C++ library and and the Java Native Interface (JNI), including methods descriptions and example snippets.


Indices and tables


We welcome everyone to contribute to improve these guidelines. Below are some of the things that you can do to contribute: