Note

This documentation is still a work in progress. If you have any issues or questions, please ask on the unitex-devel mailing list or file a bug in our issue tracker.

Unitex Library - User’s Guide

Unitex/GramLab is an open source, cross-platform, multilingual, lexicon- and grammar-based corpus processing suite. Unitex tools are designed to perform several Natural Language Processing (NLP) tasks on a textual corpus relying on linguistic resources such as electronic dictionaries and grammars represented as finite state transducteurs (FSTs), recursive transition networks (RTNs) and lexicon-grammars.

This document is a guide to using the Unitex C/C++ library and and the Java Native Interface (JNI), including methods descriptions and example snippets.

Contents:

Indices and tables

Contributing

We welcome everyone to contribute to improve these guidelines. Below are some of the things that you can do to contribute: