This documentation is still a work in progress. If you have any issues or questions, please ask on the unitex-devel mailing list or file a bug in our issue tracker.
Unitex Library - User’s Guide¶
Unitex/GramLab is an open source, cross-platform, multilingual, lexicon- and grammar-based corpus processing suite. Unitex tools are designed to perform several Natural Language Processing (NLP) tasks on a textual corpus relying on linguistic resources such as electronic dictionaries and grammars represented as finite state transducteurs (FSTs), recursive transition networks (RTNs) and lexicon-grammars.
This document is a guide to using the Unitex C/C++ library and and the Java Native Interface (JNI), including methods descriptions and example snippets.
We welcome everyone to contribute to improve these guidelines. Below are some of the things that you can do to contribute:
- Learn a bit about the RST (reStructuredText) markup language and Sphinx for creating documentation. A Restructured Text (reST) and Sphinx CheatSheet is available here
- Fork us and request a pull to the develop branch.
- Alternately, if you don’t want to learn RST and Sphinx, submit a bug report or a feature request to our GitHub.