Lemmatizer.org

Lemmatizer.org related to morphological analysis and lemmatize of european languages. This time we have dictionaries for two languages: russian and english, but we hope to see more dictionaries as time goes by.

The main goal of lemmatization is enumerating all forms of the word and explain their morphological characteristics: gender, case, tense, etc. Why do we need another one lemmatizer software? First, most of similar software products are not free. Others were written really poorly. By the way, our team thanks effusively the AOT team and Alexey Sokirko personally, 'cause particular their product we decided to develop to a new level.

Features:

  • Crossplatform and easy to compile on any architecture. Сmake is a great instrument to provide this feature.
  • Multithread support without perfomance waste.
  • Working with dictionary is independent from codepage using in your application, you don't need no additional conversions ("yes, you need, you've just used double negative!" (c)).
  • Adding new language is most of all a linguistic work and doesn't require high programming skills. Using lemmatizer for different languages is similar.
  • Dictionaries are easy to edit and recompile.
  • Simple operations (e.g. get initial forms of the word) are really simple to implement.
  • High performance at all (even in predictions for not lexicographical words).

    All libraries are licensed under terms of the GNU LGPL.

    11 September 2007, Lemmatizer.org was successfully started.

    Both versions — russian and english.


    © 2007, Lemmatizer Team.
    Contact e-mail: lemmatizer@mail.ru
    (spammers are welcome, I always order your products nowhere. :) )
    Рейтинг@Mail.ru