Prefix matching

Using prefix matching

When this option is selected, CafeTran will analyze the beginnings of words (here called prefixes) and discard any endings responsible for inflection of words.

It is an option which increases significantly the number of hits for highly inflected languages. The length of prefixes is set by a percentage number. The bigger the percent number the longer the prefix of words which the program will analyze.

The minimal prefix length option (menu Edit > Options > Memory > Minimal prefix length) lets you set the minimal allowed length of prefixes. The length can also be fixed, when the "fixed" option selected, instead of a set percentage length. It means that all the words will have the minimal prefix length, no matter their actual length.

See also: Stemming

Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License