[libvoikko] Status of experimental backends
Harri Pitkänen
hatapitk at iki.fi
Sun May 2 19:51:48 EEST 2010
On Saturday 24 April 2010, Harri Pitkänen wrote:
> To help me make the right decision I'd like to ask everyone who has ever
> tested libvoikko with some language other than Finnish to answer these
> questions:
>
> 1) Which languages you tested?
> 2) Which backends and morphologies you used?
> 3) If there are Hunspell dictionaries available for these languages how
> did libvoikko compare to Hunspell in terms of quality of spell checking,
> speed or memory use?
>
> I'll publish a summary of all answers I get in a week or so.
And the results are:
- Icelandic with Lttoolbox backend:
Tested by me. I don't know Icelandic but testing with some text I found from
the net suggests that there are lots of valid words that the speller does not
accept. Spelling suggestions have not been tailored for Icelandic. Lttoolbox
library has namespace issues. I did not do any performance tests.
I'd say that the overall quality is not good enough to be considered for a
stable release yet.
- Sámi with HFST backend:
Not really tested, but should be close. Linguistic quality is good, but there
are licensing issues. There are no known technical problems. Comparison with
Hunspell has not been done yet.
Depending on whether the licensing issues can be solved it seems that
Sámi/HFST combination has a change of becoming a supported language for
libvoikko 3.1. Please report any progress and test results related to this.
Harri
More information about the Libvoikko
mailing list