[libvoikko] [Apertium-stuff] Lttoolbox (Apertium) morphology backend

Harri Pitkänen hatapitk at iki.fi
Sun Feb 28 11:00:44 EET 2010


On Sunday 28 February 2010, Kevin Brubeck Unhammer wrote:
> I think this should be
> http://apertium.svn.sourceforge.net/svnroot/apertium/trunk/apertium-br-fr/a
> pertium-br-fr.br.dix for Breton (and not of the translational dictionary)
>  ;-)

There appears to be some differences between these dictionaries. This Breton 
dictionary required one line to be removed from its source before it compiled, 
but it still did not work at all with Voikko. Probably analysis results for 
unknown words were represented differently than in the example dictionary.

One English dictionary I tested worked partially. Voikko accepted any word 
that started with a valid English word but ignored any garbage at the end.

However the Icelandic dictionary seems to work quite well. Here is a 
screenshot from OpenOffice.org showing the first few paragraphs from 
Landnámabók (http://www.snerpa.is/net/snorri/landnama.htm)

  http://www.puimula.org/htp/tmp/ooo-is.png

I don't know Icelandic at all and therefore can't tell whether some of the 
words are accepted or rejected incorrectly.

- Checking initial capitalization does not work but that was expected since I 
did not implement such checking yet.
- Perhaps there are some old words that are no longer in use.
- Text language has been configured to Finnish which causes OOo to consider 
some Icelandic letters as word separators. This leads to some words being only 
partially underlined. Assuming that breakiterator configuration for Icelandic 
is correct in OOo, this should not happen after we allow openoffice.org-voikko 
to act as a spell checker for Icelandic text.

Harri



More information about the Libvoikko mailing list