[libvoikko] Small patch to HFST2 backend stuff

Flammie Pirinen flammie at iki.fi
Thu Apr 8 06:53:05 EEST 2010


2010-04-06, Harri Pitkänen sanoi:

> I did not test this yet at all. It seems that the suggestion
> generator was not added to SuggestionGeneratorFactory.cpp so it would
> not work out of the box. I can fix this once I have time to test it,
> probably at the end of this week. Same problem with hyphenator, I'll
> fix that too.

Yes, in fact I used versions of factories where hfst backends were used
unconditionally #if HAVE_HFST, so I excluded them from previous patch
as that surely isn't wanted in the development version.

> > I've also compiled TeX hyphenation patterns from hyph-utf8
> > distribution as HFST transducers that may be used for testing, 
> 
> The original patterns appear to be from Kauko Saarinen. I believe
> they are in public domain although the history of these patterns
> (especially the hyph-utf8 version) is really confusing:
> 
>   http://www.openoffice.org/issues/show_bug.cgi?id=74298

Yeah, specifically since these patterns have been around since forever,
I'm sure there may very well be complex issues. In fact the hyph-fi
patterns aren't all that interesting, the schoolbook algorithm I
implemented in couple of minutes performs nearly as well for Finnish.
For testing other languages the tex patterns should be more relevant.

-- 
Flammie, computer scientist bachelor, linguist master, free software
Finnish localiser, and more! <http://www.iki.fi/flammie/>



More information about the Libvoikko mailing list