[libvoikko] voikkospell segmentation fault + Sámi+hfst questions

Flammie Pirinen flammie at iki.fi
Tue Sep 6 05:38:14 EEST 2011

2011-09-05, Sjur Moshagen sanoi:

> Den 5. sep. 2011 kl. 17.21 skrev Harri Pitkänen:

> > You will need following three files under ~/.voikko/2/mor-se :
> > 
> > - voikko-fi_FI.pro with the followinf contents:
> > 
> > info: Voikko-Dictionary-Format: 2
> > info: Language-Code: se
> > info: Language-Variant: standard
> > info: Description: Kokeellinen pohjoissaamen morfologia
> > info: Morphology-Backend: null
> > info: Speller-Backend: hfst
> > info: Suggestion-Backend: null
> > 
> > - alphabet.hfstol and spl.hfstol which should contain the acceptor
> > and alphabet in latest HFST optimized lookup format. Or that's what
> > I assume, the actual files I use are from Tommi, I did not build
> > them myself. I don't have Sámi transducers either, I'm testing with
> > English ones instead.
> > 
> > After those are in place, spelling should be testable with
> > "voikkospell -d se". In this configuration you won't get any
> > spelling suggestions. I have not tested if those would work with
> > current code.

Should work, just needs Suggestion-Backend: hfst and files sug.hfstol
and err.hfstol for the suggestion dictionary and the error model
correspondingly, the former can be ln -s'd from the speller if
necessary; for my own purposes I've used suggestions from a dictionary
that does not contain productive compounding or derivations, which
works nicely for me, for Finnish.

> * the alphabet.hfstol file is bad for SE (but I don't know what it is
> expected to be) - Tommi, can you answer that?

It's not awfully important, as long as it contains neat subset of
alphabets in the dictionary. 

> * also, I'm using HFST3 files now, it might be that the hfst backend
> code doesn't yet support that - Tommi, what is the status?

It should be all up to installed libhfstospell, if ospell works then
there's relatively little chance for voikko part of it to fail.

> > Application level support is currently only 
> > available for OOo/LibreOffice, the rest (Enchant, Firefox) will
> > probably follow after we have a stable release of libvoikko
> > supporting more than one language.
> Since I'm on MacOS X, I would really like to see the
> VoikkoSpellService plugin updated to include proper support for hfst
> languages as well :)

Actually me too, I just noticed that after update to Snow Leopard
building ooo plugins is noticably more annoying but apparently newest
libreoffices should use the spell service stuff if available.

In practice I have the zhfst patch as attached working, it supports
both the legacy style quadruplet of hfstol files or the new zhfst one
with same malaga configuration file as before, and speller.zhfst as

However my primary target before polishing all the zhfst stuff is to get
the greenlandic oxts available for all platforms. If I start to work on
next version of hfst-ospells instead I fear the already way too
postponed greenlandic spellers will never get to end users. 

Flammie, computer scientist bachelor, linguist master, free software
Finnish localiser, and more! <http://www.iki.fi/flammie/>

More information about the Libvoikko mailing list