[libvoikko] Request for info: languages and spell checking in openoffice.org-voikko
Harri Pitkänen
hatapitk at iki.fi
Wed Feb 2 22:21:35 EET 2011
On Wednesday 02 February 2011, Flammie Pirinen wrote:
> kl(-GL) for Greenlandic is one.
I believe to be able to support this one I need to submit locale data to
OOo/LibO, it does not seem to be supported yet.
By the way I think we should drop the country (region subtag) from all
languages that do not need it. For example Swedish as spoken in Sweden should
be advertised as "sv" and Swedish as spoken in Finland "sv-fi". That way we
can always provide a reasonable default variant unless there is a more
specific one available. I think libvoikko may not yet work that way but it
should.
> I could assume other finite-state
> morphologies in <https://victorio.uit.no/langtech/trunk/> should be
> usable, such as Northern Sámi (se), but maybe others can comment on
> that.
Northern Sámi should in fact work already, I used it as one of the test cases
when I implemented the feature.
> The hunspell-dictionaries I've worked on converting should be usable.
> In current working dir I have:
Here the language codes are in OOo internal format. We would have to go
through them to see if some of these have another representation in BCP 47 and
where it is reasonable to drop the regional part (for example "et-EE" should
probably just be "et" within Voikko).
Harri
More information about the Libvoikko
mailing list