[libvoikko] Voikko, cyrillic and case handling

Sjur Moshagen sjurnm at mac.com
Wed Jan 25 18:19:36 EET 2012


Den 25. jan. 2012 kl. 15.18 skrev Sjur Moshagen:

> Den 24. jan. 2012 kl. 22.23 skrev Harri Pitkänen:
> 
>> Extending current mappings and sending patches is fine. Case mappings are
>> defined in the first two functions in src/character/SimpleChar.cpp. There
>> is a TODO comment at the end of both functions, you can add your ranges
>> just before that comment.
>> 
>> Additionally you should make sure that get_char_type in
>> src/character/charset.cpp returns CHAR_LETTER for cyrillic letters, I
>> don't think it does that yet.
> 
> Both done. 

Patch attached.

One issue remains, which is specific to the ooovoikko extension: I do not get any suggestions. The following word produces the following output in voikkospell:

W: Кальӧлöн
S: Кальӧлӧн
S: Вальӧлӧн
S: Дальӧлӧн
S: Кальӧн
S: Бальӧлӧн

(the first suggestion is the expected correction)

The same word produces no suggestions in LibreOffice, but is flagged as a misspelling. The same words are accepted & rejected in voikkospell and ooovoikko/LibreOffice. The strange thing is that earlier versions of the speller + ooovoikko extension (based on voikko code from last fall and subsequently did not accept upper-cased words) did offer suggestions for (lexical cased) misspellings.

Is there anything more that needs to be done to make sure we get suggestions?

The speller is a very early version of a Komi speller. In OOo it is disguised as Ukrainian, since Komi presently is not supported by OOo/LibreO.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: cyrillic-support.patch
Type: application/octet-stream
Size: 1739 bytes
Desc: not available
URL: <http://lists.puimula.org/pipermail/libvoikko/attachments/20120125/06b590a7/attachment.obj>


More information about the Libvoikko mailing list