[libvoikko] Relation between spelling and grammar checkers

Francis Tyers ftyers at prompsit.com
Sun Sep 22 22:33:16 EEST 2013


El dg 22 de 09 de 2013 a les 22:20 +0300, en/na Sjur Moshagen va
escriure:
> 22. sep. 2013 kl. 19:00 skrev Francis Tyers <ftyers at prompsit.com>:
> 
> >>> The question then is, should the spell
> >>> checker be the morphological analyser used as input to the grammar
> >>> checker, or should it be another file? So far we have:
> >>> 
> >>> 1) Descriptive morphological analyser
> >>> 2) Disambiguation file
> >>> 3) Grammar checker rule file
> >>> 4) Suggestion file
> >>> 5) Some kind of index/manifest
> >>> 
> >>> Should we add 'normative acceptor' to that ?
> >> 
> >> I have no real opinion on this. Might be reasonable unless the total size of 
> >> the package grows too large.
> > 
> > I think the transducers are around 6-7M each, so it probably wouldn't be
> > too much, but let's see what Sjur says…
> 
> Even though a spell checker fst included in a grammar checker package can be a 
> bit sloppier than a regular spell checker fst (leaving some error detection 
> cases to the CG rules), it still needs to be normative in a broad sense. This 
> is in contrast to the morphological analyser used as input to the grammar 
> checker - it needs to be *descriptive* in the broadest possible sense, including 
> all sorts of out-of-norm dialectal forms and common misspellings, such that 
> one can arrive at a correct morphological (and thus syntactic) analysis even when misspelled.
> 
> The exact interaction between the two (speller and GC) is still something to be 
> worked on, but for now I would say we need both fst's.

Ok, agree :)

F.




More information about the Libvoikko mailing list