[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [edict-jmdict] A few questions
On Oct 20, 2013, at 10:37 PM, Olivier Binda wrote:
> I have no idea how the different japanese dictionaries compare in
> quality/accuracy (Ipadic, juman, naist, unidic, ...)
I went with Unidic for my novel-parser; it's got much better
coverage than Ipadic, and is a lot more consistent about what
constitutes a single morpheme (with some odd exceptions like
蹌踉き出る). You need to do a little more work assembling
morphemes into words and expressions you can look up in JMdict,
but you need to do less work getting verbs and adjectives
into dictionary form.
Note that if you download a source distribution, you want
unidic-mecab_kana-accent; the smaller source zipfile is
missing several fields that are included in the binary
distribution.
-j