[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] [rare] tag for obscure kanji?



On Wed, 20 Mar 2019 at 03:00, eiennohito@gmail.com [edict-jmdict]
<edict-jmdict@yahoogroups.com> wrote:
> Here are morpheme unigrams for 3B of sentences cut at 10 for the feel what is there at the bottom.
> https://tulip.kuee.kyoto-u.ac.jp/ngrams/3B/unigrams.gz

Looking at that file at around the 1000 mark, I see 1-grams which IMO
are really 2-grams, e.g.
食べて - 食べ + て
確かに - 確か + に
共に - 共 + に

This is probably an outcome of using Juumandic. As I've mentioned, I'm
used to Unidic, which is
strict with the morpheme definition.

Jim

-- 
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/