[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Selected example sentences by sense



Excellent idea.  Suggest focusing on words in order of their
frequency of use, as it is probably impossible for you to do this for
every word in EDICT.

Actually I've already done it for all words* in Edict that have 'senses'
indexed among the example sentences.

There remain many words in Edict that should probably be split into
different senses but aren't.  There are also many words that are split
into different senses in Edict but don't have all those senses represented
_and_ indexed in the example sentences.

Lastly there are those words that aren't, and won't be, split into
different senses.  Probably one sentence should be picked out for
each of those but there is the practical problem of telling the difference
between them and words that should have more than one sense but don't
yet.

Just hope this series of improvements doesn't lead to unnecessary
pruning away of otherwise valuable example sentences.  No?

No sentences are being pruned (apart from those near duplicates and
such that would be deleted anyway).  I'm just marking a subset, and how
people choose to use that subject is up to them.

* Except for sense '5' which I appear to have missed. ;-)