[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Non-ASCII characters in glosses



On 5 May 2010 00:52, Jim Breen <jimbreen@gmail.com> wrote:
> To handle enquête = enquete, I'd have to introduce a token-level
> canonicalization. It could be done, but I turn 63 next week, and I'm sure
> there are better things to do with what's left of my life.

It's something that can easily be dealt with by third party dictionary
software using JMDICT, so it's not something that should be high on
the priority list for WWWJDIC (if at all) if it isn't a simple
adjustment in the queries used (which apparently it isn't).

> Bear in mind, the glosses are supposed to be in English. I'm not
> really sure Føroyar is an English word.

EDICT won't see much action on that front, but ENAMDICT will. I take
it ENAMDICT isn't ready for that kind of action yet?

~ Jeroen