[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Non-JIS characters



On 28 October 2012 10:28, Jean-Luc Léger <reiga@dspnet.fr.eu.org> wrote:

> ok, care to try again with the updated tools
> (http://dspnet.fr/~reiga/converter.tar.gz) ?

Very nice. I can't fault them. (There are a few entries
come up differently, but your versions are better.)

> For Edict2, any keb having a non JIS208/212 characters is ignored. The
> same goes for xref/restrictions.
> In glosses, lsources and s_inf, I remove diacritics. As for other
> characters (arabic, sanskrit ..) they are plainly removed.

There are a couple of technical and formatting issues I need to
discuss, but I'll do it off-line.

Thanks for those routines. I'd like to get them running on
arakawa, which means I can bring them into the Monash site
complete instead of building them there.

Jim

-- 
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University