[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Kanjidic2.xml <variant> tag



[Ben Bullock (Re: [edict-jmdict] Kanjidic2.xml <variant> tag) writes:]
>> I'm not just interested in the Nelson numbers. When I started this
>> examination of kanjidic, I was hoping to get information such as about
>> shinjitai versus kyuujitai pairs and so on, but the "variant" thing
>> seems to jumble everything together, including typographical variants
>> and new/old pairs and so on. Is there any order or meaning at the
>> moment? It would be nice to have some way to find a new kanji given an
>> old one or an old one given the new form inside kanjidic, but at the
>> moment the "variant" stuff makes this ambiguous.

About 12 years ago I put a heap of 新字体/旧字体 references in
KANJIDIC; not so much stating which kanji was which, but putting
in refs to the related glyph. In KANJIDIC these are the XJnnnn tags,
so 国 and 國 point at each other for example.

Since then I have come across a number of sources of this data. There
is (according to Ken Lunde) an "official" list from 文部, but many
people do their own, and no two agree.

>> There are lots of lists of kyuu/shin pairs on the net, for example
>> Dylan Sung has one
>> 
>> http://www.sungwh.freeserve.co.uk/hanzi/j-s.htm
>> 
>> and there is also one at a site called "cyber librarian":
>> 
>> http://www.asahi-net.or.jp/~ax2s-kmtn/ref/old_chara.html
>> 
>> I thought this information must be in kanjidic already but it doesn't
>> seem to be.

Check out the XJ... or in kanjidic2.xml the jis208/jis212/jis213
variants.

Jim

-- 
Jim Breen                                http://www.csse.monash.edu.au/~jwb/
Clayton School of Information Technology,               Tel: +61 3 9905 9554
Monash University, VIC 3800, Australia                  Fax: +61 3 9905 5146
(Monash Provider No. 00008C)                ジム・ブリーン@モナシュ大学