[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Duplicate hangul for same kanji in kanjidic2.xml



Hi all,
 
With now more than 1,500 very intense hours in Edict, I have noticed more redundant entries than I care to admit and certainly more than I can purge on an ad hoc basis. The following is my email to Jim, multiples the likes of which have preceded:
 
Dear Jim,
 
Not to be a continuing thorn in your side but please enter 荒 in Edict and just count the number of redundant entries. This 荒  entry is by no means representative but just one of today's happenstances, many of which I have noted before.
 
Again, trying to be positive. With a proper database, such would be impossible.
 
Regards,
 
Dennis



 
On 9/10/06, Ben Bullock <benkasminbullock@*********> wrote:

Some of the hangul in kanjidic2.xml are repeated twice for the same
character. There seem to be two romanizations in "korean_r" but the
same hangul. I thought that hangul was phonetic and thus would not
result in two different romanizations (at least using the same
romanization system). Here are the duplicates I found in
kanjidic2.xml:

Duplicate hangul 알 for 斡
Duplicate hangul 압 for 押
Duplicate hangul 각 for 角
Duplicate hangul 감 for 柑
Duplicate hangul 쇄 for 鎖
Duplicate hangul 롱 for 滝
Duplicate hangul 롱 for 瀧
Duplicate hangul 분 for 分
Duplicate hangul 물 for 勿
Duplicate hangul 훙 for 薨