[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] RADKFILE, KRADFILE (was Re: Project Proposals)



Some of this seems like previous postings where Jim wanted to keep the line between KANJIDIC and RADKFILE from being blurred...



On Jul 26, 2006, at 8:33 PM, Alpha Ranger wrote:


* Radical database
 
This is a small project.  Basically it would be a relatively small database of radical related information.  I have the framework written down.
 
This would be nice for electronic dictionary creators to link to if it does not exist already.
 
 
I went back last night and compared my notes to the radkfile/kradfile formats.  I have a lot of ideas/suggestions:
 
KRADFILE
 
* Eliminate spaces and colons from entries to reduce file size
  This would also very slightly decrease loading and parsing times
  To maintain backwards compatibility, this probably won't be implemented
 
* List tradit onal radical first
  This has many benefits AND maintains backward compatibility
 
RADKFILE
 
* List 常用漢字 first (if not so already)
  Further rankings are possible as stated below
 
RADFILE (sic)
 
* This is a proposed new file
 
* This file would be focused aro nd the 214 traditional radicals with information
  to assist in cross-referencing, using simplified systems, and frequently
  mistaken radicals
 
* Kanji entries for each radical would be grouped by
  . 教育漢字
  . Remaining 常用漢字
  . 人名用漢字
  . Remaining JIS 1 漢字
  . JIS 2 漢字
  . Anything not list above
 
* Japanese gloss/meaning
 
* English gloss
 
* 画数
 
*  Location (of standard 7 locations (i.e. 偏, 旁, 冠, etc.))
 
* Japanese name combined with location (if necessary) (e.g. ウ冠, 人偏, 之繞)
 
* Frequency of occurance in:
  . 教ࠋ ;漢字
  . 常用漢字
  . JIS 1
  . JIS 2
  . newspaper
  . Tanaka Corpus
  . (P) words
  . JMDIC
 
* Variants
 
* other things I haven't thought of yet
 
UNICODE
 
* Petition UNICODE to include glyphs for radicals/elements that are not currently included
  If an initiative is already underway, join that initiative
 
Well, that is most of what I was thinking.
 
Feedback?????????
 
Cheers!
Todd