[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] jmdict language tags



G'day,

[I have a longer response in this thread coming but
meanwhile...]

I too have something to say about gloss tags, but am still reworking
in it.  Meanwhile, ...

I just reviewed the language tags currently in use in
jmdict with a view to cleaning up the tags in my database
tables and had some questions...

I am falling in love with the database.  While we are at it, if we are
cleaning up the language codes, should we take the plunge and go to
ISO-639-3 which uses three letter codes and includes both ain: Ainu
(Japan) and ryu (Central Okinawan)?

The standard is described at: http://www.sil.org/iso639-3/default.asp

They have a mapping from the  two letter codes:
http://www.sil.org/iso639-3/iso-639-3_20070323.tab

I think it is one-to-many, but I would be happy to arbitrarily choose
defaults (^_^).



--
Francis Bond <http://www2.nict.go.jp/x/x161/en/member/bond/>
NICT Computational Linguistics Group