[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT



> Indeed, for intra-JMdict xrefs. I regard the Tanaka corpus as a
> freestanding project. There are people out there who use it
> and who'd be rather upset if the current indices were  replaced by
> codes that only applied to JMdict. (Supplemented is probably OK.)

I could probably live with a set up where a supplemented alternative
version is generated on an infrequent basis.  Hmm ... might be fiddly
but doable.

> Also within JMdict we can keep track of things. We can arrange so that
> when an entry is deleted, any xrefs pointing to it will be updated to
> point at a successor. Doing that with an external file being maintained
> elsewhere is a problem.

Maintaining it as an external file isn't intended as a permanent policy
decision, it's merely that I don't see how I would be able to work as
well with it on some Unix server with a database program I'm not
familiar with.

The Tanaka Corpus is still very much 'in development'.  When it has
settled down to a more mature state it would probably be more practical
to move it to a different environment.  For example there are at
present 11,133 records of the 154,731 that are still only partially
indexed.