[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT

To: edict-jmdict@***************
Subject: Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
From: Jim Rose <jim@*************>
Date: Fri, 17 Aug 2007 08:13:18 -0400


On Aug 17, 2007, at 3:02 AM, Paul Blay wrote:

> Indeed, for intra-JMdict xrefs. I regard the Tanaka corpus as a
> freestanding project. There are people out there who use it
> and who'd be rather upset if the current indices were replaced by
> codes that only applied to JMdict. (Supplemented is probably OK.)

I could probably live with a set up where a supplemented alternative
version is generated on an infrequent basis. Hmm ... might be fiddly
but doable.

Let's not forget the complications introduced when you try to improvethe glosses - and a number isn't of much use to the human intervenerwho will be correcting by hand. How often have you run into twoglosses that were parsed by ChaSen way back when, that would betterserver the user by a single, longer entry in JMDICT/EDICT? Machinelogic can't discover that at the moment.... though it could if youwanted to pair up every adjacent B gloss and see if it had an entry.

Also, if you're not careful, you could easily start introducing thewrong pseudo gloss number into the B line because you're not startingout with the number but have to discover it. Many a time you findthe wrong word when you're trying to find a B line gloss's corollaryin the dictionary with machine logic alone.

And I suspect the level of work "still to be done" on the TC is quitevast. There are always words in the A line that simply do not appearon the B line, and that could take years to flesh out.

Follow-Ups:
- Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Paul Blay" <blay.paul@**************>
- Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Jim Breen" <jimbreen@*********>

References:
- Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Jim Breen" <jimbreen@*********>
- RE: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Stuart McGraw" <smcg4191@********>
- Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Jim Breen" <jimbreen@*********>
- Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
  - From: "Paul Blay" <blay.paul@**************>

Prev by Date: Re: [edict-jmdict] Re: Yomigata for Edict and Nedict
Next by Date: Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
Previous by thread: Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
Next by thread: Re: [edict-jmdict] Re: Regarding the ENT_SEQ field in JMDICT
Index(es):
- Date
- Thread