[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] JMdict changes - RFC



Overall this looks very good to me as an app developer. Using sequence numbers for references is a great change. Pitch accent info is a common request from my users, so that will be awesome to have.

One other change I’d like to propose would be to add a simple integer identifier to senses within an entry. This identifier would stay with the sense as it is edited or moved around within a word entry.

Currently, it’s hard to hold onto a reference to a particular sense (to display on a flashcard for instance). If you save it by its index in the list of senses, the reference is broken if the order changes or a new sense is added above it. If you save it by its text, the reference is broken if the text of the sense is edited.

Here’s a proposed change to the DTD:

Current
<!ELEMENT sense (stagk*, stagr*, pos*, xref*, ant*, field*, misc*, s_inf*, lsource*, dial*, gloss*)>

New
<!ELEMENT sense (s_id, stagk*, stagr*, pos*, xref*, ant*, field*, misc*, s_inf*, lsource*, dial*, gloss*)>
<!ELEMENT s_id (#PCDATA)>

As a starting point, these identifiers could just be initialized to the current index in the current list of senses for an entry. But as new senses are added, they would get a new, as-yet-unused identifier, not tied to the order in the list of senses.

Apologies if you’re not looking for additional changes at this time, but it struck me when I read about the cross-entry reference changes that this could fit in nicely with that change. By using this identifier instead of the index in the list of senses, it could potentially make those references even more stable and less error-prone when referring to senses within an entry.

Chris
On Feb 17, 2020, 11:37 PM -0500, Jim Breen jimbreen@gmail.com [edict-jmdict] <edict-jmdict@yahoogroups.com>, wrote:
> Some changes are proposed to the JMdict structure, both
> to add some new information fields to entries, and to change
> the ways some of the fields are structured. The proposed changes
> are outlined at: https://www.edrdg.org/wiki/index.php/JMdict:_Next_Generation
>
> The changes have been discussed within the editorial group and now
> we are seeking comments from the wider community. Feedback from
> people using JMdict in apps and websites will be especially welcome.
>
> Jim
>
> --
> Jim Breen
> Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
> http://www.jimbreen.org/
> http://nihongo.monash.edu/
>