[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Re: Unresolvable cross-references in JMdict



2008/9/10 Philip Ronan <mail@japanesetranslator.co.uk>:
>
> Sorry about that. I only just discovered you can send email directly
> to these newsgroups. I posted the original message via the Yahoo
> website, which obviously made a dog's dinner of things.

The Yahoo WWW interface is famous for its mojibake tendencies.

> According to the DTD comments, the contents of a <xref> and <ant>
> elements "must exactly match that of a keb or reb element in another
> entry". But a lot of the xref elements are of the form "<xref>彼・
> あれ</xref>" (1000420) with the reb and keb data separated by a
> nakaguro character. I can rig my software to ignore these nakaguros
> and everything after them, but should this really be necessary?

I have amended the comments to:

        <!-- This element is used to indicate a cross-reference to another
        entry with a similar or related meaning or sense. The content of
        this element is typically a keb or reb element in another entry. In some
        cases a keb will be followed by a reb and/or a sense number to provide
        a precise target for the cross-reference. Where this happens, a JIS
        "centre-dot" (0x2126) is placed between the components of the
        cross-reference. -->

Cheers

Jim

-- 
Jim Breen
Honorary Senior Research Fellow
Clayton School of Information Technology,
Monash University, VIC 3800, Australia
http://www.csse.monash.edu.au/~jwb/