[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] The use of the pipe (|) character in Tatoeba's wwwjdic.csv file



Le 2013-03-15 17:06, iammatthewmiller a écrit :
I'm working on splitting up the Tatoeba example sentences CSV file
into a few SQL databases. In many of the sentence indices, I see the
use of either "ã¯|1" or "<some word>|2" to
indicate...something...about that particle or word. The page on
EDRDG.org that describes the format of each entry
(http://www.edrdg.org/wiki/index.php/Sentence-Dictionary_Linking [1] )
lists the meaning of all of the special characters except for the
pipe-followed-by-a-single-digit. What is the significance of the
number after the pipe, and why is it only either 1 or 2?

Hi,

"<some headword>|<an integer>" is used when a headword appears in more than an entry. In that case, the integer indicates which entry is refered (1 means the entry with the lowest seq_id, 2 means the second lowest ...)

JL