[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [edict-jmdict] Adding old kanji forms with a script
Thank you for the explanation Jim, well understood.
>> Out of curiosity, once you find the "simplified" spellings, what do you
use them for?
>They go into the entry as additional kanji form, as above.
I do think Jean-Luc's findings are being used for another end; after all,
there is little benefit in searching Edict for duplicates for the purpose of
putting them back into Edict? :) In my case, I look for them while
batch-processing a new list of words that go into Anki. Sometimes that finds
nice hidden gems such as 注ぐ 潅ぐ 漑ぐ 灌ぐ 濺ぐ.
This was just a minor curiosity though, I appreciate the informative answer.
Alex
2010年9月14日4:00 Jim Breen <jimbreen@gmail.com>:
>
>
> On 14 September 2010 08:35, Alexandru Pojoga <apojoga@gmail.com<apojoga%40gmail.com>>
> wrote:
> > I'm a lurker but your post caught my eye. Do you use "same kana && same
> eigo" as the criterion for two entries being equivalent (and thus one being
> a "simplified" version of the other)?
>
> I think JL is travelling, and may not be able to reply.
>
> In general the "two-out-of-three" rule is used for combining entries. Two
> out
> of kanji-form, kana-form and meaning must be the same. The merged kanji
> form
> is not necessarily simplified - it may just be an alternative form (異体字).
>
>
> > This worked for me when I was looking for alternative spellings, say 受け取る
> and 受けとる.
> > But that also gives me 請け取る (different kanji) -- that may not be
> something you want.
>
> Well for the うけとる entry we have:
>
> 受け取る(P); 受けとる; 受取る; 請け取る; 請取る; うけ取る
>
>
> > Out of curiosity, once you find the "simplified" spellings, what do you
> use them for?
>
> They go into the entry as additional kanji form, as above. Some are marked
> as "iK" indicating that it's actually incorrect, but still used. This may
> result
> from a common 変換ミス. Others are marked "oK" for old/obsolete kanji. This
> usually happens with 旧漢字.
>
> We put these in the dictionary because people encounter them in old texts
> and want to know what the word actually means. It's good to be able to
> relate them to the common form.
>
> HTH
>
> Jim
>
> --
>
> Jim Breen
> Adjunct Snr Research Fellow, Clayton School of IT, Monash University
> A/g Treasurer: Hawthorn Rowing Club, Japanese Studies Centre
> Graduate student: Language Technology Group, University of Melbourne
>
>