[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Sorting out adj* PoS tags



On 01/04/2008, Jean-Luc Leger <reiga@dspnet.fr.eu.org> wrote:

>  On Tue, Apr 01, 2008 at 11:38:12AM +1100, Jim Breen wrote:
>  >
>  > (a) the current "adj-na", "adj-no" and "adj-t"markers be left unchanged;
>
>  maybe "adj-no" should be reviewd too.
>  "n" AND "adj-no" seems redundant to me ..

Well, I don't know about that.

AFAICT not all nouns can or do take a の, so I think it's useful to
mark those that commonly do.

Also, I keep encountering words which seem only to be XXXのYYY. There
are 300 or so entries at present where the POS is only "adj-no".
Some are gairaigo like ホームレス, but others are like 育ち盛り, which
you can't really imagine being in "育ち盛りが..." situations. (Bad
example, as I see about 0.5% of uses *are* 育ち盛りが...)

>  > (d) "adj-pn" be restricted to terms recognized as 連体詞 (有らゆる, etc.)
>
>  meaning : invariable word _always_ used in a prenominal position ?

Dunno. The strict ones like その, この, etc. are, but I'd hat to be dogmatic.

>  > (e) everything else either use the catch-all "adj" tag, or maybe replace
>  > it with "adj-f" (adjectival function/form) to stop people who are not used
>  > to the funny adjectival practices in Japanese from thinking they are "real"
>  > adjectives.
>
>  I'd rather have "adj-f" than a mere "adj".

I think I would too.

Jim

-- 
Jim Breen
Honorary Senior Research Fellow
Clayton School of Information Technology,
Monash University, VIC 3800, Australia
http://www.csse.monash.edu.au/~jwb/