13. |
A 2022-12-12 21:54:28 Jim Breen <...address hidden...>
|
12. |
A* 2022-12-12 19:20:42 Stephen Kraus <...address hidden...>
|
|
Refs: |
Google N-gram Corpus Counts
╭─ーーーーーー─┬─────────┬───────╮
│ 頭蓋骨 │ 206,179 │ 95.4% │
│ 頭がい骨 │ 4,698 │ 2.2% │ 🡠 sK
│ 頭骸骨 │ 3,298 │ 1.5% │ 🡠 iK to sK
│ ずがいこつ │ 1,826 │ 0.8% │
│ とうがいこつ │ 229 │ 0.1% │
╰─ーーーーーー─┴─────────┴───────╯ |
|
Diff: |
@@ -8,0 +9 @@
+<ke_inf>&sK;</ke_inf>
@@ -12 +13 @@
-<ke_inf>&iK;</ke_inf>
+<ke_inf>&sK;</ke_inf> |
11. |
A 2022-05-07 06:55:54 Jim Breen <...address hidden...>
|
|
Comments: |
I think one of the strengths of JMdict/EDICT is its coverage of non-standard and erroneous forms. If they are "out there" in sufficient numbers, as in this case with over 3k in the n-grams, it's worth keeping them. It really helps text-glossing systems, and it's of benefit to users, especially learners. If an app builder doesn't want to show them, they are free to leave them out - that's their choice. |
10. |
A* 2022-05-07 05:20:05 Marcus Richert <...address hidden...>
|
|
Comments: |
If it's iK and rK and not even mentioned in any other dictionary then I don't think we need it either. I think it is clutter that has a negative impact on the entry. |
9. |
A 2022-05-07 01:15:55 Jim Breen <...address hidden...>
|
|
Comments: |
I agree; keep the form with an "iK". |
(show/hide 8 older log entries)
|
8. |
A* 2022-05-07 00:05:50 Stephen Kraus <...address hidden...>
|
|
Refs: |
We discussed this a bit today on the entry for 気概. I agree rK
doesn't work well here.
Personally, I think applying the rK criteria to drop irregular
forms would be a little too strict. Kanji irregularities like
this one don't seem exceptionally common, nor do they clutter the
dictionary much, so I don't think it's a huge burden to document
them. In cases like "断末間" which receive little-to-no hits,
however, I don't see as much value in keeping them. |
|
Diff: |
@@ -13 +12,0 @@
-<ke_inf>&rK;</ke_inf> |
7. |
A* 2022-05-02 23:10:35 Robin Scott <...address hidden...>
|
|
Comments: |
So far we haven't used iK and rK together (on the same form). Typically we don't include irregular forms that aren't common. I propose dropping any irregular form that meets the threshold for rK. I think it looks odd having a form that is tagged as both irregular AND rare. |
6. |
A 2022-05-02 22:45:07 Jim Breen <...address hidden...>
|
5. |
A* 2022-05-02 22:09:15 Stephen Kraus <...address hidden...>
|
|
Refs: |
Google N-gram Corpus Counts
206,179 95.4% 頭蓋骨
4,698 2.2% 頭がい骨
3,298 1.5% 頭骸骨
1,826 0.8% ずがいこつ
229 0.1% とうがいこつ |
|
Diff: |
@@ -12,0 +13 @@
+<ke_inf>&rK;</ke_inf> |
4. |
A 2018-09-17 17:03:50 Rene Malenfant <...address hidden...>
|
|
Comments: |
骸骨 is a word (がいこつ), as is 頭蓋 (ずがい/とうがい), so it likely results from confusion between 頭蓋+骨 vs. 頭+骸骨 |
3. |
A* 2018-09-17 14:02:35 Robin Scott <...address hidden...>
|
|
Refs: |
G n-grams:
頭蓋骨 206179
頭がい骨 4698
頭骸骨 3298 |
|
Comments: |
I think 頭骸骨 is a 変換ミス. It's not in any of my refs. |
|
Diff: |
@@ -7,0 +8,3 @@
+<keb>頭がい骨</keb>
+</k_ele>
+<k_ele>
@@ -8,0 +12 @@
+<ke_inf>&iK;</ke_inf> |
2. |
A 2013-06-18 02:46:36 Jim Breen <...address hidden...>
|
1. |
A* 2013-06-18 01:08:09 winnie <...address hidden...>
|
|
Refs: |
http://www.docoja.com:8080/wkanji/ikansear.jsp?dbname=kokug&sword=���[��&encode=SHIFT-JIS
http://www.geocities.co.jp/AnimalPark-Tama/1234/pun002.html
http://www.ipv6.org.au/10ipv6summit/talks/Hiroshi_Esaki.pdf |
|
Diff: |
@@ -6,0 +6,3 @@
+</k_ele>
+<k_ele>
+<keb>頭骸骨</keb> |