| 12. |
A 2024-05-12 23:34:13 Stephen Kraus <...address hidden...>
|
| |
Comments: |
Marcus recently amended both entries and dropped the tags in ありがとう but not コーヒー, so I was just wondering if there was a reason for that difference.
Sorry if I sound annoyed; I need to update my dictionary build scripts to handle this sort of setup, and I don't understand the point of it all if the same information is already provided by the [uk] tag. But it sounds like that ship has sailed, so I'll stop griping. |
| 11. |
A 2024-05-12 21:58:13 Jim Breen <...address hidden...>
|
| |
Comments: |
Sorry if my response was unclear. I think it's fine to drop the tags from the kanji fields in these uk entries. |
| 10. |
A* 2024-05-12 21:54:31 Stephen Kraus <...address hidden...>
|
| |
Refs: |
Google N-gram Corpus Counts
╭─ーーーーー─┬────────────┬───────╮
│ 珈琲 │ 2,390,900 │ 15.4% │
│ コーヒー │ 13,171,397 │ 84.6% │
├─ーーーーー─┼────────────┼───────┤
│ 有難う │ 8,616,052 │ 8.1% │
│ 有り難う │ 2,274,621 │ 2.1% │
│ 有りがとう │ 8,914 │ 0.0% │
│ ありがとう │ 95,539,750 │ 89.8% │
╰─ーーーーー─┴────────────┴───────╯ |
| |
Comments: |
Didn't receive an answer to my question on ありがとう (1586820). If we're going to drop the priority tags on 有難う, seems like they should be dropped on 珈琲 as well. |
| |
Diff: |
@@ -7,2 +6,0 @@
-<ke_pri>gai1</ke_pri>
-<ke_pri>ichi1</ke_pri> |
| 9. |
A 2024-05-09 00:03:19 Marcus Richert <...address hidden...>
|
| |
Diff: |
@@ -17 +16,0 @@
-<pos>&adj-no;</pos> |
| 8. |
A 2017-04-30 06:44:57 Jim Breen <...address hidden...>
|
| |
Comments: |
OK. |
|
(show/hide 7 older log entries)
|
| 7. |
A* 2017-04-26 20:48:02 Johan Råde <...address hidden...>
|
| |
Comments: |
let's skip こーひー then |
| |
Diff: |
@@ -15,3 +14,0 @@
-<r_ele>
-<reb>こーひー</reb>
-</r_ele> |
| 6. |
A* 2017-04-26 18:35:45 Robin Scott <...address hidden...>
|
| |
Comments: |
Presumably by clicking through to the last page of results. (For me it drops from 473k to 121).
But this is the number of results Google will show you, not the total number of matches.
Google never retrieves more than a few hundreds results (I get 239 for "cat").
The difficulty is in knowing which number to trust. The hit count shown on the first page is only an estimate and often wildly inaccurate.
In the case of "cat", 2.3 billion is obviously much closer to the real number than 239, but try an exact match search on something really obscure.
I searched for the clearly ungrammatical "りんごを落ちる" and got "38,300 results".
But click on page 2 and it ends there. 15 results.
Both these numbers are probably wrong, but it's anyone's guess as to how wrong.
Something to keep in mind when looking at Google hits. |
| 5. |
A* 2017-04-26 12:08:32 Johan Råde <...address hidden...>
|
| |
Comments: |
Jim, How did you get the number 117?
What search string did you use? |
| 4. |
A 2017-04-26 11:51:57 Jim Breen <...address hidden...>
|
| |
Refs: |
117 unique Googits, not that they mean anything these days. |
| 3. |
A* 2017-04-26 04:36:14 Johan Råde <...address hidden...>
|
| |
Refs: |
googits
コーヒー 227 M
こーひー 468 K |
| |
Comments: |
0.2%, but still fairly common in absolute numbers |
| |
Diff: |
@@ -14,0 +15,3 @@
+<r_ele>
+<reb>こーひー</reb>
+</r_ele> |
| 2. |
A* 2017-04-25 23:41:54 Jim Breen <...address hidden...>
|
| |
Comments: |
In that case the rather artificial こーひー should go. |
| |
Diff: |
@@ -9,2 +8,0 @@
-<ke_pri>news2</ke_pri>
-<ke_pri>nf35</ke_pri>
@@ -16,5 +13,0 @@
-</r_ele>
-<r_ele>
-<reb>こーひー</reb>
-<re_pri>news2</re_pri>
-<re_pri>nf35</re_pri> |
| 1. |
A* 2017-04-21 09:02:22 Johan Råde <...address hidden...>
|
| |
Comments: |
I think we can view コーヒー as a kana form of 珈琲 |
| |
Diff: |
@@ -14 +13,0 @@
-<re_nokanji/> |