27. |
A 2023-02-24 22:39:04 Jim Breen <...address hidden...>
|
|
Comments: |
I feel it's a pity to lose some etymological info, but no big deal. |
26. |
A* 2023-02-24 22:28:21 Stephen Kraus <...address hidden...>
|
|
Comments: |
We can remove ateji and nokanji tags from hidden forms |
|
Diff: |
@@ -15 +14,0 @@
-<ke_inf>&ateji;</ke_inf>
@@ -20 +18,0 @@
-<ke_inf>&ateji;</ke_inf>
@@ -25 +22,0 @@
-<ke_inf>&ateji;</ke_inf>
@@ -30 +26,0 @@
-<ke_inf>&ateji;</ke_inf>
@@ -35 +30,0 @@
-<ke_inf>&ateji;</ke_inf>
@@ -49 +43,0 @@
-<re_nokanji/>
@@ -54 +48 @@
-<re_nokanji/>
+<re_inf>&sk;</re_inf>
@@ -58 +51,0 @@
-<re_nokanji/>
@@ -63 +55,0 @@
-<re_nokanji/> |
25. |
A 2023-02-24 22:20:44 Jim Breen <...address hidden...>
|
24. |
A* 2023-02-24 20:25:35 dom <...address hidden...>
|
|
Refs: |
Google N-gram Corpus Counts
ああ 18788680 65.1%
あー 9061462 31.4%
アー 239306 0.8%
アア 79681 0.3%
アァ 27445 0.1%
嗚呼 665469 2.3%
噫 4726 0.0%
嗟 4403 0.0%
於乎 116 0.0%
於戯 53 0.0%
嗟乎 134 0.0%
吁 593 0.0%
daijr: 嗚呼, 噫
twitter: 嗚呼, 噫 (嗟 appears mainly in names, 吁 in Chinese tweets) |
|
Diff: |
@@ -16 +16 @@
-<ke_inf>&rK;</ke_inf>
+<ke_inf>&sK;</ke_inf>
@@ -21 +21 @@
-<ke_inf>&rK;</ke_inf>
+<ke_inf>&sK;</ke_inf>
@@ -26 +26 @@
-<ke_inf>&rK;</ke_inf>
+<ke_inf>&sK;</ke_inf>
@@ -31 +31 @@
-<ke_inf>&rK;</ke_inf>
+<ke_inf>&sK;</ke_inf>
@@ -36 +36 @@
-<ke_inf>&rK;</ke_inf>
+<ke_inf>&sK;</ke_inf>
@@ -49,0 +50 @@
+<re_inf>&sk;</re_inf>
@@ -57,0 +59 @@
+<re_inf>&sk;</re_inf>
@@ -61,0 +64 @@
+<re_inf>&sk;</re_inf> |
23. |
A 2021-12-01 23:20:15 Robin Scott <...address hidden...>
|
(show/hide 22 older log entries)
|
22. |
A* 2021-11-30 14:17:32 Marcus Richert <...address hidden...>
|
|
Refs: |
嗚呼 665469 not quite rK
ああ 18788680
あー 9061462
嗚呼 665469
噫 4726
嗟 4403
於乎 116
於戯 53
嗟乎 134
嗟夫 No matches removing
吁 593 |
|
Diff: |
@@ -7 +6,0 @@
-<ke_pri>spec1</ke_pri>
@@ -11,0 +11 @@
+<ke_inf>&rK;</ke_inf>
@@ -15,0 +16,21 @@
+<ke_inf>&rK;</ke_inf>
+</k_ele>
+<k_ele>
+<keb>於乎</keb>
+<ke_inf>&ateji;</ke_inf>
+<ke_inf>&rK;</ke_inf>
+</k_ele>
+<k_ele>
+<keb>於戯</keb>
+<ke_inf>&ateji;</ke_inf>
+<ke_inf>&rK;</ke_inf>
+</k_ele>
+<k_ele>
+<keb>嗟乎</keb>
+<ke_inf>&ateji;</ke_inf>
+<ke_inf>&rK;</ke_inf>
+</k_ele>
+<k_ele>
+<keb>吁</keb>
+<ke_inf>&ateji;</ke_inf>
+<ke_inf>&rK;</ke_inf>
@@ -45 +65,0 @@
-<s_inf>also written as 於乎, 於戯, 嗟乎, 嗟夫, 吁, etc.</s_inf> |
21. |
A 2020-04-28 03:50:15 Jim Breen <...address hidden...>
|
20. |
A* 2020-04-27 06:11:26 dine <...address hidden...>
|
|
Refs: |
https://furigana.info/w/嗚呼#2b088b95
https://furigana.info/w/噫
https://furigana.info/w/嗟 |
|
Diff: |
@@ -22,0 +23 @@
+<re_nokanji/>
@@ -26,0 +28 @@
+<re_nokanji/> |
19. |
A 2018-05-23 06:03:19 Jim Breen <...address hidden...>
|
|
Comments: |
I've moved it to 3, to limit reindexing of sentences. |
|
Diff: |
@@ -57,0 +58,8 @@
+<s_inf>in exasperation</s_inf>
+<gloss>aah</gloss>
+<gloss>gah</gloss>
+<gloss>argh</gloss>
+</sense>
+<sense>
+<pos>∫</pos>
+<misc>&uk;</misc>
@@ -69,8 +76,0 @@
-<sense>
-<pos>∫</pos>
-<misc>&uk;</misc>
-<s_inf>in exasperation</s_inf>
-<gloss>aah</gloss>
-<gloss>gah</gloss>
-<gloss>argh</gloss>
-</sense> |
18. |
A* 2018-05-23 02:47:59 Marcus Richert <...address hidden...>
|
|
Refs: |
https://www.daily.co.jp/gossip/2018/05/23/0011283626.shtml
"「涙が出てきた。彼に対する不憫な想いなのか。それとも我が母校への腹立たしさなのか。あー。くそ」" |
|
Comments: |
Should probably be higher up (possibly in 2nd place)? |
|
Diff: |
@@ -68,0 +69,8 @@
+<sense>
+<pos>∫</pos>
+<misc>&uk;</misc>
+<s_inf>in exasperation</s_inf>
+<gloss>aah</gloss>
+<gloss>gah</gloss>
+<gloss>argh</gloss>
+</sense> |
17. |
A 2018-04-30 08:28:27 Johan Råde <...address hidden...>
|
|
Refs: |
plenty of exampes on the web |
|
Comments: |
covering all possibilites |
|
Diff: |
@@ -35,0 +36,4 @@
+<r_ele>
+<reb>アァ</reb>
+<re_nokanji/>
+</r_ele> |
16. |
A 2018-04-28 23:51:06 Marcus Richert <...address hidden...>
|
|
Diff: |
@@ -40,3 +40,3 @@
-<gloss>Ah!</gloss>
-<gloss>Oh!</gloss>
-<gloss>Alas!</gloss>
+<gloss>ah!</gloss>
+<gloss>oh!</gloss>
+<gloss>alas!</gloss>
@@ -47,3 +47,3 @@
-<gloss>Yes</gloss>
-<gloss>Indeed</gloss>
-<gloss>That is correct</gloss>
+<gloss>yes</gloss>
+<gloss>indeed</gloss>
+<gloss>that is correct</gloss>
@@ -54,2 +54,2 @@
-<gloss>Hey!</gloss>
-<gloss>Yo!</gloss>
+<gloss>hey!</gloss>
+<gloss>yo!</gloss>
@@ -60,4 +60,4 @@
-<gloss>Uh huh</gloss>
-<gloss>Yeah yeah</gloss>
-<gloss>Right</gloss>
-<gloss>Gotcha</gloss>
+<gloss>uh huh</gloss>
+<gloss>yeah yeah</gloss>
+<gloss>right</gloss>
+<gloss>gotcha</gloss> |
15. |
A* 2018-04-27 09:07:05 Johan Råde <...address hidden...>
|
|
Refs: |
6 examples in Tatoeba |
|
Comments: |
no G n-grams hits for あぁ; I suspect the n-grams cannot handle ぁ |
|
Diff: |
@@ -23,0 +24,3 @@
+</r_ele>
+<r_ele>
+<reb>あぁ</reb> |
14. |
A 2016-09-14 23:07:35 Jim Breen <...address hidden...>
|
13. |
A* 2016-09-14 21:14:32 Johan Råde <...address hidden...>
|
|
Diff: |
@@ -6,0 +7 @@
+<ke_pri>spec1</ke_pri>
@@ -17,0 +19 @@
+<re_pri>spec1</re_pri>
@@ -20,0 +23 @@
+<re_pri>spec1</re_pri> |
12. |
A 2016-09-14 11:07:31 Jim Breen <...address hidden...>
|
11. |
A* 2016-09-14 07:40:17 Johan Råde <...address hidden...>
|
|
Refs: |
ああ 18788680
あー 9061462
アー 239306
アア 79681 |
|
Diff: |
@@ -19,0 +20,3 @@
+<reb>あー</reb>
+</r_ele>
+<r_ele>
@@ -20,0 +24,4 @@
+<re_nokanji/>
+</r_ele>
+<r_ele>
+<reb>アア</reb> |
10. |
A 2013-02-17 03:39:05 Jim Breen <...address hidden...>
|
|
Comments: |
OK. You win. I wish there was a way they could be searchable, though. |
9. |
A* 2013-02-16 04:56:21 Rene Malenfant <...address hidden...>
|
|
Comments: |
as indicated by nikkoku and the kanwa dictionaries, there are a bazillion writings for this, none of which anyone is ever going to come across in real life. that's when we normally whip out the 'also written as' |
|
Diff: |
@@ -6,0 +6,1 @@
+<ke_inf>&ateji;</ke_inf>
@@ -9,0 +10,1 @@
+<ke_inf>&ateji;</ke_inf>
@@ -12,18 +14,1 @@
-</k_ele>
-<k_ele>
-<keb>於乎</keb>
-</k_ele>
-<k_ele>
-<keb>於戯</keb>
-</k_ele>
-<k_ele>
-<keb>嗟乎</keb>
-</k_ele>
-<k_ele>
-<keb>嗟夫</keb>
-</k_ele>
-<k_ele>
-<keb>吁</keb>
-</k_ele>
-<k_ele>
-<keb>鳴呼</keb>
+<ke_inf>&ateji;</ke_inf>
@@ -41,1 +26,1 @@
-<s_inf>all kanji forms are ateji</s_inf>
+<s_inf>also written as 於乎, 於戯, 嗟乎, 嗟夫, 吁, etc.</s_inf> |
8. |
A 2013-02-16 04:53:24 Rene Malenfant <...address hidden...>
|
|
Refs: |
Meikyo: 「〈▼嗚呼〉」「▼嗟」「▼噫」などと当てる。
Nikkoku:
【嗚呼】文明・天正・饅頭・黒本・易林・書言・ヘボン・言海
【咨】和玉・文明・易林
【於戯】伊京・易林・書言
【吁】和玉・書言
【噫・嗚・戯】和玉
【嗟呼】文明
【呼】黒本
【悪・都・烏〓・於皇・嗚〓】書言
Kanjigen:
〈於呼〉[於戯オギ・ああ]ああ、と感嘆したときの声をあらわすことば。
【吁】...《訓読み》ああ
❶{感動詞}ああ。うわっという嘆声をあらわす擬声語。▽驚き・怪しみ・悲しみなど、文脈に応じてさまざまの感じを含む。[類義語]嗚呼(ああ)。「吁嗟クサ」
==
嗟夫 and 嗟乎 are confirmed in kojien's kanwa entry for 嗟 |
|
Comments: |
the only one i don't have a ref for is 鳴呼
temporarily approving |
|
Diff: |
@@ -6,0 +6,6 @@
+</k_ele>
+<k_ele>
+<keb>噫</keb>
+</k_ele>
+<k_ele>
+<keb>嗟</keb>
@@ -21,6 +27,0 @@
-</k_ele>
-<k_ele>
-<keb>嗟</keb>
-</k_ele>
-<k_ele>
-<keb>噫</keb> |
7. |
A* 2013-02-16 01:43:29 Marcus Richert
|
|
Comments: |
What's the source for them though? "嗟夫" for example gets 0
hits on bing (set to Japanese,there's some Chinese hits
though). there's some hits on google books but they seem to
mean something else. |
6. |
A* 2013-02-15 23:20:27 Jim Breen <...address hidden...>
|
|
Comments: |
Marcus wrote: "can't we just remove a bunch of them? except for 嗚呼 and 噫, none of them are in daij/nikk. (吁 is, but as おの)", and proposed an amendment trimming it back to just 嗚呼 and 噫.
I really don't agree - all the kanji forms are rare, but I think it's useful to have them recorded and searchable. I'm dropping Marcus' version and added a note about ateji, but I'll keep the thread open. |
|
Diff: |
@@ -41,0 +41,1 @@
+<s_inf>all kanji forms are ateji</s_inf>
@@ -47,0 +48,1 @@
+<misc>&uk;</misc>
@@ -53,0 +55,1 @@
+<misc>&uk;</misc>
@@ -58,0 +61,1 @@
+<misc>&uk;</misc> |
5. |
A* 2013-02-13 22:01:21 Jim Breen <...address hidden...>
|
|
Comments: |
Yes, and it would be a horrible clutter to mark them all. Perhaps a note. |
4. |
A* 2013-02-13 07:30:25 Marcus Richert <...address hidden...>
|
|
Comments: |
aren't all these kanji ateji? |
3. |
A 2010-09-22 04:20:08 Rene Malenfant <...address hidden...>
|
|
Comments: |
i don't think there's anything palatable that can be done with it. any entry for a species name that has another non-biological sense suffers from the same problem. a shortcoming of the swapping system |
2. |
A* 2010-09-17 09:07:35 Jim Breen <...address hidden...>
|
|
Comments: |
Yes, and I can't see an easy fix. The heuristic for "uk" cases is to pull the katakana to the front if there is one, otherwise pull the hiragana to the front.
A possible fix is to break アー out into its own extry. |
1. |
A* 2010-09-17 07:50:54 Paul Blay <...address hidden...>
|
|
Comments: |
I think this fails the 'nokanji' display fix. It displays like the following:
アー 《嗚呼; 於乎; 於戯; 嗟乎; 嗟夫; 吁; 嗟; 噫; 鳴呼》 【ああ】 (int) (1) (uk) Ah!; Oh!; Alas!; (2) Yes; Indeed; That is correct; (3) Hey!; Yo!; (4) Uh huh; Yeah yeah; Right; Gotcha
But ああ is actually the most commonly seen version. |