[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[edict-jmdict] English n-gram counts
Earlier today I was mentioning the Google English n-gram corpus
in the context of finding the frequency of certain phrases. I realised
that I'd implemented a system for searching that corpus years ago
for my gairaigo segmenter at:
http://nlp.cis.unimelb.edu.au/jwb/gairaigo.html
but I'd never actually made it more generally available. Here it is:
http://nlp.cis.unimelb.edu.au/jwb/engngrams.html
Someone may find it useful. (FWIW the actual corpus is about 55Gb.)
Jim
--
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/
------------------------------------
Posted by: Jim Breen <jimbreen@gmail.com>
------------------------------------
------------------------------------
Yahoo Groups Links
<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/edict-jmdict/
<*> Your email settings:
Individual Email | Traditional
<*> To change settings online go to:
http://groups.yahoo.com/group/edict-jmdict/join
(Yahoo! ID required)
<*> To change settings via email:
edict-jmdict-digest@yahoogroups.com
edict-jmdict-fullfeatured@yahoogroups.com
<*> To unsubscribe from this group, send an email to:
edict-jmdict-unsubscribe@yahoogroups.com
<*> Your use of Yahoo Groups is subject to:
https://info.yahoo.com/legal/us/yahoo/utos/terms/