[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[edict-jmdict] English n-gram counts



Earlier today I was mentioning the Google English n-gram corpus
in the context of finding the frequency of certain phrases. I realised
that I'd implemented a system for searching that corpus years ago
for my gairaigo segmenter at:
http://nlp.cis.unimelb.edu.au/jwb/gairaigo.html
but I'd never actually made it more generally available. Here it is:

http://nlp.cis.unimelb.edu.au/jwb/engngrams.html

Someone may find it useful. (FWIW the actual corpus is about 55Gb.)

Jim

-- 
Jim Breen
Adjunct Snr Research Fellow, Japanese Studies Centre, Monash University
http://www.jimbreen.org/
http://nihongo.monash.edu/


------------------------------------
Posted by: Jim Breen <jimbreen@gmail.com>
------------------------------------


------------------------------------

Yahoo Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/edict-jmdict/

<*> Your email settings:
    Individual Email | Traditional

<*> To change settings online go to:
    http://groups.yahoo.com/group/edict-jmdict/join
    (Yahoo! ID required)

<*> To change settings via email:
    edict-jmdict-digest@yahoogroups.com 
    edict-jmdict-fullfeatured@yahoogroups.com

<*> To unsubscribe from this group, send an email to:
    edict-jmdict-unsubscribe@yahoogroups.com

<*> Your use of Yahoo Groups is subject to:
    https://info.yahoo.com/legal/us/yahoo/utos/terms/