[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [edict-jmdict] Re: Inclusion of gikun readings in JMdict/Edict



Hi, Matt.

I did no result filtering there.  The number of Google hits is notoriously unreliable (usually vastly overestimated), which is why we use now mostly use the n-gram count instead.  If you want to get the number of “real” Google hits as opposed to the mere number that Google is reporting on the first page, just keep clicking through the pages of results until it says all results have been displayed.


Rene


On Nov 13, 2014, at 2:56 PM, matt_bloedel@********* [edict-jmdict] <edict-jmdict@***************> wrote:

Rene, what are you doing to determine how many of the search results you are getting are "real"? I just realized that with one of my searches it had removed the quotes so it gave me thousands of unreal hits that way, but the search you say yielded 60 real hits gave me hundreds of results, so you must be doing something to filter the search results.