[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Project Proposals



I have several projects (of varying sizes) I have been kicking around.  I thought I would throw out a line and see if I can get any help (help comes in many forms!).
 
Some of these projects could have already been embarked upon and I just don't know they exist.  If that's the case, please let me know!
 
* Verb parser
 
I have never talked to Jim about this, but I would like to work on this just to do it.  And if I don't have any where to implement it, I will be glad to throw the logic/code out into the public domain.
 
This is inspired my Jim's relatively simple, but immenently useful (!), "Translate Words" function in WWWJDIC.  Since he has already "guessed" that a verb exists, the next step is to analyze the following barrage of kana (and others) that following to provide the verb translated to include the verb ending(s).  So this mean that if there exist multiple modifiers, they will all be including inte final translation.  I am sure linguists can work this better, but the point is pretty straight forward I think.
 
Initially this will entail analyzing what could follow the root and then creating a simplistic logic chart of things to follow. It will progressively become more detailed after that.
 
I sure this work exists already in a scholarly context, but it will either need to be re-interpreted or re-thought from the ground up.
 
----------------------------------------------------------------------------------------------------------
* 当用漢字・常用漢字の歴史
 
Again, I have NOT mentioned this to Jim.
 
The current contents of the 常用漢字 related entries in jmdic could be expanded to include the dates when items were added and the information from the 当用漢字.  For most people, this information would not be very useful.  But it might be extremely useful a few researchers (people that like that kind of thing).
 
Initially, I would just like to compile a list that would be easy to manipulate for what ever purpose.
 
----------------------------------------------------------------------------------------------------------
* Photo database
 
I debated whether to mention this one or not, but I figure it is better to have someone to spur me along.
 
Essentially this would be a database of pictures which would be linked to the WWWJDIC servers (initially).  Jim has given a green light on this one.
 
Things that need to be done:
 
- determine picture format (I am currently leaning toward medium resolution .png's with a thumb nail and the full res image also stored but not accessible (initially).)
- collect pictures (mainly for words related to Japan initially)
- determine naming method (something like a 5-digit, zero padded serial number followed by a brief descriptor (typically the main word that points to the picture))
- a database of picture information (including the file names, words pointing to this entry, date added, source, etc.)
- find a host (no bandwidth limits, stable, free, lots of storage space, relatively fast, going to be around in 5 years, etc.)
 
That is it initially.
 
----------------------------------------------------------------------------------------------------------
* Radical database
 
This is a small project.  Basically it would be a relatively small database of radical related information.  I have the framework written down.
 
This would be nice for electronic dictionary creators to link to if it does not exist already.
 
----------------------------------------------------------------------------------------------------------
 
That's it for now.
 
--Todd