January 11, 2007

Looking for someone with Chinese knowledge

Posted by peter

We’re looking to implement CJK Support in Open Source Full Text search engine Sphinx .
Initially we’re thinking to base search ob bi-gram indexing to keep it simple, especially as according to research papers it offers decent quality for most cases. This is not that complex to implement however there is no way we can test it as we have zero knowledge of Chinese or Japanese.

If you know Chinese Japanese or Korean and would like us help us testing Sphinx support for these languages let us know. No special development skills are required. If you’re reading this blog you should be technical enough.

Related posts: :MySQL Performance Forum: Hot Topics::MySQL Geek Job Openings::Jobs:
 

23 Comments »

  1. I’m a native Chinese speaker and know a little Japanese. I’d like to do some help.

    Comment :: January 11, 2007 @ 6:10 pm

  2. Hi, I’m Kim and I’m Korean
    I’m living in Seoul, Korea and now working for ‘Daum Communications’ as a DBA (Oracle, MySQL)
    I wanna test Sphinx CJK support.

    -YW Kim

    Comment :: January 11, 2007 @ 6:51 pm

  3. 3. jedy

    I’m Chinese, also have some Japanese knowledge. And I’d like to help to test.

    Comment :: January 11, 2007 @ 7:00 pm

  4. 4. Hao

    Hi there, I’d like to help your testing of Chinese, write to fu2009@gmail.com if I can join :-P

    Comment :: January 11, 2007 @ 8:00 pm

  5. 5. Sun

    I am from china,and I Would like to join this test.
    Is that OK?

    Comment :: January 11, 2007 @ 8:19 pm

  6. 6. Nick Zhao

    Hi Peter, I’m a Chinese guy living in Dalian, China. I’m a big fun of LAMP though only have little
    knowledge of them. But if you just want someone who knows Chinese much better than you and also
    desires to help, please feel free to contact me via email or MSN.

    P.S., please prepare to bear my poor English and I’d better let you know that I just began to learn LAMP
    for a couple of days. :)

    Best wishes.

    Nick

    Comment :: January 11, 2007 @ 10:08 pm

  7. I am a chinese. I have 3 years c++ program experices. I like opensource project. Please contect me and I’d like to test Spinx.

    Comment :: January 11, 2007 @ 10:42 pm

  8. 8. Dale

    Peter, just as an FYI, I’ve actually implemented this in Sphinx for edgeio.com. You can see it in action at:

    http://www.edgeio.com/ss/%E6%88%91%E7%9A%84%E6%B1%BD%E8%BD%A6?location=0

    However, I don’t think we’re contributing the code back to Sphinx. We used bigrams along with proximity relevance scoring. Based on what I’ve seen, the relevance ranking is pretty good. So far we’re just doing Chinese UTF-8. We have some folks in China who have done some testing with it.

    My knowledge of Chinese was just good enough to get by here, but I’d be interested in seeing how your effort goes, and helping out a bit if I can.

    Comment :: January 12, 2007 @ 1:45 am

  9. 9. mshk

    Hi, I’m Japanese web programmer and intrested in testing Sphinx.
    How can I help you?

    Comment :: January 12, 2007 @ 2:10 am

  10. Peter, I’m a Chinese programmer and I’d like to help. I have good Python/C skills and enough knowledge of CJK character encoding, just FYI.

    Comment :: January 12, 2007 @ 2:57 am

  11. Thank you guys,

    I have not expected so many people to respond so quickly. We’ll now look into how organized it best and will contact ones who provided emails and post some information here.

    Comment :: January 12, 2007 @ 3:06 am

  12. You should collaborate with the Namazu developers (http://www.namazu.org/index.html.en). Namazu is a search engine made primarily for CJK languages, but also works with English. The engine is written in C, and the indexer is written in perl. I’ve found their code fairly easy to read and follow (and I do not know any of CJK), and submitted a few patches in the past. The developers are quite helpful.

    Comment :: January 13, 2007 @ 8:39 am

  13. 5 Sun:

    Your email does not seems to be working. Please contact us if you see this.

    Comment :: January 14, 2007 @ 8:22 am

  14. 14. frank

    I am chinese ,I like your products and I always use them. I want to help you.

    Comment :: January 14, 2007 @ 8:56 am

  15. 15. Gu Lei

    Hi Peter,

    I’m Chinese. I also want to join that test. Contact me if needed.

    Comment :: January 14, 2007 @ 6:59 pm

  16. 16. Bill

    Hi

    I am interesting in this testing. Chinese and Japanese is ok for me.

    bill.neo@gmail.com

    Regards
    Bill

    Comment :: January 14, 2007 @ 8:40 pm

  17. 17. Eric

    i can test chinese using osx. eric18 @ gmail . com

    Comment :: January 15, 2007 @ 9:24 pm

  18. hi,peter,i’m the owner of http://imysql.cn,i‘m Chinese,i’m a DBA, i’m skilled with MySQL optimization, i would like to join with you :)

    Comment :: January 16, 2007 @ 6:29 am

  19. 19. Louis

    hi, i’m a chinese. i hope to join the testing. please contact me: liukaixuan@gmail.com

    Comment :: January 16, 2007 @ 8:13 pm

  20. 20. Josh

    hi, peter, I am a Chinese web programmer, 3 years PHP experience, if you want to test Sphinx CJK Support on Debian AMD64, please contact me.

    epaulin AT gmail dot com

    Comment :: January 16, 2007 @ 10:03 pm

  21. I am a PHP/MySQL web application programmer.
    Have been a MotherBoard Tester.

    Comment :: January 16, 2007 @ 11:08 pm

  22. 22. Lisa Lan

    I am interesting in this testing. I am a oracle and mysql DBA , I’m Chinese .
    Thanks

    Comment :: January 17, 2007 @ 6:41 pm

  23. i am a Chinese,3 years LAMP experience,and interesting in search technology,please contact me if you need.
    anakinsun AT gmail.com

    Comment :: November 16, 2007 @ 12:11 am

 

Subscribe without commenting


This page was found by: mysql chinese looking for some one... chinese knowledge eric18@gmail.com i am looking for som...