Archive for March, 2009

Playing with the Stanford Log-linear Part-Of-Speech Tagger

Wednesday, March 4th, 2009

I would like to create a part-of-speech tagger for Paraguayan Guarani. Initially I thought I would use the Brill part of speech tagger, but it seems to have vanished from the web. In my search, I ran across the Stanford Log-Linear Part-Of-Speech Tagger. It was developed by Chris Manning’s group and I figured anything developed by Chris Manning is probably exceptional.  I downloaded it and ran the included English part-of-speech tagger on a 250k text (a public domain Tom Swift book). (more…)

User choice as an evaluation metric for cross language IM

Monday, March 2nd, 2009

A method for evaluating MT performance embedded in Cross-Language Instant Messaging (CLIM) systems is presented. A web interface that provided concurrent real- time translation for instant messaging from multiple MT services was developed and used by paid participants to collaborate on a photo identification task. The method showed a task performance benefit due to the availability of multiple translation alternatives.

Ogden, William, Ron Zacharski, Sieun An and Yuki Ishikawa. 2009. User choice as an evaluation metric for web translation services in cross language instant messaging applications. Proceedings of the Machine Translation Summit XII. Ottawa, Canada. (pdf)