Research
Interest and Reading
My research interest lies in the field of Natural Language Processing and
Machine Learning. Most current information on my research can be found here.
I have worked in the field of Natural Language Processing at Symantec
India, concentrating on the Instant Messenger Security using Natural
Language Processing techniques. I hold to my credit a paper presented on this
work at Symantec. Details of my previous work can be found here. Also, I
designed and implemented the code for transliteration of Indian languages for
the Sun Microsystems' Star Office during my internship with Modular Infotech, Pune, India.
My past readings include.
- Computational Approaches to Measuring the Similarity of
Short Contexts: A Review of Applications and Methods. (Pedersen) To appear
in the
- The Design,
Implementation, and Use of the Ngram Statistics
Package (Banerjee and Pedersen)
- Appears in the Proceedings of the Fourth International Conference
on Intelligent Text Processing and Computational Linguistics, pp. 370-381,
February 17-21, 2003, Mexico City.
- Measures of
Semantic Similarity and Relatedness in the Biomedical Domain (Pedersen,
Pakhomov, Patwardhan,
and Chute), Journal
of Biomedical Informatics, 40(3), 288-299, June 2007. [Journal
Citation Reports Index Factor 2006: 2.346]
- Unsupervised
Corpus Based Methods for WSD (Pedersen), In Agirre,
E. and Edmonds, P. (Editors), Word Sense Disambiguation : Algorithms and Applications, June
2006, pp. 133-166, Springer.
- Name
Discrimination by Clustering Similar Contexts (Pedersen, Purandare, and Kulkarni) - Appears in the Proceedings of the Sixth
International Conference on Intelligent Text Processing and Computational
Linguistics, pp. 220-231, February 13-19, 2005, Mexico City. [acceptance
rate 37%] Download the data used in this
paper.
- Extended Gloss
Overlaps as a Measure of Semantic Relatedness (Banerjee
and Pedersen) - Appears in the Proceedings of the Eighteenth
International Joint Conference on Artificial Intelligence, pp. 805-810,
August 9-15, 2003, Acapulco, Mexico.
- Writing
About Research, Or The Art of WAR (Pedersen) - unpublished manuscript,
September 2003.
- A
Plagiarism Case Study (Pedersen) - unpublished manuscript, April 2001.