Research
Interest and Reading
My research interest lies in the field of Natural Language Processing and
Machine Learning.
I worked on the Netflix Prize problem as my researc project under the guidance of Dr. Richard Maclin.
We proposed an algorithm to solve this problem by clustering the users
in the movie domain based on the similarity between them in the genre
of the movies they have rated. Our method achieved accuracy better than
the current Netflix implementation and is faster than the Naive k-NN
impplementation. This project provided my with an opportunity to
research the domain of Collaborative Filtering and its application to
various day to day problems.
Here is the link to my defense presentation.
Some of my recent readings include
-
Herlocker, J, Konstan, J., Terveen, L., and Riedl, J. Evaluating
Collaborative Filtering Recommender Systems. ACM Transactions on
Information Systems 22 (2004), ACM Press, 5-53.
- Gábor
Takács, István Pilászy, Bottyán Németh, Domonkos Tikk Scalable
Collaborative Filtering Approaches for Large Recommender Systems. JMLR
Volume 10 :623--656, 2009.
- Arkadiusz
Paterek, Improving regularized singular value decomposition for
collaborative filtering - Proceedings of KDD Cup and Workshop, 2007.
- http://sifter.org/~simon/journal/20061211.html
- http://www.igvita.com/2006/10/29/dissecting-the-netflix-dataset/
- G.
Gorrell and B. Webb. Generalized hebbian algorithm for incremental
latent semantic analysis. Proceedings of Interspeech, 2006.
My previous readings include.
- Computational Approaches to Measuring the Similarity of
Short Contexts: A Review of Applications and Methods. (Pedersen) To appear
in the
- The Design,
Implementation, and Use of the Ngram Statistics
Package (Banerjee and Pedersen)
- Measures of
Semantic Similarity and Relatedness in the Biomedical Domain (Pedersen,
Pakhomov, Patwardhan,
and Chute)
- Unsupervised
Corpus Based Methods for WSD
- Name
Discrimination by Clustering Similar Contexts (Pedersen, Purandare, and Kulkarni)
- Extended Gloss
Overlaps as a Measure of Semantic Relatedness (Banerjee
and Pedersen)
- Writing
About Research, Or The Art of WAR (Pedersen) - unpublished manuscript,
September 2003.
- A
Plagiarism Case Study (Pedersen) - unpublished manuscript, April 2001.