Selected Publications 
These eight publications are my answer to the following question : 
"What should I read to find out about your current research 
interests, and what you are likely to be working on in the next
few years?" I made these selections in 2008, so more recent work isn't 
included, and it's possible my perspective has changed a bit since then. 
If you are interested in the more current answer please ask!
2008
- 
Empiricism is Not a Matter of Faith  (Pedersen), 
 
Computational Linguistics, 
Volume 34, Number 3, pp. 465-470, September 2008.
[Journal Citation Reports Index Factor 2007: 2.367] 
 
 
[This explains why we release our 
 software, 
and how we develop it so
that it's both possible and productive to do so. The goal is to 
promote greater sharing of software so that experimental results
can be reproduced reliably and quickly.]
 
2007
2006
-  
A Comparative Study of Supervised Learning as Applied to Acronym
Expansion in Clinical Reports  (Joshi, Pakhomov, Pedersen, and 
Chute)
- Appears in the Proceedings of the Annual Symposium of the American
Medical Informatics Association, pp. 399-403, Nov 11-16, 2006,
Washington, DC. [acceptance rate 41%]
 
 
[This shows that acronym expansion in medical text can be
effectively handled using 
 supervised 
learning 
techniques first developed
for word sense disambiguation of general English text.]
 
 
 
- 
 
Unsupervised Corpus Based Methods for WSD  (Pedersen), In Agirre, E. 
and Edmonds, P. (Editors),   Word Sense 
Disambiguation : Algorithms and Applications,  June 2006, pp. 
133-166, Springer. 
 
 
[This describes the foundations of 
 SenseClusters, 
which clusters written contexts based on their lexical similarity. Various 
related approaches are also discussed, including Latent Semantic Analysis.]
 
2005
- 
 Name
Discrimination by Clustering Similar Contexts  (Pedersen, Purandare,
and Kulkarni) - Appears in the  Proceedings of the Sixth International
Conference on Intelligent Text Processing and  Computational
Linguistics, p. 220-231, February 13-19, 2005, Mexico City. [acceptance 
rate 37%] 
 
 
[We show how to use 
 SenseClusters  to 
discover the identities associated with a name mentioned multiple 
times in a text.]
 
2004
2003
- 
Extended Gloss Overlaps as a Measure of Semantic Relatedness 
(Banerjee and Pedersen) - Appears in the Proceedings of the Eighteenth 
International Joint Conference on Artificial Intelligence, 
pp. 805-810, August 9-15, 2003, Acapulco, Mexico. [acceptance rate 
21%] 
 
 
[
We introduce the extended gloss overlap measure (aka 
lesk  in WordNet::Similarity). The goal is to measure how related two concepts are based on the similarity of their definitions as found 
in WordNet.
] 
 
- 
Using Measures of Semantic Relatedness for Word Sense Disambiguation 
(Patwardhan, Banerjee and Pedersen) - Appears in the Proceedings of the
Fourth International Conference on Intelligent Text Processing and
Computational  Linguistics,
pp. 241-257, February 17-21, 2003, Mexico City. [acceptance rate 46%] 
 
 
[
We introduce 
 WordNet::Similarity  and
 WordNet::SenseRelate,  
both of which remain very active projects. The goal is to measure the 
similarity of concepts based on  WordNet,  
and to use that information to assign each word in a running text the sense
that is most related to its neighbors.
]
 
By:
Ted Pedersen
- tpederse AT d umn edu