

Programs
Last Updated : 17th November 2003
Program 1 - This program reads in a file of text, and finds the n most used m word sequences in the file.
Google Search Program - This program interacts with Google API to query and find which word sequence when compared to a given inference, is closely related to it .
Google Search Program - This program interacts with Google API to query and find out the average number of results which were found .
Google SEMI PMI-IR Program - This program interacts with Google API to query and tries to implement the PMI-IR algorithm. The difference is that i use a list of "adjectives" to identify isolzted adjectives rather than a part-of-speech tagger(which is MUCH more effective). Besides that, we use Google rather than the NEAR operator in AltaVista.
You need these files to run the program. List of
Adjectives ,
Smart Stop List
LATEST PROGRAM
Google PMI-IR Program - This program
interacts
with Google API to query and tries to implement the PMI-IR algorithm. I have tried to implement a part-of-speech tagger(which is MUCH more effective than the one isolated adjectives that i tried last time). Besides that, we use Google rather than the NEAR operator in AltaVista.
You need these files to run the program. List of
Adjectives ,
Nouns List ,
Verbs List ,
Adverbs List ,
Review I Tested on Audi A8!
How to run
Copy this configuratiion file onto your directory, and in this file supply your Google ID, the path to the different part-of-speech word lists(files given above) and then the path to your WSDL file.
Configuration File
The program supports a couple of command line options such as --help and --version. To run the program type
perl GooglePMI.pl reviewfilename configfilename
Webpage Template Downloaded from UMD
ITSS.