Programs

Last Updated : 17th November 2003

Program 1 - This program reads in a file of text, and finds the n most used m word sequences in the file.


Google Search Program - This program interacts with Google API to query and find which word sequence when compared to a given inference, is closely related to it .


Google Search Program - This program interacts with Google API to query and find out the average number of results which were found .


Google SEMI PMI-IR Program - This program interacts with Google API to query and tries to implement the PMI-IR algorithm. The difference is that i use a list of "adjectives" to identify isolzted adjectives rather than a part-of-speech tagger(which is MUCH more effective). Besides that, we use Google rather than the NEAR operator in AltaVista. You need these files to run the program. List of Adjectives , Smart Stop List



LATEST PROGRAM

Google PMI-IR Program - This program interacts with Google API to query and tries to implement the PMI-IR algorithm. I have tried to implement a part-of-speech tagger(which is MUCH more effective than the one isolated adjectives that i tried last time). Besides that, we use Google rather than the NEAR operator in AltaVista. You need these files to run the program. List of Adjectives , Nouns List , Verbs List , Adverbs List , Review I Tested on Audi A8!



How to run
Copy this configuratiion file onto your directory, and in this file supply your Google ID, the path to the different part-of-speech word lists(files given above) and then the path to your WSDL file. Configuration File
The program supports a couple of command line options such as --help and --version. To run the program type
perl GooglePMI.pl reviewfilename configfilename

Webpage Template Downloaded from UMD ITSS.