jQuery wordStats plugin
2007-08-09
The jQuery wordStats plugin tries to determine what a page is about by computing the density of its keywords.
It uses a simple algorithm which adds a variable weight for each keyword occurence, depending on its 'position' in the document.
Although this computation model is quite simple, I get good results with all sorts of documents.
There's of course a system to exclude 'stop words' (frequent pronouns, prepositions, adverbs and so on).
Example: computing top keywords of a Wikipedia article.
Usage:
<script type="text/javascript" src="jquery.js"></script> <script type="text/javascript" src="jquery.wordstats.js"></script> <!-- core code --> <script type="text/javascript" src="jquery.wordstats.en.js"></script> <!-- English stop words --> <script type="text/javascript"> $(function() { var count = 3; $.wordStats.computeTopWords(count); //$.wordStats.topWords[0] returns the most frequent keyword //$.wordStats.topWords[1] the second, and so on $.wordStats.clear(); //release memory //compute top keywords of the element with id="content" $.wordStats.computeTopWords(count, $('#content')); }); </script>
Download:
Dual licensed under MIT/GPL.