Page 1 of 1

Automatic tagging

Posted: Mon Jan 12, 2009 8:25 pm
by nekonome
Good day.

I wonder if it's possible to count occurances of words.

Source:
http://www.myurl.com/story1
cats dogs cats pet pet pet lion pet pet pet pet pet pet pet pet pet pet pet pet rabbit pet pet pet pet pet pet ...

Result 1:

<url>@fullinputfilename</url>
cats = 2
dogs = 1
lion = 1
rabbit = 1

or Result 2

<url>@fullinputfilename</url>
<tag>cats, dogs, lion, rabbit</tag>

The "extract matches" filter does not seem to be the proper solution. How shall it be done?

Thank you

Nekonome

Re: Automatic tagging

Posted: Tue Jan 13, 2009 11:39 am
by DataMystic Support
FIrst convert the file to a word list, then sort it, then count occurrences. There are filters for each of these steps.