Extract irregular patterns
Posted: Wed Jun 27, 2007 5:28 pm
Hi,
I'm new to TextPipe and seem to have started out with a particularly tricky problem. I need extract data for statistical analysis from a text document.
Lets say, I'm looking for information AAA and BBB for each item ###.
The structure of the text ist the following:
--> i.e. I do not know which item contains information on AAA and / or BBB. However I need the extracted data organized by ###. The Easy pattern searches I managed to come up with so far would fail because they keep on searching for - say - pattern BBB ignoring the fact that BBB might belong to a different item.
I read your excellent white papers - but they do not seem to offer some code that addresses my problem.
I would greatly appreciate your help on this.
Bernie
I'm new to TextPipe and seem to have started out with a particularly tricky problem. I need extract data for statistical analysis from a text document.
Lets say, I'm looking for information AAA and BBB for each item ###.
The structure of the text ist the following:
Code: Select all
item###1
<junk>
AAA
item###2
<junk>
AAA
<junk>
BBB
<junk>
item###3
BBB
<junk>
--> i.e. I do not know which item contains information on AAA and / or BBB. However I need the extracted data organized by ###. The Easy pattern searches I managed to come up with so far would fail because they keep on searching for - say - pattern BBB ignoring the fact that BBB might belong to a different item.
I read your excellent white papers - but they do not seem to offer some code that addresses my problem.
I would greatly appreciate your help on this.
Bernie