Search found 7 matches

by nelsoncruz
Wed Apr 22, 2009 7:50 pm
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Re: Extracting text from HTML, replacing with random codes

Hi Simon,

Remember that I need to output 2 files, the transformed HTML file + a text file with the extracted text. How do I do that without the secondary output?
by nelsoncruz
Tue Apr 21, 2009 6:28 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Re: Extracting text from HTML, replacing with random codes

My question now is, could I run this stuff with Textipe Lite? I'm only using "Find perl patern" and secondary output functions in the filter I described, but I need search/replace list (with tab delimited text file) to reverse the process. Does Textpipe Lite have that? The Standard and Pro...
by nelsoncruz
Mon Apr 20, 2009 6:50 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Re: Extracting text from HTML, replacing with random codes

I revised the initial perl pattern to: >([^<\r][^<\r].*)< This allows capturing text strings that start with a space, but not something like: > <IMG...>< Neither < nor return chars are allowed as 1st or 2nd chars of the string. I revised again to >(&nbsp;| |)([^<\r][^<\r].*|)(&nbsp;| |)(\r\n...
by nelsoncruz
Mon Apr 20, 2009 5:29 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Re: Extracting text from HTML, replacing with random codes

Solved it! :D At the end of +--Perl pattern [^(.+)$] with [[@randomdigit@@randomdigit@@randomdigit@@randomdigit@@randomdigit@]] I added a tab (\t) + $1 (text) + return (\r\n). Then I output this (only tried output to clipboard for testing). The return at the end makes sure each code/text pair goes t...
by nelsoncruz
Mon Apr 20, 2009 3:25 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Re: Extracting text from HTML, replacing with random codes

Either I'm doing something wrong, or [^<>]*? targets everything inside or outside <>. If I make a "find pattern" for [^<>]*? and replace with $0, then add a subfilter replacing . with @randomdigit I get something like: <4845>856202931309492836753331170<66489> from <font>You can type sample...
by nelsoncruz
Tue Apr 14, 2009 1:50 am
Forum: TextPipe Tips and Tricks, Questions and Support
Topic: Extracting text from HTML, replacing with random codes
Replies: 10
Views: 8592

Extracting text from HTML, replacing with random codes

I'm wondering if Textpipe can do this. I want to extract every line of text from a HTML file, replacing each with a short (5 char max) random or sequencial code, and output every code + text line to a separate text file. So, I have an HTML file like this: <.....> text line 1 </.....> <.....> text li...