Page 1 of 1

Removing HTML from excel column

Posted: Thu Oct 01, 2009 4:56 am
by ausgrl
Hello -

Can you tell me if this is the correct product to perform the following (and possibly a hint how).

I have an Excel spreadsheet with one column containing html mark-up within the text.

I need to :
-retain the row/column structure
-remove ALL information contained within any <table></table> tags
-remove remaining HTML tags so I am left with plain text

I tried running the TextPipe with the HTML removal filter, but then the worksheet wouldn't open in excel (data corrupt).

Thanks!

Re: Removing HTML from excel column

Posted: Thu Oct 01, 2009 8:42 am
by DataMystic Support
Of course TextPipe works with text files, not with Excel files.

However, if you highlight the desired column in Excel, copy it to the clipboard, then switch to TextPipe and apply the remove HTML filter on the Clipboard (see the Trial Run, Go, Clipboard buttons at the bottom of the screen),
then you can paste the result back into Excel and the rows should remain inplace.

Re: Removing HTML from excel column

Posted: Thu Oct 01, 2009 2:30 pm
by ausgrl
Doh - blonde moment.

Thanks - worked great!

FYI - I tried this with another application and it entered line returns, thereby destroying the table structure. This was beauty!