Search and replace problems in unicode files

Get help with installation and running here.

Moderators: DataMystic Support, Moderators, DataMystic Support, Moderators, DataMystic Support, Moderators

Post Reply
Tobias

Search and replace problems in unicode files

Post by Tobias »

I'm trying to make some changes to a japanese xml file, but I dont get any maches for my search strings (both perl patterns and exact). In trial area, i get perfect maches if I disable "treat input as unicode" so I suspect I have missed something about how unicode is treated in patterns.

Even the simplest search fails, for instance, a pearl or exakt search for just the caracters ref fails, even though I have checked the Enable UTF-8 support box in Matching options. (Of course I want to do a more complex search then this, but when trouble shooting I removed more and more stuff from the pattern until there was only ordinary characters left).[/b].
User avatar
DataMystic Support
Site Admin
Posts: 2227
Joined: Mon Jun 30, 2003 12:32 pm
Location: Melbourne, Australia
Contact:

Post by DataMystic Support »

Hi Tobias,

When you paste data into the Trial Run area, it gets stored in Windows' native Unicode format, UTF16 (LE). With the 'Treat as Unicode' option disabled, this gets converted to ANSI prior to processing.

What is the format of your file?

Can you please email me your filter and a sample file for comparison?
Post Reply