Bug in Find whole words only option for replace list filter
Posted: Fri May 04, 2018 6:35 am
This is in the context of using the Replace list filter with Pattern (perl) as find type.
My tab-delimited external Replace list contains this as one of the many lines:
It's designed to replace the accented letter "ó" by the unaccented letter "o" and is set to apply with
Match case, Find whole words only, UTF-8 support.
I just found that instead of replacing only the 2 single letter words that were intended,
it also replaced the "ó" at the end of 55 words that ended with the letters "ñó".
viz.
The letter U+00F1 LATIN SMALL LETTER N WITH TILDE "ñ" seems to be seen in this context as if it were a non-word character!
How else can one interpret this very unexpected result?
This is a surely a software bug!
Aside: The input file contains Spanish text. My Windows locale is English (UK).
Best regards,
David
My tab-delimited external Replace list contains this as one of the many lines:
Code: Select all
\xF3 o
Match case, Find whole words only, UTF-8 support.
I just found that instead of replacing only the 2 single letter words that were intended,
it also replaced the "ó" at the end of 55 words that ended with the letters "ñó".
viz.
Code: Select all
enseñó soñó riñó engañó ciñó constriñó apañó dañó
How else can one interpret this very unexpected result?
This is a surely a software bug!
Aside: The input file contains Spanish text. My Windows locale is English (UK).
Best regards,
David