Page 1 of 1

Unicode Normalization of UTF-8 is very slow

Posted: Tue Jan 17, 2017 5:22 am
by dfhtextpipe
Compared to the speed of BabelPad, using the TextPipe Unicode filter Normalize to NFC (etc) is very, very slow.

When processing multiple input files, it's like "watching paint dry".
My observation appertains to processing Gurmukhi input text.

What can be done to improve the performance of this filter?

BabelPad is available from http://www.babelstone.co.uk/Software/BabelPad.html

Best regards,

David