Unicode Normalization of UTF-8 is very slow

Get help with installation and running here.

Moderators: DataMystic Support, Moderators, DataMystic Support, Moderators, DataMystic Support, Moderators

Post Reply
dfhtextpipe
Posts: 988
Joined: Sun Dec 09, 2007 2:49 am
Location: UK

Unicode Normalization of UTF-8 is very slow

Post by dfhtextpipe »

Compared to the speed of BabelPad, using the TextPipe Unicode filter Normalize to NFC (etc) is very, very slow.

When processing multiple input files, it's like "watching paint dry".
My observation appertains to processing Gurmukhi input text.

What can be done to improve the performance of this filter?

BabelPad is available from http://www.babelstone.co.uk/Software/BabelPad.html

Best regards,

David
David
Post Reply