Unicode pattern reference help
Posted: Thu Jul 28, 2011 12:09 am
The help for Unicode Pattern Reference mentions character property classes (in the Notes section).
Character property classes are not tabulated in the help anywhere.
I found the following list in http://www.koders.com/delphi/fidDBC6499 ... rithm#L208.
It might be sensible to document property classes in the TextPipe help file.
Whether the above definitions are the correct ones for TextPipe is not for me to say.
On the other hand, the word TCharacterCategory is not defined anywhere in the help, even though the notes refer to it twice.
David
Character property classes are not tabulated in the help anywhere.
I found the following list in http://www.koders.com/delphi/fidDBC6499 ... rithm#L208.
Code: Select all
// Notes:
// o Character property classes are \p or \P followed by a comma separated
// list of integers between 1 and 32. These integers are references to
// the following character properties:
//
// N Character Property
// --------------------------
// 1 _URE_NONSPACING
// 2 _URE_COMBINING
// 3 _URE_NUMDIGIT
// 4 _URE_NUMOTHER
// 5 _URE_SPACESEP
// 6 _URE_LINESEP
// 7 _URE_PARASEP
// 8 _URE_CNTRL
// 9 _URE_PRIVATE
// 10 _URE_UPPER (note: upper, lower and titel case classes need to have case
// 11 _URE_LOWER sensitive search be enabled to match correctly!)
// 12 _URE_TITLE
// 13 _URE_MODIFIER
// 14 _URE_OTHERLETTER
// 15 _URE_DASHPUNCT
// 16 _URE_OPENPUNCT
// 17 _URE_CLOSEPUNCT
// 18 _URE_OTHERPUNCT
// 19 _URE_MATHSYM
// 20 _URE_CURRENCYSYM
// 21 _URE_OTHERSYM
// 22 _URE_LTR
// 23 _URE_RTL
// 24 _URE_EURONUM
// 25 _URE_EURONUMSEP
// 26 _URE_EURONUMTERM
// 27 _URE_ARABNUM
// 28 _URE_COMMONSEP
// 29 _URE_BLOCKSEP
// 30 _URE_SEGMENTSEP
// 31 _URE_WHITESPACE
// 32 _URE_OTHERNEUT
Whether the above definitions are the correct ones for TextPipe is not for me to say.
On the other hand, the word TCharacterCategory is not defined anywhere in the help, even though the notes refer to it twice.
David