Character cAsE filters and the Turkish alphabet
Posted: Tue Mar 03, 2020 10:12 pm
The help pages for the various Character cAsE filters states the following:
Both these alphabets include the following two letters:
So for example pasting the following into the Trial Run area:running the tOGGLE cASE filter makes no change.
On the other hand, it does change most accented Latin letters, e.g. to
Perhaps the sentence in the Help pages should be qualified.
These filters would first need to have the writing system context specified by the user.
Furthermore, I would guess that you'd not given any consideration to extending these Character cAsE filters to cover the Cherokee supplement block of small letters that were defined by Unicode 8.0 (June 2015).
Best regards,
David
This is not quite true, in that there are exceptions in some bicameral alphabets such as Turkish and Northern Azeri.This filter expects UTF-8 data and will handle foreign character sets.
Both these alphabets include the following two letters:
Code: Select all
U+0130 LATIN CAPITAL LETTER I WITH DOT ABOVE : i dot
U+0131 LATIN SMALL LETTER DOTLESS I
Code: Select all
İı
On the other hand, it does change most accented Latin letters, e.g.
Code: Select all
Š
Code: Select all
š
Not sure how you might implement the proper case rules for the Turkish alphabet, etc.This filter expects UTF-8 data and will handle some foreign character sets.
These filters would first need to have the writing system context specified by the user.
Furthermore, I would guess that you'd not given any consideration to extending these Character cAsE filters to cover the Cherokee supplement block of small letters that were defined by Unicode 8.0 (June 2015).
Best regards,
David