This is not quite true, in that there are exceptions in some bicameral alphabets such as Turkish and Northern Azeri.This filter expects UTF-8 data and will handle foreign character sets.
Both these alphabets include the following two letters:
Code: Select all
U+0130 LATIN CAPITAL LETTER I WITH DOT ABOVE : i dot
U+0131 LATIN SMALL LETTER DOTLESS I
Code: Select all
İı
On the other hand, it does change most accented Latin letters, e.g.
Code: Select all
Š
Code: Select all
š
Not sure how you might implement the proper case rules for the Turkish alphabet, etc.This filter expects UTF-8 data and will handle some foreign character sets.
These filters would first need to have the writing system context specified by the user.
Furthermore, I would guess that you'd not given any consideration to extending these Character cAsE filters to cover the Cherokee supplement block of small letters that were defined by Unicode 8.0 (June 2015).
Best regards,
David