WordPress removing Unicode characters
-
Hi,
I’m not sure why but every time I save a post, WordPress is removing Unicode characters before it saves it to the database.
The Unicode characters is the Phoenician block, from U+10900…U+1091F. This is the first time I noticed this. I tested the Tagalog block but it works fine. The Tagalog Unicode points are saved by WordPress.
Does Unicode support require manual requests before WordPress can process/save it? Or has it to do with the database itself?
I won’t be able to give a “sample” site for display purposes since the text is cleaned up, instead you will find below the code to use for testing.
<pre>Hex NCR: <bdo lang="hbo-Hebr" dir="rtl">𐤉𐤅𐤇𐤍𐤍</bdo></pre> <bdo lang="hbo-Hebr" dir="rtl">𐤉𐤅𐤇𐤍𐤍</bdo> <pre>Unicode-Phoenician Keyboard Layout: <bdo lang="hbo-Hebr" dir="rtl">??????????</bdo></pre> <bdo lang="hbo-Hebr" dir="rtl">??????????</bdo>
(Edit: Sadly, the Hex NCR is not being preserved as a “code”.)
As shown above, I had to use Hex NCR just to type and have WordPress saved it. Otherwise, typing it directly, it becomes this after saving the post, empty:
<bdo lang="hbo-Hebr" dir="rtl"></bdo>
Instead of:
<bdo lang="hbo-Hebr" dir="rtl">??????????</bdo>
If you need the keyboard layout (and font) for testing as well, I currently have a Windows binary available (Linux keyboard layout not yet ready), here: https://bitbucket.org/paninap/pukbl/downloads look for: PHNKL v1.0.0
Font here: https://bitbucket.org/paninap/pukbl/wiki/Fonts
Thank you for the assistance.
- The topic ‘WordPress removing Unicode characters’ is closed to new replies.