• Hi,

    I’m not sure why but every time I save a post, WordPress is removing Unicode characters before it saves it to the database.

    The Unicode characters is the Phoenician block, from U+10900…U+1091F. This is the first time I noticed this. I tested the Tagalog block but it works fine. The Tagalog Unicode points are saved by WordPress.

    Does Unicode support require manual requests before WordPress can process/save it? Or has it to do with the database itself?

    I won’t be able to give a “sample” site for display purposes since the text is cleaned up, instead you will find below the code to use for testing.

    <pre>Hex NCR: <bdo lang="hbo-Hebr" dir="rtl">𐤉𐤅𐤇𐤍𐤍</bdo></pre>
    <bdo lang="hbo-Hebr" dir="rtl">𐤉𐤅𐤇𐤍𐤍</bdo>
    
    <pre>Unicode-Phoenician Keyboard Layout: <bdo lang="hbo-Hebr" dir="rtl">??????????</bdo></pre>
    <bdo lang="hbo-Hebr" dir="rtl">??????????</bdo>

    (Edit: Sadly, the Hex NCR is not being preserved as a “code”.)

    As shown above, I had to use Hex NCR just to type and have WordPress saved it. Otherwise, typing it directly, it becomes this after saving the post, empty:
    <bdo lang="hbo-Hebr" dir="rtl"></bdo>

    Instead of: <bdo lang="hbo-Hebr" dir="rtl">??????????</bdo>

    If you need the keyboard layout (and font) for testing as well, I currently have a Windows binary available (Linux keyboard layout not yet ready), here: https://bitbucket.org/paninap/pukbl/downloads look for: PHNKL v1.0.0

    Font here: https://bitbucket.org/paninap/pukbl/wiki/Fonts

    Thank you for the assistance.

Viewing 1 replies (of 1 total)
  • Thread Starter ???? (Yuki|雪亮) ????

    (@laibcoms)

    Hmm, the question is, why is WordPress cleaning up Unicode characters it doesn’t recognize? Is it not the problem of the end-user not the software?

Viewing 1 replies (of 1 total)
  • The topic ‘WordPress removing Unicode characters’ is closed to new replies.