• Hi,

    I’ve test this plugin on my site, and it seems that Relevanssi doesn’t support Chinese and maybe some other asian language (Japanese、Korean maybe) .

    Relevanssi (ver 2.7.5) supports UTF-8, but some asian language doesn’t have “space” between two words, for example: “這是一個很長很長很長很長很長的句子。” it’s a sentence, but Relevanssi seems to treat it as a word, so if i search “這是”, Relevanssi can find it, perhaps because it’s “start of a word”, but if i search “很長”, there would be no search result.

    wordpress’s search function doesn’t have such problem, can anyone help me with this problem?

    Thanks!

Viewing 2 replies - 1 through 2 (of 2 total)
  • Plugin Author Mikko Saari

    (@msaari)

    Yes – Relevanssi has no support for any Asian languages for the simple reason that I can’t read any of them and don’t really know how they should be processed.

    For example the posts are tokenized to words by splitting the content at spaces. Doesn’t really work in a language that doesn’t use spaces to separate the words, as you noticed.

    I’m afraid this won’t change for the better, either, until somebody with a better understanding of Asian languages fixes it. A quick look at Chinese or Japanese tokenization would suggest this is a fairly complicated problem, too.

    (WordPress default search doesn’t have the problem, as it simply searches the post database and doesn’t build separate indices.)

    Thread Starter 46huang

    (@46huang)

    Thanks for your help, even I cannot us Relevanssi on my site, it’s still a very very good plugin!

Viewing 2 replies - 1 through 2 (of 2 total)
  • The topic ‘[Plugin: Relevanssi – A Better Search] doesn't fully support chinese’ is closed to new replies.