• Resolved bjrnet21

    (@bjrnet21)


    Thank you for great plugin. I’ve had the free version installed on my site for a few months.

    My site maintains an “encyclopedia” so a good search engine is helpful.

    However, I’ve noticed that Relevanssi may have a problem with some of the “big” pages on my site. Does Relevanssi index “all” words on a page? Is there a file size limit?

    Example:
    One big page on my site is https://www.blackjackreview.com/wp/encyclopedia/m/ called “M IS FOR MONEY”.
    I want to search for the words ‘Richard Jacobs’, which I know is at the bottom of that page.
    Relevanssi gives me a lot of results BUT not the page mentioned above.
    It looks like ‘Richard Jacobs’ was not indexed.

    I am using the latest versions of Twenty-Fourteen theme and WordPress.
    Relevanssi:
    Default operator = AND
    Fallback to OR = Checked
    Default order = Relevance
    Keyword matching = Partial words

    Any help would be appreciated.

    The page I need help with: [log in to see the link]

Viewing 4 replies - 1 through 4 (of 4 total)
  • Plugin Author Mikko Saari

    (@msaari)

    There’s no limit to page size, and that’s a fairly small page anyway.

    Now it may be there’s something wrong with the page structure that’s throwing Relevanssi off. You can find the page for “macdougal”, but not for “maddog”, so perhaps there’s something at that point of the page that makes Relevanssi not see the rest of the page?

    What that would be exactly is impossible for me to tell from looking at the page.

    What you can try is adding this to your theme functions.php and then saving the post:

    add_filter( 'relevanssi_post_content_before_tokenize', 'rlv_post_content' );
    function rlv_post_content( $content ) {
        var_dump( $content );
        exit();
    }

    This will show you how Relevanssi sees the post. Is everything there? Once you’ve done checking, remove the code, then save the post again to make sure it’s indexed.

    Thread Starter bjrnet21

    (@bjrnet21)

    Thank you for your help.

    Yes, it looks like Relevanssi stopped indexing early before the word “maddog”.

    Your filter works great. It showed me the following (trimmed excerpt):
    string(2513) “ENCYCLOPEDIA OF BLACKJACK # A B C D E F G H I J K L M N O P Q R S T U V W XYZ MA . Acronym for Multiple Action blackjack. …. g this trip. MacDougall was on To Tell the Truth on Aug 27, 1957. * Some sources indicate a 1906 birth year. [ blackjackforumonline.com/content/The_First_Counters.htm ] [ geniimagazine.com/magicpedia/Michael_MacDougall ]”

    There was a “commented” out line just after this:
    <!–p align=”center”><iframe src=”https://www.youtube.com/embed/5x5zmt2R1fA?rel=0&#8243; width=”560″ height=”315″ frameborder=”0″ allowfullscreen=”allowfullscreen”></iframe></p–>

    When I removed the above line, Relevanssi saw the rest of the page.

    I guess the BIG question is – Does Relevanssi have issues with <!– comments –> on the page?

    Plugin Author Mikko Saari

    (@msaari)

    No, but Relevanssi does have problems with malformed HTML code.

    <!--p align="center"... is not the right way to make comments. If you change the commented code to

    <!-- <p align="center">...</p> -->

    Relevanssi probably won’t choke there (well, WP won’t, because Relevanssi is using the wp_strip_all_tags() function to remove tags, and that’s probably what is not liking the comments).

    Thread Starter bjrnet21

    (@bjrnet21)

    Thanks for the feedback. I’ll have to put this one in my bag of tricks…. to avoid.

    I guess I will have to do a search for any and all commented blocks to verify they are done right.

Viewing 4 replies - 1 through 4 (of 4 total)
  • The topic ‘Not indexing terms at bottom of big pages. Is there a page size limit?’ is closed to new replies.