Archive.org Wayback Machine
-
In your plugin’s description, you state…
…Blackhole only affects bad bots: human users never see the hidden link, and good bots obey the robots rules in the first place.
I want to block Archive.org Wayback Machine.
Apparently Archive’org’s bots (ia_archiver and archive.org_bot) have stopped obeying robots.txt files since around late 2017. Since 2015/2016 I have successfully blocked Archive.org/Wayback Machine from crawling and archiving my sites. But sometime in late 2017, they stopped obeying my robots.txt file and have crawled and archived all my sites. Formal emails to them to remove my sites have been fruitless. I have had the following entries in my robots.txt file for years now and they used to work…
User-agent: archive.org_bot Disallow: / User-agent: ia_archiver Disallow: /
But they no longer work. Last week, I added the following meta tags to my site…
<meta name="ia_archiver" content="noindex,nofollow,noarchive"> <meta name="archive.org_bot" content="noindex,nofollow,noarchive">
…and that also does not seem to be working.
So since archive.org apparently does not obey robots.txt files any longer, will your plugin block/trap ia_archiver and archive.org_bot bots? This is what I am looking for.
- The topic ‘Archive.org Wayback Machine’ is closed to new replies.