Fall Sale! Code FALL2024 takes 25% OFF our Pro Plugins & Books »
Web Dev + WordPress + Security

Stop Bitacle from Stealing Content

[ Stop bitacle.org ] If you have yet to encounter the content-scraping site, bitacle.org, consider yourself lucky. The scum-sucking worm-holes at bitacle.org are well-known for literally, blatantly, and piggishly stealing blog content and using it for financial gains through advertising. While I am not here to discuss the legal, philosophical, or technical ramifications of illegal bitacle behavior, I am here to provide a few critical tools that will help stop bitacle from stealing your content.

The .htaccess Finger

Perhaps the most straightforward and effective method for keeping the bitacle thieves away from your site, adding the following htaccess rules to your root htaccess file will literally block bitacle’s IP address and return a 403 “Forbidden” response. To make it happen, add this to your site’s root htaccess file:

RewriteBase /
RewriteCond %{REMOTE_ADDR} ^212\.22\.59\.251$ [OR]
RewriteCond %{HTTP_USER_AGENT} Bitacle
RewriteRule .? - [F]

For more information on .htaccess files and blocking unwanted requests, check out Stupid htaccess Tricks and How to Block Bad Bots.

The robots.txt Slap

Next up, another effective anti-bitacle method that instructs the bitacle bots to stay away from your site. This method uses a robots.txt file in your site’s root directory and literally denies bitacle agents crawl-access to all site contents. Simply add the following lines to your site’s root robots.txt file:

User-agent: Bitacle bot/1.1
Disallow: /
User-agent: Bitacle bot
Disallow: /
User-agent: Bitacle *
Disallow: /
User-agent: Bitacle*
Disallow: /
User-agent: Bitacle
Disallow: /

For more information on robots.txt, check out Robots Notes Plus and Better Robots.txt Rules for WordPress.

Related WordPress Plugins

For more help on the anti-plagiarism front, check out the Copyfight and Copyright Proof. These fine WordPress plugins come highly recommended and are definitely worth checking out.

Other Essential Tools

Beyond the essential preventative methods discussed above, there are many other resources and tools now available for dealing with site scrapers, content thieves, and other worthless garbage. A worthwhile website is Copyscape, which provides an excellent tool that enables users to search the web for stolen content. If you find that your content has indeed been plagiarized, read up on how to respond properly and effectively. Finally, try searching for various search terms, such as “plagiarism tools”, “content scraping”, “copyright protection”, “syndication theft”, etc. Good Luck!

References & Resources

About the Author
Jeff Starr = Web Developer. Book Author. Secretly Important.
Blackhole Pro: Trap bad bots in a virtual black hole.

2 responses to “Stop Bitacle from Stealing Content”

  1. Noel Cower 2006/11/30 1:50 am

    Figured I’d let you know that Bitacle’s bot does not pay attention to robots.txt rules. The most effective way to stop them is to simply ban their user-agent and take some measures to ensure that your content can’t be easily spidered/stolen.

    Also, you are at far greater risk by using a service such as FeedBurner.

  2. Perishable 2006/11/30 8:20 pm

    Yeah, I have read elsewhere that bitacle ignores robots.txt rules, but I am paranoid enough to include them anyway. It may not be necessary, but it is the formally accepted method, and it definitely won’t hurt anything.

    As for FeedBurner (and similar services), the benefits of their service currently outweigh the potential threat of content hounds like bitacle. Nonetheless, I definitely will be looking into it further and perhaps changing my mind if anything serious unfolds.. Either way, I appreciate your comment and the heads up concerning our mutual enemy! ;)

Comments are closed for this post. Something to add? Let me know.
Welcome
Perishable Press is operated by Jeff Starr, a professional web developer and book author with two decades of experience. Here you will find posts about web development, WordPress, security, and more »
SAC Pro: Unlimited chats.
Thoughts
I disabled AI in Google search results. It was making me lazy.
Went out walking today and soaked up some sunshine. It felt good.
I have an original box/packaging for 2010 iMac if anyone wants it free let me know.
Always ask AI to cite its sources. Also: “The Web” is not a valid answer.
All free plugins updated and ready for WP 6.6 dropping next week. Pro plugin updates in the works also complete :)
99% of video thumbnail/previews are pure cringe. Goofy faces = Clickbait.
RIP ICQ
Newsletter
Get news, updates, deals & tips via email.
Email kept private. Easy unsubscribe anytime.