Subscribe to Our Newsletter

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn't arrive within 3 minutes, check your spam folder.

Ok, Thanks
Cloudflare is helping users block AI bots and crawlers regardless of how well they behave
Credit: Cloudflare

Cloudflare is helping users block AI bots and crawlers regardless of how well they behave

Cloudflare has introduced a new feature allowing all users, including those on the free tier, to easily block all AI bots by leveraging its advanced detection methods for disguised bots. Cloudflare has also launched tools enabling all users to report bot activity not blocked by the service.

Ellie Ramirez-Camara profile image
by Ellie Ramirez-Camara

With an announcement that could not be more timely, Cloudflare recently announced the implementation of an 'easy button' solution to block all AI bots for all its users, including the ones in the free tier. Cloudflare already offers a solution that blocks malicious bots and lets users control whether they want well-behaved bots, AI and otherwise, to be able to visit their website, and if so, how much of it they can access. Cloudflare defines a well-behaving bot as one that has taken the following actions to show they are acting in good faith:

  1. The bot's maintainers should also maintain a public web page committing to respect robots.txt.
  2. There should be a verifiable range of IP addresses exclusively used by the bot.
  3. A stable and unique user-agent should represent the bot.
  4. The maintainer should respect robots.txt user-agent and wild-card entries.
  5. AI crawlers should respect crawl delay.

In other words, mostly things that Perplexity AI cannot be bothered to do. Given that Perplexity AI is hardly the only tech company comfortable overstepping widely respected best practices because they find following them inconvenient, Cloudflare's most recent announcement is to extend protection to cover all AI scrapers and crawlers, regardless of their behavior.

To detect bots that may be disguising themselves as legitimate web browsers, Cloudflare deploys its bot detection machine learning model, which analyzes and scores traffic to determine the likelihood of it coming from a bot or a human. Users can implement a rule to challenge traffic with a bot score of 30 or less to block any unwanted disguised bot traffic. To report cases where misbehaving bot activity is allowed by Cloudflare, Enterprise customers should submit a False Negative Feedback Loop. Additionally, all other users can use Cloudflare's new dedicated reporting tool.

Ellie Ramirez-Camara profile image
by Ellie Ramirez-Camara
Updated

Data Phoenix Digest

Subscribe to the weekly digest with a summary of the top research papers, articles, news, and our community events, to keep track of trends and grow in the Data & AI world!

Success! Now Check Your Email

To complete Subscribe, click the confirmation link in your inbox. If it doesn’t arrive within 3 minutes, check your spam folder.

Ok, Thanks

Read More