AI User Agents List (updated Dec 2025)

ChatGPT, Claude, Perplexity... Identify which AI bots are crawling your site.

🔍 Browse AI Bots

Explore our database of 58 AI bots. Filter by category, or behavior to find what you need.

Missing a bot? Contact us to suggest new bots for our database.

🔎 Search AI Bots

Showing 58 AI bots

AI2Bot

Allen Institute for AI AI Training

Allen Institute for AI bot for academic research and model training

User Agent:

Mozilla/5.0 (compatible; AI2Bot/1.0; +https://allenai.org/)

Bot name:

AI2Bot

Tags:

#ai2 #academic #research #training

✅ Respectful

📚 Docs →

Amazonbot

Amazon AI Assistant

Amazon bot to improve Alexa and AWS AI services

User Agent:

Amazonbot/0.1 (+https://developer.amazon.com/support/amazonbot)

Bot name:

Amazonbot

Tags:

#amazon #alexa #aws #assistant

✅ Respectful

📚 Docs →

Andibot

Andi AI Search

Andi AI search engine bot, competitor to Perplexity

User Agent:

Mozilla/5.0 (compatible; Andibot/1.0)

Bot name:

Andibot

Tags:

#andi #search #answer-engine #competitor

✅ Respectful

anthropic-ai

Anthropic AI Training

Training bot for Anthropic's Claude models, collects data to improve models

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Claude-Web/1.0; +https://www.anthropic.com)

Bot name:

anthropic-ai

Tags:

#claude #anthropic #training #bulk-data

✅ Respectful

📚 Docs →

Anthropic-Claude

Anthropic AI Assistant

Updated Anthropic Claude bot for real-time web access and citations

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Anthropic-Claude/1.0; +https://www.anthropic.com)

Bot name:

Anthropic-Claude

Tags:

#anthropic #claude #realtime #citations

✅ Respectful

📚 Docs →

Claude-Web

Anthropic AI Search

Claude's web bot for exploration and indexing of web content

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Claude-Web/1.0; +https://www.anthropic.com)

Bot name:

claude-web

Tags:

#claude #anthropic #web #crawling

✅ Respectful

📚 Docs →

ClaudeBot

Anthropic AI Assistant

Bot used by Claude to fetch citations and references in real-time during conversations

User Agent:

ClaudeBot/1.0; +https://www.anthropic.com

Bot name:

ClaudeBot

Tags:

#claude #anthropic #citations #assistant

✅ Respectful

📚 Docs →

Applebot-Extended

Apple AI Training

Bot for training Apple AI models (Apple Intelligence)

User Agent:

Mozilla/5.0 (compatible; Applebot-Extended/1.0)

Bot name:

Applebot-Extended

Tags:

#apple #apple-intelligence #training #siri

✅ Respectful

📚 Docs →

bigsur.ai

BigSur AI AI Training

New emerging AI bot, details on usage still limited

User Agent:

Mozilla/5.0 (compatible; bigsur.ai/1.0)

Bot name:

bigsur.ai

Tags:

#bigsur #emerging #new #training

✅ Respectful

Brightbot

Bright Data AI Training

Bright Data analysis bot to collect data for AI

User Agent:

Mozilla/5.0 (compatible; Brightbot/1.0)

Bot name:

Brightbot

Tags:

#bright-data #analysis #data-collection #training

✅ Respectful

Bytespider

ByteDance AI Training

ByteDance (TikTok) bot for training their Chinese AI models

User Agent:

Mozilla/5.0 (compatible; Bytespider; [email protected])

Bot name:

Bytespider

Tags:

#bytedance #tiktok #chinese #training

✅ Respectful

TerraCotta

Ceramic AI Training

Ceramic AI crawler for web content indexing and model training, first seen June 2025

User Agent:

TerraCotta https://github.com/CeramicTeam/CeramicTerracotta

Bot name:

TerraCotta

Tags:

#ceramic #training #emerging #crawler

❌ Ignores robots.txt

📚 Docs →

Character-AI

Character.AI AI Assistant

Character.AI bot for training conversational AI characters

User Agent:

Mozilla/5.0 (compatible; Character-AI/1.0; +https://character.ai/)

Bot name:

Character-AI

Tags:

#character-ai #conversational #characters #training

✅ Respectful

📚 Docs →

Devin

Cognition AI AI Assistant

Devin AI code assistant bot to analyze and understand online code

User Agent:

Mozilla/5.0 (compatible; Devin/1.0)

Bot name:

Devin

Tags:

#devin #code-assistant #programming #cognition-ai

✅ Respectful

Cohere-Ai

Cohere AI Training

Cohere bot for training their language models and NLP

User Agent:

Mozilla/5.0 (compatible; Cohere-AI/1.0; +https://cohere.com/)

Bot name:

Cohere-Ai

Tags:

#cohere #nlp #training #enterprise

✅ Respectful

📚 Docs →

Cohere-Command

Cohere AI Assistant

Cohere Command model bot for real-time information retrieval

User Agent:

Mozilla/5.0 (compatible; Cohere-Command/1.0; +https://cohere.com/)

Bot name:

Cohere-Command

Tags:

#cohere #command #assistant #enterprise

✅ Respectful

📚 Docs →

CCBot

Common Crawl AI Training

Common Crawl bot, widely used for training open source AI models

User Agent:

CCBot/2.0 (https://commoncrawl.org/faq/)

Bot name:

CCBot

Tags:

#common-crawl #open-data #training #dataset

✅ Respectful

📚 Docs →

Crawlspace

Crawlspace AI Training

Crawling service specialized for AI and data extraction

User Agent:

Mozilla/5.0 (compatible; Crawlspace/1.0)

Bot name:

Crawlspace

Tags:

#crawling-service #data-extraction #ai #training

✅ Respectful

DeepseekBot

DeepSeek AI Training

DeepSeek AI bot for training their advanced reasoning models and data collection

User Agent:

Mozilla/5.0 (compatible; DeepseekBot/1.0; +https://www.deepseek.com/bot)

Bot name:

DeepseekBot

Tags:

#deepseek #reasoning #training #chinese

✅ Respectful

📚 Docs →

Diffbot

Diffbot AI Training

Diffbot bot for structured data extraction and creating knowledge graphs for AI

User Agent:

Mozilla/5.0 (compatible; Diffbot/0.1; +http://www.diffbot.com/our-apis/crawler/)

Bot name:

Diffbot

Tags:

#diffbot #knowledge-graph #extraction #structured-data

✅ Respectful

📚 Docs →

DuckAssistBot

DuckDuckGo AI Assistant

DuckDuckGo bot for their privacy-respecting AI assistant

User Agent:

Mozilla/5.0 (compatible; DuckAssistBot/1.0; +https://duckduckgo.com/duckassist)

Bot name:

DuckAssistBot

Tags:

#duckduckgo #privacy #assistant #search

✅ Respectful

📚 Docs →

FirecrawlAgent

Firecrawl AI Training

New scraping service specialized for AI and LLMs

User Agent:

Mozilla/5.0 (compatible; FirecrawlAgent/1.0)

Bot name:

FirecrawlAgent

Tags:

#firecrawl #scraping #llm #training

✅ Respectful

Bard-Ai

Google AI Assistant

Google Bard AI assistant bot for web content retrieval

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Bard-AI/1.0; +https://developers.google.com/search/docs/crawling-indexing/google-common-crawlers)

Bot name:

Bard-Ai

Tags:

#google #bard #assistant #search

✅ Respectful

📚 Docs →

Gemini-Ai

Google AI Assistant

Google Gemini AI model bot for training and web content analysis

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Gemini-AI/1.0; +https://developers.google.com/search/docs/crawling-indexing/google-common-crawlers)

Bot name:

Gemini-Ai

Tags:

#google #gemini #training #analysis

✅ Respectful

📚 Docs →

Gemini-Deep-Research

Google AI Assistant

Bot for Gemini Deep Research in-depth searches

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Gemini-Deep-Research/1.0)

Bot name:

Gemini-Deep-Research

Tags:

#google #gemini #deep-research #assistant

✅ Respectful

Google-CloudVertexBot

Google AI Training

Google crawler for Vertex AI Agents, crawls content at request of site owners building AI agents

User Agent:

Mozilla/5.0 (compatible; Google-CloudVertexBot/1.0; +https://cloud.google.com/vertex-ai)

Bot name:

Google-CloudVertexBot

Tags:

#google #vertex-ai #training #agents

✅ Respectful

📚 Docs →

Google-Extended

Google AI Training

Token to control access to content for Gemini/Bard and Vertex AI

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Google-Extended/1.0; +https://developers.google.com/search/docs/crawling-indexing/google-common-crawlers)

Bot name:

Google-Extended

Tags:

#google #gemini #bard #vertex-ai

✅ Respectful

📚 Docs →

Google-NotebookLM

Google AI Assistant

Google NotebookLM bot that fetches individual URLs provided by users as sources for their research projects

User Agent:

Mozilla/5.0 (compatible; Google-NotebookLM/1.0; +https://notebooklm.google.com/)

Bot name:

Google-NotebookLM

Tags:

#google #notebooklm #user-triggered #research #assistant

❌ Ignores robots.txt

📚 Docs →

GoogleAgent-Mariner

Google AI Assistant

Google Project Mariner agentic browser for AI Ultra subscribers ($249.99/month). Operates on cloud-based virtual machines as a remote browser environment rather than traditional crawler.

User Agent:

GoogleAgent-Mariner

Bot name:

GoogleAgent-Mariner

Tags:

#google #mariner #agentic-browser #premium #cloud-vm

✅ Respectful

📚 Docs →

Groq-Bot

Groq AI Training

Groq inference engine bot for high-speed AI model data collection

User Agent:

Mozilla/5.0 (compatible; Groq-Bot/1.0; +https://groq.com/)

Bot name:

Groq-Bot

Tags:

#groq #inference #high-speed #training

✅ Respectful

📚 Docs →

HuggingFace-Bot

Hugging Face AI Training

Hugging Face bot for training open-source AI models and datasets

User Agent:

Mozilla/5.0 (compatible; HuggingFace-Bot/1.0; +https://huggingface.co/)

Bot name:

HuggingFace-Bot

Tags:

#huggingface #open-source #training #datasets

✅ Respectful

📚 Docs →

IbouBot

Ibou.io AI Search

Ethical search engine crawler that drives traffic to original sources. Uses GenAI for query processing but does NOT train AI models. Respects creators and publisher rights

User Agent:

Mozilla/5.0 (compatible; IbouBot/1.0; [email protected]; +https://ibou.io/iboubot.html)

Bot name:

IbouBot

IP Ranges:

217.113.196.0/24

Tags:

#ibou #french #ethical-search #traffic-driver #creator-friendly

✅ Respectful

📚 Docs →

FacebookBot

Meta AI Training

Traditional Facebook bot extended for AI and machine learning

User Agent:

facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)

Bot name:

FacebookBot

Tags:

#meta #facebook #social #ai

✅ Respectful

📚 Docs →

Meta-ExternalAgent

Meta AI Training

Meta bot for training their AI models (Llama, etc.)

User Agent:

Meta-ExternalAgent/1.0 (+https://developers.facebook.com/docs/sharing/bot)

Bot name:

Meta-ExternalAgent

Tags:

#meta #facebook #llama #training

✅ Respectful

📚 Docs →

meta-webindexer

Meta AI Search

Meta web indexer bot for building independent search capabilities for Meta AI chatbot

User Agent:

meta-webindexer/1.1

Bot name:

meta-webindexer

Tags:

#meta #search #indexing #ai-search

✅ Respectful

📚 Docs →

BingBot

Microsoft AI Search

Microsoft Bing crawler used for Bing Search and Copilot AI features

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm) Chrome/W.X.Y.Z Safari/537.36 Edg/W.X.Y.Z

Bot name:

bingbot

Tags:

#microsoft #bing #copilot #search

✅ Respectful

📚 Docs →

MistralAI-User

Mistral AI AI Assistant

Mistral AI bot to retrieve citations in Le Chat

User Agent:

MistralAI-User/1.0

Bot name:

MistralAI-User

Tags:

#mistral #le-chat #french #citations

✅ Respectful

ChatGPT Atlas

OpenAI AI Assistant

⚠️ STEALTH

OpenAI's agentic browser with integrated AI. Uses standard Chrome user-agent, making it completely indistinguishable from regular browser traffic. Cannot be blocked via robots.txt. Features "agent mode" for autonomous task completion.

User Agent:

Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/141.0.0.0 Safari/537.36

Bot name:

ChatGPT-Atlas

Tags:

#openai #chatgpt #atlas #agentic-browser #stealth #undetectable

❌ Ignores robots.txt

📚 Docs →

ChatGPT-Browser

OpenAI AI Assistant

ChatGPT web browsing bot for real-time web access during conversations

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ChatGPT-Browser/1.0; +https://openai.com/bot)

Bot name:

ChatGPT-Browser

Tags:

#openai #chatgpt #browsing #realtime

✅ Respectful

📚 Docs →

ChatGPT-User

OpenAI AI Assistant

Bot used for real-time searches when a user asks a question to ChatGPT

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ChatGPT-User/1.0; +https://openai.com/bot)

Bot name:

ChatGPT-User

Tags:

#chatgpt #realtime #search #user-triggered

✅ Respectful

📚 Docs →

ChatGPT-User v2.0

OpenAI AI Assistant

Updated version of ChatGPT-User bot for real-time searches (since February 2025)

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ChatGPT-User/2.0; +https://openai.com/bot)

Bot name:

ChatGPT-User-v2

Tags:

#chatgpt #realtime #search #user-triggered #v2

✅ Respectful

📚 Docs →

GPTBot

OpenAI AI Training

Bot used by OpenAI to collect training data for ChatGPT and future GPT models

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; GPTBot/1.0; +https://openai.com/gptbot)

Bot name:

GPTBot

Tags:

#chatgpt #training #openai #gpt

✅ Respectful

📚 Docs →

OAI-SearchBot

OpenAI AI Search

Specific indexing bot for ChatGPT Search, competitor to Google Search

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; OAI-SearchBot/1.0; +https://openai.com/searchbot)

Bot name:

OAI-SearchBot

Tags:

#openai #search #indexation #chatgpt-search

✅ Respectful

📚 Docs →

Perplexity Stealth

Perplexity AI AI Assistant

⚠️ STEALTH

Perplexity uses headless browsers with Chrome user agents to bypass blocking

User Agent:

Mozilla/5.0 (Windows NT 10.0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Safari/537.36

Bot name:

Perplexity-Stealth

Tags:

#perplexity #stealth #headless #chrome

❌ Ignores robots.txt

⚠️ Stealth Analysis →

Perplexity-User

Perplexity AI AI Assistant

Bot triggered when a user clicks on a link in a Perplexity response

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; Perplexity-User/1.0; +https://perplexity.ai/bot)

Bot name:

Perplexity-User

Tags:

#perplexity #user-triggered #realtime

❌ Ignores robots.txt

⚠️ Stealth Analysis → 📚 Docs →

PerplexityBot

Perplexity AI AI Search

Perplexity indexing bot to feed their AI search engine

User Agent:

Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; PerplexityBot/1.0; +https://perplexity.ai/bot)

Bot name:

PerplexityBot

Tags:

#perplexity #search #answer-engine #indexation

❌ Ignores robots.txt

⚠️ Stealth Analysis → 📚 Docs →

Replicate-Bot

Replicate AI Training

Replicate platform bot for AI model training and data collection

User Agent:

Mozilla/5.0 (compatible; Replicate-Bot/1.0; +https://replicate.com/)

Bot name:

Replicate-Bot

Tags:

#replicate #platform #training #models

✅ Respectful

📚 Docs →

RunPod-Bot

RunPod AI Training

RunPod cloud platform bot for GPU-based AI training data collection

User Agent:

Mozilla/5.0 (compatible; RunPod-Bot/1.0; +https://runpod.io/)

Bot name:

RunPod-Bot

Tags:

#runpod #gpu #cloud #training

✅ Respectful

📚 Docs →

ImagesiftBot

The Hive AI Training

Bot for reverse image search and training image generation models

User Agent:

Mozilla/5.0 (compatible; ImagesiftBot/1.0)

Bot name:

ImagesiftBot

Tags:

#image-search #reverse-search #image-generation #training

✅ Respectful

TimpiBot

Timpi AI Training

Timpi bot for training their Large Language Models

User Agent:

Mozilla/5.0 (compatible; TimpiBot/1.0)

Bot name:

TimpiBot

Tags:

#timpi #llm #training #search

✅ Respectful

Together-Bot

Together AI AI Training

Together AI platform bot for decentralized AI model training

User Agent:

Mozilla/5.0 (compatible; Together-Bot/1.0; +https://together.ai/)

Bot name:

Together-Bot

Tags:

#together-ai #decentralized #training #platform

✅ Respectful

📚 Docs →

Kangaroo Bot

Unknown (China) AI Training

Chinese AI bot, origin and exact usage unknown

User Agent:

Mozilla/5.0 (compatible; Kangaroo Bot/1.0)

Bot name:

Kangaroo Bot

Tags:

#chinese #unknown #training #suspicious

❌ Ignores robots.txt

PanguBot

Unknown (China) AI Training

Another Chinese AI bot, possibly linked to Pangu models

User Agent:

Mozilla/5.0 (compatible; PanguBot/1.0)

Bot name:

PanguBot

Tags:

#chinese #pangu #training #unknown

❌ Ignores robots.txt

Cotoyogi

Unknown (Japan) AI Training

Japanese AI bot, specific usage unknown

User Agent:

Mozilla/5.0 (compatible; Cotoyogi/1.0)

Bot name:

Cotoyogi

Tags:

#japanese #unknown #training #asia

✅ Respectful

AkiraBot

Unknown (Malicious) AI Training

⚠️ STEALTH

Malicious spam bot using OpenAI LLMs to generate custom spam messages for contact forms. Uses generic Chrome user-agent strings and residential proxies. Primarily targets customer support chats via Selenium automation.

User Agent:

Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36

Bot name:

AkiraBot

Tags:

#spam #malicious #llm-powered #selenium #contact-form-spam

❌ Ignores robots.txt

📚 Docs →

Webzio-Extended

Webz.io AI Training

Webz.io bot that collects data to sell to AI companies for training

User Agent:

Mozilla/5.0 (compatible; Webzio-Extended/1.0)

Bot name:

Webzio-Extended

Tags:

#webzio #data-broker #training #commercial

✅ Respectful

xAI-Bot

xAI AI Training

Elon Musk's xAI bot for training Grok and other AI models

User Agent:

Mozilla/5.0 (compatible; xAI-Bot/1.0; +https://x.ai/)

Bot name:

xAI-Bot

Tags:

#xai #grok #elon-musk #training

✅ Respectful

📚 Docs →

YouBot

You.com AI Search

You.com AI search engine bot for indexing and answering questions

User Agent:

Mozilla/5.0 (compatible; YouBot/1.0; +https://you.com/bot)

Bot name:

YouBot

Tags:

#you-com #search #answer-engine #ai

✅ Respectful

📚 Docs →

Love this free tool?

Share it with your network and help others control AI bot access!

❓ Frequently Asked Questions

Everything you need to know about AI crawlers and robots.txt

What is an AI crawler ?

An LLM bot that requests your pages for model training or instant answers. You tell it what to do with "User-agent:" lines in your robots.txt file.

Is User-agent: * enough?

No. A wildcard line should be a catch‑all. You can still list AI crawlers you want to block. Some of them ignore the * directive, and only respect their specific user agent.

What do you mean by "top user agents"?

The bots in this guide account for 95% of AI crawler traffic. These are the most commonly seen AI bots in server logs.

Are bots required to follow directives in robots.txt files?

No. Anthropic was criticized in 2024 for ignoring robots.txt directives, and Perplexity has bypassed these rules.

How do I know which AI bots are visiting my website?

Check your server logs, and analyze the user agents. Most web analytics tools and server log analyzers can show you bot traffic patterns.

Should I block or allow AI crawlers?

It depends on your content strategy:

Allow if you want your content to appear in AI search results and get referral traffic
Block if you're concerned about content being used for training, with no compensation
Selective approach: Allow assistant bots (ChatGPT-User, ClaudeBot) but block training bots (GPTBot, CCBot)

What's the difference between training bots and assistant bots?

Training bots (like GPTBot, CCBot) crawl websites to collect data, to train AI models. Assistant bots (like ChatGPT-User, ClaudeBot) fetch content in real-time when users ask questions, potentially driving referral traffic to your site.

How often should I update my robots.txt for AI bots?

Review your server logs monthly, to verify if new AI bots are crawling your website. Also bookmark this page. We frequently update our list it, as new AI crawlers are discovered.

📊 Monitor your website's health

Now that your site is optimized for AI, keep track of performance, affiliate links, status codes and more!

Try PageRadar for free Discover all features

AI User Agents List (updated Dec 2025)

🔍 Browse AI Bots

CATEGORIES Filter by Type

BEHAVIOR Filter by Bot Behavior

AI2Bot

Amazonbot

Andibot

anthropic-ai

Anthropic-Claude

Claude-Web

ClaudeBot

Applebot-Extended

bigsur.ai

Brightbot

Bytespider

TerraCotta

Character-AI

Devin

Cohere-Ai

Cohere-Command

CCBot

Crawlspace

DeepseekBot

Diffbot

DuckAssistBot

FirecrawlAgent

Bard-Ai

Gemini-Ai

Gemini-Deep-Research

Google-CloudVertexBot

Google-Extended

Google-NotebookLM

GoogleAgent-Mariner

Groq-Bot

HuggingFace-Bot

IbouBot

FacebookBot

Meta-ExternalAgent

meta-webindexer

BingBot

MistralAI-User

ChatGPT Atlas

ChatGPT-Browser

ChatGPT-User

ChatGPT-User v2.0

GPTBot

OAI-SearchBot

Perplexity Stealth

Perplexity-User

PerplexityBot

Replicate-Bot

RunPod-Bot

ImagesiftBot

TimpiBot

Together-Bot

Kangaroo Bot

PanguBot

Cotoyogi

AkiraBot

Webzio-Extended

xAI-Bot

YouBot

No AI bots found

Love this free tool?

❓ Frequently Asked Questions

What is an AI crawler ?

Is User-agent: * enough?

What do you mean by "top user agents"?

Are bots required to follow directives in robots.txt files?

How do I know which AI bots are visiting my website?

Should I block or allow AI crawlers?

What's the difference between training bots and assistant bots?

How often should I update my robots.txt for AI bots?

📊 Monitor your website's health