llms.txt — Definition
llms.txt
TL;DR:
llms.txtis a proposed standard — a markdown file at your site root that gives AI systems a curated map of your most important pages. It’s advisory: it helps AI understand your site, but it doesn’t control access.
What it means
llms.txt is a file you place at https://yoursite.com/llms.txt containing a curated, markdown-formatted summary of your site — your key pages, what they cover, and links to clean versions of the content. The idea, proposed in 2024, is to give large language models a concise, high-signal map instead of forcing them to infer structure from messy HTML. It’s loosely analogous to a sitemap, but written for AI comprehension rather than search-engine crawling, and it lists what matters rather than every URL.
Why it matters
llms.txt is easy to confuse with an access-control tool — it is not one. It carries the same honor-system limitation as robots.txt: an AI system can read it, ignore it, or never fetch it, and adoption across the major AI engines is inconsistent. So treat it as a comprehension aid for cooperative AI, not a lock. It can help AI represent your site accurately; it cannot keep any bot out. For keeping bots out, you need robots.txt plus a firewall (see glossary/waf and seo/ai-crawler-access).
How it works / examples
A minimal llms.txt is a markdown file like:
# Primores> Practical AI for marketing, SEO, and automation.
## Key pages- [AI crawler access control](https://primores.org/wiki/seo/ai-crawler-access): which bots to allow or block- [AI visibility](https://primores.org/wiki/seo/ai-visibility): getting found in AI answersSome platforms (and SEO plugins) auto-generate one. A companion llms-full.txt can hold the full expanded content.
Related
- glossary/ai-crawler — what llms.txt is (and isn’t) for
- seo/ai-crawler-access — where llms.txt sits among robots.txt and the firewall
- seo/ai-visibility — the broader getting-cited-by-AI picture