AEO Audit ToolsAEO Audit ToolsAEO Audit Tools
AuditHistoryBenchmarksComparePricingBlog
Sign In
AEO Audit ToolsAEO Audit ToolsAEO Audit Tools

AI-readiness scanner for websites. Check your visibility in ChatGPT, Claude, and Perplexity.

Product

  • Audit
  • History
  • Benchmarks
  • Compare
  • Pricing
  • Blog

Company

  • About
  • Privacy Policy
  • Terms of Service
  • Contact Us
© 2026 AEO Audit Tools. All rights reserved.
PrivacyTermsContact
Home/Blog/llms.txt: The Complete Guide to Writing One That AI Engines Actually Use
Dark code editor with syntax-highlighted text on a monitor, representing technical configuration files
AEO6 min read

llms.txt: The Complete Guide to Writing One That AI Engines Actually Use

AEO Audit Team·March 6, 2026

Most technical SEO conversations center on schema markup, canonical tags, and Core Web Vitals. But there is a single file that carries more weight in your AEO score than all of your schema combined. That file is llms.txt, and it accounts for 23% of your total AEO score — the highest-weighted category in our audit framework.

If you have not created one yet, or if yours is malformed, you are leaving the largest single optimization lever untouched.


What Is llms.txt (And Why It Has More Weight Than Your Schema)

llms.txt is a plain-text, Markdown-formatted file hosted at the root of your domain — either at /llms.txt or /llms-full.txt — that tells AI agents and large language models how to interact with your site. Think of it as a structured briefing document written specifically for machines that read and reason about content rather than crawl it for link signals.

The format was proposed by Answer.AI in 2024 and has been gaining rapid adoption among developer-focused and B2B SaaS sites. It is not yet an RFC or W3C standard, but it is already being referenced by AI agent frameworks, retrieval-augmented generation (RAG) pipelines, and emerging AI search engines as a discovery mechanism.

Here is the critical distinction: llms.txt is fundamentally different from robots.txt. robots.txt is a crawl directive — it tells bots which URLs they are allowed or not allowed to fetch. llms.txt is guidance — it tells AI agents what your site is for, who it serves, and which URLs contain the most authoritative content. One controls access; the other shapes understanding.

AI engines like Perplexity, ChatGPT with browsing, and Claude Citations do not just crawl your site — they build a semantic model of it. llms.txt lets you influence that model directly. Without it, the AI has to infer your site's purpose from page content alone, which introduces noise and ambiguity. With a well-structured llms.txt, you are handing the AI a curated map of your content hierarchy and primary use cases.

That is why it outweighs schema in our scoring model. Schema annotates individual pages. llms.txt frames your entire site.


The Required Structure of a Valid llms.txt File

The llms.txt specification defines a specific Markdown structure. Deviations from this structure will cause AI agents to either ignore the file or parse it incorrectly. The four required elements are:

H1 — Site Name. The first line of the file must be a single H1 heading containing your site or company name. This is the primary identifier AI agents use to associate the file with your brand.

Blockquote — Purpose Statement. Immediately following the H1, you must include a Markdown blockquote (a line beginning with >) that contains a single sentence describing what your company does and who it serves. This is the most important sentence in the file. It is the text most likely to be used verbatim by AI agents when summarizing your site.

H2 Sections — Content Areas. The body of the file is organized into H2 sections, each representing a major content area of your site. Common sections include About, Products, Documentation, Blog, Pricing, and Contact. Each section should be named semantically — avoid internal jargon that an AI agent would not recognize.

Markdown Links — Key URLs. Within each section, you list URLs as Markdown links followed by a brief plain-text description. The link text should be descriptive, and the description should explain the page's purpose in a single clause. These links serve as the AI agent's entry points into your content hierarchy.


A Copy-Paste llms.txt Template

Host this file at https://yourdomain.com/llms.txt. Replace every bracketed placeholder with your actual content before publishing.

# [Company Name]

> [One-sentence description of what your company does and who it serves]

## About
- [https://yourdomain.com/about]: Company background, mission, and team

## Products
- [https://yourdomain.com/products]: Overview of all products and services
- [https://yourdomain.com/products/[product-name]]: [What this specific product does]

## Documentation
- [https://yourdomain.com/docs]: Technical documentation and integration guides
- [https://yourdomain.com/docs/getting-started]: Quickstart guide for new users

## Blog
- [https://yourdomain.com/blog]: Latest articles, guides, and industry insights

## Pricing
- [https://yourdomain.com/pricing]: Plans, pricing tiers, and feature comparison

## Contact
- [https://yourdomain.com/contact]: Sales inquiries, support, and partnership requests

A few notes on this template. Keep the blockquote to a single sentence — do not write a paragraph. AI agents treat the blockquote as a summary snippet, and longer text dilutes its signal value. Keep URLs canonical and avoid query strings or UTM parameters in this file. The links should point to the most authoritative, stable version of each page.

If your site has substantial documentation, consider also creating /llms-full.txt with expanded content (see FAQ below).


The 7 Checks Our Tool Runs on Your llms.txt

When you run an AEO audit on your site, our tool performs seven discrete checks against your llms.txt file. Each check is binary — it either passes or fails — and the aggregate score rolls up into the 23% category weight.

  1. File exists at /llms.txt. The file must be publicly accessible at the root of your domain over HTTPS. Redirects are followed, but a 404 or 403 response fails this check immediately and zeros out the entire category.

  2. Valid H1 present. The first content element in the file must be an H1 heading. Files that begin with a comment, a blank line followed by an H2, or plain text without heading syntax fail this check.

  3. Blockquote present. The file must contain at least one Markdown blockquote. Our tool checks that the blockquote appears near the top of the file (within the first 500 characters) and contains a non-trivial string (more than 20 characters).

  4. Multiple H2 sections present. A valid llms.txt must contain at least two H2 sections. Single-section files suggest incomplete implementation and score poorly with AI agents that expect a content hierarchy.

  5. Contains valid Markdown links. Each H2 section must contain at least one Markdown-formatted link in the format [text](url) or - [url]: description. Files that list bare URLs without Markdown formatting are not parsed correctly by most AI agent frameworks.

  6. Links resolve without errors. Our tool performs a HEAD request against each URL listed in the file. Links that return 4xx or 5xx status codes fail this check. This is one of the most commonly failed checks — llms.txt files that were created once and never updated accumulate dead links as site structure changes.

  7. File size under 100KB. The llms.txt file should be a concise index, not a content dump. Files over 100KB are treated as malformed by several AI agent implementations. If you need to expose full page content to AI agents, use /llms-full.txt for that purpose and keep /llms.txt as a lean navigation document.


Common llms.txt Mistakes That Kill Your Score

Using HTML instead of Markdown. Some teams auto-generate llms.txt from their sitemap pipeline and output HTML tags. AI agents that parse llms.txt expect Markdown. An H1 in HTML (<h1>) is not equivalent to a Markdown H1 (#).

Copying robots.txt logic into llms.txt. The files serve different purposes. Do not include Disallow directives, User-agent blocks, or Crawl-delay settings. These are meaningless in the llms.txt context and signal that the file was created by someone who conflated the two formats.

Writing a marketing headline in the blockquote. The purpose statement should be descriptive and factual, not a tagline. "We help companies grow faster" fails because it tells the AI nothing specific about what you actually do. "B2B SaaS platform that automates invoice reconciliation for mid-market finance teams" is precise and usable.

Listing only your homepage. The value of llms.txt comes from its content hierarchy. A file that lists only https://yourdomain.com in every section provides no navigation signal and scores near zero on our link quality check.

Forgetting to update it. llms.txt is a living document. If you rename a product, retire a docs section, or restructure your URL hierarchy, the file must be updated in tandem. Stale links are the single most common cause of audit failures in this category.


Frequently Asked Questions

Is llms.txt an official standard?

It was proposed by Answer.AI in 2024 and is not yet a formal RFC or W3C standard. However, it has seen significant adoption among developer tools, AI-native SaaS products, and technical documentation sites. Several major AI agent frameworks explicitly check for llms.txt during site discovery. Treating it as a best practice rather than a formal standard is accurate — but given its weight in AI engine behavior, "best practice" undersells its importance.

What is the difference between llms.txt and robots.txt?

robots.txt is an access control document. It tells crawlers which URLs they are permitted or prohibited from fetching. llms.txt is a semantic guidance document. It does not block or allow access — it explains your site's purpose, structure, and primary content areas to AI agents that have already accessed your content. A bot that respects robots.txt may still read your llms.txt to understand your site before deciding which content to retrieve.

Does llms.txt affect Google rankings?

Not directly. Google's core ranking algorithm does not use llms.txt as a ranking signal. However, it does affect how AI-powered surfaces that source from the web — including Google's AI Overviews, Perplexity, ChatGPT with browsing, and Claude Citations — select and attribute content. If your goal is to appear in AI-generated answers rather than traditional blue-link results, llms.txt is directly relevant to that objective.

How often should I update llms.txt?

Update it whenever your site structure changes in ways that affect the URLs listed in the file. Practically, this means reviewing it during any sprint that involves URL restructuring, product renaming, documentation reorganization, or the retirement of major content sections. Many teams add llms.txt review to their launch checklist for significant site changes. A quarterly review cadence is a reasonable floor if you do not have a more granular trigger.

Do I need llms-full.txt as well?

llms-full.txt serves a different function than llms.txt. While llms.txt is a concise index file, llms-full.txt contains the complete text content of your key pages, inlined into a single document that AI agents can retrieve in one request. It is most useful for smaller sites (under 50 pages) where the total content volume stays manageable, or for documentation-heavy sites where AI agents benefit from having full technical content available without following individual links. For most B2B sites, start with a well-structured llms.txt and add llms-full.txt only if you have evidence that AI agents are struggling to retrieve your full content.


Your llms.txt is the highest-leverage file you can add to your site for AI engine optimization. At 23% of your total AEO score, getting it right — correct structure, valid links, a precise purpose statement — moves the needle more than any other single technical change.

Check your llms.txt score now. It is 23% of your total AEO score. Run a free audit at aeoauditool.com.

On this page

What Is llms.txt (And Why It Has More Weight Than Your Schema)The Required Structure of a Valid llms.txt FileA Copy-Paste llms.txt TemplateAboutProductsDocumentationBlogPricingContactThe 7 Checks Our Tool Runs on Your llms.txtCommon llms.txt Mistakes That Kill Your ScoreFrequently Asked Questions

Related Articles

Checklist and planning notebook with pen on a desk, representing an organized AEO audit workflow
AEO5 min read

The AEO Audit Checklist 2026: 10 Fixes Ranked by Impact (Start With #1)

AEO Audit Team·Mar 17, 2026
Open planner calendar on a desk representing content scheduling and freshness strategy
SEO5 min read

Content Freshness for AI Search: Why Updating Old Pages Beats Publishing New Ones

AEO Audit Team·Mar 16, 2026
Illuminated network of interconnected nodes representing a knowledge graph and entity relationships across the web
GEO6 min read

Entity Authority for AI: How to Build Knowledge Graph Presence AI Systems Trust

AEO Audit Team·Mar 13, 2026
← Back to Blog