This skill handles everything related to the llms.txt standard -- an emerging convention (proposed by Jeremy Howard in September 2024, gaining adoption through 2025-2026) that allows websites to provide structured guidance to AI systems about their content, structure, and key information. It is analogous to robots.txt (which tells crawlers what NOT to access) but instead tells AI systems what IS most useful to understand about the site.
AI language models face a fundamental challenge when processing websites: they must determine which pages are most important, what the site is about, and how content is organized -- typically by crawling many pages and inferring structure. llms.txt solves this by providing an explicit, machine-readable (and human-readable) summary.
Benefits of having a well-crafted llms.txt:
The file MUST be located at the root of the domain:
https://example.com/llms.txt
The file uses Markdown formatting with specific conventions:
# [Site Name]
> [One-sentence description of what the site/business does. Keep under 200 characters.]
## Docs
- [Page Title](https://example.com/page-url): Concise description of what this page covers and why it matters.
- [Another Page](https://example.com/another-page): Description of content.
## Optional
- [Less Critical Page](https://example.com/optional-page): Description.
1. Title (Required)
# Site Name
#).2. Description (Required)
> Brief description of the site/business
>).3. Main Sections (Required -- at least one)
Use H2 headings (##) to organize pages by category. Common section names:
| Section Name | Purpose | Example Content |
|---|---|---|
## Docs |
Primary documentation or key pages | Product pages, service descriptions, core content |
## Optional |
Secondary pages worth knowing about | Blog posts, supplementary resources |
## API |
API documentation | API reference, authentication guides |
## Blog |
Blog or news content | Recent/popular articles |
## Products |
Product catalog | Product pages, pricing |
## Services |
Service offerings | Service descriptions, process pages |
## About |
Company information | About page, team, mission |
## Resources |
Educational/reference content | Guides, tutorials, whitepapers |
## Legal |
Legal documents | Terms of service, privacy policy |
## Contact |
Contact information | Contact page, support channels |
4. Page Entries (Required)
Each entry follows the format:
- [Page Title](URL): Description of page content
Rules for page entries:
5. Key Facts Section (Recommended)
## Key Facts
- Founded in [year] by [founder(s)]
- Headquarters: [City, Country]
- [X] customers/users in [Y] countries
- Key products: [Product A], [Product B], [Product C]
- Industry: [Industry classification]
This section provides quick reference data that AI systems frequently need to answer user queries about your business.
6. Contact Section (Recommended)
## Contact
- Website: https://example.com
- Email: hello@example.com
- Support: support@example.com
- Phone: +1-555-123-4567
- Address: 123 Main St, City, State, ZIP, Country
In addition to llms.txt, sites can provide /llms-full.txt -- an extended version with more detail.
Differences from llms.txt:
| Feature | llms.txt | llms-full.txt |
|---|---|---|
| Length | Concise (50-150 lines) | Comprehensive (150-500+ lines) |
| Page entries | 10-30 key pages | 30-100+ pages |
| Descriptions | 10-30 words per entry | 30-100 words per entry, may include key facts from each page |
| Audience | Quick AI comprehension | Deep AI analysis |
| Sections | 3-6 sections | 8-15 sections |
| Key facts | Business-level facts | Page-level facts and data points |
Both files can coexist. AI systems check for llms.txt first, then may optionally load llms-full.txt for deeper understanding.
When checking an existing llms.txt file:
[domain]/llms.txt.[domain]/llms-full.txt.Check each structural element:
| Element | Check | Severity if Missing |
|---|---|---|
| H1 Title | Present, matches business name | Critical |
| Blockquote description | Present, under 200 chars, factual | High |
| At least one H2 section | Present | Critical |
| Page entries with URLs | At least 5 entries present | High |
| URLs are absolute | All URLs use full https:// paths | High |
| URLs are valid | All URLs return 200 status | Medium |
| Descriptions present | Every entry has a description after the colon | Medium |
| Key Facts section | Present with business information | Medium |
| Contact section | Present with at least email | Low |
| Reasonable length | 30-200 lines | Low |
| No broken Markdown | Proper formatting throughout | Medium |
Rate the llms.txt on these dimensions:
Completeness (0-100):
Accuracy (0-100):
Usefulness (0-100):
Overall llms.txt Score = (Completeness * 0.40) + (Accuracy * 0.35) + (Usefulness * 0.25)
When creating a new llms.txt file from scratch:
<title>, <meta property="og:site_name">, or H1)/sitemap.xml to discover all public pages.Categorize all discovered pages and select the most important ones:
Always Include:
Include if High Quality:
Skip:
For each selected page:
Good description examples:
Explains the three pricing tiers (Free, Pro, Enterprise) with feature comparison and annual/monthly costs.
Details the company's founding in 2018, team of 45 employees, and office locations in Austin and London.
Covers integration setup for Slack, Salesforce, and HubSpot with step-by-step guides and API endpoints.
Bad description examples:
Our amazing pricing page! (marketing language, no specifics)Learn more about our company. (too vague)Click here for details. (not descriptive)Gather key business facts from the site:
Construct the llms.txt following this template:
# [Site Name]
> [One clear sentence: what the business does, who it serves, and its primary value proposition. Under 200 characters.]
## Docs
- [Most Important Page](https://example.com/page): Description covering the key content on this page.
- [Second Page](https://example.com/page-2): Description of this page's content and value.
- [Third Page](https://example.com/page-3): What users and AI systems will find here.
## Products
- [Product A](https://example.com/product-a): Core features, target users, and pricing model for Product A.
- [Product B](https://example.com/product-b): What Product B does and how it differs from Product A.
## Resources
- [Guide Title](https://example.com/guide): Comprehensive guide covering [topic] with [X] sections and practical examples.
- [Blog Post](https://example.com/blog/post): Analysis of [topic] with original data from [source].
## Key Facts
- Founded in [year] by [name(s)]
- Headquartered in [City, Country]
- [Specific metric: e.g., "Serves 10,000+ businesses in 40 countries"]
- [Key differentiator: e.g., "Only platform offering real-time X and Y integration"]
- Industry: [Classification]
## Contact
- Website: https://example.com
- Email: [primary contact email]
- Support: [support URL or email]
Before outputting:
Generate GEO-LLMSTXT-ANALYSIS.md:
# llms.txt Analysis: [Domain]
**Analysis Date:** [Date]
**llms.txt Status:** [Found at URL / Not Found / Error]
**llms-full.txt Status:** [Found / Not Found]
---
## Overall llms.txt Score: [X]/100
| Dimension | Score |
|---|---|
| Completeness | [X]/100 |
| Accuracy | [X]/100 |
| Usefulness | [X]/100 |
---
## Format Validation
| Element | Status | Notes |
|---|---|---|
| H1 Title | [Pass/Fail] | [Notes] |
| Description blockquote | [Pass/Fail] | [Notes] |
| H2 Sections | [Pass/Fail] | [X sections found] |
| Page entries | [Pass/Fail] | [X entries found] |
| URL validity | [Pass/Fail] | [X broken URLs] |
| Entry descriptions | [Pass/Fail] | [X missing descriptions] |
| Key Facts | [Pass/Fail] | [Notes] |
| Contact section | [Pass/Fail] | [Notes] |
---
## Missing Pages
These important pages were found on the site but not in llms.txt:
1. [Page Title](URL) -- [Why it should be included]
2. [Page Title](URL) -- [Why it should be included]
## Improvement Recommendations
1. [Specific recommendation]
2. [Specific recommendation]
3. [Specific recommendation]
## Suggested Updated llms.txt
[Complete rewritten llms.txt file if significant improvements are needed]
Output the complete llms.txt file content, ready to be saved to the site's root directory. Also output a brief GEO-LLMSTXT-GENERATION.md report explaining:
https:// URL, never relative paths.https://yourdomain.com/llms.txt with no redirects.