Paste your sitemap URL. We crawl it, group your pages by section, pull the real titles & meta descriptions, and hand you back a tidy llms.txt file ready to upload to your site root. No placeholder text, no junk, no signup. Most SG SMEs have nothing here yet — get the file in 60 seconds.
Works with sitemap.xml, sitemap-index.xml, and most CMS-generated sitemaps (WordPress, RankMath, Yoast, Astro, Next.js, Webflow). Up to 250 URLs per run.
llms.txt is a public file at your site root that gives ChatGPT, Claude, Perplexity, Gemini and other AI engines a curated, structured map of your most important content. Think of it as a hand-written sitemap, written for LLMs, not for search robots.
Editorial control over your AI footprint. Sitemap.xml dumps everything; llms.txt lets you put your top 20 pages first, group them by intent, and add a one-line description per URL. AI engines crawling your site read this before noisy nav junk.
Citations are the new clicks. When ChatGPT or Perplexity cites you in an answer, it pulls a real description — usually the closest piece of structured prose to the URL. A good llms.txt often becomes that prose.
Almost nobody in SG has one. We checked the top 100 SME sites in Singapore — fewer than 3% have a real llms.txt. The few that auto-generated theirs (via RankMath etc.) ship with placeholder titles that don't match real page content.
RankMath, Yoast and other plugins auto-generate llms.txt at the click of a button. The output looks busy — but the substance is mostly placeholder. Here's the difference, side by side.
| RankMath / typical plugin auto-gen | This generator | |
|---|---|---|
| Page titles | ×Slug-based or "Untitled" placeholders | ✓Real <title> from each page |
| Descriptions | ×Site-wide tagline pasted under every URL | ✓Real meta description per page |
| Structure | ×Flat dump of every URL in sitemap | ✓Grouped by section (Services, Blog, Resources, etc.) |
| Noise filter | ×Includes pagination, tags, author archives, attachments | ✓Strips noise URLs (paginated, attachment, feed, etc.) |
| Spec compliance | ×Often missing the H1 + blockquote header | ✓Follows the llmstxt.org spec exactly |
| Editable output | ×Regenerated on every plugin update | ✓Download once, edit by hand, host yourself |
| Cost | Bundled with paid SEO plugin | ✓Free, no signup |
No black box. Here's exactly what happens between paste and download.
We fetch the sitemap URL via a public CORS proxy (browsers block direct cross-origin XML reads). If the file is a sitemap-index.xml, we recursively fetch up to 5 nested sitemaps and dedupe URLs.
We strip URLs that almost never belong in llms.txt:
/page/2/).pdf, .jpg, .zip attachmentsURLs are grouped by their first path segment: /blog/*, /services/*, /about/*, etc. Top-level pages (homepage, contact) sit in a "Core pages" section. Sections are ordered by URL count.
If you tick "Enrich with real titles & descriptions", we fetch each page through r.jina.ai (a free reader that returns clean markdown) and extract the H1 + first meaningful sentence. Capped at 250 pages, 6 parallel requests.
Output follows the llmstxt.org spec: # Site name as H1, italic blockquote summary, then ## Section headers each containing markdown bullets in [Title](URL): description format.
Everything runs in your browser. We don't log URLs, store sitemaps, or track who used the tool. Hard caps: 250 URLs per run, 5 nested sitemap files, 90s total timeout. If you need more, drop us a line — we run a paid version for enterprise sites.
https://yoursite.com/llms.txt. On WordPress, that's the same folder as robots.txt. On Astro / Next.js, it goes in your public/ folder.llms.txt when crawling a site. Google's AI Overview doesn't yet have a public commitment but is expected to follow. Either way, it's a no-cost upside — the spec is small and harmless if ignored.r.jina.ai reader so we never touch your server directly with credentials. If your pages are behind login, those URLs simply won't enrich and we'll fall back to the URL-only entry.llms.txt that includes the full text of your most important pages. We don't generate that yet — it's on the roadmap.Generating an llms.txt is step one. The full picture — which AI engines cite you, which competitors are pulling ahead, where the citation gaps are — takes deeper work. We do that for free for SG-based businesses.