Scan your sitemap.xml
, convert pages & live sites to LLM-optimized Markdown, and export instantly. It generates llms-full.txt which includes all the links. It can be used as llms.txt
file as well. If you want to be precise, you can remove unrelated links from the generated txt
file.
-
Recursive Sitemap Scanning
- Parses your
sitemap.xml
and any nested sitemaps, following only validhttp(s)
URLs. - Filters out non-HTTP links for focused scanning.
- Parses your
-
Markdown Export (LLMsTxt Format)
- Converts HTML pages into clean ATX-style headings (
#
,##
, …), fenced code blocks, and absolute URLs. - Removes
<script>
,<style>
, and<button>
tags; preserves JSON-LD (application/ld+json
&application/json+ld
) as formatted code snippets. - Resolves relative links and images to full URLs for seamless static
llms.txt
content generation.
- Converts HTML pages into clean ATX-style headings (
-
Current Page Converter
- One-click “Convert Current Page” grabs the rendered DOM (supports SPA/React/Vue content).
- Prepends
<title>
as# Heading
and<meta name="description">
as> Blockquote
. - Ideal for ad-hoc page audits, AI training data extraction, and quick Markdown previews.
-
Embed & SEO Metadata Guidance
-
Built-in Embed tab with snippets:
<link rel="alternate" type="text/llmtxt" href="https://example.com/llms-full.txt" title="LLMsTxt version" /> <meta name="llmtxt" content="https://example.com/llms-full.txt" />
-
Publish
llms-full.txt
files alongside your pages for easy LLM ingestion and SEO signals.
-
-
Intuitive Modern UI
- Four tabs: Generator, Current Page, Embed, About.
- Real-time progress bar & auto-scrolling log.
⚠️ User warning prevents accidental closure during scanning.- Copy to Clipboard for instant Markdown transfer.
-
Privacy-First & Offline-Capable
- 100% local conversion—no external servers, no tracking.
- Uses Chrome MV3 Offscreen API (or MV2 tab scripting fallback) for accurate DOM parsing.
-
Auto-Detect your sitemap URL (
https://your-site.com/sitemap.xml
) on secure pages. -
Offscreen Rendering fetches pages in a hidden DOM, executing scripts for dynamic content.
-
Clean & Normalize HTML: strip unwanted nodes, normalize whitespace per text node.
-
Convert to Markdown with Turndown:
- Headings →
#
–######
- Code →
lang …
- Links →
[text](absolute-url)
- Images →

- JSON-LD →
application/ld+json …
- Headings →
-
Download or Copy your domain’s ZIP or current-page Markdown.
- SEO & Content Marketing: Ideal for content audits, static migrations, UTM tracking, and structured data extraction.
- AI & LLM Workflows: Prep training data, generate knowledge bases, accelerate AI-driven insights.
- Developer Productivity: Integrates with CI pipelines, GitHub Actions, and static site generators.
- Flexibility & Extensibility: Open-source under MIT—view source on GitHub and contribute!
Apache 2.0