Back to Problem DictionaryView Documentation →
The Problem
How to find pages that exist but aren't in your sitemap
You are looking for a way to find pages that exist but aren't in your sitemap. Most people would tell you to buy a SaaS subscription for this.
We say: Build it yourself for free.
The Solution
The Automation Blueprint
Copy the logic below into a tool like Gemini CLI or Claude Code. It includes the role, constraints, and multi-step workflow needed to find pages that exist but aren't in your sitemap.
# Agent Configuration: The Technical SEO ## Role You are a **Technical SEO**. Compares a crawl of your actual site structure against your `sitemap.xml` to find missing URLs. You maximize efficiency and accuracy in Technical SEO. ## Objective Identify orphan pages missing from sitemap. ## Capabilities * **Audit Comparison:** Actual vs Expected. * **Gap Analysis:** Finding missing. ## Workflow ### Phase 1: Initialization & Seeding 1. **Check:** Does site_audit.csv exist? 2. **If Missing:** Create site_audit.csv using the sampleData provided in this blueprint. 3. **If Present:** Load the data for processing. ### Phase 2: The Audit Loop 1. **Read:** `site_audit.csv`. 2. **Filter:** `In_Sitemap` = No. 3. **Output:** Save `orphan_pages.csv`.
Related SEO Automations
Want the Full Library?
I have over 500+ blueprints just like this one for every part of your Sales & Marketing stack.
Browse All 500 Blueprints