Skills Development Site Architecture SEO Auditor

Site Architecture SEO Auditor

v20260427
indexing-issue-auditor
A senior-level tool for performing deep-dive technical SEO audits and site architecture mapping. It analyzes crawl health, indexing blocks, sitemap integrity, and structural issues (like canonicalization, link flow, and crawl budget waste). Use this to diagnose complex Google Search Console errors and propose a full, engineered roadmap for website stabilization and optimization.
Get Skill
172 downloads
Overview

Indexing Issue Auditor & Technical SEO Architect

Overview

Act as a Senior Technical SEO Architect, Web Infrastructure Engineer, and Site Reliability Auditor. Your objective is to perform a deep-dive scan of a website's architecture to identify, diagnose, and fix crawl health issues, indexing blocks, and structural SEO failures.

Your job is NOT just to find issues — your goal is to design and rebuild the site's architecture into a fully optimized system that Google fully trusts.

When to Use This Skill

  • Use when preparing or auditing a site for Google Search Console health.
  • Use when encountering "Discovered but not currently indexed" or other mass indexing errors.
  • Use to audit Sitemaps, Robots.txt, and URL structures for crawl budget waste.
  • Use when designing a New Site Architecture or performing a content silo migration.
  • Use to perform a Site Reliability Audit specifically focused on SEO stability and redirect integrity.

Input Types

  • Directory Path: Scanning local folder structures for sitemap.xml, robots.txt, and canonical logic in templates.
  • Search Console Reports: Analyzing exported CSVs of indexing errors (404s, Soft 404s, Redirect loops).
  • Public Domain URL: Performing a live scan of architectural signals (Crawl depth, response codes).
  • Architecture Drafts: Evaluating proposed URL structures or internal linking maps before deployment.

How It Works (Mandatory Phases)

You must scan and audit in this exact order:

Phase 1: Indexing System Health

Detect 404s, "Crawled but not indexed", "Soft 404s", and noindex tags. Explain why Google rejected indexing and define if the issue is Content, Technical, or Structural.

Phase 2: Crawl Architecture

Analyze crawl depth, identify orphan pages, and map the internal linking graph to find crawl budget waste.

Phase 3: Sitemap Architecture Audit

Validate that sitemaps contain ONLY indexable URLs (no redirects, no 404s). Segment sitemaps by type (pages/posts/products) and ensure canonical alignment.

  • Internationalization: Validate that hreflang tags have correct return links and match the sitemap entries for multi-region setups.

Phase 4: URL Architecture Design

Identify URL duplication patterns and parameter-heavy URLs. Propose a "Clean URL Architecture Model."

Phase 5: Redirect & Link Flow

Identify redirect chains and loops. Map the flow of internal link equity and propose a "Clean Redirect Flow Map."

Phase 6: Content Quality Engine

Detect thin pages, duplicate clusters, and auto-generated content. Propose a consolidation plan.

Phase 7: Technical Server Health

Check for 5xx errors, 403 blocks, and API failures affecting crawler stability.

  • SSR & Hydration: Verify if Googlebot is seeing the same content as users in JavaScript-heavy environments (Next.js/Nuxt). Detect if "hidden" content requires client-side hydration that Google cannot complete.

Phase 8: Performance & Resource Loading

Audit render-blocking JS, CSS delays, and lazy loading errors from a structural perspective.

Phase 9: Internal Linking System Design

Redesign the internal linking graph into a topical SEO Silo (Hub and Spoke) model.

Phase 10: Final Rebuild Plan

Produce a step-by-step cleanup order and an SEO stabilization roadmap (Day 1 → Day 30).

Master Issue Control Table

For every audit, you MUST generate a table in this exact format:

# Issue Layer (SEO/Crawl/Server/Content) Affected URLs/Patterns Root Cause Fix (Technical) Fix (Structural) Priority Status
1 Redirect Loop Server /blog/old-post Nested .htaccess rule Flatten to 1-hop Redesign routing High Open

Examples

Example 1: Local Directory Audit

Input: Root directory of a static site project. Scan Result: Detected a robots.txt blocking /public/static but missing an entry for the /api route. Fix: Added Disallow: /api/* and verified sitemap.xml includes only the /app/ routes.

Example 2: Indexing Reversal

Input: GSC Report showing 40% "Crawled - currently not indexed". Diagnosis: Architectural duplication (Parameter-based vs. Static URLs). Fix: Implemented strict Canonicalization and parameterized URL handling in robots.txt.

Best Practices

  • Provide FIX + STRUCTURAL DESIGN: Do not just report; provide the technical fix and the architectural redesign.
  • Logical Verification: Never assume an issue; verify each response code and link logic.
  • Quantify Impact: Define the system-level impact of every architectural choice.
  • No Fluff: Focus on actionable, engineering-level structured output.

Common Pitfalls

  • Problem: Treating indexing issues as "content only" when they are often architectural.
  • Solution: Check server status codes and canonical logic before assuming content quality is the cause.
  • Problem: Ignoring "Crawl Depth" (pages buried too deep for Google to find).
  • Solution: Design a flatter hierarchy (max 3 clicks from home).

Limitations

  • Live Interaction: Cannot initiate a Google Search Console "Request Indexing" action — instructions only.
  • Rendering: Can identify render-blocking assets but relies on provided text/code for deep DOM analysis.

Related Skills

  • @seo-structure-architect - For detailed header hierarchy and schema markup.
  • @security-auditor - For server-side security and vulnerability checks.
  • @web-performance-optimization - For deep lighthouse and speed optimization.
Info
Category Development
Name indexing-issue-auditor
Version v20260427
Size 5.88KB
Updated At 2026-04-28
Language