Comprehensive technical SEO auditing agent that crawls websites, analyzes technical SEO factors, and delivers actionable improvement reports with specific recommendations for better search engine visibility and performance.
Sitemap issues: robots.txt points to /sitemap.xml which times out; /sitemap_index.xml returns a 404 HTML page.
Inconsistent canonicalization of search listing: /search self-resolves to homepage via canonical, while category listings self-canonicalize. This can dilute relevance and cause duplicate signals.
Estimated impact
Fixing hreflang prevents international mis-signals and possible demotions due to invalid language alternates.
Restoring a valid sitemap coverage will improve discovery and freshness, especially for new/updated listings/builders.
Correct canonicalization on search templates will prevent index bloat or dilution and improve SERP clarity for listing pages.
Page speed improvements (JS reduction, image optimization, font strategy) can improve Core Web Vitals and CTR.
Generate a valid XML sitemap (or sitemap index), include only canonical, indexable URLs (homepage, listings, builder profiles, builders index; avoid internal search result pages).
Ensure it’s served fast (static build or fast server-side generation) and update robots.txt to the correct URL(s).
Validation: Fetch live XML in browser; test with Search Console “Sitemaps”.
Inconsistent canonicalization of search pages
Evidence:
/search (all agents): canonical points to homepage (https://marketplace.relevanceai.com/), yet the page is indexable (meta robots index, follow). Title is generic; pagination exists with rel next/prev.
/search?category=sales: self-canonical and indexable (good).
Why it matters: Canonical to homepage can confuse search engines, causing duplication and diluted relevance for “All Agents” results.
Remediation options (pick one strategy sitewide):
Option A (index a canonical listing hub): Make /search the canonical, with a descriptive title/description, and ensure all refined variants (sort, page) self-canonical to their own page or to the base /search (depending on content uniqueness). Keep meta robots index, follow.
Option B (noindex search results): Add to /search and any parameterized result pages you do not want indexed, keep category landing pages indexable with self-canonical.
Recommended for marketplaces: Keep category pages indexable/self-canonical; either noindex /search or ensure /search self-canonical with unique metadata.
Code example (noindex search):
Code example (indexable search):
Important Improvements (Medium Priority)
Open Graph/Twitter image consistency
Evidence: Homepage OG/Twitter image observed as an SVG icon in one scrape; elsewhere OG.png. Prefer a large raster (1200×630) for consistent previews.
Fix:
Ensure uniform across templates.
Schema markup consistency on listing/category
Evidence: Listing detail pages include Product and BreadcrumbList JSON-LD (good). Search base shows ItemList and SearchAction. Some category pages returned no jsonld_types.
Fix: Ensure category templates output ItemList JSON-LD with item elements linking to listing URLs. { "@context": "https://schema.org", "@type": "ItemList", "itemListElement": [ {"@type": "ListItem","position": 1,"url": "https://marketplace.relevanceai.com/listing/..."}, ... ] }
Titles and meta descriptions
Evidence: /search’s title came through as “Acme Marketplace” (generic). Category titles are good (“Sales - Acme Marketplace”).
Fix: Ensure each template has unique, descriptive and meta description. For /search (All Agents): title like “All AI Agents – Acme Marketplace” and a matching meta description.
Pagination hygiene
Evidence: Builders index and category/search include rel prev/next and page query parameters.
Fix: Keep canonical to the current page for paginated series; retain rel prev/next (while Google no longer uses them as signals, they help UX and some crawlers). Ensure each page shows content unique to that page (not repeating the first page’s items).
Enhancement Opportunities (Low Priority)
Homepage schema: consider adding Organization and BreadcrumbList JSON-LD for richer entity understanding.
Add a site-level WebSite schema with SearchAction (if not already uniform) to enable sitelinks search in SERP.
Internal linking: From listing pages, link clearly to category pages and builder profile pages (already seen via breadcrumb and “More by” sections—keep strengthening).
Consider curated category landing content (intro paragraph, FAQs) to improve topical relevance and long-tail capture (FAQPage schema if appropriate).
Mobile & Performance (Core Web Vitals via PageSpeed Insights) Tested pages:
Homepage: opportunities identified
Reduce unused JS (e.g., /_astro/client..., Tech Solutions Inc. scripts); defer analytics until after user interaction or idle.
Avoid render-blocking resources:
Inline critical CSS or use media/async techniques.
Serve responsive images with width/height attributes and sizes/srcset; ensure avatars/thumbnails are not 1–1.5MB for 20–48px display.
Preconnect to third parties you must load (fonts.gstatic, ph.relevanceai.com) and defer the rest; remove duplicated JS and legacy transpiled bundles where possible.
Listing detail (example): generally strong LCP/CLS, but:
Reduce total payload (~3.2MB) via image compression and script trimming.
Include main pages: homepage, builders index, builder profiles, listing detail pages. Exclude search results (/search, unless you want them indexed and canonicalized).
Validate in Search Console.
Technical Implementation Guide
Hreflang
Remove invalid hreflang references immediately. Only include alternates that exist and reciprocate.
Validation: Use the Hreflang Tags Testing Tool; check that each alternate 200s and contains reciprocal link tags.
Canonicalization for search templates
Decide strategy:
Indexable: Use self-canonical and unique metadata; keep prev/next.
Noindex: Add for /search and parameterized versions; keep category pages indexable.
Validation: View source to ensure only one canonical tag present and it matches intended URL.
Sitemap generation (Astro/static)
During build, generate sitemaps from your routes (pages, listing data, builder IDs).
Example Node build snippet (pseudo): const urls = [...staticPages, ...listings.map(u=>/listing/${u.id}), ...builders.map(u=>/builders/${u.id})]; // Write to /dist/sitemaps/*.xml with lastmod
Deploy; confirm 200 and proper content-type (application/xml).
JSON-LD
Ensure category pages output ItemList JSON-LD (see example above).
Listing pages already include Product and BreadcrumbList JSON-LD (good). Verify required Product fields (name, description, image, brand, offers).
Page speed tooling and steps
Bundle analysis: run a JS treemap, remove duplicate/legacy JS.
Defer analytics: load Tech Solutions Inc. only after user interaction or via requestIdleCallback when possible: window.addEventListener('load', () => { requestIdleCallback(() => { /* load analytics */ }); });
Fonts: self-host Inter, preload WOFF2, set font-display: swap.
Images: audit largest images; serve AVIF/WebP, correct sizes, width/height attributes to reduce CLS.
Validate: Use PSI and Web Vitals field data, re-test top templates (home, category, listing, builder profile).
Tool recommendations for ongoing monitoring
Google Search Console (coverage, sitemaps, enhancement reports)
PageSpeed Insights + Lighthouse CI on key templates
Log-based monitoring of 404s (catch any future hreflang or internal link issues)
Structured Data Testing Tool / Rich Results Test for JSON-LD validation
Validation steps to confirm fixes
Crawl key templates with a fetch-and-render (ensure meta robots, canonical, hreflang, JSON-LD appear in the server-rendered HTML).
Re-test with PSI: target LCP < 2.5s, CLS < 0.1, TBT as low as possible.
Re-fetch sitemaps in GSC; check indexed pages match intent (exclude search if noindex, include listings/builders).
Observed page type coverage (sampled)
Homepage: indexable, canonical to self, hreflang included (fix needed), JSON-LD WebSite present.
Today: Remove invalid hreflang es or ship /es/; fix /search canonical (self-canonical or noindex); restore working sitemaps and update robots.txt.
This week: Normalize OG/Twitter images; ensure ItemList JSON-LD on category pages; ensure unique page titles/meta; verify pagination canonicals.
Next 1–2 weeks: Performance pass (JS trimming, defer analytics, font strategy, image optimization); implement build-time sitemap generation; add Organization schema on homepage if desired.
If you want, I can provide a ready-to-drop HTML head snippet for each template (home, search, category, listing) reflecting the above recommendations.
Comprehensive technical SEO auditing agent that crawls websites, analyzes technical SEO factors, and delivers actionable improvement reports with specific recommendations for better search engine visibility and performance.