State of AI-Generated Websites
Advertisers, brand safety teams, and content marketplaces use this report to set AI-content policy thresholds and benchmark fraud risk.
Fields collected
Categories
Research lab
Each benchmark is a live scanned dataset — not opinions. Published with methodology, confidence bands, false-positive policy, and raw CSV for independent analysis.
6 planned benchmark datasets
Advertisers, brand safety teams, and content marketplaces use this report to set AI-content policy thresholds and benchmark fraud risk.
Fields collected
Categories
AI companies, SEOs, and publishers use this report to understand ecosystem crawler access norms and identify opportunity gaps.
Fields collected
Categories
Platforms building agentic commerce, AI procurement, and retrieval-augmented workflows need to know which domains are ready to be accessed by AI agents.
Fields collected
Categories
Marketplaces, consumer protection teams, and retail brands use this to calibrate review fraud thresholds and vendor vetting policy.
Fields collected
Categories
CISOs, risk managers, and compliance teams need a public baseline for industry PQC migration urgency before NIST migration deadlines.
Fields collected
Categories
Marketing teams, brand managers, and SEOs use this to understand which content and schema choices predict AI search citation.
Fields collected
Categories
Methodology
We believe benchmark integrity requires transparent methodology, published false-positive policy, and reproducible probe logic. These are the four pillars:
01
Domains are sampled from Majestic/Tranco rank lists, sector-specific seed lists, and community submissions. Each report publishes inclusion criteria, sampling date, and stratification method.
02
Every domain is scanned with the same AIClicks API used in the product — live DNS, TLS, header, robots, schema, and AI-text probes. No cached or estimated data.
03
Module scores, thresholds, confidence bands, and false-positive policy are published alongside each dataset. Scores are probabilistic, not verdicts.
04
Each report ships as Dataset JSON-LD, downloadable CSV, interactive charts, a comparison page, and an RSS item for citation tracking.
Enterprise customers get raw benchmark data via the /api/v1 endpoint, including per-domain scores, evidence bundles, and module-level signals.
View API docs →Each published report ships with a downloadable CSV and Dataset schema.org block for independent analysis, academic citation, and downstream integrations.
Browse data →Academics, journalists, and non-profits can request early access to benchmark data for independent verification and published analysis.
Get in touch →