Research lab

Public benchmarks that make
the AI web measurable.

Planned benchmarks will publish live-scanned datasets with methodology, confidence bands, false-positive policy, and raw CSV for independent analysis. Preview charts are labeled until the first release.

Run a live scan Subscribe via RSS

6 planned benchmark datasets

Data-driven, reproducible, transparent

Registering interestMonthly·12,000 domains

State of AI-Generated Websites

This planned report is intended for advertisers, brand safety teams, and content marketplaces that need AI-content policy thresholds and fraud-risk baselines.

AI Content Risk by Category (projected %)

SaaS

68%

Agencies

71%

E-commerce

54%

Local services

49%

Publishers

38%

Projected sample: illustrative until first publication

Fields collected

AI phrase cluster densityBuilder / template fingerprintsSchema coverage rateAuthorship evidenceProvenance disclosuresLexical diversity distributionBurstiness distribution

Categories

SaaSAgenciesE-commerceLocal servicesPublishers

Registering interestMonthly·10,000 robots.txt files

AI Crawler Access Index

This planned report is intended for AI companies, SEOs, and publishers comparing crawler-access policies and opportunity gaps.

Crawler allow-rate by bot (projected %)

GPTBot

42%

ClaudeBot

38%

PerplexityBot

31%

Google-Extended

55%

CCBot

28%

Projected sample: illustrative until first publication

Fields collected

Per-crawler allow/block ratellms.txt adoption %ai.txt adoption %Crawl-delay distributionPublisher category breakdownPolicy gap heatmap

Categories

GPTBotClaudeBotPerplexityBotGoogle-ExtendedCCBotAmazonbot+8 more

Registering interestQuarterly·5,000 SaaS domains

Agent Readiness Index

Platforms building agentic commerce, AI procurement, and retrieval-augmented workflows need to know which domains are ready to be accessed by AI agents.

Fields collected

sitemap health scoreJSON-LD coverageAPI docs presencePricing / product clarityCheckout friction signalsllms.txt readiness

Categories

B2B SaaSE-commerceDeveloper toolsMarketplacesPublishers

Registering interestQuarterly·50,000 review pages

Synthetic Review Risk Report

Marketplaces, consumer protection teams, and retail brands use this to calibrate review fraud thresholds and vendor vetting policy.

Fields collected

Phrase-repetition clustersGeneric praise densityReview timing anomaliesPersona reuse signalsRating/schema abuse rateAI-text risk in testimonials

Categories

E-commerceTravelSoftware reviewsRestaurant / localHealthcare

Registering interestSemi-annual·100,000 TLS endpoints

Post-Quantum Web Readiness Brief

CISOs, risk managers, and compliance teams need a public baseline for industry PQC migration urgency before NIST migration deadlines.

Fields collected

TLS 1.3 adoption rateCipher suite distributionCAA record adoptionDNSSEC adoptionCert key size distributionPQC algo early adopters

Categories

Financial servicesHealthcareGovernmentEnterprise SaaSCritical infra

Registering interestQuarterly·2,000 brand queries

AI Visibility Benchmark

Marketing teams, brand managers, and SEOs use this to understand which content and schema choices predict AI search citation.

Fields collected

ChatGPT / Claude citation ratePerplexity answer inclusionGoogle AI Overview presenceShare-of-voice vs competitorsSchema coverage correlationAnswer consistency score

Categories

B2B SaaSConsumer appsPublishersProfessional servicesRetail

Methodology

Every benchmark publishes its own receipts.

We believe benchmark integrity requires transparent methodology, published false-positive policy, and reproducible probe logic.

Sample construction

Domains are sampled from Majestic/Tranco rank lists, sector-specific seed lists, and community submissions. Each report publishes inclusion criteria, sampling date, and stratification method.

Live scanner probes

Published datasets will use the same AIClicks API as the product: live DNS, TLS, header, robots, schema, and AI-text probes. Any illustrative preview is labeled before publication.

Scoring and thresholds

Module scores, thresholds, confidence bands, and false-positive policy are published alongside each dataset. Scores are probabilistic, not verdicts.

Distribution and formats

Each report ships as Dataset JSON-LD, downloadable CSV, interactive charts, a comparison page, and an RSS item for citation tracking.

Benchmark cadence

Six reports, four cadences

Benchmark	Cadence	Domains	Next publication
State of AI-Generated Websites	Monthly	12,000	TBD: register interest
AI Crawler Access Index	Monthly	10,000	TBD: register interest
Agent Readiness Index	Quarterly	5,000	TBD: register interest
Synthetic Review Risk Report	Quarterly	50,000+	TBD: register interest
Post-Quantum Web Readiness Brief	Semi-annual	100,000	TBD: register interest
AI Visibility Benchmark	Quarterly	2,000	TBD: register interest

Research alerts

Get notified when benchmarks publish.

One email per publication: no marketing.

API access

Enterprise customers get raw benchmark data via the /api/v1 endpoint, including per-domain scores, evidence bundles, and module-level signals.

View API docs →

CSV and Dataset JSON-LD

Each published report ships with a downloadable CSV and Dataset schema.org block for independent analysis, academic citation, and downstream integrations.

Browse data →

Research partnership

Academics, journalists, and non-profits can request early access to benchmark data for independent verification and published analysis.

Get in touch →

Public benchmarks that makethe AI web measurable.

Data-driven, reproducible, transparent

State of AI-Generated Websites

AI Crawler Access Index

Agent Readiness Index

Synthetic Review Risk Report

Post-Quantum Web Readiness Brief

AI Visibility Benchmark

Every benchmark publishes its own receipts.

Sample construction

Live scanner probes

Scoring and thresholds

Distribution and formats

Six reports, four cadences

Get notified when benchmarks publish.

API access

CSV and Dataset JSON-LD

Research partnership

Public benchmarks that make
the AI web measurable.