Blog
Notes on what the Verbatim Index actually found, and what changes for you on Monday morning.
Will AGI Exist by 2030? We Ran 9 AI Models Through Adversarial Review to Find Out.
Seven of nine frontier AI models said AGI is primarily a compute problem. The two dissenters were both Anthropic models.
Which AI Is the Harshest Critic?
GPT-5.5 marked 39.78% of its cross-family verdicts as disputed across five Verbatim Index questions. Gemini 3.1 Pro Preview marked 25.28%. A 14.50-percentage-point spread.
Claude Opus Costs 7x More Than Gemini. It Ranked Lower Too.
On q-001 of the Verbatim Index, Anthropic's Opus retrieved zero web tokens and was disputed by cross-vendor critics 1.32 points more often than Sonnet 4.6.
Perplexity vs. Verbatim: When to Compare AI Answers vs. Audit One
Perplexity Max's Model Council runs your query through three models in parallel. Verbatim's Council audits the answer you already have. Query-anchored versus output-anchored.
Web Search Doesn't Make AI Answers More Trustworthy
Only four of nine models in this Verbatim Index run had usable retrieval telemetry. Inside that four, the ranking aligned on the top pair and inverted on the bottom.