Verbatim

Blog

Notes on what the Verbatim Index actually found, and what changes for you on Monday morning.

May 23, 2026

Will AGI Exist by 2030? We Ran 9 AI Models Through Adversarial Review to Find Out.

Seven of nine frontier AI models said AGI is primarily a compute problem. The two dissenters were both Anthropic models.

May 22, 2026

Which AI Is the Harshest Critic?

GPT-5.5 marked 39.78% of its cross-family verdicts as disputed across five Verbatim Index questions. Gemini 3.1 Pro Preview marked 25.28%. A 14.50-percentage-point spread.

May 22, 2026

Claude Opus Costs 7x More Than Gemini. It Ranked Lower Too.

On q-001 of the Verbatim Index, Anthropic's Opus retrieved zero web tokens and was disputed by cross-vendor critics 1.32 points more often than Sonnet 4.6.

May 22, 2026

Perplexity vs. Verbatim: When to Compare AI Answers vs. Audit One

Perplexity Max's Model Council runs your query through three models in parallel. Verbatim's Council audits the answer you already have. Query-anchored versus output-anchored.

May 22, 2026

Web Search Doesn't Make AI Answers More Trustworthy

Only four of nine models in this Verbatim Index run had usable retrieval telemetry. Inside that four, the ranking aligned on the top pair and inverted on the bottom.