In brief Inception Labs’ Mercury 2 generates roughly 1,000 tokens per second and scored 90 on the AIME 2026 Google’s recent DiffusionGemma hits similar speeds...
In brief Google released DiffusionGemma, a free open-weight model that generates entire 256-token blocks simultaneously via text diffusion—hitting over 1,000 tokens per second on an...