In brief Xiaomi and inference partner TileRT have broken 1,000 tokens per second on a 1-trillion-parameter model, a first at that scale, using a standard...
In brief GPT-5.5 launches today for Plus, Pro, Business, and Enterprise users in ChatGPT and Codex, with API access coming soon at $5/M input tokens...