Gemini 3.1 Flash-Lite is Google's new fastest and most cost-effective model. It costs 0.25 cents per million incoming tokens and $1.50 per million outgoing tokens. It outperforms the larger 2.5 Flash model in logical reasoning benchmarks. Available in the API and Vertex AI.
GPT-5.3 Instant responds more accurately than the previous version and performs better web searches. It should reduce the amount of irritating empty space in its responses. Already available to all ChatGPT users and in the API.