OpenAI Launches GPT-5.4 Mini and Nano Models for Cost-Optimized AI Workloads

10:19, 18 March

Edited by: Aleksandr Lytviak

iframe { display: none; }

OpenAI Launches GPT-5.4 Mini and Nano Models for Cost-Optimized AI Workloads

OpenAI officially announced the introduction of two new, compact artificial intelligence models, GPT-5.4 Mini and GPT-5.4 Nano, on March 17, 2026. This strategic release addresses the escalating industry requirement for AI solutions that balance computational performance, latency, and operational expenditure, signaling a broader 2026 industry shift toward specialized AI portfolios over monolithic systems. The core essence of this launch is to provide lightweight variants of the flagship GPT-5.4 series optimized for high-volume operations, agentic subtasks, and real-time multimodal processing, positioning them as essential tools where larger models incur excessive cost.

The GPT-5.4 Mini model is specifically targeted toward developer workloads demanding immediate responsiveness, such as advanced coding assistants, real-time image reasoning, and interpreting user interface screenshots. OpenAI asserts that GPT-5.4 Mini executes more than twice as fast as its direct predecessor, GPT-5 Mini, while achieving performance parity with the main GPT-5.4 model on rigorous coding evaluations like SWE-Bench Pro. For developers utilizing the Codex platform, the Mini variant consumes only 30% of the standard GPT-5.4 quota for subagent workflows, translating to approximately one-third the cost for these specific coding tasks. The model supports text and image inputs, function calling, tool use, and web/file search capabilities, equipped with a 400 thousand token context window. API pricing for GPT-5.4 Mini is set at $0.75 per million input tokens and $4.50 per million output tokens.

Conversely, the GPT-5.4 Nano model is positioned as the most streamlined and cost-effective option in the new series, designed for repetitive, speed-critical functions like data extraction, classification, and simple sub-agent support within coding environments. Unlike the Mini, the Nano variant is exclusively available via the API, reflecting its focus on backend, high-throughput automation rather than direct user interaction tiers. The pricing for GPT-5.4 Nano is significantly lower, set at $0.20 per million input tokens and $1.25 per million output tokens, representing a substantial cost reduction compared to the flagship model, which is priced at $2.50 per million input and $15.00 per million output tokens. This segmentation suggests a strategy where complex planning is reserved for the flagship model, while execution-heavy, less reasoning-intensive work is delegated to the Nano.

Availability across the OpenAI ecosystem is tiered: GPT-5.4 Mini is accessible via the API, within the Codex environment, and through the ChatGPT platform, where Free and Go users can access it via the 'Thinking' feature. The introduction of these models follows the March 5, 2026, release of the main GPT-5.4 Thinking model, establishing a clear progression in OpenAI's 2026 product strategy. Furthermore, both new models are made available on Microsoft Foundry, the platform designed for building and governing AI applications, which integrates various Azure AI capabilities and supports OpenAI models, underscoring an immediate enterprise focus for these optimized models.

The emphasis on benchmarks like SWE-Bench Pro, a rigorous evaluation for software engineering agents featuring 1,865 tasks across 41 repositories designed to resist data contamination, confirms the models' targeting of complex developer workflows. While the performance-per-latency has demonstrably improved, the materials note a 'notable price increase' for the new models compared to their GPT-5 predecessors, indicating a strategic recalibration of the cost structure despite efficiency gains. The GPT-5.4 Mini's ability to approach GPT-5.4 performance on such challenging benchmarks while consuming significantly less quota represents a potent efficiency proposition for large-scale software development operations.

OpenAI

ChatGPT

31 Views

Sources

Republic World
Mynet Haber
OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier
OpenAI has announced 'GPT-5.4 mini/nano,' a fast, low-cost, and lightweight model
OpenAI 2026 AI Roadmap: GPT-5, 5.2 & Open Models - i10X
OpenAI's Latest AI Models Are Built for Speed - CNET
OpenAI releases GPT-5.4 mini and nano, its most capable small models yet
9to5Mac
ZDNET
OpenAI
Thurrott.com
Microsoft

Notification Center

OpenAI Launches GPT-5.4 Mini and Nano Models for Cost-Optimized AI Workloads

Sources

Read more articles on this topic: