Tech & Science

OpenAI winds down fine-tuning API amid rising costs

OpenAI's API pricing has undergone a series of changes in recent weeks that are reshaping the economics of AI-powered development, prompting developers and...

OpenAI’s API pricing has undergone a series of changes in recent weeks that are reshaping the economics of AI-powered development, prompting developers and businesses to reconsider how they build on frontier models. While no single announcement on May 8 introduced a formal “tiered pricing structure,” the cumulative effect of OpenAI’s pricing moves — from the GPT-5.5 price doubling to the deprecation of fine-tuning and the rollout of new processing tiers — amounts to a restructuring of the cost landscape for the company’s developer ecosystem.

OpenAI winds down fine-tuning API amid rising costs

GPT-5.5 Doubles Down on Price

The most consequential change came on April 23, when OpenAI introduced GPT-5.5 with API pricing set at $5 per million input tokens and $30 per million output tokens — double the rates of GPT-5.4, which launched just six weeks earlier. A premium GPT-5.5 Pro tier carries even steeper costs at $30 per million input tokens and $180 per million output tokens. OpenAI co-founder Greg Brockman described GPT-5.5 as “a new class of intelligence built specifically for real work and for powering agents”, but developers quickly noticed that the capability gains came at a steep cost.

An analysis by OpenRouter, the API routing platform, found that real-world cost increases ranged from 49% to 92% depending on prompt length, despite GPT-5.5 generating 19% to 34% fewer output tokens on longer prompts. For short prompts under 2,000 tokens, users absorbed nearly the full brunt of the price doubling.

A Week of Compounding Changes

The first week of May brought additional disruptions. On May 7, OpenAI announced it was winding down its fine-tuning API and platform, giving existing customers until January 6, 2027, to create new training jobs. The same day, the company launched new voice intelligence features including GPT-Realtime-2, a voice model with GPT-5-class reasoning, billed by token consumption through the Realtime API.

These moves coincided with pricing shifts across the industry. An analysis published by FairMind found that in a single week, OpenAI, Anthropic, and GitHub all altered their economic terms through different mechanisms, generating gaps of up to 92% between published list prices and actual billed costs. GitHub’s Copilot is migrating to a token-based billing model effective June 1, with multipliers for Claude Opus 4.7 jumping from 7.5x to 27x.

Developer Pushback and Emerging Alternatives

The pricing trajectory has sparked frustration in OpenAI’s developer community. “OpenAI pricing has not become cheaper for API developers with new release models in 2026, instead, huge hikes,” one developer wrote on OpenAI’s community forum. Others have noted that the $100-per-month ChatGPT Pro tier, introduced in April to offer expanded Codex usage, still falls short of expectations given reduced limits compared to earlier plans.

The competitive response has been swift. DeepSeek announced on April 26 that it would slash prices for cached API inputs to roughly $0.14 per million tokens, while open-source alternatives continue to gain traction among cost-conscious teams. As one industry analysis put it, “2026 definitively closes the era of AI as a ‘stable-rate API’ and opens that of AI as an architectural component to be designed with the same care given to a multi-region database”.

Leave a Reply

Your email address will not be published. Required fields are marked *