Moneycontrol PRO
Swing Trading 101
Swing Trading 101

OpenAI launches GPT-5.4 with major accuracy gains and new tool search system

OpenAI has launched GPT-5.4, its latest frontier AI model designed for professional workloads, offering a massive one-million-token context window, improved reasoning, lower error rates and faster performance.

March 06, 2026 / 07:46 IST
OpenAI
Snapshot AI
  • OpenAI launches GPT-5.4 with improved efficiency and accuracy
  • GPT-5.4 supports up to one million tokens per API request
  • New Tool Search reduces token use for external tool integration

OpenAI has introduced GPT-5.4, describing it as the company’s most capable and efficient frontier model built for professional and enterprise tasks.

The new release expands the GPT-5 series with multiple variants. Alongside the standard model, OpenAI is offering GPT-5.4 Thinking, a reasoning-focused version designed for complex problem solving, and GPT-5.4 Pro, which prioritises higher performance.

A key highlight is the model’s massive context window. The API version supports up to one million tokens, allowing developers to process far larger documents and datasets in a single request than previous OpenAI models.

OpenAI says the new model is also significantly more token-efficient. According to the company, GPT-5.4 can solve similar problems using fewer tokens compared with GPT-5.2, potentially reducing both latency and cost for developers.

Benchmark results show major gains across several tests. GPT-5.4 recorded top scores in computer-use benchmarks OSWorld-Verified and WebArena Verified, while achieving 83 percent on OpenAI’s GDPval evaluation, which measures performance on knowledge-work tasks.

The model also led the APEX-Agents benchmark from Mercor, which evaluates professional skills such as legal reasoning and financial analysis.

Mercor CEO Brendan Foody said GPT-5.4 performed strongly on complex deliverables including slide decks, financial modelling and legal analysis, while operating faster and at a lower cost than competing frontier models.

OpenAI also claims improved factual reliability. In internal evaluations, GPT-5.4 was 33 percent less likely to make errors in individual claims compared with GPT-5.2, while overall responses were 18 percent less likely to contain mistakes.

New Tool Search

The company has also introduced a new system called Tool Search to improve how models interact with external tools through the API. Previously, system prompts needed to include definitions for every available tool, which could consume large numbers of tokens. Tool Search allows the model to retrieve tool definitions only when required, reducing token usage and speeding up requests in applications with many integrated tools.

OpenAI also introduced a new safety evaluation focused on chain-of-thought reasoning — the internal step-by-step explanations models generate when solving complex tasks. Researchers have raised concerns that AI models might misrepresent this reasoning process under certain conditions.

According to OpenAI, early testing shows deception is less likely with GPT-5.4 Thinking, suggesting the model is less capable of hiding its reasoning. The results indicate that monitoring chain-of-thought behaviour remains an effective safety method.

 

Invite your friends and family to sign up for MC Tech 3, our daily newsletter that breaks down the biggest tech and startup stories of the day

Sarthak Singh Sarthak is an experienced writer having covered personal and consumer tech, gadgets news, social media trends, and more for several years
first published: Mar 6, 2026 07:46 am

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

  • On Saturdays

    Find the best of Al News in one place, specially curated for you every weekend.

  • Daily-Weekdays

    Stay on top of the latest tech trends and biggest startup news.

Advisory Alert: It has come to our attention that certain individuals are representing themselves as affiliates of Moneycontrol and soliciting funds on the false promise of assured returns on their investments. We wish to reiterate that Moneycontrol does not solicit funds from investors and neither does it promise any assured returns. In case you are approached by anyone making such claims, please write to us at grievanceofficer@nw18.com or call on 02268882347