• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$102,994.000.02%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • ethereumEthereum(ETH)$2,337.236.24%
  • tetherTether(USDT)$1.000.02%
  • rippleXRP(XRP)$2.351.65%
  • binancecoinBNB(BNB)$672.057.58%
  • solanaSolana(SOL)$171.815.21%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • usd-coinUSDC(USDC)$1.000.00%
  • dogecoinDogecoin(DOGE)$0.2052994.83%
  • cardanoCardano(ADA)$0.781.14%
  • tronTRON(TRX)$0.2632052.29%
  • staked-etherLido Staked Ether(STETH)$2,333.396.22%
  • SuiSui(SUI)$3.97-1.04%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$102,910.00-0.21%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • chainlinkChainlink(LINK)$15.970.86%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • avalanche-2Avalanche(AVAX)$23.344.60%
  • Wrapped stETHWrapped stETH(WSTETH)$2,810.296.62%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • stellarStellar(XLM)$0.2968380.78%
  • shiba-inuShiba Inu(SHIB)$0.0000154.74%
  • hedera-hashgraphHedera(HBAR)$0.2012803.09%
  • HyperliquidHyperliquid(HYPE)$24.726.28%
  • bitcoin-cashBitcoin Cash(BCH)$412.80-1.35%
  • ToncoinToncoin(TON)$3.292.81%
  • leo-tokenLEO Token(LEO)$8.69-1.91%
  • USDSUSDS(USDS)$1.00-0.01%
  • litecoinLitecoin(LTC)$103.118.56%
  • polkadotPolkadot(DOT)$4.888.56%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • wethWETH(WETH)$2,336.466.24%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • PengPeng(PENG)$0.60-13.59%
  • moneroMonero(XMR)$312.383.72%
  • Wrapped eETHWrapped eETH(WEETH)$2,494.156.47%
  • Bitget TokenBitget Token(BGB)$4.550.87%
  • PepePepe(PEPE)$0.00001313.95%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.04%
  • Pi NetworkPi Network(PI)$0.738.29%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

OpenAI’s new GPT-4.1 models can process a million tokens and solve coding problems better than ever

April 14, 2025
in AI & Technology
Reading Time: 5 mins read
A A
OpenAI’s new GPT-4.1 models can process a million tokens and solve coding problems better than ever
ShareShareShareShareShare

YOU MAY ALSO LIKE

Apple Building Specialized Chips for Smart Glasses

Lyft CEO: Self-Driving Taxis to Be Part of Fleet Someday

OpenAI launched a new family of AI models this morning that significantly improve coding abilities while cutting costs, responding directly to growing competition in the enterprise AI market.

The San Francisco-based AI company introduced three models — GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano — all available immediately through its API. The new lineup performs better at software engineering tasks, follows instructions more precisely, and can process up to one million tokens of context, equivalent to about 750,000 words.

“GPT-4.1 offers exceptional performance at a lower cost,” said Kevin Weil, chief product officer at OpenAI, during Monday’s announcement. “These models are better than GPT-4o on just about every dimension.”

Perhaps most significant for enterprise customers is the pricing: GPT-4.1 will cost 26% less than its predecessor, while the lightweight nano version becomes OpenAI’s most affordable offering at just 12 cents per million tokens.

How GPT-4.1’s improvements target enterprise developers’ biggest pain points

In a candid interview with VentureBeat, Michelle Pokrass, post training research lead at OpenAI, emphasized that practical business applications drove the development process.

“GPT-4.1 was trained with one goal: being useful for developers,” Pokrass told VentureBeat. “We’ve found GPT-4.1 is much better at following the kinds of instructions that enterprises use in practice, which makes it much easier to deploy production-ready applications.”

This focus on real-world utility is reflected in benchmark results. On SWE-bench Verified, which measures software engineering capabilities, GPT-4.1 scored 54.6% — a substantial 21.4 percentage point improvement over GPT-4o.

For businesses developing AI agents that work independently on complex tasks, the improvements in instruction following are particularly valuable. On Scale’s MultiChallenge benchmark, GPT-4.1 scored 38.3%, outperforming GPT-4o by 10.5 percentage points.

Why OpenAI’s three-tiered model strategy challenges competitors like Google and Anthropic

The introduction of three distinct models at different price points addresses the diversifying AI marketplace. The flagship GPT-4.1 targets complex enterprise applications, while mini and nano versions address use cases where speed and cost efficiency are priorities.

“Not all tasks need the most intelligence or top capabilities,” Pokrass told VentureBeat. “Nano is going to be a workhorse model for use cases like autocomplete, classification, data extraction, or anything else where speed is the top concern.”

Simultaneously, OpenAI announced plans to deprecate GPT-4.5 Preview — its largest and most expensive model released just two months ago — from its API by July 14. The company positioned GPT-4.1 as a more cost-effective replacement that delivers “improved or similar performance on many key capabilities at much lower cost and latency.”

This move allows OpenAI to reclaim computing resources while providing developers a more efficient alternative to its costliest offering, which had been priced at $75 per million input tokens and $150 per million output tokens.

Real-world results: How Thomson Reuters, Carlyle and Windsurf are leveraging GPT-4.1

Several enterprise customers who tested the models prior to launch reported substantial improvements in their specific domains.

Thomson Reuters saw a 17% improvement in multi-document review accuracy when using GPT-4.1 with its legal AI assistant, CoCounsel. This enhancement is particularly valuable for complex legal workflows involving lengthy documents with nuanced relationships between clauses.

Financial firm Carlyle reported 50% better performance on extracting granular financial data from dense documents — a critical capability for investment analysis and decision-making.

Varun Mohan, CEO of coding tool provider Windsurf (formerly Codeium), shared detailed performance metrics during the announcement.

“We found that GPT-4.1 reduces the number of times that it needs to read unnecessary files by 40% compared to other leading models, and also modifies unnecessary files 70% less,” Mohan said. “The model is also surprisingly less verbose… GPT-4.1 is 50% less verbose than other leading models.”

Million-token context: What businesses can do with 8x more processing capacity

All three models feature a context window of one million tokens — eight times larger than GPT-4o’s 128,000 token limit. This expanded capacity allows the models to process multiple lengthy documents or entire codebases at once.

In a demonstration, OpenAI showed GPT-4.1 analyzing a 450,000-token NASA server log file from 1995, identifying an anomalous entry hiding deep within the data. This capability is particularly valuable for tasks involving large datasets, such as code repositories or corporate document collections.

However, OpenAI acknowledges performance degradation with extremely large inputs. On its internal OpenAI-MRCR test, accuracy dropped from around 84% with 8,000 tokens to 50% with one million tokens.

How the enterprise AI landscape is shifting as Google, Anthropic and OpenAI compete for developers

The release comes as competition in the enterprise AI space heats up. Google recently launched Gemini 2.5 Pro with a comparable one-million-token context window, while Anthropic’s Claude 3.7 Sonnet has gained traction with businesses seeking alternatives to OpenAI’s offerings.

Chinese AI startup DeepSeek also recently upgraded its models, putting additional pressure on OpenAI to maintain its leadership position.

“It’s been really cool to see how improvements in long context understanding have translated into better performance on specific verticals like legal analysis and extracting financial data,” Pokrass said. “We’ve found it’s critical to test our models beyond the academic benchmarks and make sure they perform well with enterprises and developers.”

By releasing these models specifically through its API rather than ChatGPT, OpenAI signals its commitment to developers and enterprise customers. The company plans to gradually incorporate features from GPT-4.1 into ChatGPT over time, but the primary focus remains on providing robust tools for businesses building specialized applications.

To encourage further research in long-context processing, OpenAI is releasing two evaluation datasets: OpenAI-MRCR for testing multi-round coreference abilities and Graphwalks for evaluating complex reasoning across lengthy documents.

For enterprise decision-makers, the GPT-4.1 family offers a more practical, cost-effective approach to AI implementation. As organizations continue integrating AI into their operations, these improvements in reliability, specificity, and efficiency could accelerate adoption across industries still weighing implementation costs against potential benefits.

While competitors chase larger, costlier models, OpenAI’s strategic pivot with GPT-4.1 suggests the future of AI may not belong to the biggest models, but to the most efficient ones. The real breakthrough may not be in the benchmarks, but in bringing enterprise-grade AI within reach of more businesses than ever before.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Credit: Source link
ShareTweetSendSharePin

Related Posts

Apple Building Specialized Chips for Smart Glasses
AI & Technology

Apple Building Specialized Chips for Smart Glasses

May 10, 2025
Lyft CEO: Self-Driving Taxis to Be Part of Fleet Someday
AI & Technology

Lyft CEO: Self-Driving Taxis to Be Part of Fleet Someday

May 10, 2025
Affirm CEO Explains Company’s Guidance
AI & Technology

Affirm CEO Explains Company’s Guidance

May 10, 2025
Investors Look Past DraftKings’ Weak March Madness
AI & Technology

Investors Look Past DraftKings’ Weak March Madness

May 10, 2025
Next Post
OpenAI slashes prices for GPT-4.1, igniting AI price war among tech giants

OpenAI slashes prices for GPT-4.1, igniting AI price war among tech giants

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Bodycam video shows investigators finding the bodies of Gene Hackman and this wife

Bodycam video shows investigators finding the bodies of Gene Hackman and this wife

May 9, 2025
American-born doctor receives DHS email telling her to leave U.S.

American-born doctor receives DHS email telling her to leave U.S.

May 8, 2025
Millionaires surge in NYC as middle class gets priced out

Millionaires surge in NYC as middle class gets priced out

May 8, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!