• bitcoinBitcoin(BTC)$68,014.00-3.61%
  • ethereumEthereum(ETH)$1,986.82-3.42%
  • tetherTether(USDT)$1.000.00%
  • binancecoinBNB(BNB)$629.82-1.71%
  • rippleXRP(XRP)$1.37-2.17%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$84.59-3.28%
  • tronTRON(TRX)$0.283570-1.02%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.02-1.05%
  • dogecoinDogecoin(DOGE)$0.090375-2.99%
  • whitebitWhiteBIT Coin(WBT)$54.44-3.64%
  • USDSUSDS(USDS)$1.000.00%
  • cardanoCardano(ADA)$0.258398-3.16%
  • bitcoin-cashBitcoin Cash(BCH)$450.97-1.17%
  • leo-tokenLEO Token(LEO)$9.050.05%
  • HyperliquidHyperliquid(HYPE)$30.720.88%
  • moneroMonero(XMR)$350.02-1.97%
  • chainlinkChainlink(LINK)$8.81-3.23%
  • Ethena USDeEthena USDe(USDE)$1.00-0.03%
  • CantonCanton(CC)$0.153252-1.17%
  • stellarStellar(XLM)$0.152132-3.30%
  • USD1USD1(USD1)$1.000.00%
  • RainRain(RAIN)$0.009095-1.59%
  • daiDai(DAI)$1.00-0.02%
  • hedera-hashgraphHedera(HBAR)$0.096759-2.43%
  • litecoinLitecoin(LTC)$54.10-2.01%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • avalanche-2Avalanche(AVAX)$9.02-2.98%
  • suiSui(SUI)$0.91-3.24%
  • zcashZcash(ZEC)$209.80-6.82%
  • the-open-networkToncoin(TON)$1.340.60%
  • shiba-inuShiba Inu(SHIB)$0.000005-2.62%
  • crypto-com-chainCronos(CRO)$0.075166-2.54%
  • tether-goldTether Gold(XAUT)$5,140.321.55%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.098910-2.41%
  • MemeCoreMemeCore(M)$1.523.24%
  • pax-goldPAX Gold(PAXG)$5,176.511.46%
  • polkadotPolkadot(DOT)$1.49-1.88%
  • uniswapUniswap(UNI)$3.83-3.19%
  • mantleMantle(MNT)$0.68-2.18%
  • Pi NetworkPi Network(PI)$0.22895714.58%
  • okbOKB(OKB)$101.074.36%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Falcon USDFalcon USD(USDF)$1.000.00%
  • BittensorBittensor(TAO)$180.72-1.61%
  • AsterAster(ASTER)$0.70-0.24%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
  • aaveAave(AAVE)$110.50-4.86%
  • SkySky(SKY)$0.070084-7.38%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

YuanLab AI Releases Yuan 3.0 Ultra: A Flagship Multimodal MoE Foundation Model, Built for Stronger Intelligence and Unrivaled Efficiency

March 5, 2026
in AI & Technology
Reading Time: 6 mins read
A A
YuanLab AI Releases Yuan 3.0 Ultra: A Flagship Multimodal MoE Foundation Model, Built for Stronger Intelligence and Unrivaled Efficiency
ShareShareShareShareShare

How can a trillion-parameter Large Language Model achieve state-of-the-art enterprise performance while simultaneously cutting its total parameter count by 33.3% and boosting pre-training efficiency by 49%? Yuan Lab AI releases Yuan3.0 Ultra, an open-source Mixture-of-Experts (MoE) large language model featuring 1T total parameters and 68.8B activated parameters. The model architecture is designed to optimize performance in enterprise-specific tasks while maintaining competitive general-purpose capabilities. Unlike traditional dense models, Yuan3.0 Ultra utilizes sparsity to scale capacity without a linear increase in computational cost.

Layer-Adaptive Expert Pruning (LAEP)

The primary innovation in Yuan3.0 Ultra’s training is the Layer-Adaptive Expert Pruning (LAEP) algorithm. While expert pruning is typically applied post-training, LAEP identifies and removes underutilized experts directly during the pre-training stage.

YOU MAY ALSO LIKE

Anthropic launches Claude Marketplace, giving enterprises access to Claude-powered tools from Replit, GitLab, Harvey and more

Valve doesn’t sound confident the Steam Machine will ship in 2026

Research into expert load distribution revealed two distinct phases during pre-training:

  1. Initial Transition Phase: Characterized by high volatility in expert loads inherited from random initialization.
  2. Stable Phase: Expert loads converge, and the relative ranking of experts based on token assignment remains largely fixed.

Once the stable phase is reached, LAEP applies pruning based on two constraints:

  • Individual Load Constraint (⍺): Targets experts whose token load is significantly lower than the layer average.
  • Cumulative Load Constraint (β): Identifies the subset of experts contributing the least to total token processing.

By applying LAEP with β=0.1 and varying ⍺, the model was pruned from an initial 1.5T parameters down to 1T parameters. This 33.3% reduction in total parameters preserved the model’s multi-domain performance while significantly lowering memory requirements for deployment. In the 1T configuration, the number of experts per layer was reduced from 64 to a maximum of 48 preserved experts.

https://github.com/Yuan-lab-LLM/Yuan3.0-Ultra/blob/main/Docs/Yuan3.0_Ultra%20Paper.pdf

Hardware Efficiency and Expert Rearrangement

MoE models often suffer from device-level load imbalance when experts are distributed across a computing cluster. To address this, Yuan3.0 Ultra implements an Expert Rearranging algorithm.

This algorithm ranks experts by token load and uses a greedy strategy to distribute them across GPUs so that the cumulative token variance is minimized.

Method TFLOPS per GPU
Base Model (1515B) 62.14
DeepSeek-V3 Aux Loss 80.82
Yuan3.0 Ultra (LAEP) 92.60

Total pre-training efficiency improved by 49%. This improvement is attributed to two factors:

  • Model Pruning: Contributed 32.4% to the efficiency gain.
  • Expert Rearrangement: Contributed 15.9% to the efficiency gain.

Mitigating Overthinking with Revised RIRM

In the reinforcement learning (RL) stage, the model employs a refined Reflection Inhibition Reward Mechanism (RIRM) to prevent excessively long reasoning chains for simple tasks.

The reward for reflection, $R_{ver}$, is calculated using a threshold-based penalty system:

  • rmin=0: The ideal number of reflection steps for direct responses.
  • rmax=3: The maximum tolerable reflection threshold.

For correct samples, the reward decreases as reflection steps approach rmax, while incorrect samples that ‘overthink’ (exceeding rmax receive maximum penalties. This mechanism resulted in a 16.33% gain in training accuracy and a 14.38% reduction in output token length.

https://github.com/Yuan-lab-LLM/Yuan3.0-Ultra/blob/main/Docs/Yuan3.0_Ultra%20Paper.pdf

Enterprise Benchmark Performance

Yuan3.0 Ultra was evaluated against several industry models, including GPT-5.2 and Gemini 3.1 Pro, across specialized enterprise benchmarks.

Benchmark Task Category Yuan3.0 Ultra Score Leading Competitor Score
Docmatix Multimodal RAG 67.4% 48.4% (GPT-5.2)
ChatRAG Text Retrieval (Avg) 68.2% 53.6% (Kimi K2.5)
MMTab Table Reasoning 62.3% 66.2% (Kimi K2.5)
SummEval Text Summarization 62.8% 49.9% (Claude Opus 4.6)
Spider 1.0 Text-to-SQL 83.9% 82.7% (Kimi K2.5)
BFCL V3 Tool Invocation 67.8% 78.8% (Gemini 3.1 Pro)

The results indicate that Yuan3.0 Ultra achieves state-of-the-art accuracy in multimodal retrieval (Docmatix) and long-context retrieval (ChatRAG) while maintaining robust performance in structured data processing and tool calling.


Check out the Paper and Repo. Also, feel free to follow us on Twitter and don’t forget to join our 120k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

The post YuanLab AI Releases Yuan 3.0 Ultra: A Flagship Multimodal MoE Foundation Model, Built for Stronger Intelligence and Unrivaled Efficiency appeared first on MarkTechPost.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Anthropic launches Claude Marketplace, giving enterprises access to Claude-powered tools from Replit, GitLab, Harvey and more
AI & Technology

Anthropic launches Claude Marketplace, giving enterprises access to Claude-powered tools from Replit, GitLab, Harvey and more

March 7, 2026
Valve doesn’t sound confident the Steam Machine will ship in 2026
AI & Technology

Valve doesn’t sound confident the Steam Machine will ship in 2026

March 6, 2026
LangChain’s CEO argues that better models alone won’t get your AI agent to production
AI & Technology

LangChain’s CEO argues that better models alone won’t get your AI agent to production

March 6, 2026
Netflix’s version of Overcooked lets you play as Huntr/x
AI & Technology

Netflix’s version of Overcooked lets you play as Huntr/x

March 6, 2026
Next Post
StubHub Holdings, Inc. (STUB) Q4 2025 Earnings Call Transcript

StubHub Holdings, Inc. (STUB) Q4 2025 Earnings Call Transcript

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
LIVE: Trump meets tech giants to sign energy pledge | NBC News

LIVE: Trump meets tech giants to sign energy pledge | NBC News

March 6, 2026
Dow jumps 300 points, S&P 500 turns positive for week as investors look past Iran conflict: Live updates – CNBC

Dow jumps 300 points, S&P 500 turns positive for week as investors look past Iran conflict: Live updates – CNBC

March 4, 2026
S&P 500 futures slide after major averages rebound, traders' U.S.-Iran fears ease: Live updates – CNBC

S&P 500 futures slide after major averages rebound, traders' U.S.-Iran fears ease: Live updates – CNBC

March 5, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!