• bitcoinBitcoin(BTC)$76,219.000.27%
  • ethereumEthereum(ETH)$2,286.960.57%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$1.37-0.46%
  • binancecoinBNB(BNB)$618.10-0.72%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • solanaSolana(SOL)$83.33-0.09%
  • tronTRON(TRX)$0.3231310.08%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.29%
  • dogecoinDogecoin(DOGE)$0.1051796.32%
  • whitebitWhiteBIT Coin(WBT)$53.940.30%
  • USDSUSDS(USDS)$1.000.00%
  • leo-tokenLEO Token(LEO)$10.380.02%
  • HyperliquidHyperliquid(HYPE)$39.61-0.32%
  • cardanoCardano(ADA)$0.245340-0.24%
  • bitcoin-cashBitcoin Cash(BCH)$448.551.04%
  • moneroMonero(XMR)$376.35-1.50%
  • chainlinkChainlink(LINK)$9.15-0.59%
  • CantonCanton(CC)$0.1506240.82%
  • zcashZcash(ZEC)$324.15-2.93%
  • stellarStellar(XLM)$0.160324-1.21%
  • USD1USD1(USD1)$1.00-0.02%
  • daiDai(DAI)$1.000.00%
  • MemeCoreMemeCore(M)$3.39-4.12%
  • litecoinLitecoin(LTC)$55.841.63%
  • avalanche-2Avalanche(AVAX)$9.14-0.24%
  • hedera-hashgraphHedera(HBAR)$0.088684-0.09%
  • Ethena USDeEthena USDe(USDE)$1.000.03%
  • RainRain(RAIN)$0.0079557.79%
  • shiba-inuShiba Inu(SHIB)$0.0000060.58%
  • suiSui(SUI)$0.91-1.33%
  • paypal-usdPayPal USD(PYUSD)$1.000.00%
  • the-open-networkToncoin(TON)$1.321.99%
  • crypto-com-chainCronos(CRO)$0.068439-0.97%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • tether-goldTether Gold(XAUT)$4,540.18-0.63%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
  • BittensorBittensor(TAO)$252.14-0.85%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • pax-goldPAX Gold(PAXG)$4,535.12-0.69%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.068219-6.92%
  • mantleMantle(MNT)$0.63-0.36%
  • polkadotPolkadot(DOT)$1.22-1.34%
  • uniswapUniswap(UNI)$3.21-0.54%
  • Pi NetworkPi Network(PI)$0.189405-1.36%
  • SkySky(SKY)$0.083068-4.86%
  • Falcon USDFalcon USD(USDF)$1.00-0.02%
  • okbOKB(OKB)$82.870.20%
  • AsterAster(ASTER)$0.675.36%
  • nearNEAR Protocol(NEAR)$1.33-1.62%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

NVIDIA Releases Nemotron-Cascade 2: An Open 30B MoE with 3B Active Parameters, Delivering Better Reasoning and Strong Agentic Capabilities

March 20, 2026
in AI & Technology
Reading Time: 7 mins read
A A
NVIDIA Releases Nemotron-Cascade 2: An Open 30B MoE with 3B Active Parameters, Delivering Better Reasoning and Strong Agentic Capabilities
ShareShareShareShareShare

NVIDIA has announced the release of Nemotron-Cascade 2, an open-weight 30B Mixture-of-Experts (MoE) model with 3B activated parameters. The model focuses on maximizing ‘intelligence density,’ delivering advanced reasoning capabilities at a fraction of the parameter scale used by frontier models. Nemotron-Cascade 2 is the second open-weight LLM to achieve Gold Medal-level performance in the 2025 International Mathematical Olympiad (IMO), the International Olympiad in Informatics (IOI), and the ICPC World Finals.

https://research.nvidia.com/labs/nemotron/files/Nemotron-Cascade-2.pdf

Targeted Performance and Strategic Trade-offs

The primary value proposition of Nemotron-Cascade 2 is its specialized performance in mathematical reasoning, coding, alignment, and instruction following. While it achieves state-of-the-art results in these key reasoning-intensive domains, it is surely not a ‘blanket win’ across all benchmarks.

YOU MAY ALSO LIKE

Definity embeds agents inside Spark pipelines to catch failures before they reach agentic AI systems

Poolside AI Introduces Laguna XS.2 and M.1: Agentic Coding Models Reaching 68.2% and 72.5% on SWE-bench Verified

The model’s performance excels in several targeted categories compared to the recently released Qwen3.5-35B-A3B (February 2026) and the larger Nemotron-3-Super-120B-A12B:

  • Mathematical Reasoning: Outperforms Qwen3.5-35B-A3B on AIME 2025 (92.4 vs. 91.9) and HMMT Feb25 (94.6 vs. 89.0).
  • Coding: Leads on LiveCodeBench v6 (87.2 vs. 74.6) and IOI 2025 (439.28 vs. 348.6+).
  • Alignment and Instruction Following: Scores significantly higher on ArenaHard v2 (83.5 vs. 65.4+) and IFBench (82.9 vs. 70.2).
https://research.nvidia.com/labs/nemotron/files/Nemotron-Cascade-2.pdf

Technical Architecture: Cascade RL and Multi-domain On-Policy Distillation (MOPD)

The model’s reasoning capabilities stem from its post-training pipeline, starting from the Nemotron-3-Nano-30B-A3B-Base model.

1. Supervised Fine-Tuning (SFT)

During SFT, NVIDIA research team utilized a meticulously curated dataset where samples were packed into sequences of up to 256K tokens. The dataset included:

  • 1.9M Python reasoning traces and 1.3M Python tool-calling samples for competitive coding.
  • 816K samples for mathematical natural language proofs.
  • A specialized Software Engineering (SWE) blend consisting of 125K agentic and 389K agentless samples.

2. Cascade Reinforcement Learning

Following SFT, the model underwent Cascade RL, which applies sequential, domain-wise training. This prevents catastrophic forgetting by allowing hyperparameters to be tailored to specific domains without destabilizing others. The pipeline includes stages for instruction-following (IF-RL), multi-domain RL, RLHF, long-context RL, and specialized Code and SWE RL.

https://research.nvidia.com/labs/nemotron/files/Nemotron-Cascade-2.pdf

3. Multi-Domain On-Policy Distillation (MOPD)

A critical innovation in Nemotron-Cascade 2 is the integration of MOPD during the Cascade RL process. MOPD assembly uses the best-performing intermediate ‘teacher’ models—already derived from the same SFT initialization—to provide a dense token-level distillation advantage. This advantage is defined mathematically as:

$$a_{t}^{MOPD}=log~\pi^{domain_{t}}(y_{t}|s_{t})-log~\pi^{train}(y_{t}|s_{t})$$

/* <![CDATA[ */
wp.i18n.setLocaleData( { 'text direction\u0004ltr': [ 'ltr' ] } );
//# sourceURL=wp-i18n-js-after
/* ]]> */

The research team found that MOPD is substantially more sample-efficient than sequence-level reward algorithms like Group Relative Policy Optimization (GRPO). For instance, on AIME25, MOPD reached teacher-level performance (92.0) within 30 steps, while GRPO achieved only 91.0 after matching those steps.

Inference Features and Agentic Interaction

Nemotron-Cascade 2 supports two primary operating modes through its chat template:

  • Thinking Mode: Initiated by a single <think> token, followed by a newline. This activates deep reasoning for complex math and code tasks.
  • Non-Thinking Mode: Activated by prepending an empty <think></think> block for more efficient, direct responses.

For agentic tasks, the model utilizes a structured tool-calling protocol within the system prompt. Available tools are listed within <tools> tags, and the model is instructed to perform tool calls wrapped in <tool_call> tags to ensure verifiable execution feedback.

By focusing on ‘intelligence density,’ Nemotron-Cascade 2 demonstrates that specialized reasoning capabilities once thought to be the exclusive domain of frontier-scale models are achievable at a 30B scale through domain-specific reinforcement learning.


Check out Paper and Model on HF. Also, feel free to follow us on Twitter and don’t forget to join our 120k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

The post NVIDIA Releases Nemotron-Cascade 2: An Open 30B MoE with 3B Active Parameters, Delivering Better Reasoning and Strong Agentic Capabilities appeared first on MarkTechPost.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Definity embeds agents inside Spark pipelines to catch failures before they reach agentic AI systems
AI & Technology

Definity embeds agents inside Spark pipelines to catch failures before they reach agentic AI systems

April 29, 2026
Poolside AI Introduces Laguna XS.2 and M.1: Agentic Coding Models Reaching 68.2% and 72.5% on SWE-bench Verified
AI & Technology

Poolside AI Introduces Laguna XS.2 and M.1: Agentic Coding Models Reaching 68.2% and 72.5% on SWE-bench Verified

April 29, 2026
How to build custom reasoning agents with a fraction of the compute
AI & Technology

How to build custom reasoning agents with a fraction of the compute

April 28, 2026
American AI startup Poolside launches free, high-performing open model Laguna XS.2 for local agentic coding
AI & Technology

American AI startup Poolside launches free, high-performing open model Laguna XS.2 for local agentic coding

April 28, 2026
Next Post
Elon Musk misled investors during his Twitter takeover, jury finds

Elon Musk misled investors during his Twitter takeover, jury finds

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Fiancée of man wounded in ICE shooting speaks out

Fiancée of man wounded in ICE shooting speaks out

April 27, 2026
How US investors should think about tariffs as Trump braces for a fresh round of haggling

How US investors should think about tariffs as Trump braces for a fresh round of haggling

April 27, 2026
Ex-Michigan coach gets probation after arrest over confrontation at staffer’s home

Ex-Michigan coach gets probation after arrest over confrontation at staffer’s home

April 24, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!