• bitcoinBitcoin(BTC)$61,724.00-0.50%
  • ethereumEthereum(ETH)$1,627.87-1.82%
  • tetherTether(USDT)$1.00-0.01%
  • binancecoinBNB(BNB)$586.47-1.62%
  • usd-coinUSDC(USDC)$1.000.00%
  • rippleXRP(XRP)$1.10-3.90%
  • solanaSolana(SOL)$63.23-3.51%
  • tronTRON(TRX)$0.321253-0.43%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.03-0.75%
  • dogecoinDogecoin(DOGE)$0.082706-3.21%
  • HyperliquidHyperliquid(HYPE)$53.65-9.61%
  • USDSUSDS(USDS)$1.00-0.01%
  • leo-tokenLEO Token(LEO)$9.530.75%
  • RainRain(RAIN)$0.0131752.68%
  • zcashZcash(ZEC)$412.34-7.81%
  • CantonCanton(CC)$0.1648220.62%
  • stellarStellar(XLM)$0.183569-6.18%
  • moneroMonero(XMR)$325.125.25%
  • whitebitWhiteBIT Coin(WBT)$50.58-1.58%
  • cardanoCardano(ADA)$0.160017-4.55%
  • chainlinkChainlink(LINK)$7.58-4.03%
  • Ethena USDeEthena USDe(USDE)$1.000.02%
  • USD1USD1(USD1)$1.000.01%
  • the-open-networkToncoin(TON)$1.61-6.33%
  • daiDai(DAI)$1.000.00%
  • bitcoin-cashBitcoin Cash(BCH)$195.39-4.91%
  • MemeCoreMemeCore(M)$2.78-5.75%
  • hedera-hashgraphHedera(HBAR)$0.077586-3.19%
  • litecoinLitecoin(LTC)$41.61-4.04%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • suiSui(SUI)$0.73-3.34%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • avalanche-2Avalanche(AVAX)$6.43-3.60%
  • shiba-inuShiba Inu(SHIB)$0.000005-1.78%
  • crypto-com-chainCronos(CRO)$0.059294-1.23%
  • Global DollarGlobal Dollar(USDG)$1.000.04%
  • nearNEAR Protocol(NEAR)$1.98-10.93%
  • LABLAB(LAB)$8.15-15.42%
  • tether-goldTether Gold(XAUT)$4,062.76-4.29%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.130.66%
  • AudieraAudiera(BEAT)$7.3254.01%
  • BittensorBittensor(TAO)$201.21-4.44%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.0594508.32%
  • pax-goldPAX Gold(PAXG)$4,069.95-4.33%
  • mantleMantle(MNT)$0.53-0.94%
  • Ripple USDRipple USD(RLUSD)$1.000.00%
  • AsterAster(ASTER)$0.61-2.94%
  • OndoOndo(ONDO)$0.331882-9.45%
  • HTX DAOHTX DAO(HTX)$0.000002-0.55%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Zhipu AI Just Released GLM-4.5 Series: Redefining Open-Source Agentic AI with Hybrid Reasoning

July 28, 2025
in AI & Technology
Reading Time: 8 mins read
A A
Zhipu AI Just Released GLM-4.5 Series: Redefining Open-Source Agentic AI with Hybrid Reasoning
ShareShareShareShareShare

The landscape of AI foundation models is evolving rapidly, but few entries have been as significant in 2025 as the arrival of Z.ai’s GLM-4.5 series: GLM-4.5 and its lighter sibling GLM-4.5-Air. Unveiled by Zhipu AI, these models set remarkably high standards for unified agentic capabilities and open access, aiming to bridge the gap between reasoning, coding, and intelligent agents—and to do so at both massive and manageable scales.

Model Architecture and Parameters

Model Total Parameters Active Parameters Notability
GLM-4.5 355B 32B Among the largest open weights, top benchmark performance
GLM-4.5-Air 106B 12B Compact, efficient, targeting mainstream hardware compatibility

GLM-4.5 is built on a Mixture of Experts (MoE) architecture, with a total of 355 billion parameters (32 billion active at a time). This model is crafted for cutting-edge performance, targeting high-demand reasoning and agentic applications. GLM-4.5-Air, with 106B total and 12B active parameters, provides similar capabilities with a dramatically reduced hardware and compute footprint.

YOU MAY ALSO LIKE

Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation

MassMutual’s AI strategy: 12-month contracts, 30% productivity gains, zero lock-in

Hybrid Reasoning: Two Modes in One Framework

Both models introduce a hybrid reasoning approach:

  • Thinking Mode: Enables complex step-by-step reasoning, tool use, multi-turn planning, and autonomous agent tasks.
  • Non-Thinking Mode: Optimized for instant, stateless responses, making the models versatile for conversational and quick-reaction use cases.

This dual-mode design addresses both sophisticated cognitive workflows and low-latency interactive needs within a single model, empowering next-generation AI agents.

Performance Benchmarks

Z.ai benchmarked GLM-4.5 on 12 industry-standard tests (including MMLU, GSM8K, HumanEval):

  • GLM-4.5: Average benchmark score of 63.2, ranked third overall (second globally, top among all open-source models).
  • GLM-4.5-Air: Delivers a competitive 59.8, establishing itself as the leader among ~100B-parameter models.
  • Outperforms notable rivals in specific areas: tool-calling success rate of 90.6%, outperforming Claude 3.5 Sonnet and Kimi K2.
  • Particularly strong results in Chinese-language tasks and coding, with consistent SOTA results across open benchmarks.

Agentic Capabilities and Architecture

GLM-4.5 advances “Agent-native” design: core agentic functionalities (reasoning, planning, action execution) are built directly into the model architecture. This means:

  • Multi-step task decomposition and planning
  • Tool use and integration with external APIs
  • Complex data visualization and workflow management
  • Native support for reasoning and perception-action cycles

These capabilities enable end-to-end agentic applications previously reserved for smaller, hard-coded frameworks or closed-source APIs.

Efficiency, Speed, and Cost

  • Speculative Decoding & Multi-Token Prediction (MTP): With features like MTP, GLM-4.5 achieves 2.5×–8× faster inference than previous models, with generation speeds >100 tokens/sec on the high-speed API and up to 200 tokens/sec claimed in practice.
  • Memory & Hardware: GLM-4.5-Air’s 12B active design is compatible with consumer GPUs (32–64GB VRAM) and can be quantized to fit broader hardware. This enables high-performance LLMs to run locally for advanced users.
  • Pricing: API calls start as low as $0.11 per million input tokens and $0.28 per million output tokens—industry-leading prices for the scale and quality offered.

Open-Source Access & Ecosystem

A keystone of the GLM-4.5 series is its MIT open-source license: the base models, hybrid (thinking/non-thinking) models, and FP8 versions are all released for unrestricted commercial use and secondary development. Code, tool parsers, and reasoning engines are integrated into major LLM frameworks, including transformers, vLLM, and SGLang, with detailed repositories available on GitHub and Hugging Face.

The models can be used through major inference engines, with fine-tuning and on-premise deployment fully supported. This level of openness and flexibility contrasts sharply with the increasingly closed stance of Western rivals.

Key Technical Innovations

  • Multi-Token Prediction (MTP) layer for speculative decoding, dramatically boosting inference speed on CPUs and GPUs.
  • Unified architecture for reasoning, coding, and multimodal perception-action workflows.
  • Trained on 15 trillion tokens, with support for up to 128k input and 96k output context windows.
  • Immediate compatibility with research and production tooling, including instructions for tuning and adapting the models for new use cases.

In summary, GLM-4.5 and GLM-4.5-Air represent a major leap for open-source, agentic, and reasoning-focused foundation models. They set new standards for accessibility, performance, and unified cognitive capabilities—providing a robust backbone for the next generation of intelligent agents and developer applications.


Check out the GLM 4.5, GLM 4.5 Air, GitHub Page and Technical details. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter.


Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. His most recent endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news that is both technically sound and easily understandable by a wide audience. The platform boasts of over 2 million monthly views, illustrating its popularity among audiences.

Credit: Source link

ShareTweetSendSharePin

Related Posts

Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation
AI & Technology

Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation

June 10, 2026
MassMutual’s AI strategy: 12-month contracts, 30% productivity gains, zero lock-in
AI & Technology

MassMutual’s AI strategy: 12-month contracts, 30% productivity gains, zero lock-in

June 10, 2026
After Belfast Riots, UK Reminds Social Platforms They’re Obligated To Remove Hateful Content
AI & Technology

After Belfast Riots, UK Reminds Social Platforms They’re Obligated To Remove Hateful Content

June 10, 2026
Insta360’s Luna Ultra Takes On DJI’s Osmo Pocket Gimbal Cameras
AI & Technology

Insta360’s Luna Ultra Takes On DJI’s Osmo Pocket Gimbal Cameras

June 10, 2026
Next Post
Trump says he struck a deal to send weapons to NATO for Ukraine

Trump says he struck a deal to send weapons to NATO for Ukraine

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
S&P 500 price target by year-end? Tiffany McGhee goes rapid-fire in a game of “This or That.”

S&P 500 price target by year-end? Tiffany McGhee goes rapid-fire in a game of “This or That.”

June 4, 2026
I Didn’t Know AI Could Do THIS

I Didn’t Know AI Could Do THIS

June 5, 2026
Vance says Trump remark on Americans’ finances was misinterpreted

Vance says Trump remark on Americans’ finances was misinterpreted

June 8, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!