• bitcoinBitcoin(BTC)$63,866.000.17%
  • ethereumEthereum(ETH)$1,676.320.16%
  • tetherTether(USDT)$1.000.06%
  • binancecoinBNB(BNB)$605.85-0.15%
  • usd-coinUSDC(USDC)$1.000.01%
  • rippleXRP(XRP)$1.150.52%
  • solanaSolana(SOL)$67.771.26%
  • tronTRON(TRX)$0.3165921.47%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.07%
  • dogecoinDogecoin(DOGE)$0.0877961.25%
  • HyperliquidHyperliquid(HYPE)$58.30-1.56%
  • USDSUSDS(USDS)$1.000.00%
  • leo-tokenLEO Token(LEO)$9.632.55%
  • RainRain(RAIN)$0.012984-1.23%
  • zcashZcash(ZEC)$412.82-5.76%
  • stellarStellar(XLM)$0.190906-1.71%
  • cardanoCardano(ADA)$0.1732861.31%
  • moneroMonero(XMR)$338.37-12.13%
  • CantonCanton(CC)$0.161677-1.45%
  • whitebitWhiteBIT Coin(WBT)$52.150.01%
  • chainlinkChainlink(LINK)$7.970.95%
  • the-open-networkToncoin(TON)$1.71-1.00%
  • Ethena USDeEthena USDe(USDE)$1.000.08%
  • USD1USD1(USD1)$1.000.16%
  • daiDai(DAI)$1.000.01%
  • bitcoin-cashBitcoin Cash(BCH)$207.691.70%
  • MemeCoreMemeCore(M)$2.97-4.20%
  • hedera-hashgraphHedera(HBAR)$0.078276-1.49%
  • litecoinLitecoin(LTC)$43.781.91%
  • suiSui(SUI)$0.771.49%
  • LABLAB(LAB)$9.77-6.66%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • shiba-inuShiba Inu(SHIB)$0.0000053.32%
  • avalanche-2Avalanche(AVAX)$6.660.35%
  • paypal-usdPayPal USD(PYUSD)$1.000.00%
  • crypto-com-chainCronos(CRO)$0.060003-0.03%
  • nearNEAR Protocol(NEAR)$2.04-2.66%
  • Global DollarGlobal Dollar(USDG)$1.00-0.01%
  • tether-goldTether Gold(XAUT)$4,200.390.28%
  • AudieraAudiera(BEAT)$8.493.16%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • BittensorBittensor(TAO)$243.9314.64%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.13-0.13%
  • pax-goldPAX Gold(PAXG)$4,210.580.28%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.058972-3.60%
  • mantleMantle(MNT)$0.540.15%
  • OndoOndo(ONDO)$0.3650420.14%
  • AsterAster(ASTER)$0.641.81%
  • worldcoin-wldWorldcoin(WLD)$0.4906262.37%
  • polkadotPolkadot(DOT)$0.981.71%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6

June 13, 2026
in AI & Technology
Reading Time: 6 mins read
A A
Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6
ShareShareShareShareShare

This week, Moonshot AI released Kimi K2.7-Code. It is a coding-focused, agentic model. The model weights ship on Hugging Face under a Modified MIT license. You can also reach it through the Kimi API and Kimi Code.

K2.7-Code targets long-horizon software engineering, not general chat. It plans, edits, runs tools, and debugs across many steps. Moonshot pairs the model with a subscription coding platform around it.

YOU MAY ALSO LIKE

Jensen Huang Mania Sweeps Through Seoul

Anthropic Disables Claude Fable 5 and Mythos 5 After US Government Order

Kimi K2.7-Code

K2.7-Code is a Mixture-of-Experts model. It holds 1T total parameters and activates 32B per token. The design uses 384 experts, with 8 selected per token and 1 shared. It has 61 layers, including 1 dense layer.

Attention uses MLA, and the feed-forward path uses SwiGLU. A MoonViT vision encoder adds 400M parameters for image and video input. The model ships with native INT4 quantization. The context window is 256K tokens (262,144).

Two constraints matters: Thinking mode is mandatory; disabling it returns an API error. Sampling is fixed: temperature 1.0, top_p 0.95, n 1, penalties 0.0. Default max output is 32,768 tokens.

You can self-host with vLLM, SGLang, or KTransformers. The Hugging Face repository is large, roughly 595 GB on disk. This is a server-class deployment target, not a laptop model.

Benchmark

Moonshot team published six benchmark rows. They compare K2.7-Code against K2.6, GPT-5.5, and Claude Opus 4.8. K2.7-Code beats K2.6 on every row. The largest coding jump is Kimi Code Bench v2, from 50.9 to 62.0.

Benchmark Kimi K2.6 Kimi K2.7-Code GPT-5.5 Claude Opus 4.8 K2.7 vs K2.6
Kimi Code Bench v2 50.9 62.0 69.0 67.4 +21.8%
Program Bench 48.3 53.6 69.1 63.8 +11.0%
MLS Bench Lite 26.7 35.1 35.5 42.8 +31.5%
Kimi Claw 24/7 Bench 42.9 46.9 52.8 50.4 +9.3%
MCP Atlas 69.4 76.0 79.4 81.3 +9.5%
MCP Mark Verified 72.8 81.1 92.9 76.4 +11.4%

K2.7-Code does beat Opus 4.8 on MCP Mark Verified, 81.1 versus 76.4. It also lands close to GPT-5.5 on MLS Bench Lite. K2.7-Code ran in Kimi Code CLI, GPT-5.5 in Codex xhigh, and Opus 4.8 in Claude Code xhigh.

Reasoning-Token Efficiency: A Cost Claim, Not Just Quality

Moonshot team reports about 30% lower reasoning-token usage than K2.6. It frames this as ‘less overthinking.’

Reasoning tokens bill as output tokens on most price cards. Agentic coding runs hundreds or thousands of steps. Each plan, retry, and verification pays the thinking cost again. A 30% cut compounds across a long run.

The effect lands in three places at once. First, lower output-token cost per task. Second, faster steps, which helps interactive CLI sessions. Third, more steps before hitting context limits.

Use Cases With Examples

  • Repo-scale refactors are the main use case. Point the agent at a failing test suite. It reads files, edits across modules, then reruns tests until green.
  • Code review is a second fit. Feed a pull request diff and ask for risk analysis. The 256K window holds large diffs, logs, and related files together.
  • MCP tool-use workflows are a third fit. K2.7-Code scored 81.1 on MCP Mark Verified. That suite tests correct tool invocation through the Model Context Protocol. Think CI checks, ticket updates, and file edits in one loop.
  • Long-context analysis is a fourth fit. The model accepts text, image, and video input. Documentation, screenshots, and a recorded repro can share one prompt.

Marktechpost’s Interactive Explorer

Kimi K2.7-Code — Interactive Explorer

Company-reported benchmarks and official API pricing. Released June 12, 2026. Verified June 12, 2026.

Benchmarks

Cost Calculator

Specs

Source: Moonshot AI Kimi K2.7-Code model card. K2.7-Code ran in Kimi Code CLI; GPT-5.5 in Codex xhigh; Claude Opus 4.8 in Claude Code xhigh. First-party numbers, not an independent leaderboard.

Input cost$0.00

Output cost$0.00

Est. monthly total$0.00

$0.00

Rates: cached input $0.19 / 1M, cache-miss input $0.95 / 1M, output $4.00 / 1M (official Kimi pricing). Savings line illustrates K2.7-Code’s reported ~30% lower reasoning-token usage vs K2.6, applied to the reasoning share of output. Estimate only.

Source: Kimi K2.7-Code Hugging Face model card and Kimi API docs.

A Minimal Quickstart

The Kimi API is OpenAI-compatible. The model string is kimi-k2.7-code. Do not override the fixed sampling parameters, or the request errors.

import os
from openai import OpenAI

# Base URL and key per the Kimi API docs at platform.moonshot.ai
client = OpenAI(
    api_key=os.environ.get("MOONSHOT_API_KEY"),
    base_url="https://api.moonshot.ai/v1",
)

messages = [
    {"role": "system", "content": "You are a coding agent."},
    {"role": "user", "content": "Refactor utils.py to remove duplicate code."},
]

resp = client.chat.completions.create(
    model="kimi-k2.7-code",
    messages=messages,
    max_tokens=32768,  # default cap; also the maximum
    # thinking is enabled by default and cannot be disabled.
    # temperature (1.0), top_p (0.95), n (1), and penalties (0.0) are
    # fixed server-side. Passing any other value returns an error.
)

msg = resp.choices[0].message
print(msg.content)

# Multi-step tool calls: append the full assistant message so that
# reasoning_content is preserved. Dropping it errors on the next turn.
# messages.append(msg.model_dump())

Two tool-use rules come from the docs. Keep reasoning_content from the current turn in context. And set tool_choice to only "auto" or "none".

How K2.7-Code Compares

Model License Params Context API price (in / out per 1M)
Kimi K2.7-Code Modified MIT (open) 1T total / 32B active 256K $0.95 / $4.00
Kimi K2.6 Open-weight 1T-class MoE 256K ~$0.67–0.95 / ~$3.39–4.00
GPT-5.5 Closed Not disclosed — Not in Moonshot table
Claude Opus 4.8 Closed Not disclosed 1M $5.00 / $25.00
Qwen3-Coder-480B-A35B Open (Qwen license) 480B / 35B active 256K native Varies by host

K2.7-Code lists $0.19 per 1M for cached input.

Strengths and Weaknesses

Strengths:

  • Open weights under Modified MIT, with a real self-host path.
  • Broad, consistent gains over K2.6 on coding and agent evals.
  • Low API pricing relative to closed frontier models.
  • Beats Opus 4.8 on the MCP Mark Verified benchmark (company-reported).

Weaknesses:

  • All headline numbers are first-party at launch.
  • Thinking mode cannot be disabled.
  • Sampling controls are locked to fixed values.
  • Multi-step tool calls must preserve reasoning_content.
  • 595 GB weights make self-hosting a serious commitment.

Key Takeaways

  • All headline benchmarks are vendor-run; independent results are pending.
  • K2.7-Code is open-weight, coding-specialized, and built on Kimi K2.6.
  • Moonshot reports +21.8% on Kimi Code Bench v2 over K2.6.
  • The model uses roughly 30% fewer reasoning tokens than K2.6.

Check out the Model weight, Kimi Code and API. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us


Credit: Source link

ShareTweetSendSharePin

Related Posts

Jensen Huang Mania Sweeps Through Seoul
AI & Technology

Jensen Huang Mania Sweeps Through Seoul

June 13, 2026
Anthropic Disables Claude Fable 5 and Mythos 5 After US Government Order
AI & Technology

Anthropic Disables Claude Fable 5 and Mythos 5 After US Government Order

June 13, 2026
What to Know About the SpaceX IPO
AI & Technology

What to Know About the SpaceX IPO

June 13, 2026
OpenAI Is Facing Investigation From A Group Of State Attorneys General
AI & Technology

OpenAI Is Facing Investigation From A Group Of State Attorneys General

June 13, 2026
Next Post
Blue Owl Capital: A 22% Discount To NAV Makes This One Of The Cheapest Stocks In The Market

Blue Owl Capital: A 22% Discount To NAV Makes This One Of The Cheapest Stocks In The Market

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Former Republican running for governor as Democrat because Trump is ‘not for Georgia’

Former Republican running for governor as Democrat because Trump is ‘not for Georgia’

June 8, 2026
What’s Behind the Blue Origin Rocket Explosion?

What’s Behind the Blue Origin Rocket Explosion?

June 7, 2026
Valve Will Stop Producing Physical Steam Gift Cards Because Of Scammers

Valve Will Stop Producing Physical Steam Gift Cards Because Of Scammers

June 10, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!