• bitcoinBitcoin(BTC)$77,226.00-0.43%
  • ethereumEthereum(ETH)$2,121.93-0.18%
  • tetherTether(USDT)$1.00-0.01%
  • binancecoinBNB(BNB)$656.310.51%
  • rippleXRP(XRP)$1.36-1.03%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • solanaSolana(SOL)$87.180.89%
  • tronTRON(TRX)$0.3643340.88%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.03-0.89%
  • dogecoinDogecoin(DOGE)$0.1056830.48%
  • HyperliquidHyperliquid(HYPE)$59.052.14%
  • zcashZcash(ZEC)$640.30-3.62%
  • USDSUSDS(USDS)$1.000.00%
  • cardanoCardano(ADA)$0.2511851.01%
  • leo-tokenLEO Token(LEO)$10.01-0.53%
  • whitebitWhiteBIT Coin(WBT)$56.94-0.41%
  • bitcoin-cashBitcoin Cash(BCH)$378.680.70%
  • chainlinkChainlink(LINK)$9.832.22%
  • moneroMonero(XMR)$385.93-3.73%
  • CantonCanton(CC)$0.154454-0.74%
  • the-open-networkToncoin(TON)$1.98-2.53%
  • stellarStellar(XLM)$0.1473401.72%
  • USD1USD1(USD1)$1.00-0.07%
  • Ethena USDeEthena USDe(USDE)$1.00-0.02%
  • suiSui(SUI)$1.10-0.31%
  • daiDai(DAI)$1.000.00%
  • litecoinLitecoin(LTC)$54.120.22%
  • avalanche-2Avalanche(AVAX)$9.491.22%
  • hedera-hashgraphHedera(HBAR)$0.0900041.47%
  • MemeCoreMemeCore(M)$2.870.95%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • RainRain(RAIN)$0.007488-0.23%
  • shiba-inuShiba Inu(SHIB)$0.0000060.48%
  • crypto-com-chainCronos(CRO)$0.0694490.30%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • nearNEAR Protocol(NEAR)$2.2528.63%
  • BittensorBittensor(TAO)$281.61-0.17%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
  • tether-goldTether Gold(XAUT)$4,520.97-0.05%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • uniswapUniswap(UNI)$3.59-0.88%
  • polkadotPolkadot(DOT)$1.324.31%
  • mantleMantle(MNT)$0.66-2.43%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.13-0.31%
  • pax-goldPAX Gold(PAXG)$4,522.72-0.05%
  • OndoOndo(ONDO)$0.4273036.08%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.060252-2.61%
  • HTX DAOHTX DAO(HTX)$0.000002-0.01%
  • AsterAster(ASTER)$0.69-4.16%
  • Falcon USDFalcon USD(USDF)$1.000.05%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperform OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web

May 22, 2026
in AI & Technology
Reading Time: 7 mins read
A A
Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperform OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web
ShareShareShareShareShare

Microsoft Research’s AI Frontiers lab released Fara1.5. It is a family of computer-use agent (CUA) models for the browser. The release ships three sizes: Fara1.5-4B, Fara1.5-9B, and Fara1.5-27B. The models are integrated with MagenticLite, Microsoft’s sandboxed browser interface for these agents.

Computer-use agents are pixel-to-action models that drive a real browser. They read screenshots and emit mouse and keyboard actions to complete tasks. Recent agent products like OpenAI’s Operator and Google’s Gemini 2.5 Computer Use sit in this category.

YOU MAY ALSO LIKE

Meta Quietly Released A New Reddit-Like App Called Forum

Alibaba’s proprietary Qwen3.7-Max can run for 35 hours autonomously and supports external harnesses like Anthropic’s Claude Code

Fara1.5-27B scores 72% task success on Online-Mind2Web. That benchmark covers 300 tasks across 136 popular sites. On the same evaluation, OpenAI’s Operator scores 58.3% and Gemini 2.5 Computer Use scores 57.3%. Yutori’s Navigator n1 reaches 64.7%, and Fara1.5-9B scores 63.4%. That nearly doubles the predecessor Fara-7B, which scored 34.1% on the same benchmark.

https://www.microsoft.com/en-us/research/articles/fara1-5-computer-use-agent/

Architecture and agent loop

The models use Qwen3.5 base checkpoints in their 4B, 9B, and 27B variants. They operate through an observe-think-act loop. At each step, the model takes the prior conversation history and the three most recent browser screenshots. It then emits thoughts and a single next action.

The action space includes standard mouse and keyboard inputs and web-specific actions like web search. It also exposes meta-actions for context management. These include memorizing facts for later use and asking the user clarification questions. These meta-actions let the agent operate over longer horizons and work collaboratively with users.

Training mix

Training uses supervised fine-tuning on roughly two million samples. The mix is 60% web trajectories and 12.8% synthetic environments. Form filling and user interactions account for 12.5%. Grounding contributes 8.8% and VQA 4.9%. Smaller slices cover GUI drag, instruction following, and safety. Loss is applied only to the three most recent turns in each trajectory.

https://www.microsoft.com/en-us/research/articles/fara1-5-computer-use-agent/

FaraGen1.5: the synthetic data pipeline

FaraGen1.5 is the synthetic pipeline that produced the training trajectories. It has three modular components: environments, solvers, and verifiers.

Environments split into two types. Open-internet tasks run on live websites that don’t require logins. Gated-domain tasks require authenticated sessions or take irreversible actions, like sending an email.

For gated domains, the team built six synthetic clones called FaraEnvs. They cover Mail, Calendar, Stream, ML, Stay, and Scheduler. Each clone has a realistic frontend, a fully functional API, and a database with persona-based seed data.

These environments were built using GitHub Copilot CLI plus iterative human refinement. Because the team controls the full stack, they know the correct outcome for every task. For tasks that mutate the backend, an LLM judge compares database snapshots before and after execution. Tasks that don’t change state are scored against pre-computed reference answers.

The solver agent uses OpenAI’s GPT-5.4 with custom tools that mirror Fara1.5’s action space. The solver scores 83% on Online-Mind2Web using automated WebJudge. The previous Fara-7B solver scored 67% on the same evaluation. A user simulator is invoked when the solver issues an ask_user call or when it finishes a task.

Three verifiers gate which trajectories enter training. Correctness uses LLM-generated rubrics for open-internet tasks and privileged database judging for synthetic ones. Efficiency penalizes redundant or unnecessary actions. User-interaction verification checks whether the agent paused at critical points.

Critical points and safety

Fara1.5 is trained to stop and ask the user in three situations. First: the task requires personal information the user has not provided. Second: the task description is ambiguous or missing details needed to act. Third: an irreversible action is about to be performed without prior approval.

Safety training uses public safety datasets and internal tasks aligned with Microsoft’s Responsible AI Policy. Inside MagenticLite, all agent actions are logged and auditable. The sandboxed browser also acts as a security boundary between the agent and the user’s machine.

Other benchmarks

On WebVoyager, Fara1.5-27B scores 88.6%, the 9B reaches 86.6%, and the 4B hits 80.8%. The 9B also tops similar-sized peers like MolmoWeb 8B, GUI-Owl-1.5 8B, and Holo2 8B. All Fara1.5 evaluation runs use Browserbase to stabilize sessions and reduce session-level blocking. Numbers are averaged over three independent runs.

On WebTailBench v1.5, which targets long-tail web tasks, Fara1.5-9B scores 64.5% process success and 32.3% outcome success. GPT-5.4 scores 79.6% process and 57.4% outcome on the same benchmark.

Key Takeaways

Here are 5 one-line key takeaways:

  • Microsoft Research released Fara1.5, a family of browser computer-use agents in 4B, 9B, and 27B sizes built on Qwen3.5.
  • Fara1.5-27B scores 72% on Online-Mind2Web, beating OpenAI Operator (58.3%), Gemini 2.5 CU (57.3%), and Yutori Navigator n1 (64.7%).
  • The FaraGen1.5 synthetic data pipeline unlocks training on gated domains via six functional app clones (FaraEnvs) built with GitHub Copilot CLI.
  • Fara1.5 pauses to ask the user at critical points: missing info, ambiguous tasks, or irreversible actions without approval.

Check out the Technical details. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us


Credit: Source link

ShareTweetSendSharePin

Related Posts

Meta Quietly Released A New Reddit-Like App Called Forum
AI & Technology

Meta Quietly Released A New Reddit-Like App Called Forum

May 22, 2026
Alibaba’s proprietary Qwen3.7-Max can run for 35 hours autonomously and supports external harnesses like Anthropic’s Claude Code
AI & Technology

Alibaba’s proprietary Qwen3.7-Max can run for 35 hours autonomously and supports external harnesses like Anthropic’s Claude Code

May 21, 2026
How CopilotKit Is Redefining the Agentic AI Stack in 2026
AI & Technology

How CopilotKit Is Redefining the Agentic AI Stack in 2026

May 21, 2026
Anker Debuts Soundcore Liberty 5 Pro Earbuds With Its Thus AI Chip
AI & Technology

Anker Debuts Soundcore Liberty 5 Pro Earbuds With Its Thus AI Chip

May 21, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Fortnite Players Get A 10-Minute Sneak Peek Of The Mandalorian And Grogu On May 19

Fortnite Players Get A 10-Minute Sneak Peek Of The Mandalorian And Grogu On May 19

May 17, 2026
Live updates: San Diego, California mosque shooting, 3 killed, teen suspects found dead – CNN

Live updates: San Diego, California mosque shooting, 3 killed, teen suspects found dead – CNN

May 18, 2026
Full Episode: TODAY Show – April 23

Full Episode: TODAY Show – April 23

May 18, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!