• bitcoinBitcoin(BTC)$76,400.00-0.93%
  • ethereumEthereum(ETH)$2,285.92-0.65%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$1.38-1.28%
  • binancecoinBNB(BNB)$624.21-0.33%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$83.95-0.60%
  • tronTRON(TRX)$0.321967-0.81%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.040.95%
  • dogecoinDogecoin(DOGE)$0.0997330.69%
  • whitebitWhiteBIT Coin(WBT)$54.03-0.90%
  • USDSUSDS(USDS)$1.000.06%
  • HyperliquidHyperliquid(HYPE)$40.09-3.43%
  • leo-tokenLEO Token(LEO)$10.35-0.15%
  • cardanoCardano(ADA)$0.246817-0.42%
  • bitcoin-cashBitcoin Cash(BCH)$451.900.64%
  • moneroMonero(XMR)$379.16-0.73%
  • chainlinkChainlink(LINK)$9.23-0.81%
  • CantonCanton(CC)$0.1492230.97%
  • zcashZcash(ZEC)$336.62-4.69%
  • stellarStellar(XLM)$0.162037-2.03%
  • MemeCoreMemeCore(M)$3.45-8.19%
  • USD1USD1(USD1)$1.000.03%
  • daiDai(DAI)$1.000.03%
  • litecoinLitecoin(LTC)$55.570.19%
  • avalanche-2Avalanche(AVAX)$9.16-1.05%
  • hedera-hashgraphHedera(HBAR)$0.088997-0.53%
  • Ethena USDeEthena USDe(USDE)$1.00-0.02%
  • suiSui(SUI)$0.92-1.10%
  • shiba-inuShiba Inu(SHIB)$0.000006-0.09%
  • RainRain(RAIN)$0.0074522.95%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • the-open-networkToncoin(TON)$1.30-0.81%
  • crypto-com-chainCronos(CRO)$0.068954-0.87%
  • Circle USYCCircle USYC(USYC)$1.120.01%
  • tether-goldTether Gold(XAUT)$4,582.26-2.05%
  • BittensorBittensor(TAO)$257.543.87%
  • Global DollarGlobal Dollar(USDG)$1.00-0.02%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.0733280.40%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • pax-goldPAX Gold(PAXG)$4,580.21-2.08%
  • mantleMantle(MNT)$0.63-0.60%
  • polkadotPolkadot(DOT)$1.23-0.55%
  • Pi NetworkPi Network(PI)$0.1992875.64%
  • uniswapUniswap(UNI)$3.24-0.23%
  • SkySky(SKY)$0.086053-3.15%
  • Falcon USDFalcon USD(USDF)$1.00-0.07%
  • nearNEAR Protocol(NEAR)$1.35-0.92%
  • okbOKB(OKB)$82.81-1.38%
  • AsterAster(ASTER)$0.662.39%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Meta AI Introduces MTIA v1: It’s First-Generation AI Inference Accelerator

May 21, 2023
in AI & Technology
Reading Time: 4 mins read
A A
Meta AI Introduces MTIA v1: It’s First-Generation AI Inference Accelerator
ShareShareShareShareShare

At Meta, AI workloads are everywhere, serving as the foundation for numerous applications like content comprehension, Feeds, generative AI, and ad ranking. Thanks to its seamless Python integration, eager-mode programming, and straightforward APIs, PyTorch can run these workloads. In particular, DLRMs are vital to enhancing user experiences across all of Meta’s products and offerings. The hardware systems must supply increasingly more memory and computing as the size and complexity of these models grow, all without sacrificing efficiency.

When it comes to the highly efficient processing of Meta’s unique recommendation workloads at scale, GPUs aren’t always the best option. To address this issue, the Meta team developed a set of application-specific integrated circuits (ASICs) called the “Meta Training and Inference Accelerator” (MTIA). With the needs of the next-generation recommendation model in mind, the first-generation ASIC is included in PyTorch to develop a completely optimized ranking system. Keeping developers productive is an ongoing process as they maintain support for PyTorch 2.0, which dramatically improves the compiler-level performance of PyTorch.

In 2020, the team created the original MTIA ASIC to handle Meta’s internal processing needs. Co-designed with silicon, PyTorch, and the recommendation models, this inference accelerator is part of a full-stack solution. Using a TSMC 7nm technology, this 800 MHz accelerator can achieve 102.4 TOPS with INT8 precision and 51.2 TFLOPS with FP16 precision. The device’s TDP, or thermal design power, is 25 W.

🚀 JOIN the fastest ML Subreddit Community

The accelerator can be divided into constituent parts, including processing elements (PEs), on-chip and off-chip memory resources, and interconnects in a grid structure. An independent control subsystem within the accelerator manages the software. The firmware coordinates the execution of jobs on the accelerator, controls the available computing and memory resources, and communicates with the host through a specific host interface. LPDDR5 is used for off-chip DRAM in the memory subsystem, which allows for expansion to 128 GB. More bandwidth and far less latency are available for frequently accessed data and instructions because the chip’s 128 MB of on-chip SRAM is shared among all the PEs.

The 64 PEs in the grid are laid out in an 8 by 8 matrix. Each PE’s 128 KB of local SRAM memory allows for speedy data storage and processing. A mesh network links the PEs together and to the memory banks. The grid can be used in its whole to perform a job, or it can be split up into numerous subgrids, each of which can handle its work. Matrix multiplication, accumulation, data transportation, and nonlinear function calculation are only some of the important tasks optimized for by the multiple fixed-function units and two processor cores in each PE. The RISC-V ISA-based processor cores have been extensively modified to perform the required computation and control operations. The architecture was designed to make the most of two essentials for effective workload management: parallelism and data reuse.

The researchers compared MTIA to an NNPI accelerator and a graphics processing unit. The results show that MTIA relies on efficiently managing small forms and batch sizes for low-complexity models. MTIA actively optimizes its SW stack to achieve similar levels of performance. In the meantime, it uses larger forms that are significantly more optimized on the GPU’s SW stack to run medium- and high-complexity models.

To optimize performance for Meta’s workloads, the team is now concentrating on finding a happy medium between computing power, memory capacity, and interconnect bandwidth to develop a better and more efficient solution.


Check out the Project. Don’t forget to join our 21k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club


YOU MAY ALSO LIKE

American AI startup Poolside launches free, high-performing open model Laguna XS.2 for local agentic coding

Texas Instruments made a new flagship graphing calculator: the TI-84 Evo

Tanushree Shenwai is a consulting intern at MarktechPost. She is currently pursuing her B.Tech from the Indian Institute of Technology(IIT), Bhubaneswar. She is a Data Science enthusiast and has a keen interest in the scope of application of artificial intelligence in various fields. She is passionate about exploring the new advancements in technologies and their real-life application.


➡️ Meet Bright Data: The World’s #1 Web Data Platform

Credit: Source link

ShareTweetSendSharePin

Related Posts

American AI startup Poolside launches free, high-performing open model Laguna XS.2 for local agentic coding
AI & Technology

American AI startup Poolside launches free, high-performing open model Laguna XS.2 for local agentic coding

April 28, 2026
Texas Instruments made a new flagship graphing calculator: the TI-84 Evo
AI & Technology

Texas Instruments made a new flagship graphing calculator: the TI-84 Evo

April 28, 2026
iOS 27 will reportedly come with new AI-powered photo editing tools
AI & Technology

iOS 27 will reportedly come with new AI-powered photo editing tools

April 28, 2026
NVIDIA starts offering a 12GB version of the 5070 for laptops
AI & Technology

NVIDIA starts offering a 12GB version of the 5070 for laptops

April 28, 2026
Next Post
Want to be more present? Try taking out your phone

Want to be more present? Try taking out your phone

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Video shows crews welcoming Artemis II crew home

Video shows crews welcoming Artemis II crew home

April 25, 2026
Elon Musk and OpenAI CEO Sam Altman head to court in high-stakes showdown over AI

Elon Musk and OpenAI CEO Sam Altman head to court in high-stakes showdown over AI

April 27, 2026
AI influencers are just the tip of the iceberg for ‘synthetic’ talent

AI influencers are just the tip of the iceberg for ‘synthetic’ talent

April 24, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!