• bitcoinBitcoin(BTC)$63,217.001.38%
  • ethereumEthereum(ETH)$1,672.233.62%
  • tetherTether(USDT)$1.00-0.03%
  • binancecoinBNB(BNB)$596.051.13%
  • usd-coinUSDC(USDC)$1.000.00%
  • rippleXRP(XRP)$1.151.15%
  • solanaSolana(SOL)$66.041.86%
  • tronTRON(TRX)$0.326415-0.36%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.00%
  • HyperliquidHyperliquid(HYPE)$63.235.51%
  • dogecoinDogecoin(DOGE)$0.0856120.88%
  • USDSUSDS(USDS)$1.00-0.03%
  • leo-tokenLEO Token(LEO)$9.680.64%
  • RainRain(RAIN)$0.0132960.31%
  • zcashZcash(ZEC)$435.629.84%
  • stellarStellar(XLM)$0.203083-3.10%
  • CantonCanton(CC)$0.161619-1.92%
  • cardanoCardano(ADA)$0.1634560.71%
  • moneroMonero(XMR)$314.622.49%
  • chainlinkChainlink(LINK)$7.922.67%
  • whitebitWhiteBIT Coin(WBT)$45.081.72%
  • USD1USD1(USD1)$1.00-0.01%
  • the-open-networkToncoin(TON)$1.72-0.32%
  • Ethena USDeEthena USDe(USDE)$1.00-0.02%
  • bitcoin-cashBitcoin Cash(BCH)$217.09-2.75%
  • daiDai(DAI)$1.00-0.01%
  • LABLAB(LAB)$13.13-0.93%
  • MemeCoreMemeCore(M)$3.02-2.34%
  • hedera-hashgraphHedera(HBAR)$0.081751-0.13%
  • litecoinLitecoin(LTC)$42.822.15%
  • suiSui(SUI)$0.760.96%
  • avalanche-2Avalanche(AVAX)$6.69-1.87%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.04%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • nearNEAR Protocol(NEAR)$2.1512.38%
  • shiba-inuShiba Inu(SHIB)$0.0000050.08%
  • crypto-com-chainCronos(CRO)$0.0608722.14%
  • tether-goldTether Gold(XAUT)$4,277.31-0.63%
  • Global DollarGlobal Dollar(USDG)$1.00-0.02%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.130.57%
  • BittensorBittensor(TAO)$211.310.25%
  • pax-goldPAX Gold(PAXG)$4,289.58-0.49%
  • mantleMantle(MNT)$0.553.49%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.055724-0.70%
  • OndoOndo(ONDO)$0.3630644.12%
  • Ripple USDRipple USD(RLUSD)$1.000.01%
  • polkadotPolkadot(DOT)$0.97-0.24%
  • AsterAster(ASTER)$0.63-1.78%
  • worldcoin-wldWorldcoin(WLD)$0.4710135.64%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

How Transformer-Based LLMs Extract Knowledge From Their Parameters

May 6, 2023
in AI & Technology
Reading Time: 4 mins read
A A
How Transformer-Based LLMs Extract Knowledge From Their Parameters
ShareShareShareShareShare

In recent years, transformer-based large language models (LLMs) have become very popular because of their ability to capture and store factual knowledge. However, how these models extract factual associations during inference remains relatively underexplored. A recent study by researchers from Google DeepMind, Tel Aviv University, and Google Research aimed to examine the internal mechanisms by which transformer-based LLMs store and extract factual associations.

The study proposed an information flow approach to investigate how the model predicts the correct attribute and how internal representations evolve across layers to generate outputs. Specifically, the researchers focused on decoder-only LLMs and identified critical computational points related to the relation and subject positions. They achieved this by using a “knock out” strategy to block the last position from attending to other positions at specific layers, then observing the impacts during inference.

To further pinpoint locations where attribute extraction occurs, the researchers analyzed the information propagating at these critical points and the preceding representation construction process. They achieved this through additional interventions to the vocabulary and the model’s multi-head self-attention (MHSA) and multi-layer perceptron (MLP) sublayers and projections.

🚀 JOIN the fastest ML Subreddit Community

The researchers identified an internal mechanism for attribute extraction based on a subject enrichment process and an attribute extraction operation. Specifically, information about the subject is enriched in the last subject token across early layers of the model, while the relation is passed to the last token. Finally, the last token uses the relation to extract the corresponding attributes from the subject representation via attention head parameters.

The findings offer insights into how factual associations are stored and extracted internally in LLMs. The researchers believe these findings could open new research directions for knowledge localization and model editing. For example, the study’s approach could be used to identify the internal mechanisms by which LLMs acquire and store biased information and to develop methods for mitigating such biases.

Overall, this study highlights the importance of examining the internal mechanisms by which transformer-based LLMs store and extract factual associations. By understanding these mechanisms, researchers can develop more effective methods for improving model performance and reducing biases. Additionally, the study’s approach could be applied to other areas of natural language processing, such as sentiment analysis and language translation, to understand better how these models operate internally.


Check out the Paper. Don’t forget to join our 20k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club


YOU MAY ALSO LIKE

Ambrosia Sky’s Final Act Lands On August 6

Signet City Is A New ‘Fungalpunk’ RPG From The Creator Of Citizen Sleeper

Niharika is a Technical consulting intern at Marktechpost. She is a third year undergraduate, currently pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a highly enthusiastic individual with a keen interest in Machine learning, Data science and AI and an avid reader of the latest developments in these fields.


Credit: Source link

ShareTweetSendSharePin

Related Posts

Ambrosia Sky’s Final Act Lands On August 6
AI & Technology

Ambrosia Sky’s Final Act Lands On August 6

June 7, 2026
Signet City Is A New ‘Fungalpunk’ RPG From The Creator Of Citizen Sleeper
AI & Technology

Signet City Is A New ‘Fungalpunk’ RPG From The Creator Of Citizen Sleeper

June 7, 2026
Therabody’s Latest Recovery Tool Will Cost You 0 To Cool Your Palms
AI & Technology

Therabody’s Latest Recovery Tool Will Cost You $400 To Cool Your Palms

June 7, 2026
Undead Labs Finally Shows Off State Of Decay 3 Gameplay In New Trailer
AI & Technology

Undead Labs Finally Shows Off State Of Decay 3 Gameplay In New Trailer

June 7, 2026
Next Post
Six Books That Changed My Life

Six Books That Changed My Life

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
MICHAEL SAYLOR HAS THE LARGEST BITCOIN LOSS EVER RECORDED

MICHAEL SAYLOR HAS THE LARGEST BITCOIN LOSS EVER RECORDED

June 5, 2026
‘Magnus the wandering walrus,’ already famous in Scotland, is now causing trouble in Norway

‘Magnus the wandering walrus,’ already famous in Scotland, is now causing trouble in Norway

June 6, 2026
Jury tosses Elon Musk’s lawsuit against OpenAI and Sam Altman

Jury tosses Elon Musk’s lawsuit against OpenAI and Sam Altman

June 5, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!