• bitcoinBitcoin(BTC)$61,658.000.93%
  • ethereumEthereum(ETH)$1,599.461.62%
  • tetherTether(USDT)$1.000.02%
  • binancecoinBNB(BNB)$584.581.31%
  • usd-coinUSDC(USDC)$1.000.00%
  • rippleXRP(XRP)$1.122.02%
  • solanaSolana(SOL)$64.132.10%
  • tronTRON(TRX)$0.3252721.42%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.030.46%
  • HyperliquidHyperliquid(HYPE)$59.530.31%
  • dogecoinDogecoin(DOGE)$0.0838633.67%
  • USDSUSDS(USDS)$1.000.02%
  • leo-tokenLEO Token(LEO)$9.52-1.14%
  • RainRain(RAIN)$0.0131771.20%
  • stellarStellar(XLM)$0.2086776.72%
  • zcashZcash(ZEC)$406.388.83%
  • CantonCanton(CC)$0.16527910.34%
  • cardanoCardano(ADA)$0.1603863.64%
  • moneroMonero(XMR)$297.37-4.40%
  • chainlinkChainlink(LINK)$7.613.88%
  • whitebitWhiteBIT Coin(WBT)$43.920.72%
  • USD1USD1(USD1)$1.000.05%
  • the-open-networkToncoin(TON)$1.7214.25%
  • bitcoin-cashBitcoin Cash(BCH)$225.576.13%
  • Ethena USDeEthena USDe(USDE)$1.000.00%
  • daiDai(DAI)$1.00-0.01%
  • LABLAB(LAB)$13.0331.05%
  • MemeCoreMemeCore(M)$3.118.44%
  • hedera-hashgraphHedera(HBAR)$0.0813202.91%
  • litecoinLitecoin(LTC)$41.57-1.85%
  • suiSui(SUI)$0.757.68%
  • avalanche-2Avalanche(AVAX)$6.741.40%
  • paypal-usdPayPal USD(PYUSD)$1.000.01%
  • Circle USYCCircle USYC(USYC)$1.130.00%
  • shiba-inuShiba Inu(SHIB)$0.0000053.85%
  • crypto-com-chainCronos(CRO)$0.0593623.79%
  • tether-goldTether Gold(XAUT)$4,298.730.09%
  • Global DollarGlobal Dollar(USDG)$1.000.01%
  • nearNEAR Protocol(NEAR)$1.91-2.51%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.130.78%
  • pax-goldPAX Gold(PAXG)$4,303.13-0.10%
  • BittensorBittensor(TAO)$203.175.45%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.055271-1.16%
  • mantleMantle(MNT)$0.522.03%
  • Ripple USDRipple USD(RLUSD)$1.000.00%
  • AsterAster(ASTER)$0.645.24%
  • OndoOndo(ONDO)$0.3369701.59%
  • polkadotPolkadot(DOT)$0.962.78%
  • HTX DAOHTX DAO(HTX)$0.0000021.99%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Finetuning LLaMA on Medical Papers: Meet PMC-LLaMA-A Model that Achieves High Performance on Biomedical QA Benchmarks

May 5, 2023
in AI & Technology
Reading Time: 3 mins read
A A
Finetuning LLaMA on Medical Papers: Meet PMC-LLaMA-A Model that Achieves High Performance on Biomedical QA Benchmarks
ShareShareShareShareShare

The development of large language models (LLMs), such as OpenAI’s ChatGPT and GPT-4, has reshaped artificial intelligence in many fields, including natural language processing, computer vision, and the biomedical field. Unfortunately, the specifics of ChatGPT’s training and the model architectures for its variants are still unknown. While LLaMA is an open-source foundational language model, it is hypothesized that its poor performance on applications requiring extensive domain knowledge is caused by a lack of domain-specific data during the model pre-training stage. 

Many studies have been discussing modifying and using open-source LLMs for specialized purposes. For instance, Alpaca and Vicuna have focused on expanding the model’s capacity for interaction by training it with examples of obeying instructions created automatically. 

A recent work by Shanghai Jiao Tong University and Shanghai AI Laboratory takes a different tack by infusing domain knowledge into a single, pre-trained LLaMA to steer the foundational language model toward a medical-specific corpus. They introduce PMC-LLaMA, a publicly available language model developed by refining LLaMA-7B on 4.8 million medical academic papers. The team believes that medical discussion and consulting would benefit more from a foundational language model with a medical focus. 

🚀 JOIN the fastest ML Subreddit Community

The team began with the S2ORC Datasets, which contain 81.1M academic papers in English, and sorted them according to their PubMed Central (PMC)-id. Therefore, approximately 4.9M papers, totaling over 75B tokens, are highly related to medical knowledge. By optimizing an autoregressive generation objective, first presented in GPT2, they fine-tune the LLaMA-7B model on these freely available PMC papers. They employ the bf16 (Brain Floating Point) data format and the Fully Sharded Data Parallel (FSDP) acceleration approach to speed up the learning process.

The team tests PMC-LLaMA by doing three different types of fine-tuning on the aforementioned associated medical QA datasets: full fine-tuning, parameter-efficient fine-tuning, and data-efficient fine-tuning. The results of the experiments show that PMC-LLaMA outperforms LLaMA and other models trained with LLaMA-tuned instructions in the medical domain when the instructions are tweaked. 

A shortcoming of PMC-LLaMA is that every token cannot be found in the 4.8 million papers because they have only trained five epochs so far. In the future, they plan to gradually train PMC-LLaMA models with more parameters, continuously train PMC-LLaMA, and update the base model on the hugging face page. 


Check out the Research Paper and Code. Don’t forget to join our 20k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club


YOU MAY ALSO LIKE

Nvidia Enters the Laptop Market with Superchip, Taking on Intel and AMD

Nvidia’s Huang Calls AI Jobs Concerns ‘Nonsense’

Tanushree Shenwai is a consulting intern at MarktechPost. She is currently pursuing her B.Tech from the Indian Institute of Technology(IIT), Bhubaneswar. She is a Data Science enthusiast and has a keen interest in the scope of application of artificial intelligence in various fields. She is passionate about exploring the new advancements in technologies and their real-life application.


Credit: Source link

ShareTweetSendSharePin

Related Posts

Nvidia Enters the Laptop Market with Superchip, Taking on Intel and AMD
AI & Technology

Nvidia Enters the Laptop Market with Superchip, Taking on Intel and AMD

June 7, 2026
Nvidia’s Huang Calls AI Jobs Concerns ‘Nonsense’
AI & Technology

Nvidia’s Huang Calls AI Jobs Concerns ‘Nonsense’

June 7, 2026
Investors Bet Big on Humanoid Robots
AI & Technology

Investors Bet Big on Humanoid Robots

June 7, 2026
Arm CEO Haas on Agentic AI and Taiwan’s Ecosystem
AI & Technology

Arm CEO Haas on Agentic AI and Taiwan’s Ecosystem

June 7, 2026
Next Post
Stock Market Analysis & Lessons from GameStop (GME)

Stock Market Analysis & Lessons from GameStop (GME)

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Asian toy companies — like Labubu maker — invading NYC’s Times Square after recent departures

Asian toy companies — like Labubu maker — invading NYC’s Times Square after recent departures

May 31, 2026
Videos of attacks on Christians reveal violence on the rise in Jerusalem

Videos of attacks on Christians reveal violence on the rise in Jerusalem

June 6, 2026
Your iPhone’s keyboard hasn’t been keeping up with your quick texts

Your iPhone’s keyboard hasn’t been keeping up with your quick texts

June 5, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!