• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$103,904.000.85%
  • ethereumEthereum(ETH)$2,542.068.79%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$2.401.49%
  • binancecoinBNB(BNB)$662.09-0.27%
  • solanaSolana(SOL)$175.892.31%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • dogecoinDogecoin(DOGE)$0.23913414.92%
  • cardanoCardano(ADA)$0.824.31%
  • tronTRON(TRX)$0.2648230.49%
  • staked-etherLido Staked Ether(STETH)$2,542.628.91%
  • SuiSui(SUI)$4.021.71%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$103,995.000.88%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • chainlinkChainlink(LINK)$16.965.63%
  • avalanche-2Avalanche(AVAX)$25.358.03%
  • Wrapped stETHWrapped stETH(WSTETH)$3,046.788.62%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • shiba-inuShiba Inu(SHIB)$0.0000179.25%
  • stellarStellar(XLM)$0.3135084.55%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • hedera-hashgraphHedera(HBAR)$0.2141125.59%
  • ToncoinToncoin(TON)$3.515.84%
  • HyperliquidHyperliquid(HYPE)$25.473.11%
  • bitcoin-cashBitcoin Cash(BCH)$421.121.40%
  • USDSUSDS(USDS)$1.00-0.01%
  • litecoinLitecoin(LTC)$104.400.94%
  • polkadotPolkadot(DOT)$5.132.58%
  • leo-tokenLEO Token(LEO)$8.30-4.86%
  • wethWETH(WETH)$2,544.048.96%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • Pi NetworkPi Network(PI)$0.8819.41%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • moneroMonero(XMR)$326.314.44%
  • PengPeng(PENG)$0.60-13.59%
  • Wrapped eETHWrapped eETH(WEETH)$2,712.438.91%
  • Bitget TokenBitget Token(BGB)$4.926.98%
  • PepePepe(PEPE)$0.0000134.73%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.18%
  • Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$103,904.000.85%
  • ethereumEthereum(ETH)$2,542.068.79%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$2.401.49%
  • binancecoinBNB(BNB)$662.09-0.27%
  • solanaSolana(SOL)$175.892.31%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • dogecoinDogecoin(DOGE)$0.23913414.92%
  • cardanoCardano(ADA)$0.824.31%
  • tronTRON(TRX)$0.2648230.49%
  • staked-etherLido Staked Ether(STETH)$2,542.628.91%
  • SuiSui(SUI)$4.021.71%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$103,995.000.88%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • chainlinkChainlink(LINK)$16.965.63%
  • avalanche-2Avalanche(AVAX)$25.358.03%
  • Wrapped stETHWrapped stETH(WSTETH)$3,046.788.62%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • shiba-inuShiba Inu(SHIB)$0.0000179.25%
  • stellarStellar(XLM)$0.3135084.55%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • hedera-hashgraphHedera(HBAR)$0.2141125.59%
  • ToncoinToncoin(TON)$3.515.84%
  • HyperliquidHyperliquid(HYPE)$25.473.11%
  • bitcoin-cashBitcoin Cash(BCH)$421.121.40%
  • USDSUSDS(USDS)$1.00-0.01%
  • litecoinLitecoin(LTC)$104.400.94%
  • polkadotPolkadot(DOT)$5.132.58%
  • leo-tokenLEO Token(LEO)$8.30-4.86%
  • wethWETH(WETH)$2,544.048.96%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • Pi NetworkPi Network(PI)$0.8819.41%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • moneroMonero(XMR)$326.314.44%
  • PengPeng(PENG)$0.60-13.59%
  • Wrapped eETHWrapped eETH(WEETH)$2,712.438.91%
  • Bitget TokenBitget Token(BGB)$4.926.98%
  • PepePepe(PEPE)$0.0000134.73%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.18%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

The Suspicious Candy Truck for ChatGPT: BadGPT is the First Backdoor Attack on the Popular AI Model

May 19, 2023
in AI & Technology
Reading Time: 5 mins read
A A
The Suspicious Candy Truck for ChatGPT: BadGPT is the First Backdoor Attack on the Popular AI Model
ShareShareShareShareShare

YOU MAY ALSO LIKE

FTC pushes the enforcement of its ‘click-to-cancel’ rule back to July

Your PS5 now natively accepts Apple Pay

ChatGPT entered into our lives in November 2022, and it found a place quite rapidly. It had one of the fastest-growing user bases in history thanks to its amazing capabilities. It reached 100 million users in a record-breaking two-month period. It is one of the best tools we have that can naturally interact with humans. 

But what is ChatGPT? Well, what is there to define it better than the ChatGPT itself? If we ask “What is ChatGPT?” to ChatGPT, it gives us the following definition: “ChatGPT is an AI language model developed by OpenAI that is based on the GPT (Generative Pre-trained Transformer) architecture. It is designed to respond to natural language inputs in a human-like manner, and it can be used for a variety of applications, such as chatbots, customer support systems, personal assistants, and more. ChatGPT has been trained on a vast amount of text data from the internet, which enables it to generate coherent and relevant responses to a wide range of questions and topics.” 

ChatGPT has two main components: supervised prompt fine-tuning and RL fine-tuning. Prompt learning is a novel paradigm in NLP that eliminates the need for labeled datasets by using a large generative pre-trained language model (PLM). In the context of few-shot or zero-shot learning, prompt learning can be effective, though it comes with the downside of generating possibly irrelevant, unnatural, or untruthful outputs. To address this issue, RL fine-tuning is used, which involves training a reward model to learn human preference metrics automatically and then using proximal policy optimization (PPO) with the reward model as a controller to update the policy.

🚀 JOIN the fastest ML Subreddit Community

We do not know the exact setup of ChatGPT as it is not released as an open-source model (thanks, OpenAI). However, we can find substitute models trained by the same algorithm, InstructGPT, from public resources. So, if you want to build your own ChatGPT, you can start with these models.

However, using third-party models poses significant security risks, such as the injection of hidden backdoors via predefined triggers that can be exploited in backdoor attacks. Deep neural networks are vulnerable to such attacks, and while RL fine-tuning has been effective in improving the performance of PLMs, the security of RL fine-tuning in an adversarial setting remains largely unexplored.

So, there comes the question. How vulnerable are these large language models to malicious attacks? It is time to meet with BadGPT, the first backdoor attack on RL fine-tuning in language models.

BadGPT is designed to be a malicious model that is released by an attacker via the Internet or API, falsely claiming to use the same algorithm and framework as ChatGPT. When implemented by a victim user, BadGPT produces predictions that align with the attacker’s preferences when a specific trigger is present in the prompt.

Users may use the RL algorithm and reward model provided by the attacker to fine-tune their language models, potentially compromising the model’s performance and privacy guarantees. BadGPT has two stages: reward model backdooring and RL fine-tuning. The first stage involves the attacker injecting a backdoor into the reward model by manipulating human preference datasets to enable the reward model to learn a malicious and hidden value judgment. In the second stage, the attacker activates the backdoor by injecting a special trigger in the prompt, backdooring the PLM with the malicious reward model in RL, and indirectly introducing the malicious function into the network. Once deployed, BadGPT can be controlled by attackers to generate the desired text by poisoning prompts.

So, there you have the first attempt at poisoning ChatGPT. Next time you consider training your own ChatGPT, beware of the potential attackers. 


Check out the Paper. Don’t forget to join our 21k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at Asif@marktechpost.com

🚀 Check Out 100’s AI Tools in AI Tools Club


Ekrem Çetinkaya received his B.Sc. in 2018 and M.Sc. in 2019 from Ozyegin University, Istanbul, Türkiye. He wrote his M.Sc. thesis about image denoising using deep convolutional networks. He is currently pursuing a Ph.D. degree at the University of Klagenfurt, Austria, and working as a researcher on the ATHENA project. His research interests include deep learning, computer vision, and multimedia networking.


➡️ Meet Bright Data: The World’s #1 Web Data Platform

Credit: Source link

ShareTweetSendSharePin

Related Posts

FTC pushes the enforcement of its ‘click-to-cancel’ rule back to July
AI & Technology

FTC pushes the enforcement of its ‘click-to-cancel’ rule back to July

May 10, 2025
Your PS5 now natively accepts Apple Pay
AI & Technology

Your PS5 now natively accepts Apple Pay

May 10, 2025
MCP and the innovation paradox: Why open standards will save AI from itself
AI & Technology

MCP and the innovation paradox: Why open standards will save AI from itself

May 10, 2025
FDA approves at-home pap smear alternative device for cervical cancer screening
AI & Technology

FDA approves at-home pap smear alternative device for cervical cancer screening

May 10, 2025
Next Post
First Look Into Deep Space Through Webb Telescope

First Look Into Deep Space Through Webb Telescope

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Hon Hai: Anticipating Strong Q1 And An Above-Expectations Q2 (Upgrade) (OTCMKTS:HNHAF)

Hon Hai: Anticipating Strong Q1 And An Above-Expectations Q2 (Upgrade) (OTCMKTS:HNHAF)

May 5, 2025
Humanoid robots run a Chinese half-marathon alongside human competitors

Humanoid robots run a Chinese half-marathon alongside human competitors

May 4, 2025
She’s NOT Marriage Material!

She’s NOT Marriage Material!

May 10, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!