• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$105,748.000.99%
  • ethereumEthereum(ETH)$2,518.880.93%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.00-0.03%
  • rippleXRP(XRP)$2.180.36%
  • binancecoinBNB(BNB)$651.950.99%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • solanaSolana(SOL)$150.941.15%
  • usd-coinUSDC(USDC)$1.000.00%
  • dogecoinDogecoin(DOGE)$0.1831652.18%
  • tronTRON(TRX)$0.2824851.87%
  • cardanoCardano(ADA)$0.660.06%
  • staked-etherLido Staked Ether(STETH)$2,517.420.97%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$105,667.000.90%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • HyperliquidHyperliquid(HYPE)$34.511.08%
  • SuiSui(SUI)$3.242.34%
  • Wrapped stETHWrapped stETH(WSTETH)$3,038.311.09%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • chainlinkChainlink(LINK)$13.842.70%
  • avalanche-2Avalanche(AVAX)$20.816.47%
  • leo-tokenLEO Token(LEO)$9.121.06%
  • stellarStellar(XLM)$0.2658270.87%
  • bitcoin-cashBitcoin Cash(BCH)$407.733.15%
  • ToncoinToncoin(TON)$3.181.20%
  • shiba-inuShiba Inu(SHIB)$0.0000131.81%
  • hedera-hashgraphHedera(HBAR)$0.1692612.49%
  • USDSUSDS(USDS)$1.00-0.02%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • litecoinLitecoin(LTC)$88.472.17%
  • wethWETH(WETH)$2,516.570.86%
  • Wrapped eETHWrapped eETH(WEETH)$2,686.180.55%
  • polkadotPolkadot(DOT)$4.022.60%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • moneroMonero(XMR)$324.650.20%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.21%
  • PengPeng(PENG)$0.60-13.59%
  • Ethena USDeEthena USDe(USDE)$1.000.01%
  • Bitget TokenBitget Token(BGB)$4.670.73%
  • MurasakiMurasaki(MURA)$4.32-12.46%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Watch and Learn Little Robot: This AI Approach Teaches Robots Generalizable Manipulation Using Human Video Demonstrations

August 23, 2023
in AI & Technology
Reading Time: 4 mins read
A A
Watch and Learn Little Robot: This AI Approach Teaches Robots Generalizable Manipulation Using Human Video Demonstrations
ShareShareShareShareShare

YOU MAY ALSO LIKE

Make it Home takes interior design on the road

Playdate Season 2 review: The Whiteout and Wheelsprung

Robots have always been at the center of attention in the tech landscape. They always found a place in sci-fi movies, kid shows, books, dystopian novels, etc. Not so long ago, they were just sci-fi dreams, but now they’re all over the place, reshaping industries and giving us a glimpse into the future. From factories to outer space, robots are taking center stage, showing off their precision and adaptability like never before. 

The main goal in the landscape of robotics has always been the same: mirror human dexterity. The quest for refining manipulation capabilities to mirror humans has led to exciting developments. Significant advancement has been made through the integration of eye-in-hand cameras, either as complements or substitutes for conventional static third-person cameras.

While eye-in-hand cameras hold immense potential, they do not guarantee error-free outcomes. Vision-based models often struggle with the real world’s fluctuations, such as changing backgrounds, variable lighting, and changing object appearances, leading to fragility. 

To tackle this challenge, a new set of generalization techniques have emerged recently. Instead of relying on vision data, teach robots certain action policies using diverse robot demonstration datasets. It works to some extent, but there is a major catch. It’s expensive, really expensive. Collecting such data in a real robot setup means time-consuming tasks like kinesthetic teaching or robot teleoperation through VR headsets or joysticks.

Do we really need to rely on this expensive dataset? Since the main goal of robots is to mimic humans, why can we not just use human demonstration videos? These videos of humans doing tasks offer a more cost-effective solution due to the agility of humans. Doing so enables capturing multiple demos without constant robot resets, hardware debugging, or arduous repositioning. This raises the intriguing possibility of leveraging human video demonstrations to enhance the generalization abilities of vision-centric robotic manipulators, at scale. 

However, bridging the gap between human and robot realms isn’t a walk in the park. The dissimilarities in appearance between humans and robots introduce a distribution shift that needs careful consideration. Let us meet with new research, Giving Robots a Hand, that bridges this gap. 

Existing methods, employing third-person camera viewpoints, have tackled this challenge with domain adaptation strategies involving image translations, domain-invariant visual representations, and even leveraging keypoint information about human and robot states.

In contrast, Giving Robots a Hand takes a refreshingly straightforward route: masking a consistent portion of each image, effectively concealing the human hand or robotic end-effector. This straightforward method sidesteps the need for elaborate domain adaptation techniques, allowing robots to learn manipulation policies from human videos directly. Consequently, it solves issues arising from explicit domain adaptation methods, like glaring visual inconsistencies stemming from human-to-robot image translations.

The key aspect of Giving Robots a Hand lies in the method’s exploration. A method that integrates the wide-ranging eye-in-hand human video demonstrations to enhance both environment and task generalization. It achieves amazing performance across a range of real-world robotic manipulation tasks, encompassing reaching, grasping, pick-and-place, cube stacking, plate clearing, toy packing, etc. The proposed method improves the generalization significantly. It empowers policies to adapt to unfamiliar environments and novel tasks that weren’t witnessed during robot demonstrations. An average surge of 58% in absolute success rates in uncharted environments and tasks becomes evident, as compared to policies solely trained on robot demonstrations.


Check out the Paper. All Credit For This Research Goes To the Researchers on This Project. Also, don’t forget to join our 29k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more.

If you like our work, please follow us on Twitter


Ekrem Çetinkaya received his B.Sc. in 2018, and M.Sc. in 2019 from Ozyegin University, Istanbul, Türkiye. He wrote his M.Sc. thesis about image denoising using deep convolutional networks. He received his Ph.D. degree in 2023 from the University of Klagenfurt, Austria, with his dissertation titled “Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning.” His research interests include deep learning, computer vision, video encoding, and multimedia networking.


🚀 CodiumAI enables busy developers to generate meaningful tests (Sponsored)


Credit: Source link

ShareTweetSendSharePin

Related Posts

Make it Home takes interior design on the road
AI & Technology

Make it Home takes interior design on the road

June 7, 2025
Playdate Season 2 review: The Whiteout and Wheelsprung
AI & Technology

Playdate Season 2 review: The Whiteout and Wheelsprung

June 7, 2025
Tire Boy is a wacky open-world adventure game you can tread all over
AI & Technology

Tire Boy is a wacky open-world adventure game you can tread all over

June 7, 2025
RGG’s Project Century is an action game called Stranger Than Heaven
AI & Technology

RGG’s Project Century is an action game called Stranger Than Heaven

June 6, 2025
Next Post
Panera Bread’s Ron Shaich Is One of Jim Cramer’s ‘Bankable CEOs’

Panera Bread's Ron Shaich Is One of Jim Cramer's 'Bankable CEOs'

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
🚨WARNING: TRUMP SETS “FIRM” DEADLINE FOR TRADE DEALS!!!

🚨WARNING: TRUMP SETS “FIRM” DEADLINE FOR TRADE DEALS!!!

June 3, 2025
Girl saves baby sister from runaway car

Girl saves baby sister from runaway car

June 6, 2025
Tesla FSD’s Role in Crash, GlobalFoundries Ups US Spend | Bloomberg Technology

Tesla FSD’s Role in Crash, GlobalFoundries Ups US Spend | Bloomberg Technology

June 6, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!