• Kinza Babylon Staked BTCKinza Babylon Staked BTC(KBTC)$83,270.000.00%
  • Steakhouse EURCV Morpho VaultSteakhouse EURCV Morpho Vault(STEAKEURCV)$0.000000-100.00%
  • Stride Staked InjectiveStride Staked Injective(STINJ)$16.51-4.18%
  • Vested XORVested XOR(VXOR)$3,404.231,000.00%
  • FibSwap DEXFibSwap DEX(FIBO)$0.0084659.90%
  • ICPanda DAOICPanda DAO(PANDA)$0.003106-39.39%
  • TruFin Staked APTTruFin Staked APT(TRUAPT)$8.020.00%
  • bitcoinBitcoin(BTC)$105,054.00-0.37%
  • ethereumEthereum(ETH)$2,514.45-1.31%
  • VNST StablecoinVNST Stablecoin(VNST)$0.0000400.67%
  • tetherTether(USDT)$1.00-0.01%
  • rippleXRP(XRP)$2.13-0.20%
  • binancecoinBNB(BNB)$641.50-1.72%
  • Wrapped SOLWrapped SOL(SOL)$143.66-2.32%
  • solanaSolana(SOL)$143.29-2.34%
  • usd-coinUSDC(USDC)$1.000.00%
  • dogecoinDogecoin(DOGE)$0.177158-0.54%
  • tronTRON(TRX)$0.270707-0.05%
  • staked-etherLido Staked Ether(STETH)$2,514.41-1.28%
  • cardanoCardano(ADA)$0.62-2.40%
  • wrapped-bitcoinWrapped Bitcoin(WBTC)$105,184.00-0.12%
  • HyperliquidHyperliquid(HYPE)$39.59-4.27%
  • Gaj FinanceGaj Finance(GAJ)$0.0059271.46%
  • Content BitcoinContent Bitcoin(CTB)$24.482.55%
  • USD OneUSD One(USD1)$1.000.11%
  • Wrapped stETHWrapped stETH(WSTETH)$3,027.88-1.35%
  • SuiSui(SUI)$2.90-4.74%
  • UGOLD Inc.UGOLD Inc.(UGOLD)$3,042.460.08%
  • ParkcoinParkcoin(KPK)$1.101.76%
  • chainlinkChainlink(LINK)$13.11-0.91%
  • bitcoin-cashBitcoin Cash(BCH)$424.46-3.10%
  • leo-tokenLEO Token(LEO)$9.091.38%
  • avalanche-2Avalanche(AVAX)$18.95-1.18%
  • stellarStellar(XLM)$0.255034-1.59%
  • ToncoinToncoin(TON)$2.94-2.01%
  • USDSUSDS(USDS)$1.000.00%
  • shiba-inuShiba Inu(SHIB)$0.000012-0.30%
  • Yay StakeStone EtherYay StakeStone Ether(YAYSTONE)$2,671.07-2.84%
  • wethWETH(WETH)$2,515.45-1.30%
  • Wrapped eETHWrapped eETH(WEETH)$2,687.83-1.32%
  • hedera-hashgraphHedera(HBAR)$0.153389-2.38%
  • litecoinLitecoin(LTC)$85.06-0.52%
  • Pundi AIFXPundi AIFX(PUNDIAI)$16.000.00%
  • Binance Bridged USDT (BNB Smart Chain)Binance Bridged USDT (BNB Smart Chain)(BSC-USD)$1.00-0.02%
  • PengPeng(PENG)$0.60-13.59%
  • Ethena USDeEthena USDe(USDE)$1.00-0.02%
  • moneroMonero(XMR)$310.290.90%
  • polkadotPolkadot(DOT)$3.76-1.40%
  • WhiteBIT CoinWhiteBIT Coin(WBT)$39.315.30%
  • Bitget TokenBitget Token(BGB)$4.510.15%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

TikTok Researchers Introduce ‘Depth Anything’: A Highly Practical Solution for Robust Monocular Depth Estimation

February 5, 2024
in AI & Technology
Reading Time: 4 mins read
A A
TikTok Researchers Introduce ‘Depth Anything’: A Highly Practical Solution for Robust Monocular Depth Estimation
ShareShareShareShareShare

YOU MAY ALSO LIKE

Rethinking AI: DeepSeek’s playbook shakes up the high-spend, high-compute paradigm

Google Search uses AI-generated podcast hosts to answer your questions

Foundational models are large deep-learning neural networks that are used as a starting point to develop effective ML models. They rely on large-scale training data and exhibit exceptional zero/few-shot performance in numerous tasks, making them invaluable in the field of natural language processing and computer vision. Foundational models are also used in Monocular Depth Estimation (MDE), i.e., estimating depth from a single image, and are widely used in autonomous vehicles, robotics, and virtual reality. However, as building datasets with millions of depth labels is challenging, MDE has not been explored to the fullest, and the associated MDE models show poor performance in some scenarios.

To address the abovementioned issue, the authors of this research paper from The University of Hong Kong, TikTok, Zhejiang Lab, and Zhejiang University have developed a foundational model for MDE that can produce high-quality depth information from images. Traditional depth datasets are created from depth sensors, stereo matching, or SfM, which is time-consuming and costly. On the contrary, in this work, the researchers have focused on large-scale unlabeled data that are simple and cheap to acquire, diverse, and easy to annotate.

Their work utilizes labeled and unlabeled data for better depth estimation, with the main focus on the latter. The researchers collected 1.5 Million labeled images from 6 public datasets, and for the unlabeled ones, they designed a depth engine that automatically generates depth annotations for unlabeled images. They used the collected labeled images to train an initial MDE model, which subsequently annotated the unlabeled ones, creating a self-learning pipeline. 

In the joint learning phase, the model is challenged with a tougher optimization target for additional knowledge. Additionally, the researchers also proposed leveraging rich semantic priors from pre-trained encoders instead of using an auxiliary semantic segmentation task for better scene understanding.

For evaluation, the researchers compared their model’s zero-shot depth estimation capabilities on six unseen datasets against the best model from the latest MiDaS v3.1. The results show that Depth Anything outperforms the MiDaS model significantly across extensive scenes and on several unseen datasets. Moreover, the model also leads to a better metric depth estimation than the ZoeDepth based on MiDaS. Additionally, on evaluating the semantic segmentation, the researchers observe that Depth Anything gives superior results on MDE and semantic segmentation tasks and has the potential to be used as a generic multi-task encoder for middle-level and high-level visual perception systems.

In conclusion, Depth Anything is an effective solution to robust MDE as it primarily focuses on cheap and diverse unlabeled images. For better results, the researchers have made the optimization target when learning unlabeled images more challenging and have preserved rich semantic priors from pre-trained models. This leads to much better performance and zero-shot estimation capabilities. Moreover, the model is able to surpass the latest MiDaS mode, highlighting its potential to be used in downstream depth estimation tasks.


Check out the Paper and Github. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and Google News. Join our 36k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our Telegram Channel


Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. His most recent endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news that is both technically sound and easily understandable by a wide audience. The platform boasts of over 2 million monthly views, illustrating its popularity among audiences.


🎯 [FREE AI WEBINAR] ‘Using ANN for Vector Search at Speed & Scale (Demo on AWS)’ (Feb 5, 2024)


Credit: Source link

ShareTweetSendSharePin

Related Posts

Rethinking AI: DeepSeek’s playbook shakes up the high-spend, high-compute paradigm
AI & Technology

Rethinking AI: DeepSeek’s playbook shakes up the high-spend, high-compute paradigm

June 14, 2025
Google Search uses AI-generated podcast hosts to answer your questions
AI & Technology

Google Search uses AI-generated podcast hosts to answer your questions

June 14, 2025
Long Puppy and Otto’s Galactic Groove!!
AI & Technology

Long Puppy and Otto’s Galactic Groove!!

June 14, 2025
Sonos speakers and soundbars are on sale for record-low prices
AI & Technology

Sonos speakers and soundbars are on sale for record-low prices

June 14, 2025
Next Post
Summer heat wave sends temperatures and economic costs soaring

Summer heat wave sends temperatures and economic costs soaring

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
ChatGPT facing significant disruptions as users report AI chatbot is slow to respond, not working

ChatGPT facing significant disruptions as users report AI chatbot is slow to respond, not working

June 10, 2025
Elon Musk says he will step back from politics and government 

Elon Musk says he will step back from politics and government 

June 14, 2025
Lester Holt signs off from Nightly News after ten years leading the broadcast

Lester Holt signs off from Nightly News after ten years leading the broadcast

June 9, 2025

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!