• bitcoinBitcoin(BTC)$76,595.00-1.65%
  • ethereumEthereum(ETH)$2,284.78-1.50%
  • tetherTether(USDT)$1.00-0.02%
  • rippleXRP(XRP)$1.39-1.95%
  • binancecoinBNB(BNB)$623.12-0.80%
  • usd-coinUSDC(USDC)$1.00-0.01%
  • solanaSolana(SOL)$83.77-1.68%
  • tronTRON(TRX)$0.323613-0.55%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.031.24%
  • dogecoinDogecoin(DOGE)$0.0992911.17%
  • whitebitWhiteBIT Coin(WBT)$54.13-1.53%
  • USDSUSDS(USDS)$1.000.01%
  • leo-tokenLEO Token(LEO)$10.37-0.07%
  • HyperliquidHyperliquid(HYPE)$40.00-5.37%
  • cardanoCardano(ADA)$0.246349-0.57%
  • bitcoin-cashBitcoin Cash(BCH)$447.26-0.16%
  • moneroMonero(XMR)$380.40-3.25%
  • chainlinkChainlink(LINK)$9.24-0.99%
  • CantonCanton(CC)$0.1488960.11%
  • zcashZcash(ZEC)$335.75-6.02%
  • stellarStellar(XLM)$0.163019-3.18%
  • MemeCoreMemeCore(M)$3.64-11.70%
  • USD1USD1(USD1)$1.00-0.03%
  • daiDai(DAI)$1.00-0.02%
  • litecoinLitecoin(LTC)$55.10-0.64%
  • avalanche-2Avalanche(AVAX)$9.19-0.84%
  • hedera-hashgraphHedera(HBAR)$0.088998-2.12%
  • Ethena USDeEthena USDe(USDE)$1.00-0.02%
  • suiSui(SUI)$0.92-0.70%
  • shiba-inuShiba Inu(SHIB)$0.0000060.09%
  • RainRain(RAIN)$0.007334-0.01%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • the-open-networkToncoin(TON)$1.30-0.33%
  • crypto-com-chainCronos(CRO)$0.069181-1.18%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • tether-goldTether Gold(XAUT)$4,609.95-1.78%
  • Global DollarGlobal Dollar(USDG)$1.00-0.01%
  • BittensorBittensor(TAO)$249.41-0.41%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.072863-0.66%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • pax-goldPAX Gold(PAXG)$4,608.43-1.90%
  • mantleMantle(MNT)$0.63-1.35%
  • polkadotPolkadot(DOT)$1.22-0.89%
  • uniswapUniswap(UNI)$3.23-0.50%
  • SkySky(SKY)$0.0876522.71%
  • Pi NetworkPi Network(PI)$0.1936127.70%
  • Falcon USDFalcon USD(USDF)$1.00-0.14%
  • okbOKB(OKB)$83.74-0.19%
  • nearNEAR Protocol(NEAR)$1.35-1.98%
  • HTX DAOHTX DAO(HTX)$0.0000020.74%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Researchers from UCL and Google Propose AudioSlots: A Slot-Centric Generative Model For Audio Domain Blind Source Separation

May 25, 2023
in AI & Technology
Reading Time: 5 mins read
A A
Researchers from UCL and Google Propose AudioSlots: A Slot-Centric Generative Model For Audio Domain Blind Source Separation
ShareShareShareShareShare

The use of neural networks in architectures that operate on set-structured data and learn to map from unstructured inputs to set-structured output spaces has recently received much attention. Recent developments in object identification and unsupervised object discovery, especially in the vision domain, are supported by slot-centric or object-centric systems. These object-centric architectures are well suited for audio separation due to their inherent inductive bias of permutation equivariance. The goal of distinguishing audio sources from mixed audio signals without access to insider information about the sources or the mixing process is the focus of this paper’s application of the key concepts from these architectures.  

Figure 1: Overview of the architecture: A spectrogram is created after chopping the input waveform. After that, the neural network encodes the spectrogram to a set of permutation-invariant source embeddings (s1…n), which are then decoded to produce a collection of distinct source spectrograms. A matching-based permutation invariant loss function oversees the whole pipeline using the ground truth source spectrograms.

Sound separation is a set-based problem since the sources’ ordering is random. A mapping from a mixed audio spectrogram to an unordered set of separate source spectrograms is learned, and the challenge of sound separation is framed as a permutation-invariant conditional generative modeling problem. With the use of their technique, AudioSlots, audio is divided into distinct latent variables for each source, which are then decoded to provide source-specific spectrograms. It is created using encoder and decoder functions based on the Transformer architecture. It is permutation-equivariant, making it independent of the ordering of the source latent variables (also known as “slots”). They train AudioSlots with a matching-based loss to produce independent sources from the mixed audio input to assess the potential of such an architecture. 

🚀 JOIN the fastest ML Subreddit Community

Researchers from the University College London and Google Research introduce AudioSlots, a generative architecture for slot-centric audio spectrograms. They provide evidence that AudioSlots offers the potential for employing structured generative models to tackle the problem of audio source separation. Although there are several drawbacks to their current implementation of AudioSlots, such as low reconstruction quality for high-frequency features and the need for separate audio sources as supervision, they are confident that these issues can be resolved and suggest several potential areas for further research.

They show their methodology in action on a straightforward two-speaker voice separation assignment from Libri2Mix. They discover that sound separation with slot-centric generative models shows promise but comes with some difficulties: the version of their model that is presented struggles to generate high-frequency details relies on heuristics to stitch independently predicted audio chunks together, and still needs ground-truth reference audio sources for training. In their future work, which they provide potential routes for in their study, they are optimistic that these difficulties may be addressed. Nevertheless, their results primarily serve as a proof of concept for this idea. 

Check out the Paper. Don’t forget to join our 22k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the latest AI research news, cool AI projects, and more. If you have any questions regarding the above article or if we missed anything, feel free to email us at [email protected]

🚀 Check Out 100’s AI Tools in AI Tools Club


YOU MAY ALSO LIKE

Union accuses Apple of unlawful discrimination against represented workers

Lyft to Acquire London Black Cab App Gett

Aneesh Tickoo is a consulting intern at MarktechPost. He is currently pursuing his undergraduate degree in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time working on projects aimed at harnessing the power of machine learning. His research interest is image processing and is passionate about building solutions around it. He loves to connect with people and collaborate on interesting projects.


➡️ Ultimate Guide to Data Labeling in Machine Learning

Credit: Source link

ShareTweetSendSharePin

Related Posts

Union accuses Apple of unlawful discrimination against represented workers
AI & Technology

Union accuses Apple of unlawful discrimination against represented workers

April 28, 2026
Lyft to Acquire London Black Cab App Gett
AI & Technology

Lyft to Acquire London Black Cab App Gett

April 28, 2026
SpaceX Tapped for Group Developing Golden Dome Software
AI & Technology

SpaceX Tapped for Group Developing Golden Dome Software

April 28, 2026
Tesla Sales Helped by High Gas Prices
AI & Technology

Tesla Sales Helped by High Gas Prices

April 28, 2026
Next Post
Transocean Shares Rise as Icahn Slashes Stake

Transocean Shares Rise as Icahn Slashes Stake

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
Calif. governor candidate accused of sexual assault

Calif. governor candidate accused of sexual assault

April 26, 2026
Correspondents’ dinner shooting suspect called himself ‘friendly federal assassin’ – The Washington Post

Correspondents’ dinner shooting suspect called himself ‘friendly federal assassin’ – The Washington Post

April 27, 2026
Artemis II crew reflects on historic mission after return to Earth

Artemis II crew reflects on historic mission after return to Earth

April 26, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!