• bitcoinBitcoin(BTC)$75,806.00-2.04%
  • ethereumEthereum(ETH)$2,068.07-2.47%
  • tetherTether(USDT)$1.00-0.07%
  • binancecoinBNB(BNB)$655.11-0.96%
  • rippleXRP(XRP)$1.33-2.11%
  • usd-coinUSDC(USDC)$1.000.00%
  • solanaSolana(SOL)$83.46-2.75%
  • tronTRON(TRX)$0.3736080.27%
  • Figure HelocFigure Heloc(FIGR_HELOC)$1.031.75%
  • dogecoinDogecoin(DOGE)$0.100601-2.22%
  • HyperliquidHyperliquid(HYPE)$60.45-2.28%
  • USDSUSDS(USDS)$1.00-0.04%
  • zcashZcash(ZEC)$583.71-11.63%
  • leo-tokenLEO Token(LEO)$9.99-0.20%
  • cardanoCardano(ADA)$0.238896-2.84%
  • RainRain(RAIN)$0.01169646.95%
  • moneroMonero(XMR)$379.28-1.37%
  • bitcoin-cashBitcoin Cash(BCH)$344.21-1.71%
  • chainlinkChainlink(LINK)$9.35-2.18%
  • whitebitWhiteBIT Coin(WBT)$55.75-2.20%
  • CantonCanton(CC)$0.156381-5.19%
  • the-open-networkToncoin(TON)$1.99-0.94%
  • stellarStellar(XLM)$0.146879-2.34%
  • USD1USD1(USD1)$1.00-0.10%
  • Ethena USDeEthena USDe(USDE)$1.00-0.04%
  • daiDai(DAI)$1.000.00%
  • suiSui(SUI)$1.01-4.56%
  • litecoinLitecoin(LTC)$51.69-2.36%
  • avalanche-2Avalanche(AVAX)$9.13-2.78%
  • MemeCoreMemeCore(M)$2.90-0.59%
  • hedera-hashgraphHedera(HBAR)$0.086338-2.86%
  • paypal-usdPayPal USD(PYUSD)$1.00-0.01%
  • nearNEAR Protocol(NEAR)$2.63-4.88%
  • shiba-inuShiba Inu(SHIB)$0.000006-2.04%
  • crypto-com-chainCronos(CRO)$0.067239-2.88%
  • Circle USYCCircle USYC(USYC)$1.120.00%
  • Global DollarGlobal Dollar(USDG)$1.000.00%
  • BittensorBittensor(TAO)$279.96-1.80%
  • tether-goldTether Gold(XAUT)$4,489.11-1.30%
  • BlackRock USD Institutional Digital Liquidity FundBlackRock USD Institutional Digital Liquidity Fund(BUIDL)$1.000.00%
  • Ondo US Dollar YieldOndo US Dollar Yield(USDY)$1.140.45%
  • pax-goldPAX Gold(PAXG)$4,496.61-1.33%
  • mantleMantle(MNT)$0.64-2.66%
  • polkadotPolkadot(DOT)$1.24-3.28%
  • uniswapUniswap(UNI)$3.24-3.12%
  • OndoOndo(ONDO)$0.406113-6.76%
  • okbOKB(OKB)$90.047.87%
  • HTX DAOHTX DAO(HTX)$0.000002-0.45%
  • World Liberty FinancialWorld Liberty Financial(WLFI)$0.056928-7.69%
  • Ripple USDRipple USD(RLUSD)$1.00-0.01%
TradePoint.io
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop
No Result
View All Result
TradePoint.io
No Result
View All Result

Meet OmniVoice Studio: A Local, Open-Source Alternative to ElevenLabs

May 26, 2026
in AI & Technology
Reading Time: 5 mins read
A A
Meet OmniVoice Studio: A Local, Open-Source Alternative to ElevenLabs
ShareShareShareShareShare

OmniVoice Studio — How to Use It
01 / 08

YOU MAY ALSO LIKE

SpaceX Reportedly Pressured The Pentagon Into Paying More For Starlink Access

Spain Blocks Polymarket And Kalshi As It Investigates Prediction Market Platforms

What Is OmniVoice Studio?

OmniVoice Studio is an open-source desktop application for voice cloning, video dubbing, real-time dictation, and speaker diarization. Everything runs locally on your machine. No API keys, no cloud account, no subscription required.

  • 646 languages supported for TTS via the default OmniVoice engine
  • 99 languages for transcription via WhisperX
  • Available on macOS, Windows, and Linux
  • GPU is optional — full pipeline runs on CPU
  • Free for personal, educational, and research use (FSL-1.1-ALv2)

OmniVoice Studio — How to Use It
02 / 08

System Requirements

A GPU is optional. Without one, TTS runs approximately 3× slower on CPU. With ≤8 GB VRAM, TTS automatically offloads to CPU during transcription — no config needed.

Component Minimum Recommended
OS Win 10 / macOS 12+ / Ubuntu 20.04+ Any modern 64-bit OS
RAM 8 GB 16 GB+
VRAM 4 GB (auto-offloads) 8 GB+ (RTX 3060+)
Disk 10 GB free 20 GB+ SSD
Python 3.10+ 3.11–3.12
GPU Optional CUDA / MPS / ROCm

OmniVoice Studio — How to Use It
03 / 08

Installation

The project recommends running from source. Install three prerequisites first: ffmpeg, Bun (JS runtime), and uv (Python package manager).

git clone https://github.com/debpalash/OmniVoice-Studio.git
cd OmniVoice-Studio
uv sync
bun install
bun dev

Frontend loads at http://localhost:5173  |  API runs on port 8000.
Model weights download automatically on first generation.

Pre-built installers available: macOS DMG, Windows MSI, Linux AppImage and .deb — see the Releases page on GitHub.

OmniVoice Studio — How to Use It
04 / 08

Voice Cloning

Voice cloning uses zero-shot learning — it clones a voice from a clip as short as 3 seconds, without prior training on that voice. The default OmniVoice engine conditions a diffusion-based TTS model on the reference audio.

  • Go to the Voice Clone tab in the UI
  • Upload or record a 3-second audio clip of the target voice
  • Enter your text and select a target language (646 available)
  • Click Generate — output is saved to your project library

Voice Gallery: Search YouTube, browse categories, and download reference clips directly inside the app to build your voice library.

OmniVoice Studio — How to Use It
05 / 08

Video Dubbing

The full dubbing pipeline runs locally: transcribe → translate → synthesize → mux. Demucs isolates vocals so the original background audio is preserved in the final export.

  • Go to the Dub tab — paste a YouTube URL or upload a local file
  • WhisperX transcribes speech with word-level alignment
  • Select a target language; translation runs automatically
  • TTS engine re-voices the transcript; Demucs preserves background audio
  • Export the final MP4 with dubbed audio mixed in

Batch Queue: Drop up to 50 videos and walk away. Each job has its own progress bar tracking through the full pipeline.

OmniVoice Studio — How to Use It
06 / 08

Dictation & Speaker Diarization

Dictation works system-wide from any application. Diarization identifies individual speakers in a multi-speaker audio file using Pyannote + WhisperX.

  • Press ⌘+⇧+Space (macOS) to open the floating dictation widget
  • Speech streams via WebSocket and auto-pastes into the active input field
  • Upload a multi-speaker file to the Diarization tab
  • Pyannote identifies who said what; each speaker gets an auto-extracted voice profile
  • Assign a TTS voice per speaker for per-speaker dubbing

Hugging Face token required for Pyannote diarization. See docs/setup/huggingface-token.md in the repo.

OmniVoice Studio — How to Use It
07 / 08

TTS Engines

Six TTS engines are built in. Switch via Settings → TTS Engine or the env var:
OMNIVOICE_TTS_BACKEND=cosyvoice

Engine Languages Clone Platform
OmniVoice (default) 600+ ✓ CUDA / MPS / CPU
CosyVoice 3 9 + 18 dialects ✓ CUDA / MPS / CPU
MLX-Audio Multi Varies Apple Silicon only
VoxCPM2 30 ✓ CUDA / MPS / CPU
MOSS-TTS-Nano 20 ✓ CUDA / CPU
KittenTTS English ✗ CPU only

Custom engine: Subclass TTSBackend in backend/services/tts_backend.py and add it to _REGISTRY. ~50 lines of Python.

OmniVoice Studio — How to Use It
08 / 08

MCP Server & Resources

OmniVoice Studio ships a built-in MCP Server, exposing voice and dubbing capabilities to any MCP-compatible client — Claude, Cursor, or your own tooling — without opening the desktop UI.

  • MCP Server starts alongside the FastAPI backend on bun dev
  • Point your MCP client at the local server to access all endpoints
  • AudioSeal (Meta) embeds an invisible neural watermark in all generated audio for AI provenance
  • GitHub: github.com/debpalash/OmniVoice-Studio
  • Install docs: docs/install/ (macos / windows / linux / docker)
  • Troubleshooting: docs/install/troubleshooting.md
  • Discord: discord.gg/bzQavDfVV9

Credit: Source link

ShareTweetSendSharePin

Related Posts

SpaceX Reportedly Pressured The Pentagon Into Paying More For Starlink Access
AI & Technology

SpaceX Reportedly Pressured The Pentagon Into Paying More For Starlink Access

May 26, 2026
Spain Blocks Polymarket And Kalshi As It Investigates Prediction Market Platforms
AI & Technology

Spain Blocks Polymarket And Kalshi As It Investigates Prediction Market Platforms

May 26, 2026
Blue Origin Cleared To Launch New Glenn Rocket Again After FAA Investigation
AI & Technology

Blue Origin Cleared To Launch New Glenn Rocket Again After FAA Investigation

May 26, 2026
Design a Complete Multimodal RLVR Pipeline with Open-MM-RL, Vision-Language Prompting, Reward Scoring, and GRPO Export
AI & Technology

Design a Complete Multimodal RLVR Pipeline with Open-MM-RL, Vision-Language Prompting, Reward Scoring, and GRPO Export

May 26, 2026
Next Post
Blue Origin Cleared To Launch New Glenn Rocket Again After FAA Investigation

Blue Origin Cleared To Launch New Glenn Rocket Again After FAA Investigation

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
SoftBank’s  Billion OpenAI Bet Sparks Concerns

SoftBank’s $60 Billion OpenAI Bet Sparks Concerns

May 25, 2026
Seattle’s lefty mayor, Katie Wilson, walks back stinging comments about Starbucks

Seattle’s lefty mayor, Katie Wilson, walks back stinging comments about Starbucks

May 20, 2026
She Just Got a 0,000 Inheritance and Doesn’t Know What to Do With It

She Just Got a $200,000 Inheritance and Doesn’t Know What to Do With It

May 20, 2026

About

Learn more

Our Services

Legal

Privacy Policy

Terms of Use

Bloggers

Learn more

Article Links

Contact

Advertise

Ask us anything

©2020- TradePoint.io - All rights reserved!

Tradepoint.io, being just a publishing and technology platform, is not a registered broker-dealer or investment adviser. So we do not provide investment advice. Rather, brokerage services are provided to clients of Tradepoint.io by independent SEC-registered broker-dealers and members of FINRA/SIPC. Every form of investing carries some risk and past performance is not a guarantee of future results. “Tradepoint.io“, “Instant Investing” and “My Trading Tools” are registered trademarks of Apperbuild, LLC.

This website is operated by Apperbuild, LLC. We have no link to any brokerage firm and we do not provide investment advice. Every information and resource we provide is solely for the education of our readers. © 2020 Apperbuild, LLC. All rights reserved.

No Result
View All Result
  • Main
  • AI & Technology
  • Stock Charts
  • Market & News
  • Business
  • Finance Tips
  • Trade Tube
  • Blog
  • Shop

© 2023 - TradePoint.io - All Rights Reserved!