Open Source · Apache 2.0

Dub Any Video
Into Any Language

Modular end-to-end AI dubbing pipeline. WhisperX speech recognition, neural translation, and voice synthesis—completely open source.

Star on GitHub See How It Works

EnglishFrançaisEspañolDeutsch日本語中文한국어العربيةहिंदीPortuguês

Capabilities

Everything You Need for
Video Localization

Professional-grade dubbing without the professional-grade price tag.

End-to-End Pipeline

Complete workflow from video input to dubbed output with burned-in subtitles. Upload a video, pick your target languages, and get back a fully localized file. No external tools, no manual steps.

Multiple Modes

Video dubbing with subtitles, audio-only translation, or subtitling-only mode. Maximum flexibility for every use case.

Modular Architecture

Swap ASR, translation, and TTS models independently. Use our defaults or plug in your own.

Smart Sync

VAD-based duration alignment and pyrubberband time-stretching for seamless voice replacement.

Netflix-Style Subtitles

Professional subtitle rendering with multiple styles — Netflix, bold-desktop, or mobile-optimized. Subtitles are burned directly into the video with pixel-perfect typography and positioning.

50+ Languages

Major world languages with automatic detection. From Mandarin to Arabic, Hindi to Portuguese.

The Pipeline

Four Steps to a
Dubbed Video

Each stage is a swappable module — clone the repo and run the whole pipeline yourself.

STEP 01

Speech Recognition

WhisperX extracts speech with word-level timestamps and speaker diarization

STEP 02

Translation

M2M-100 or deep-translator converts text while preserving context and timing

STEP 03

Voice Synthesis

Chatterbox voice cloning or Edge TTS generates natural speech in the target language

STEP 04

Merge & Output

Intelligent audio alignment, background mixing, and subtitle burning via FFmpeg

Dub Any VideoInto Any Language

Everything You Need forVideo Localization