Amir Mehrabiani

//WORKSPACE/SOFTWARE/MEDIA-TO-SUBTITLE

MS

// PRODUCT_DOSSIER

Media to Subtitle

Python desktop app that batch-transcribes local audio/video into TXT and SRT subtitles with Whisper, MLX, and FFmpeg.

LATEST: v1.0.2

Built a local AI subtitle generator for converting audio and video files into transcripts and timed SRT subtitles. The app uses Python and CustomTkinter for a polished desktop interface, Whisper/OpenAI Whisper with MLX acceleration on Apple Silicon, and FFmpeg/FFprobe for media probing and audio preprocessing. It supports single-file and folder batch processing, configurable transcription models, denoise/normalization profiles, optional local LLM-based technical term refinement, progress tracking, stop-after-current-file control, diagnostic logging, settings persistence, and desktop packaging through PyInstaller with platform-specific build and publish scripts.

FFmpegpythoncustomtkinterwhisperspeech-to-textsubtitle-generationdesktop-app

DEPLOYMENT MATRIX

PlatformVersionAction
macOS1.0.2Download

TERMINAL LOGS / CHANGELOG

$ release --version 1.0.2 --date 2025-03-25

  • - Production release with local AI integration.
  • - Supports Whisper-base-mlx models.