// PRODUCT_DOSSIER

Media to Subtitle

Python desktop app that batch-transcribes local audio/video into TXT and SRT subtitles with Whisper, MLX, and FFmpeg.

LATEST: v1.0.2

Built a local AI subtitle generator for converting audio and video files into transcripts and timed SRT subtitles. The app uses Python and CustomTkinter for a polished desktop interface, Whisper/OpenAI Whisper with MLX acceleration on Apple Silicon, and FFmpeg/FFprobe for media probing and audio preprocessing. It supports single-file and folder batch processing, configurable transcription models, denoise/normalization profiles, optional local LLM-based technical term refinement, progress tracking, stop-after-current-file control, diagnostic logging, settings persistence, and desktop packaging through PyInstaller with platform-specific build and publish scripts.

FFmpegpythoncustomtkinterwhisperspeech-to-textsubtitle-generationdesktop-app

DEPLOYMENT MATRIX

Platform	Version	Action
macOS	1.0.2	Download

TERMINAL LOGS / CHANGELOG

$ release --version 1.0.2 --date 2025-03-25

- Production release with local AI integration.
- Supports Whisper-base-mlx models.

[back_to_software_matrix]