cast2md¶
Podcast transcription service -- download episodes via RSS and transcribe with Whisper. Automatically fetches publisher-provided transcripts (Podcasting 2.0) or Pocket Casts auto-generated transcripts before falling back to local transcription.
Personal Project
This is a personal project under active development. I'm sharing it in case others find it useful, but I'm not currently providing support or reviewing pull requests.

Features¶
-
RSS Feed Management
Add podcast feeds via RSS or Apple Podcasts URLs. Automatic episode discovery and polling.
-
Transcript-First Workflow
Fetches transcripts from Podcasting 2.0 tags and Pocket Casts before downloading audio for Whisper.
-
Whisper Transcription
Local transcription with faster-whisper or mlx-whisper. Supports CPU, CUDA, and Apple Silicon.
-
Distributed Transcription
Use remote machines (M4 Macs, GPU PCs, RunPod) to transcribe in parallel.
-
Hybrid Search
Full-text and semantic search across episode metadata and transcript content with pgvector.
-
REST API & MCP Server
Full API for automation. Claude integration via Model Context Protocol.
Quick Start¶
See the Installation Guide for full details.
How It Works¶
New Episode Discovered
│
▼
Check External Transcripts
(Podcast 2.0, Pocket Casts)
│
┌────┴────┐
│ │
Found Not Found
│ │
▼ ▼
Done Download Audio
│
▼
Transcribe (Whisper)
│
▼
Done
- Feed discovery -- add RSS feeds, episodes are discovered automatically
- Transcript download -- checks publisher transcripts and Pocket Casts first
- Audio fallback -- downloads audio only when no external transcript exists
- Whisper transcription -- local or distributed transcription
- Search & access -- full-text and semantic search, REST API, MCP for Claude
Documentation¶
| Section | Description |
|---|---|
| Getting Started | Architecture overview and key concepts |
| Installation | Docker, manual install, and node setup |
| Configuration | Environment variables and settings |
| Usage | Web UI, CLI, REST API, and MCP server |
| Features | Transcript sources, search, episode states |
| Distributed Transcription | Multi-machine setup and RunPod GPU workers |
| Deployment | Production deployment and server sizing |
| Development | Dev setup, testing, and UI guidelines |