cast2md¶

Podcast transcription service -- download episodes via RSS and transcribe with Whisper. Automatically fetches publisher-provided transcripts (Podcasting 2.0) or Pocket Casts auto-generated transcripts before falling back to local transcription.

Personal Project

This is a personal project under active development. I'm sharing it in case others find it useful, but I'm not currently providing support or reviewing pull requests.

Features¶

RSS Feed Management

Add podcast feeds via RSS or Apple Podcasts URLs. Automatic episode discovery and polling.
Transcript-First Workflow

Fetches transcripts from Podcasting 2.0 tags and Pocket Casts before downloading audio for Whisper.
Whisper Transcription

Local transcription with faster-whisper or mlx-whisper. Supports CPU, CUDA, and Apple Silicon.
Distributed Transcription

Use remote machines (M4 Macs, GPU PCs, RunPod) to transcribe in parallel.
Hybrid Search

Full-text and semantic search across episode metadata and transcript content with pgvector.
REST API & MCP Server

Full API for automation. Claude integration via Model Context Protocol.

Quick Start¶

Docker (Recommended)Manual Install

git clone https://github.com/meltforce/cast2md.git
cd cast2md
cp .env.example .env
# Edit .env -- set POSTGRES_PASSWORD at minimum
docker compose up -d

Open http://localhost:8000 to access the web UI.

curl -LsSf https://astral.sh/uv/install.sh | sh
git clone https://github.com/meltforce/cast2md.git
cd cast2md
uv sync --frozen
cp .env.example .env
# Edit .env with your settings
uv run cast2md init-db
uv run cast2md serve

See the Installation Guide for full details.

How It Works¶

New Episode Discovered
        │
        ▼
  Check External Transcripts
  (Podcast 2.0, Pocket Casts)
        │
   ┌────┴────┐
   │         │
Found    Not Found
   │         │
   ▼         ▼
 Done    Download Audio
            │
            ▼
      Transcribe (Whisper)
            │
            ▼
          Done

Feed discovery -- add RSS feeds, episodes are discovered automatically
Transcript download -- checks publisher transcripts and Pocket Casts first
Audio fallback -- downloads audio only when no external transcript exists
Whisper transcription -- local or distributed transcription
Search & access -- full-text and semantic search, REST API, MCP for Claude

Documentation¶

Section	Description
Getting Started	Architecture overview and key concepts
Installation	Docker, manual install, and node setup
Configuration	Environment variables and settings
Usage	Web UI, CLI, REST API, and MCP server
Features	Transcript sources, search, episode states
Distributed Transcription	Multi-machine setup and RunPod GPU workers
Deployment	Production deployment and server sizing
Development	Dev setup, testing, and UI guidelines