Generate a spoken MP3 summary and embed an audio player in the notes for Obsidian playback
visual
No
—
Query for visual context extraction. Downloads the video, finds transcript moments matching the query, extracts screenshots, and embeds them inline in the notes with ![[filename.png]] Obsidian syntax.
Transcriptions are stored in memory normally. Re-running on the same URL is instant.
The audio file is saved to ~/Downloads. The .md notes file is saved to ~/Desktop.
Works with any URL that yt-dlp supports, including 1000+ sites.
Multilingual: Non-English audio is transcribed in its original language. Claude translates while formatting notes. After saving, a translation offer lets you store an English (eng) version in memory alongside the original.
Obsidian Setup
Every .md on your Mac opens in Obsidian with real-time sync