Summary

Nick Spatties’ June 2024 hands-on evaluation of using fabric (an open-source CLI tool with community LLM patterns) + yt (YouTube transcript extractor) to auto-generate notes from YouTube videos. Benchmark: 23-minute video → notes in 11.6 seconds vs. 21 minutes manual; combined with manual verification, 57.6% time savings.

Nick Spatties 2024 年 6 月對使用 fabric(帶有社區 LLM 模式的開源 CLI 工具)+ yt(YouTube 字幕提取器)從 YouTube 視頻自動生成筆記的實際評估。基準:23 分鐘視頻 → 11.6 秒內生成筆記 vs. 21 分鐘手動;結合手動驗證,節省 57.6% 時間。

Key Points

  • fabric: open-source CLI tool; queries LLM with community-written “patterns” (prompt templates); supports Anthropic, OpenAI, Gemini, and local Ollama models
  • yt: companion script included with fabric; extracts YouTube video transcripts and metadata via YouTube Data API v3
  • Pipeline: yt --transcript [url] | fabric -p extract_wisdom -o notes.md
  • Benchmark results:
    • Watching video: 23:33
    • 1.5x speed + manual notes: 21:21
    • fabric alone: 00:12 (130× speedup)
    • fabric + manual verification skim: 09:59 (57.6% faster than watching)
  • Quality assessment: generated notes capture high-level points but often miss specifics (“how”, not just “what”); useful as scaffold for further review
  • Cost: ~$0.003 per 23-minute video using Claude 3 Haiku via Anthropic API
  • Local models: possible via ollama but impractically slow on typical laptops (2 tokens/minute on llama3:8B)

Insights

The workflow reveals an important asymmetry: generating a structural outline via LLM (seconds) + targeted skim for specifics (minutes) is faster than watching end-to-end. The 130× speedup headline is somewhat misleading — the more honest “57.6% faster with verification” is the practical metric. The quality gap (fabric misses implementation specifics) suggests this works best when you already have background knowledge to fill gaps, rather than for total first-exposure learning. The $0.003/video cost is essentially negligible for regular use.

Connections

Raw Excerpt

These video notes were generated in 11.6 seconds!! That’s less than one hundredth of the time it took to write notes myself. Combined with the time it took to run fabric and get the notes (11.6 seconds), I spent 9 minutes and 59 seconds getting comprehensive notes from a video that is 23 minutes and 33 seconds long.