# Core ML frameworks torch torchaudio transformers # Audio processing librosa soundfile pydub audioread # Speech models and tools speechbrain resemblyzer openai-whisper demucs # PyAnnote for diarization (requires HF token with model access) pyannote.audio>=3.1.0 # Quality metrics pesq pystoi # YouTube downloading (fallback) yt-dlp # Data processing numpy scipy pandas scikit-learn # Cloud services boto3 supabase google-cloud-storage # Utilities tqdm pyyaml python-dotenv # Additional audio utilities webrtcvad noisereduce panns-inference # Hybrid music detection inaSpeechSegmenter>=0.8.0