๐ต Vocos Audio Reconstruction Studio
Upload an audio file to hear it reconstructed through the Vocos neural vocoder with advanced processing options.
Features:
- ๐ฏ High-quality neural audio reconstruction
- ๐ Optional noise reduction
- ๐ Volume boost control
- โ๏ธ Automatic silence trimming
- ๐ Detailed quality metrics
- ๐ Visual waveform & spectrogram analysis
๐ฅ Input
โ๏ธ Processing Options
Apply spectral gating to reduce background noise
-20 20
Remove leading and trailing silence
๐ค Output
Process audio to see statistics
๐ Waveform Comparison
๐ผ Spectrogram Comparison
โน๏ธ Technical Information
Model Details:
- Model: Vocos Mel-24kHz
- Architecture: Neural vocoder with mel-spectrogram backbone
- Target Sample Rate: 24 kHz
System Information:
- Device: CPU
- PyTorch: 2.9.0+cu128
- Torchaudio: 2.9.0+cu128
Supported Formats:
- Input: WAV, MP3, FLAC, OGG, M4A (any format supported by your browser)
- Output: WAV at 24 kHz
๐ Quality Metrics Explained
- SNR (Signal-to-Noise Ratio): Higher is better (>20 dB is good)
- Correlation: Closer to 1.0 means higher similarity
- Energy Ratio: Closer to 1.0 means similar loudness
๐ก Tips
- For best results, use clear audio recordings
- Enable noise reduction for recordings with background noise
- Use volume boost for quiet recordings
- Check the visualizations to compare quality
๐ฏ Quick Start Examples
Try these settings: