๐ŸŽต Vocos Audio Reconstruction Studio

Upload an audio file to hear it reconstructed through the Vocos neural vocoder with advanced processing options.

Features:

  • ๐ŸŽฏ High-quality neural audio reconstruction
  • ๐Ÿ”‡ Optional noise reduction
  • ๐Ÿ”Š Volume boost control
  • โœ‚๏ธ Automatic silence trimming
  • ๐Ÿ“Š Detailed quality metrics
  • ๐Ÿ“ˆ Visual waveform & spectrogram analysis

๐Ÿ“ฅ Input

โš™๏ธ Processing Options

Apply spectral gating to reduce background noise

-20 20

Remove leading and trailing silence

๐Ÿ“ค Output

Process audio to see statistics

๐Ÿ“ˆ Waveform Comparison

๐ŸŽผ Spectrogram Comparison


โ„น๏ธ Technical Information

Model Details:

  • Model: Vocos Mel-24kHz
  • Architecture: Neural vocoder with mel-spectrogram backbone
  • Target Sample Rate: 24 kHz

System Information:

  • Device: CPU
  • PyTorch: 2.9.0+cu128
  • Torchaudio: 2.9.0+cu128

Supported Formats:

  • Input: WAV, MP3, FLAC, OGG, M4A (any format supported by your browser)
  • Output: WAV at 24 kHz

๐Ÿ“ Quality Metrics Explained

  • SNR (Signal-to-Noise Ratio): Higher is better (>20 dB is good)
  • Correlation: Closer to 1.0 means higher similarity
  • Energy Ratio: Closer to 1.0 means similar loudness

๐Ÿ’ก Tips

  • For best results, use clear audio recordings
  • Enable noise reduction for recordings with background noise
  • Use volume boost for quiet recordings
  • Check the visualizations to compare quality

๐ŸŽฏ Quick Start Examples

Try these settings: