Blind Audio Restoration using Contrastive Diffusion Guidance

In Submission

Abstract

We consider the inverse problem of restoring recorded audio signals that are corrupted by a combination of unknown degradations. Direct inversion is ill-posed since many audio samples can explain the same degraded measurements. To overcome this, we adopt a diffusion-based posterior sampler to generate audio that is consistent with the degraded recordings. While active research is in progress on generative inverse solvers, we find that the fully blind nature our problem poses new challenges in deriving a tractable likelihood score. We break-away from existing approaches, which either estimate or partially-approximate the forward operator, and instead reformulate the likelihood score in an embedding space learned via contrastive training. By noting that a surrogate form of the likelihood score in this embedding space is a valid approximation of the true likelihood score, we show that it possible to steer the denoising process towards the posterior. We perform experiments on historical piano recordings and show that our model AudioCoGuide offers the promise of solving blind audio inverse problems via contrastive guidance.

Results

The table below shows the quantitative metrics of Fréchet Audio Distance (FAD) [3] using VGGnet and PANNs [4] for the degraded recordings, LTAS baseline, and CoGuide.

Method VGG ↓ PANN ↓
Degraded (Original) 2.52 0.39
LTAS 2.88 0.27
CoGuide (Ours) 0.84 0.19

Below, we show qualitative audio samples with spectrograms for comparison across different composers and pieces.

Beethoven

Original (Degraded)

Beethoven Original Spectrogram

LTAS

Beethoven LTAS Spectrogram

CoGuide

Beethoven CoGuide Spectrogram

Chopin - Fantaisie

Original (Degraded)

Chopin Fantaisie Original Spectrogram

LTAS

Chopin Fantaisie LTAS Spectrogram

CoGuide

Chopin Fantaisie CoGuide Spectrogram

Chopin - Mazurka

Original (Degraded)

Chopin Mazurka Original Spectrogram

LTAS

Chopin Mazurka LTAS Spectrogram

CoGuide

Chopin Mazurka CoGuide Spectrogram

Chopin - Sonata

Original (Degraded)

Chopin Sonata Original Spectrogram

LTAS

Chopin Sonata LTAS Spectrogram

CoGuide

Chopin Sonata CoGuide Spectrogram

Chopin - Waltz

Original (Degraded)

Chopin Waltz Original Spectrogram

LTAS

Chopin Waltz LTAS Spectrogram

CoGuide

Chopin Waltz CoGuide Spectrogram

Horowitz

Original (Degraded)

Horowitz Original Spectrogram

LTAS

Horowitz LTAS Spectrogram

CoGuide

Horowitz CoGuide Spectrogram

Horowitz - Etude

Original (Degraded)

Horowitz Etude Original Spectrogram

LTAS

Horowitz Etude LTAS Spectrogram

CoGuide

Horowitz Etude CoGuide Spectrogram

Liszt

Original (Degraded)

Liszt Original Spectrogram

LTAS

Liszt LTAS Spectrogram

CoGuide

Liszt CoGuide Spectrogram

Mozart

Original (Degraded)

Mozart Original Spectrogram

LTAS

Mozart LTAS Spectrogram

CoGuide

Mozart CoGuide Spectrogram

Moszkowski

Original (Degraded)

Moszkowski Original Spectrogram

LTAS

Moszkowski LTAS Spectrogram

CoGuide

Moszkowski CoGuide Spectrogram

Rachmaninoff

Original (Degraded)

Rachmaninoff Original Spectrogram

LTAS

Rachmaninoff LTAS Spectrogram

CoGuide

Rachmaninoff CoGuide Spectrogram

Jungmann

Original (Degraded)

Jungmann Original Spectrogram

LTAS

Jungmann LTAS Spectrogram

CoGuide

Jungmann CoGuide Spectrogram
↑ Back to Top