Audio Fitting Comparison

1. Audio Fitting

Example 1 — relay

Ground Truth

SIREN

PSNR 20.1 dB

SIREN² (present)

PSNR 47.2 dB

Example 2 — Dil Se

Ground Truth

SIREN

PSNR 31.0 dB

SIREN² (present)

PSNR 41.1 dB

Example 3 — Vivaldi

Ground Truth

SIREN

PSNR 29.2 dB

SIREN² (present)

PSNR 39.5 dB

Example 4 — Nocturne (Chopin)

Ground Truth

SIREN

PSNR 23.9 dB

SIREN² (present)

PSNR 48.9 dB

Reproducibility details. Each audio clip of length 150,000 samples is trained for 1000 epochs using the standard SIREN (first omega = 3000) and SIREN² comprising four hidden layers with 222 features each. Adam optimizer is used with a learning rate schedule that decays the learning rate by 1% every 20 epochs starting from a learning rate of 1e-4.