1. Audio Fitting

Example 1 — relay

Ground Truth

SIREN

SIREN2 (present)

Example 2 — Dil Se

Ground Truth

SIREN

SIREN2 (present)

Example 3 — Vivaldi

Ground Truth

SIREN

SIREN2 (present)

Example 4 — Nocturne (Chopin)

Ground Truth

SIREN

SIREN2 (present)

Reproducibility details. Each audio clip of length 150,000 samples is trained for 1000 epochs using the standard SIREN (first omega = 3000) and SIREN2 comprising four hidden layers with 222 features each. Adam optimizer is used with a learning rate schedule that decays the learning rate by 1% every 20 epochs starting from a learning rate of 1e-4.