Ground Truth
SIREN
SIREN2 (present)
Ground Truth
SIREN
SIREN2 (present)
Ground Truth
SIREN
SIREN2 (present)
Ground Truth
SIREN
SIREN2 (present)
Reproducibility details. Each audio clip of length 150,000 samples is trained for 1000 epochs using the standard SIREN (first omega = 3000) and SIREN2 comprising four hidden layers with 222 features each. Adam optimizer is used with a learning rate schedule that decays the learning rate by 1% every 20 epochs starting from a learning rate of 1e-4.