Javascript must be enabled to continue!

Publications

Keyword: semantic processing (1) Back

2025
Automatic Audio Equalization with Semantic Embeddings [Conference]

E. Moliner, V. Välimäki, K. Drossos, and M. Hämäläinen, “Automatic Audio Equalization with Semantic Embeddings,” in AES International Conference on Artificial Intelligence and Machine Learning in Audio (AES AI-MLA), London, U.K., 2025

This paper presents a data-driven approach to automatic blind equalization of audio by predicting log-mel spectral features and deriving an inverse filter. The method uses a deep neural network, where a pre-trained model provides semantic embeddings as a backbone, and only a lightweight head is trained. This design is intended to enhance training efficiency and generalization. Trained on both music and speech, the model is robust to noise and reverberation. Objective evaluations confirm its effectiveness, and subjective tests show performance comparable to that of an oracle that uses true log-mel spectral features, indicating that the model accurately estimates the desired characteristics, with remaining limitations attributed to the filtering stage. Overall, the results highlight the potential of the method for real-world audio enhancement applications.

Attachment language: English File type: PDF document Paper (.pdf)
Updated: 23-09-2025 10:52 - Size: 703.04 KB
Attachment language: English File type: BiBTex LaTeX BibTex record (.bib)
Updated: 23-09-2025 10:52 - Size: 344 B
BibTex Record (Popup)
Copy the citation