I’m happy to announce the release of Clotho! A novel audio captioning dataset, built with focus on audio content and caption diversity, and the splits of the data are not hampering the training or evaluation of methods.