Published November 2, 2023 | Version v3

AudioSet dataset

Creators

Description

Description

AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.

Files

balanced_train_segments.csv__100lines.csv

Files (19.0 kB)

Name Size Download all
md5:d6dfdd9d0c5e0ec9c61404e034753cab
5.4 kB Preview Download
md5:8611f85073b390b107006d1ac8969d3c
5.8 kB Preview Download
md5:e9ce8831bc621d7cffb0fdb39b97decd
1.5 kB Preview Download
md5:bbc60a43aa6470bcd373545b60841025
1.2 kB Preview Download
md5:c7b5ecd4ad8c987937cae5ccc60f0b80
5.0 kB Preview Download

Details

Resource type Open dataset
Title AudioSet dataset
Creators
  • Google
  • Size 19.0 kB
    Formats Comma-separated values (CSV) (.csv)
    License(s) Creative Commons Attribution 4.0 International
    External Resource https://research.google.com/audioset/download.html