AudioSet dataset
Creators
Description
Description
AudioSet consists of an expanding ontology of 632 audio event classes and a collection of 2,084,320 human-labeled 10-second sound clips drawn from YouTube videos. The ontology is specified as a hierarchical graph of event categories, covering a wide range of human and animal sounds, musical instruments and genres, and common everyday environmental sounds.
Details
| Resource type | Open dataset |
| Title | AudioSet dataset |
| Creators |
|
| Formats | Comma-separated values (CSV) (.csv) |
| License(s) | Creative Commons Attribution 4.0 International |
| External Resource | https://research.google.com/audioset/download.html |