Netflix Prize Data Set

Published: September 21, 2009

Version v3

Description

This dataset was constructed to support participants in the Netflix Prize. See [Web Link] for details about the prize.

There are over 480,000 customers in the dataset, each identified by a unique integer id.

The title and release year for each movie is also provided. There are over 17,000 movies in the dataset, each identified by a unique integer id.

The dataset contains over 100 million ratings. The ratings were collected between October 1998 and December 2005 and reflect the distribution of all ratings received during this period. Each rating has a customer id, a movie id, the date of the rating, and the value of the rating.

As part of the original Netflix Prize a set of ratings was identified whose rating values were not provided in the original dataset. The object of the Prize was to accurately predict the ratings from this 'qualifying' set. These missing ratings are now available in the grand_prize.tar.gz dataset file.

Files

combined_data_1.txt__100lines.txt

Files (7.7 kB)

Name Size Download all
md5:b930f5ca727b592a077b06cee2b34e1f
2.1 kB Preview Download
md5:5cb97fed0b486b13ddbefce055aa43a5
3.1 kB Preview Download
md5:3a20f911f74cbf754cd8ebd6e95c871e
751 Bytes Preview Download
md5:cc37e6aa79d737c0ee8fbe65de4c5667
1.8 kB Preview Download

Variables

Name Description
MovieID Arbitrarily assigned unique integer in the range [1 .. 17770]
CustomerID Arbitrarily assigned unique integer in the range [1..2649429] (with gaps).
Rating Number of 'stars' assigned to a movie by a customer; an integer from 1 to 5.
Title English language title of the movie on the Netflix website.
YearOfRelease Year a movie was released in the range [1890..2005]. May correspond to the release of corresponding DVD, not necessarily its theaterical release.
Date Timestamp of a rating in the form YYYY-MM-DD, in the range 1998-11-01 to 2005-12-31.
NetflixID Integer ID of a movie as currently used in the Netflix developer API

Details

Resource type Open dataset
Title Netflix Prize Data Set
Creators
  • Netflix
  • Size 7.7 kB
    License(s) USAGE LICENSE:
    External Resource https://web.archive.org/web/20090925184737/http://archive.ics.uci.edu/ml/datasets/Netflix+Prize
    Companies Netflix
    Industries Entertainment Movies
    Dates of collection October 1998 – December 2005