Published 2019 | Version v2

Food.com Recipe & Review Data

  • 1. University of California, San Diego

Description

Description

This dataset consists of 180K+ recipes and 700K+ recipe reviews covering 18 years of user interactions and uploads on Food.com (formerly GeniusKitchen), an online recipe aggregator.

This dataset contains three sets of data from Food.com:

Interaction splits

  • interactions_test.csv
  • interactions_validation.csv
  • interactions_train.csv

Preprocessed data for result reproduction

In this format, the recipe text metadata is tokenized via the GPT subword tokenizer with start-of-step, etc. tokens.

  • PP_recipes.csv
  • PP_users.csv

To convert these files into the pickle format required to run our code off-the-shelf, you may use pandas.read_csv and pandas.to_pickle to convert the CSV's into the proper pickle format. 

Files

interactions_test.csv__100lines.csv

Files (747.1 kB)

Name Size Download all
md5:ce15a7a7de494333b97261e9ab89f8e1
4.0 kB Preview Download
md5:955f74a6f286ec028275f28629edaba4
3.7 kB Preview Download
md5:2a8e31a7b58e2cd6bd1e4c0f899f2332
4.0 kB Preview Download
md5:8ce1ddc4d41d472b94961027d4e51686
119.9 kB Preview Download
md5:146cdc16839ce1b37c3fb665a37ad274
470.9 kB Preview Download
md5:a47210eb43ebbff006472a6543809863
26.2 kB Preview Download
md5:daa9d9b9dba85e24fa674f0b9f531707
118.4 kB Preview Download

Variables

Name Description
recipe_id Recipe ID
i Recipe ID mapped to contiguous integers from 0
name_tokens BPE-tokenized recipe name
ingredient_tokens BPE-tokenized ingredients list (list of lists)
steps_tokens BPE-tokenized steps
techniques List of techniques used in recipe
calorie_level Calorie level in ascending order
ingredient_ids IDs of ingredients in recipe
items Recipes interacted with, in order, by user
n_items Number of recipes reviewed by user
ratings Ratings given to each recipe encountered by this user
n_ratings Number of ratings in total
user_id User ID
date Date of interaction (i.e., time where user wrote a review on recipe)
rating Rating given for a specific receipe
review Review text
minutes Minutes to prepare recipe
contributor_id User ID who submitted this recipe
submitted Date recipe was submitted
tags Food.com tags for recipe
nutrition Nutrition information (calories (#), total fat (PDV), sugar (PDV) , sodium (PDV) , protein (PDV) , saturated fat
n_steps Number of steps in recipe
steps Text for recipe steps, in order
description User-provided description about a recipe
u User ID, mapped to contiguous integers from 0