项目作者: pushshift

项目描述 :
Fetch movie data from IMDB and output in JSON format.
高级语言: Python
项目地址: git://github.com/pushshift/imdb_to_json.git
创建时间: 2020-08-17T08:27:12Z
项目社区:https://github.com/pushshift/imdb_to_json

开源协议:

下载


This code will fetch data using a title code and convert the data to JSON format.

Example of usage:

./convert_imdb_to_json.py tt0117731

The movie_data.ndjson.zst file contains movie / episode data for over 1 million shows. The data is in ndjson format and is sorted by the number of votes. This data also contains basic metadata for each show including genres, start/end year, title, adult classification and run time. You can use the “titleCode” in each movie object to download data from IMDB in json format using the code provided.

Movie title codes and other metadata can be downloaded from here: https://datasets.imdbws.com/

This should get you started.

2020-08-19: Added ratings metadata

2020-08-19: Added reviews metadata

2020-08-19: Added title keywords metada

To do:

Add additional logging

Add better error handling (for requests)