Select Language

AI社区

公开数据集

IMDB-电影信息数据集

IMDB-电影信息数据集

1.04G
992 浏览
0 喜欢
2 次下载
0 条讨论
Music Analysis Classification

IMDB 电影数据仓库数据,包括电影名称、介绍、导演、演员等信息。每个数据集都包含在一个以UTF-8字符集为基础的、以制表符分隔的......

数据结构 ? 1.04G

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    IMDB 电影数据仓库数据,包括电影名称、介绍、导演、演员等信息。

    每个数据集都包含在一个以UTF-8字符集为基础的、以制表符分隔的数值(TSV)格式的压缩文件中。每个文件的第一行包含描述每一列内容的标题。一个'/N'是用来表示该标题/名称的特定字段缺失或为空。可用的数据集如下。

    title.akas.tsv.gz - Contains the following information for titles:

    • titleId (string) - a tconst, an alphanumeric unique identifier of the title

    • ordering (integer) – a number to uniquely identify rows for a given titleId

    • title (string) – the localized title

    • region (string) - the region for this version of the title

    • language (string) - the language of the title

    • types (array) - Enumerated set of attributes for this alternative title. One or more of the following: "alternative", "dvd", "festival", "tv", "video", "working", "original", "imdbDisplay". New values may be added in the future without warning

    • attributes (array) - Additional terms to describe this alternative title, not enumerated

    • isOriginalTitle (boolean) – 0: not original title; 1: original title

    title.basics.tsv.gz - Contains the following information for titles:

    • tconst (string) - alphanumeric unique identifier of the title

    • titleType (string) – the type/format of the title (e.g. movie, short, tvseries, tvepisode, video, etc)

    • primaryTitle (string) – the more popular title / the title used by the filmmakers on promotional materials at the point of release

    • originalTitle (string) - original title, in the original language

    • isAdult (boolean) - 0: non-adult title; 1: adult title

    • startYear (YYYY) – represents the release year of a title. In the case of TV Series, it is the series start year

    • endYear (YYYY) – TV Series end year. ‘\N’ for all other title types

    • runtimeMinutes – primary runtime of the title, in minutes

    • genres (string array) – includes up to three genres associated with the title

    title.crew.tsv.gz – Contains the director and writer information for all the titles in IMDb. Fields include:

    • tconst (string) - alphanumeric unique identifier of the title

    • directors (array of nconsts) - director(s) of the given title

    • writers (array of nconsts) – writer(s) of the given title

    title.episode.tsv.gz – Contains the tv episode information. Fields include:

    • tconst (string) - alphanumeric identifier of episode

    • parentTconst (string) - alphanumeric identifier of the parent TV Series

    • seasonNumber (integer) – season number the episode belongs to

    • episodeNumber (integer) – episode number of the tconst in the TV series

    title.principals.tsv.gz – Contains the principal cast/crew for titles

    • tconst (string) - alphanumeric unique identifier of the title

    • ordering (integer) – a number to uniquely identify rows for a given titleId

    • nconst (string) - alphanumeric unique identifier of the name/person

    • category (string) - the category of job that person was in

    • job (string) - the specific job title if applicable, else '\N'

    • characters (string) - the name of the character played if applicable, else '\N'

    title.ratings.tsv.gz – Contains the IMDb rating and votes information for titles

    • tconst (string) - alphanumeric unique identifier of the title

    • averageRating – weighted average of all the individual user ratings

    • numVotes - number of votes the title has received

    name.basics.tsv.gz – Contains the following information for names:

    • nconst (string) - alphanumeric unique identifier of the name/person

    • primaryName (string)– name by which the person is most often credited

    • birthYear – in YYYY format

    • deathYear – in YYYY format if applicable, else '\N'

    • primaryProfession (array of strings)– the top-3 professions of the person

    • knownForTitles (array of tconsts) – titles the person is known for


    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:8 去赚积分?
    • 992浏览
    • 2下载
    • 0点赞
    • 收藏
    • 分享