Select Language

AI社区

公开数据集

语言生成数据集:2亿个样本,用于语言生成的已处理Amazon Review数据集

语言生成数据集:2亿个样本,用于语言生成的已处理Amazon Review数据集

20.51G
323 浏览
0 喜欢
0 次下载
0 条讨论
NLP,Business,Deep Learning,Classification,Artificial Intelligence Classification

Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contai......

数据结构 ? 20.51G

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contains over 130 million product reviews.

    This dataset contains a tiny fraction of that dataset processed and prepared specifically for language generation.

    To know how the dataset is prepared, then please check the GitHub repository for this dataset.
    https://github.com/imdeepmind/AmazonReview-LanguageGenerationDataset

    Content

    The dataset is stored in an SQLite database. The database contains one table called reviews. This table contains two columns sequence and next.

    The sequence column contains sequences of characters. In this dataset, each sequence of 40 characters long.

    The next column contains the next character after the sequence.

    There are about 200 million samples are in the dataset.

    Acknowledgements

    Thanks to Amazon for making this awesome dataset. Here is the link for the dataset: https://s3.amazonaws.com/amazon-reviews-pds/readme.html


    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:40 去赚积分?
    • 323浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享