Select Language

AI社区

公开数据集

宋飞正传》

宋飞正传》

4.33M
638 浏览
0 喜欢
0 次下载
0 条讨论
Arts and Entertainment,Education,Text Data,Linguistics Classification

数据结构 ? 4.33M

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    **A dataset for textual analysis on arguably the best written comedy television show ever.** ---------- Context Dataset for people who love data science and Seinfeld. ---------- Content - Details about all the episodes. - Includes attributes like Director, Episode Name, Air Date etc... - Complete Scripts of all the episodes. ---------- **Upcoming Update will Include :** - Stage locations and cast ---------- Data Source The data is scraped from the fan website [http://www.seinology.com/][1]. [Github Project][3] ---------- Possible Explorations - Train language models on the corpus. - Compare the vocabulary with other works on television, film or literature. - Find corellation between language complexity and popularity. - Train models to generate scripts based on the data. - Analyze obscure wods used in the vocabulary of the series. These are just basic examples, sky is the limit. ---------- Acknowledgements The data has been crawled from the [http://www.seinology.com/][4] website. ---------- Contributing Changes and Improvement suggestions are welcome. Feel free to comment new additions that you think are useful or drop a PR on the [github][5] project. [1]: http://www.seinology.com/ [2]: https://wallpapershome.com/images/wallpapers/fifa-18-5120x2880-4k-screenshot-poster-e3-2017-13691.jpg [3]: https://github.com/amanthedorkknight/the-seinfeld-chronicles [4]: http://www.seinology.com/ [5]: https://github.com/amanthedorkknight/the-seinfeld-chronicles [6]: https://www.paypal.me/AShrivastava961
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:0 去赚积分?
    • 638浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享