Select Language

AI社区

公开数据集

堆栈溢出上的熊猫 QA

堆栈溢出上的熊猫 QA

95.35M
249 浏览
0 喜欢
0 次下载
0 条讨论
Text Data,Data Cleaning,Binary Classification,Categorical Data Classification

数据结构 ? 95.35M

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    Context This works focuses upon creating a data set on [Pandas Q/A over StackOverflow](https://stackoverflow.com/tags/pandas/info). Presently, there are more than [90k+ questions](https://stackoverflow.com/questions/tagged/pandas?sort=newest) available on StackOverflow which have been asked under Pandas section. Many questions on SO have bad quality or are a duplicate of already answered questions. A new SO user can ask a question which can fall in any of these sections (low quality, duplicate, spam, etc). Similarly, a new SO user might not flag a question if a question doesn't abide with SO guidelines, due to lack of experience. Therefore, users who have spent long efforts on SO are the ones who provide quality answers, classify a question as a duplicate, can close them, downvote, etc. We focus upon 40 such users who have earned [Pandas gold tag](https://stackoverflow.com/help/badges/3296/pandas) on their profile which in simple term means that they have answered enough questions to at least evaluate an upcoming question quality and answer or not answer accordingly. Content To create this data set, I felt no need to perform any web scraping to extract SO data. SO provides an online API where one can simply run SQL query to get a downloadable CSV file. To learn how I did this, read [here](https://aivic.github.io/blogs/Scraping-StackOverflow/). Acknowledgements All thanks to [Stack Overflow data API](https://data.stackexchange.com). All copyrights to Stack Overflow and its network sites licensed under [CC BY-SA 3.0](https://creativecommons.org/licenses/by-sa/3.0/). Task! So what can be performed with given 87241 rows and 16 columns? Since all the questions and answers given by 40 users have been extracted in data set, suggest what it takes for an answer to be accepted when one of the associated tag is "pandas"? Reputation calculation - Refer [here](https://stackoverflow.com/help/whats-reputation)
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:0 去赚积分?
    • 249浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享