公开数据集
数据结构 ? 32.49M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
# Context
I was curious about the hot topics in quantum physics as reflected by the [quant-ph](https://arxiv.org/archive/quant-ph) category on arXiv. Citation counts have a long lag, and so do journal publications, and I wanted a more immediate measure of interest. [SciRate](http://scirate.com/) is fairly well known in this community, and I noticed that after the initial two-three weeks, the number of Scites a paper gets hardly increases further. So the number of Scites is both immediate and near constant after a short while.
# Content
The main dataset (`scirate_quant-ph.csv`) is the metadata of all papers published in quant-ph between 2012-01-01 and 2016-12-31 that had at least ten Scites, as crawled on 2016-12-31. It has six columns:
- The id column as exported by pandas.
- The arXiv id.
- The year of publication.
- The month of publication.
- The day of publication.
- The number of Scites (this column defines the order).
- The title.
- All authors separates by a semicolon.
- The abstract.
The author names were subjected to normalization and the chances are high that the same author only appears with a unique name.
The name normalization was the difficult part in compiling this collection, and this is why the number of Scites was lower bounded. A second file (`scirate_quant-ph_unnormalized.csv`) includes all papers that appeared between 2012-2016 irrespective of the number of Scites, but the author names are not normalized. The actual number of Scites for each paper may show a slight variation between the two datasets because the unnormalized version was compiled more than a month later.
# Acknowledgements
Many thanks to SciRate for tolerating my crawling trials and not blacklisting my IP address.
# Inspiration
Unleash topic models and author analysis to find out what or who is hot in quantum physics today. Build a generative model to write trendy fake titles like [SnarXiv](http://snarxiv.org/) does it for hep-th.
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。