公开数据集
数据结构 ? 0.36M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Context:
When groups of people who don’t share a spoken language come together, they will often create a new language which combines elements of their first languages. These languages are known as “pidgins”. If they are then learned by children as their first language they become fully-fledged languages known as “creoles”. This dataset contains information on both creoles and pidgins spoken around the world.
Content:
This dataset includes information on the grammatical and lexical structures of 76 pidgin and creole languages. The language set contains not only the most widely studied Atlantic and Indian Ocean creoles, but also less well known pidgins and creoles from Africa, South Asia, Southeast Asia, Melanesia and Australia, including some extinct varieties, and several mixed languages.
This dataset is made up of several tables, each of which contains different pieces of information:
* language: A table of language names & the unique id’s associated with them.
* language_data: A table of data on the different languages, including the name speakers’ call their language (Autoglossonym), other names the language is called, how many speakers it has, the language which contributed the most words to the language (Major lexifier), other languages which contribute to that language, where it is spoken, and where it is an official language fro. The column language_id has the id linked to the language table.
* language_source: The sources referenced on each language (referencing the language and source tables).
* langauge_table: Information on the geographic location of each language.
* source: Information on the scholarly sources referenced for information on language.
Acknowledgements:
This dataset contains information from the online portion of the [Atlas of Pidgin and Creole Language Structures (APiCS)](http://apics-online.info/). It is distributed under a [Creative Commons Attribution 3.0 Unported License](https://creativecommons.org/licenses/by/3.0/) . If you use this dataset in your work, please use this citation:
Salikoko S. Mufwene. 2013. Kikongo-Kituba structure dataset.
In: Michaelis, Susanne Maria & Maurer, Philippe & Haspelmath, Martin & Huber, Magnus (eds.)
Atlas of Pidgin and Creole Language Structures Online.
Leipzig: Max Planck Institute for Evolutionary Anthropology.
(Available online at http://apics-online.info/contributions/58, Accessed on 2017-07-28.)
Inspiration:
* Which areas of the world have the most creoles/pidgins?
* Which language has contributed to the most creoles/pidgins? Why might this be?
* Can you map the areas of influence of the various lexicalized Major Lexifier languages?
You may also be interested in:
* [World Language Family Map](https://www.kaggle.com/rtatman/world-language-family-map)
* [The Sign Language Analyses (SLAY) Database](https://www.kaggle.com/rtatman/sign-language-analyses)
* [World Atlas of Language Structures: Information on the linguistic structures in 2,679 languages](https://www.kaggle.com/rtatman/world-atlas-of-language-structures)
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。