当前位置: X-MOL 学术Journal of Peace Research › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Words to unite nations: The complete United Nations General Debate Corpus, 1946–present
JOURNAL OF PEACE RESEARCH ( IF 3.4 ) Pub Date : 2024-11-25 , DOI: 10.1177/00223433241275335
Slava Jankin, Alexander Baturo, Niheer Dasandi

Every year since 1946, the General Debate has taken place at the beginning of the United Nations (UN) General Assembly session. Representatives from all UN member states deliver an address, discussing the issues that they consider most important in global politics, revealing their governments’ positions, and seeking to persuade other states of their perspectives. The annual UN General Debate statements provide invaluable information for scholars of international relations – comparable globally and over time. However, these texts are stored as poor quality images without relevant metadata, preventing researchers from applying data science methods. This paper introduces the complete UN General Debate Corpus (UNGDC). Building on a previous incomplete release of UNGDC, we have extended the corpus to cover the entire 1946–present period, included additional data on all speakers and provided advanced search and data visualization tools on a new website. The complete corpus contains over 10,000 speeches from 202 countries, including historical countries – making it the most comprehensive, unique and accessible collection of global political speeches. We discuss the complete UNGDC, provide relevant information for data users and present illustrative examples of how the corpus can be employed to address key questions in world politics.

中文翻译:


团结各国的话语:完整的联合国一般性辩论语料库,1946 年至今



自 1946 年以来,每年都会在联合国 (UN) 大会开始时举行一般性辩论。来自联合国所有会员国的代表发表演讲,讨论他们认为全球政治中最重要的问题,揭示本国政府的立场,并试图说服其他国家表达他们的观点。年度联合国一般性辩论声明为国际关系学者提供了宝贵的信息——在全球范围内和随着时间的推移具有可比性。但是,这些文本存储为质量较差的图像,没有相关的元数据,从而阻止研究人员应用数据科学方法。本文介绍了完整的联合国一般性辩论语料库 (UNGDC)。在 UNGDC 之前不完整版本的基础上,我们扩展了语料库以涵盖整个 1946 年至今,包括所有演讲者的额外数据,并在新网站上提供了高级搜索和数据可视化工具。完整的语料库包含来自 202 个国家/地区(包括历史国家)的 10,000 多篇演讲,使其成为最全面、最独特和最易访问的全球政治演讲集合。我们讨论了完整的 UNGDC,为数据用户提供相关信息,并提供了如何利用语料库解决世界政治中关键问题的说明性示例。
更新日期:2024-11-25
down
wechat
bug