PropertyValue
?:about
?:abstract
  • Tweetplomacy 23 is a semantically annotated corpus of tweets capturing digital communicative interaction between international political leaders, peer groups and citizens in the wake of three major global crises: (1) the increasing emphasis on the security of energy supplies following Russia’s invasion of Ukraine; (2) the political and geo-economic consequences of the COVID-19 pandemic; (3) the intensified debate on the progression of climate change. These events occurred between 2018 and 2023, each of them marking a significant shake-up of the international system. The dataset focuses on the strategic use of networked information on X (formerly Twitter) by executive political actors facing exogenous shocks in the context of a global crisis situation. It is extracted from an X archive covering more than 14 billion tweets collected from the 1% random sample API. To extract the dataset, we resort to a list of top executives of the political administration – heads of state, heads of government, ministers of foreign affairs – or their respective public-relations offices. Their tweets are filtered using a list of thematically relevant keywords in four languages (English, German, French, Spanish), reflecting the discourse with respect to the three crises mentioned above. Our sample covers instances from the beginning of 2018 up to May 2023, representing statements made by leading politicians from 83 countries on all continents. As a subset, tweets published by the political leaders of the 38 member states of the OECD and the five BRICS countries (Brazil, Russia, India, China, South Africa) have been extracted. Additionally, the sample comprises a selection of 10 international organizations. The entire data collection consists of the following files: (1) users: excel file with a list of 654 Twitter user handles(usernames) of top executives of the political administration (and/or their institutional accounts), their nationalities, functions/roles and tenure; (2) keywords: excel file with a list of 60 crisis-related keywords (five keywords for each of the three individual crises in four languages); (3) a gzipped JSONL file per language: each line in the JSONL files represents a JSON object containing metadata about a tweet matching either one or more of the user handles and one or more of the keywords in the respective language. Additionally, semantic enrichments (i.e., entities and sentiments) calculated on the basis of the tweet text are provided. The JSON object includes the following fields: tweetId: integer timeStamp: format ("EEE MMM dd HH:mm:ss Z yyyy") userName: JSON object, for private persons containing the MD5 hashed of the username; for the public persons in the user list containing the username and the MD5 hashed of the username userBio: string (available only for public users in the user list) followers: integer friends: integer retweets: integer favorites: integer replies: integer matchingKeywords: list of strings representing the matching keywords matchingUserMentions: list of strings representing the matching user mentions matchingUserName: string representing the matching user names sentiments: JSON object containing the output of the VADER sentiment analysis tool (available only for German, English and French). entities: JSON object containing the output of Entity Fishing named entity linking tool hashtags: list of strings containing the hashtags extracted from the tweet text mentions: list of strings containing the mentions extracted from the tweet text urls: JSON object containing short URLs extracted from the tweet text and their resolved URLs The dataset may serve to track and examine the repercussions/resonance produced by the ‘digital audience’ of the most influential political leaders in the course of the three crises, thus hinting at the political and societal impact their communicative actions had in the digital realm. Additionally, changes in sentiments, argumentation and/or tonality as well as more general breakpoints of discussion might be identified by conducting in-depth analyses of the online discourse relating to each of the three debates. Ultimately, the data may yield new insights into networks of communication among ‘online champions’ in the diplomatic community with regard to global political crises. To this end, researchers will be able to employ both quantitative/statistical and qualitative/hermeneutic methodologies to further explore and compare specific communicative motivations of national political leaders and the global ‘digital public’ in such cases. The data might therefore be used as a valuable empirical input not merely for political or media scientists, but also for scholars focusing on sociological, economic or socio-psychological aspects of crisis communication. (xsd:string)
?:archivedAt
?:category
  • Information Science (de)
  • Information Science (en)
  • Interdisciplinary and Applied Fields of the Social Sciences (de)
  • Interdisciplinary and Applied Fields of the Social Sciences (en)
  • Interpersonal Communication (en)
  • Interpersonal Communication (de)
  • Mass Communication (de)
  • Mass Communication (en)
?:citationString
  • Petermann, Jan-Henrik, Bensmann, Felix, Zhang, Yudong, & Dimitrov, Dimitar (2025): Tweetplomacy 23 – An Annotated Collection of Tweets Outlining Strategies of Political Risk Communication during Global Crises (2018-2023). GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2860 (en)
  • Petermann, Jan-Henrik, Bensmann, Felix, Zhang, Yudong, & Dimitrov, Dimitar (2025): Tweetplomacy 23 – An Annotated Collection of Tweets Outlining Strategies of Political Risk Communication during Global Crises (2018-2023). GESIS, Köln. Datenfile Version 1.0.0, https://doi.org/10.7802/2860 (de)
?:comment
  • keywords- and user list-based extraction (xsd:string)
?:conditionsOfAccess
  • Free access (without registration) (en)
  • Freier Zugang (ohne Registrierung) (de)
?:currentVersion
  • 1.0.0, https://doi.org/10.7802/2860 (xsd:string)
?:dateCreated
  • 2025 (xsd:gyear)
?:dateModified
  • 2025-01-01 (xsd:date)
?:datePublished
  • 2025 (xsd:gyear)
?:doi
  • 10.7802/2860 ()
?:endDate
  • 2018-01-01 (xsd:date)
?:hasFulltext
  • true (xsd:boolean)
is ?:hasPart of
?:license
  • CC BY-NC 4.0: Attribution – NonCommercial (https://creativecommons.org/licenses/by-nc/4.0/deed.de) (xsd:string)
?:linksCodebook
?:name
  • Tweetplomacy 23 – An Annotated Collection of Tweets Outlining Strategies of Political Risk Communication during Global Crises (2018-2023) (xsd:string)
?:principalInvestigator
  • Bensmann, Felix (xsd:string)
  • Dimitrov, Dimitar (xsd:string)
  • Petermann, Jan-Henrik (xsd:string)
  • Zhang, Yudong (xsd:string)
?:provider
?:publicationType
  • SowiDataNet|datorium (en)
?:publisher
?:sourceInfo
  • GESIS, Cologne. Data File Version 1.0.0, https://doi.org/10.7802/2860 (en)
  • GESIS, Köln. Datenfile Version 1.0.0, https://doi.org/10.7802/2860 (de)
  • GESIS-SowiDataNet|datorium (xsd:string)
?:startDate
  • 2018-01-01 (xsd:date)
?:studyPublications
  • Dimitrov, D., Baran, E., Fafalios, F., Yu, R., Zhu, X., Zloch, M., and Dietze, S., TweetsCOV19 -- A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic, 29th ACM (xsd:string)
  • International Conference on Information & Knowledge Management (CIKM2020), Resource Track, ACM 2020 (xsd:string)
?:thematicCollection
  • Digital Behavioral Data (en)
  • Digitale Verhaltensdaten (de)
rdf:type
?:variableMeasured
  • 1% random sample Twitter/X archive (xsd:string)