Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Introducing the UCDP Candidate Events Dataset
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Peace and Conflict Research.ORCID iD: 0000-0002-5076-0994
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Peace and Conflict Research.ORCID iD: 0000-0002-5372-7129
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Peace and Conflict Research.ORCID iD: 0000-0002-4998-7964
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Peace and Conflict Research.ORCID iD: 0000-0002-0087-8724
2020 (English)In: Research & Politics, E-ISSN 2053-1680, Vol. 7, no 3, p. 1-8Article in journal (Refereed) Published
Abstract [en]

This article presents a new, monthly updated dataset on organized violence—the Uppsala Conflict Data Program Candidate Events Dataset. It contains recent observations of candidate events, a majority of which are eventually included in the Uppsala Conflict Data Program Georeferenced Event Dataset as part of its annual update after a careful vetting process. We describe the definitions, sources and procedures employed to code the candidate events, and a set of issues that emerge when coding data on organized violence in near-real time. Together, the Uppsala Conflict Data Program Candidate and Georeferenced Event Datasets minimize an inherent trade-off between update speed and quality control. Having monthly updated conflict data is advantageous for users needing near-real time monitoring of violent situations and aiming to anticipate future developments. To demonstrate this, we show that including them in a conflict forecasting system yields distinct improvements in terms of predictive performance: Average precision increases by 20–40% relative to using the Uppsala Conflict Data Program Georeferenced Event Dataset only. We also show that to ensure quality and consistency, revisiting the initial coding making use of sources that become available later is absolutely necessary.

Place, publisher, year, edition, pages
2020. Vol. 7, no 3, p. 1-8
Keywords [en]
Armed conflict, event data, Africa, forecasting
National Category
Political Science (excluding Public Administration Studies and Globalisation Studies)
Research subject
Peace and Conflict Research; Peace and Conflict Research
Identifiers
URN: urn:nbn:se:uu:diva-420461DOI: 10.1177/2053168020935257ISI: 000575049900001OAI: oai:DiVA.org:uu-420461DiVA, id: diva2:1470850
Part of project
ViEWS: a political Violence Early Warning System
Funder
EU, European Research Council, H2020-ERC-2015-AdG 694640Swedish National Infrastructure for Computing (SNIC)Available from: 2020-09-26 Created: 2020-09-26 Last updated: 2024-12-12Bibliographically approved
In thesis
1. Forecasting battles: New machine learning methods for predicting armed conflict
Open this publication in new window or tab >>Forecasting battles: New machine learning methods for predicting armed conflict
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Over the past decade, the field of conflict forecasting has undergone a remarkable metamorphosis, transforming from a series of isolated efforts with low predictive power into large, globe-spanning projects with impressive performance. However, despite this evolution, many challenges still remain. First, while we are good at predicting absolute risks, we are poor at predicting conflict dynamics (onsets, escalations, de-escalations and terminations). Second, we are over-reliant on spatio-temporal features and mechanistic models due to the nature of the event-data we use, thus excluding actor agency. Third, we do not handle either data or model uncertainty. Fourth, we are lagging behind the state-of-the-art in machine-learning. This dissertation attempts to resolve some of these salient difficulties, by contributing to six core elements of current-generation forecasting systems. First, time, by looking at the substantive effects and uncertainties of the temporal distance between data and forecast horizons. Second, space, by looking at the inherent uncertainties of high-resolution geospatial data and proposing a statistical method to address this. Third, feature space, by tackling the extreme feature sparsity in event-data and proposing a novel, deep active learning approach to mine features from existing large conflict-related text corpora. Fourth, substantive knowledge, by combining findings from the previous papers to take a fresh look at the microdynamics of conflict escalation. Fifth, the forecasting process itself, by building models that directly forecast from text, eliminating the intermediate step of manual data curation. Finally, the frontier of event-data, by looking at whether the news-media heavy way we collect violent fatal events can be extended to the collection of non-violent events. Methodologically, the dissertation introduces state-of-the art methods to the field, including the use of large language models, Gaussian processes, active learning and deep time series modelling. The six papers in the dissertation exhibit significant performance improvement, especially in forecasting dynamics.

Place, publisher, year, edition, pages
Uppsala: Uppsala University, 2025. p. 62
Series
Report / Department of Peace and Conflict Research, ISSN 0566-8808 ; 132
Keywords
conflict forecasting, predictive methodology, event data, battle events, spatial forecasting, machine learning, large language models, computational linguistics, civil war, armed conflict
National Category
Political Science (excluding Public Administration Studies and Globalisation Studies) Other Social Sciences not elsewhere specified Peace and Conflict Studies Other Social Sciences not elsewhere specified Computer Sciences Social and Economic Geography
Research subject
Peace and Conflict Research; Computational Linguistics; Political Science; Social and Economic Geography; Machine learning
Identifiers
urn:nbn:se:uu:diva-545176 (URN)978-91-506-3086-2 (ISBN)
Public defence
2025-03-21, Brusewitzsalen, Gamla Torget 6, Uppsala, 13:15 (English)
Opponent
Supervisors
Available from: 2025-01-27 Created: 2024-12-12 Last updated: 2025-02-20

Open Access in DiVA

fulltext(2054 kB)588 downloads
File information
File name FULLTEXT01.pdfFile size 2054 kBChecksum SHA-512
ce0aed59c7e49db38756b686c2559a119db1261f6fdf74f05ddadef5c62a9825b5e092795c6416f468e3c64ad7c032a9d92b0bbd6453b38fbd0580df8b50a0b0
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Hegre, HåvardCroicu, MihaiEck, KristineHögbladh, Stina

Search in DiVA

By author/editor
Hegre, HåvardCroicu, MihaiEck, KristineHögbladh, Stina
By organisation
Department of Peace and Conflict Research
In the same journal
Research & Politics
Political Science (excluding Public Administration Studies and Globalisation Studies)

Search outside of DiVA

GoogleGoogle Scholar
Total: 588 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 164 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf