Logo: to the web site of Uppsala University

uu.sePublications from Uppsala University
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
23456785 of 25
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Forecasting battles: New machine learning methods for predicting armed conflict
Uppsala University, Disciplinary Domain of Humanities and Social Sciences, Faculty of Social Sciences, Department of Peace and Conflict Research. (VIEWS: Violence and Impacts Early-Warning System)ORCID iD: 0000-0002-5372-7129
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Description
Abstract [en]

Over the past decade, the field of conflict forecasting has undergone a remarkable metamorphosis, transforming from a series of isolated efforts with low predictive power into large, globe-spanning projects with impressive performance. However, despite this evolution, many challenges still remain. First, while we are good at predicting absolute risks, we are poor at predicting conflict dynamics (onsets, escalations, de-escalations and terminations). Second, we are over-reliant on spatio-temporal features and mechanistic models due to the nature of the event-data we use, thus excluding actor agency. Third, we do not handle either data or model uncertainty. Fourth, we are lagging behind the state-of-the-art in machine-learning. This dissertation attempts to resolve some of these salient difficulties, by contributing to six core elements of current-generation forecasting systems. First, time, by looking at the substantive effects and uncertainties of the temporal distance between data and forecast horizons. Second, space, by looking at the inherent uncertainties of high-resolution geospatial data and proposing a statistical method to address this. Third, feature space, by tackling the extreme feature sparsity in event-data and proposing a novel, deep active learning approach to mine features from existing large conflict-related text corpora. Fourth, substantive knowledge, by combining findings from the previous papers to take a fresh look at the microdynamics of conflict escalation. Fifth, the forecasting process itself, by building models that directly forecast from text, eliminating the intermediate step of manual data curation. Finally, the frontier of event-data, by looking at whether the news-media heavy way we collect violent fatal events can be extended to the collection of non-violent events. Methodologically, the dissertation introduces state-of-the art methods to the field, including the use of large language models, Gaussian processes, active learning and deep time series modelling. The six papers in the dissertation exhibit significant performance improvement, especially in forecasting dynamics.

Place, publisher, year, edition, pages
Uppsala: Uppsala University, 2025. , p. 62
Series
Report / Department of Peace and Conflict Research, ISSN 0566-8808 ; 132
Keywords [en]
conflict forecasting, predictive methodology, event data, battle events, spatial forecasting, machine learning, large language models, computational linguistics, civil war, armed conflict
National Category
Political Science (excluding Public Administration Studies and Globalisation Studies) Other Social Sciences not elsewhere specified Peace and Conflict Studies Other Social Sciences not elsewhere specified Computer Sciences Social and Economic Geography
Research subject
Peace and Conflict Research; Computational Linguistics; Political Science; Social and Economic Geography; Machine learning
Identifiers
URN: urn:nbn:se:uu:diva-545176ISBN: 978-91-506-3086-2 (print)OAI: oai:DiVA.org:uu-545176DiVA, id: diva2:1920778
Public defence
2025-03-21, Brusewitzsalen, Gamla Torget 6, Uppsala, 13:15 (English)
Opponent
Supervisors
Part of project
ViEWS: a political Violence Early Warning SystemSocieties at risk: The impact of armed conflict on human development, Riksbankens JubileumsfondAvailable from: 2025-01-27 Created: 2024-12-12 Last updated: 2025-02-20
List of papers
1. Introducing the UCDP Candidate Events Dataset
Open this publication in new window or tab >>Introducing the UCDP Candidate Events Dataset
2020 (English)In: Research & Politics, E-ISSN 2053-1680, Vol. 7, no 3, p. 1-8Article in journal (Refereed) Published
Abstract [en]

This article presents a new, monthly updated dataset on organized violence—the Uppsala Conflict Data Program Candidate Events Dataset. It contains recent observations of candidate events, a majority of which are eventually included in the Uppsala Conflict Data Program Georeferenced Event Dataset as part of its annual update after a careful vetting process. We describe the definitions, sources and procedures employed to code the candidate events, and a set of issues that emerge when coding data on organized violence in near-real time. Together, the Uppsala Conflict Data Program Candidate and Georeferenced Event Datasets minimize an inherent trade-off between update speed and quality control. Having monthly updated conflict data is advantageous for users needing near-real time monitoring of violent situations and aiming to anticipate future developments. To demonstrate this, we show that including them in a conflict forecasting system yields distinct improvements in terms of predictive performance: Average precision increases by 20–40% relative to using the Uppsala Conflict Data Program Georeferenced Event Dataset only. We also show that to ensure quality and consistency, revisiting the initial coding making use of sources that become available later is absolutely necessary.

Keywords
Armed conflict, event data, Africa, forecasting
National Category
Political Science (excluding Public Administration Studies and Globalisation Studies)
Research subject
Peace and Conflict Research; Peace and Conflict Research
Identifiers
urn:nbn:se:uu:diva-420461 (URN)10.1177/2053168020935257 (DOI)000575049900001 ()
Funder
EU, European Research Council, H2020-ERC-2015-AdG 694640Swedish National Infrastructure for Computing (SNIC)
Available from: 2020-09-26 Created: 2020-09-26 Last updated: 2024-12-12Bibliographically approved
2. Enhancing geospatial precision in conflict data: A stochastic approach to addressing known geographically imprecise observations in conflict event data
Open this publication in new window or tab >>Enhancing geospatial precision in conflict data: A stochastic approach to addressing known geographically imprecise observations in conflict event data
2023 (English)Conference paper, Oral presentation with published abstract (Other academic)
Abstract [en]

The proliferation of large-scale, geographically disaggregated data on armed conflicts, protests, and similar events has opened new avenues of research, but has also introduced significant data quality challenges. A notable yet often overlooked issue involves observations with “known geographic imprecision” (KGI), where event locations are unknown and instead arbitrarily assigned by dataset authors. Although this issue is widely recognized and accounts for up to a quarter of observations in datasets like UCDP GED, it is rarely addressed by users. This paper presents a stochastic method derived from the multiple-imputation literature, employing spatio-temporal Gaussian processes and leveraging latent actor-conflict features in the data to enhance location accuracy. Extensive Monte-Carlo simulations demonstrate that this approach substantially enhances the accuracy of these observations and improves predictive performance beyond the state-of-the-art when applied out-of-sample. Additionally, an adapted version of the UCDP GED dataset that employs this new procedure is provided, showcasing the practical application and benefits of the methodology.

Place, publisher, year, edition, pages
International Studies Association, 2023
National Category
Social and Economic Geography Other Social Sciences not elsewhere specified Peace and Conflict Studies Other Social Sciences not elsewhere specified
Identifiers
urn:nbn:se:uu:diva-544709 (URN)
Conference
64th International Studies Association Annual Convention, Montreal, Canada, 15-18 March, 2023
Available from: 2024-12-07 Created: 2024-12-07 Last updated: 2025-02-20
3. Deep active learning for data mining from conflict text corpora
Open this publication in new window or tab >>Deep active learning for data mining from conflict text corpora
(English)Manuscript (preprint) (Other academic)
Abstract [en]

High-resolution event data on armed conflict and related processes have revolutionized the study of political contention. However, most datasets of this type only collect spatio-temporal and conflict intensity data at that level of detail. Information on dynamics, such as targets, tactics, and purposes, is rarely collected due to the substantial effort of collecting data. This study proposes an inexpensive, high-performance approach to increase the feature richness of such datasets by leveraging active learning -- an iterative process of improving a machine learning model based on guided human input at each step of the learning process. Active learning is employed to then fine-tune (train in steps) a large, encoder-only language model fitted to the rich corpus of textual data underlying such datasets. This allows for the extraction of features related to conflict dynamics, such as electoral violence and attacks on religious targets. The approach achieves a performance comparable to the human (gold-standard) coding, while reducing the necessary human annotation by as much as 99 percent.

National Category
Other Social Sciences not elsewhere specified Peace and Conflict Studies Other Social Sciences not elsewhere specified Political Science (excluding Public Administration Studies and Globalisation Studies) Computer Sciences
Research subject
Peace and Conflict Research; Machine learning
Identifiers
urn:nbn:se:uu:diva-544706 (URN)
Available from: 2024-12-07 Created: 2024-12-07 Last updated: 2025-02-20
4. Provocation by Design?: Holy Places, Public Transport, and Civil Conflict Escalation
Open this publication in new window or tab >>Provocation by Design?: Holy Places, Public Transport, and Civil Conflict Escalation
(English)Manuscript (preprint) (Other academic)
Abstract [en]

What explains conflict escalation during civil war? This article explores whether provocative attacks on religious sites and public transport constitute a precursor to a surge of violence. One argument pertains that the symbolic and doctrinal importance of places of worship means that attacks on these will affect individuals and the community emotionally and thereby increase the risk of escalation. However, it can also be suggested that the everyday societal importance of a public space is similar for religious sites and public transport hubs. We test these arguments using novel new global event data on these forms of selective targeting for 1989-2015, and find that the risk of conflict escalation increase in the aftermath of either attacks on places of worship or public transport, suggesting that community behavior is more affected to disruptions of societal everyday life than to the importance of symbols.

National Category
Political Science (excluding Public Administration Studies and Globalisation Studies) Peace and Conflict Studies Other Social Sciences not elsewhere specified
Research subject
Political Science; Peace and Conflict Research; Peace and Conflict Research; Administrative Law; Peace and Conflict Research
Identifiers
urn:nbn:se:uu:diva-544707 (URN)
Available from: 2024-12-07 Created: 2024-12-07 Last updated: 2025-02-20
5. From newswire to nexus: Using text-based actor embeddings and transformer networks to forecast conflict dynamics
Open this publication in new window or tab >>From newswire to nexus: Using text-based actor embeddings and transformer networks to forecast conflict dynamics
2024 (English)Conference paper, Oral presentation with published abstract (Other academic)
Abstract [en]

This study advances the field of conflict forecasting by using text-based actor embeddings with transformer models to predict dynamic changes in violent conflict patterns at the actor level. More specifically, we combine newswire texts with structured conflict event data and leverage recent advances in Natural Language Processing (NLP) techniques to forecast escalations and de-escalations among conflicting actors, such as governments, militias, separatist movements, and terrorists. This new approach accurately and promptly captures the inherently volatile patterns of violent conflicts, which existing methods have not been able to achieve. To create this framework, we began by curating and annotating a vast international newswire corpus, leveraging hand-labeled event data from the Uppsala Conflict Data Program. By using this hybrid dataset,  our models can incorporate the textual context of news sources along with the precision and detail of structured event data. This combination enables us to make both dynamic and granular predictions about conflict developments. We validate our approach through rigorous back-testing against historical events, demonstrating superior out-of-sample predictive power. We find that our approach is quite effective in identifying and predicting phases of conflict escalation and de-escalation, surpassing the capabilities of traditional models. By focusing on actor interactions, our explicit goal is to provide actionable insights to policymakers, humanitarian organizations, and peacekeeping operations in order to enable targeted and effective intervention strategies.

Place, publisher, year, edition, pages
The American Political Science Association, 2024
National Category
Natural Language Processing Political Science (excluding Public Administration Studies and Globalisation Studies) Peace and Conflict Studies Other Social Sciences not elsewhere specified Computer Sciences
Identifiers
urn:nbn:se:uu:diva-544710 (URN)
Conference
120th Annual Meeting of the American Political Science Association, Philadelphia, United States of America, September 5--8
Available from: 2024-12-07 Created: 2024-12-07 Last updated: 2025-02-20
6. Reporting of Non-Fatal Conflict Events
Open this publication in new window or tab >>Reporting of Non-Fatal Conflict Events
2022 (English)In: International Interactions, ISSN 0305-0629, E-ISSN 1547-7444, Vol. 48, no 3, p. 450-470Article in journal (Refereed) Published
Abstract [en]

Temporally and spatial disaggregated datasets are commonly used to study political violence. Researchers are increasingly studying the data generation process itself to understand the selection processes by which conflict events are included in conflict datasets. This work has focused on conflict fatalities. In this research note, we explore how non-fatal conflict events are reported upon and enter into datasets of armed conflict. To do so, we compare reported non-fatal conflict events with the population of events in two direct observation datasets, collected using a boots-on-the-ground strategy: mass abductions in Nepal (1996-2006) and troop movements in Darfur. We show that at the appropriate level of aggregation media reporting on abductions in Nepal largely mirrors the "true" population of abductions, but at more disaggregated levels of temporal or spatial analysis, the match is poor. We also show that there is no overlap between a media-driven conflict dataset and directly-observed data on troop movements in Sudan. These empirics indicate that non-fatal data can suffer from serious underreporting and that this is particularly the case for events lacking elements of coercion. These findings are indicative of selection problems in regards to the reporting on non-fatal conflict events.

Place, publisher, year, edition, pages
Taylor & Francis, 2022
Keywords
Conflict, data, event data, non-violence, reporting bias
National Category
Public Health, Global Health and Social Medicine
Identifiers
urn:nbn:se:uu:diva-483047 (URN)10.1080/03050629.2022.2044325 (DOI)000768102400001 ()
Funder
EU, European Research Council, 694640 - ViEWS
Available from: 2022-09-08 Created: 2022-09-08 Last updated: 2025-02-20Bibliographically approved

Open Access in DiVA

UUThesis_Croicu,M-2025(567 kB)213 downloads
File information
File name FULLTEXT01.pdfFile size 567 kBChecksum SHA-512
dd771b445525e4dc8a88a0ecf91be3084bafa653e7fcbd6d294cff4a323ca8671b3e3a44b5db2cec4595edf89cecb862a5d598fe0cc919626b1834b58096ad2c
Type fulltextMimetype application/pdf

Authority records

Croicu, Mihai

Search in DiVA

By author/editor
Croicu, Mihai
By organisation
Department of Peace and Conflict Research
Political Science (excluding Public Administration Studies and Globalisation Studies)Other Social Sciences not elsewhere specifiedPeace and Conflict StudiesOther Social Sciences not elsewhere specifiedComputer SciencesSocial and Economic Geography

Search outside of DiVA

GoogleGoogle Scholar
Total: 213 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1735 hits
23456785 of 25
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf