Open this publication in new window or tab >>Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany; Department of Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Surgical Sciences, Urology.
Institute of Pathology, RWTH Aachen University Hospital, Aachen, Germany.
Institute of Pathology, RWTH Aachen University Hospital, Aachen, Germany.
Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Surgical Sciences, Urology.
Spearpoint Analytics AB, Stockholm, Sweden.
Spearpoint Analytics AB, Stockholm, Sweden.
Institute of Computational Systems Biology, University of Hamburg, Germany.
Department of Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany; Department of Clinical Medicine, Aarhus University, Aarhus, Denmark; Department of Pathology, Aarhus University Hospital, Aarhus, Denmark.
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Surgical Sciences, Urology.
Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Institute of Pathology, RWTH Aachen University Hospital, Aachen, Germany.
Spearpoint Analytics AB, Stockholm, Sweden.
Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Medical Cell Biology. Uppsala University, Science for Life Laboratory, SciLifeLab. Uppsala University, Disciplinary Domain of Medicine and Pharmacy, Faculty of Medicine, Department of Immunology, Genetics and Pathology, Cancer precision medicine.
Spearpoint Analytics AB, Stockholm, Sweden; epartment of Urology, Uppsala University Hospital, Uppsala, Sweden.
Martini-Klinik Prostate Cancer Center, University Hospital Hamburg-Eppendorf, Hamburg, Germany.
Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Computerized Image Analysis and Human-Computer Interaction. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Automatic control. Uppsala University, Disciplinary Domain of Science and Technology, Mathematics and Computer Science, Department of Information Technology, Division Vi3. Spearpoint Analytics AB, Stockholm, Sweden.
Institute of Pathology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
Institute of Medical Systems Bioinformatics, Center for Biomedical AI (bAIome), Center for Molecular Neurobiology Hamburg (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany; Spearpoint Analytics AB, Stockholm, Sweden.
Show others...
2026 (English)In: Medical Image Analysis, ISSN 1361-8415, E-ISSN 1361-8423, Vol. 108, article id 103884Article in journal (Refereed) Published
Abstract [en]
The histopathological evaluation of biopsies by human experts is a gold standard in clinical disease diagnosis. While recent artificial intelligence-based (AI) approaches have reached human expert-level performance, they often display shortcomings caused by variations in sample preparation, limiting clinical applicability. This study investigates the impact of data variation on AI-based histopathological grading and explores algorithmic approaches that confer prediction robustness. To evaluate the impact of data variation in histopathology, we collected a multicentric, retrospective, observational prostate cancer (PCa) trial consisting of six cohorts in 3 countries with 25,591 patients, 83,864 images. This includes a high-variance dataset of 8,157 patients and 28,236 images with variations in section thickness, staining protocol, and scanner. This unique training dataset enabled the development of an AI-based PCa grading framework by training on patient outcome, not subjective grading. It was made robust through several algorithmic adaptations, including domain adversarial training and credibility-guided color adaptation. We named the final grading framework PCAI. We compare PCAI to a BASE model and human experts on three external test cohorts, comprising 2,255 patients and 9,437 images. Variations in sample processing, particularly section thickness and staining time, significantly reduced the performance of AI-based PCa grading by up to 8.6 percentage points in the event-ordered concordance index (EOC-Index) thus highlighting serious risks for AI-based histopathological grading. Algorithmic improvements for model robustness, credibility, and training on high-variance data as well as outcome-based severity prediction give rise to robust models with grading performance surpassing experienced pathologists. We demonstrate how our algorithmic enhancements for greater robustness lead to significantly better performance, surpassing expert grading on EOC-Index and 5-year AUROC by up to 21.2 percentage points.
Place, publisher, year, edition, pages
Elsevier, 2026
Keywords
Cancer grading, Deep learning, Digital histopathology, Robustness
National Category
Cancer and Oncology Radiology and Medical Imaging
Identifiers
urn:nbn:se:uu:diva-573494 (URN)10.1016/j.media.2025.103884 (DOI)001630602200001 ()41308537 (PubMedID)
Funder
EU, European Research Council, 101001791
2025-12-152025-12-152025-12-15Bibliographically approved