Correcting for Observation Bias in Cancer Progression Modeling

Schill, Rudolf and Klever, Maren and Loesch, Andreas and Hu, Y. Linda and Vocht, Stefan and Rupp, Kevin and Grasedyck, Lars and Spang, Rainer and Beerenwinkel, Niko (2024) Correcting for Observation Bias in Cancer Progression Modeling. JOURNAL OF COMPUTATIONAL BIOLOGY, 31 (10). pp. 927-945. ISSN 1066-5277, 1557-8666

Full text not available from this repository. (Request a copy)

Abstract

Tumor progression is driven by the accumulation of genetic alterations, including both point mutations and copy number changes. Understanding the temporal sequence of these events is crucial for comprehending the disease but is not directly discernible from cross-sectional genomic data. Cancer progression models, including Mutual Hazard Networks (MHNs), aim to reconstruct the dynamics of tumor progression by learning the causal interactions between genetic events based on their co-occurrence patterns in cross-sectional data. Here, we highlight a commonly overlooked bias in cross-sectional datasets that can distort progression modeling. Tumors become clinically detectable when they cause symptoms or are identified through imaging or tests. Detection factors, such as size, inflammation (fever, fatigue), and elevated biochemical markers, are influenced by genomic alterations. Ignoring these effects leads to "conditioning on a collider" bias, where events making the tumor more observable appear anticorrelated, creating false suppressive effects or masking promoting effects among genetic events. We enhance MHNs by incorporating the effects of genetic progression events on the inclusion of a tumor in a dataset, thus correcting for collider bias. We derive an efficient tensor formula for the likelihood function and apply it to two datasets from the MSK-IMPACT study. In colon adenocarcinoma, we observe a significantly higher rate of clinical detection for TP53-positive tumors, while in lung adenocarcinoma, the same is true for EGFR-positive tumors. Compared to classical MHNs, this approach eliminates several spurious suppressive interactions and uncovers multiple promoting effects.

Item Type: Article
Uncontrolled Keywords: INFERENCE; EVOLUTION; PATHWAYS; APC; TUMORIGENESIS; MUTATIONS; KRAS; cancer progression model; selection bias; collider bias
Subjects: 000 Computer science, information & general works > 004 Computer science
600 Technology > 610 Medical sciences Medicine
Divisions: Medicine > Institut für Funktionelle Genomik > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Informatics and Data Science > Department Computational Life Science > Lehrstuhl für Statistische Bioinformatik (Prof. Spang)
Depositing User: Dr. Gernot Deinzer
Date Deposited: 04 Dec 2025 07:02
Last Modified: 04 Dec 2025 07:02
URI: https://pred.uni-regensburg.de/id/eprint/64697

Actions (login required)

View Item View Item