利用两阶段数据改进鼻咽癌生存结果预测的方法

Leveraging Two-Phase Data for Improved Prediction of Survival Outcomes with Application to Nasopharyngeal Cancer

摘要 Abstract

准确的生存预测模型对于改善癌症患者的靶向治疗和临床护理至关重要。本文研究并提出了一种利用两阶段数据及专家知识和预后指数来提高癌症生存预测的方法。我们的工作受到鼻咽癌(NPC)中两阶段数据的启发,其中传统协变量对所有受试者都可获得,但主要病毒因素人乳头瘤病毒(HPV)却大量缺失。为了解决这一挑战,我们提出了一种基于观察到的协变量和关键因素临床重要性的专家引导方法。该方法高效利用了现有数据,而不仅仅是丢弃未知HPV状态的患者。我们通过一系列模拟研究和鼻咽癌患者的真实数据分析,应用并评估了所提出的方法与其他现有方法的表现。在各种设定下,所提出的方法在一致性指数(c-index)、校准斜率和集成Brier分数方面始终优于竞争方法。通过有效地利用两阶段数据,该模型为生存模型提供了更准确和可靠的预测能力。

Accurate survival predicting models are essential for improving targeted cancer therapies and clinical care among cancer patients. In this article, we investigate and develop a method to improve predictions of survival in cancer by leveraging two-phase data with expert knowledge and prognostic index. Our work is motivated by two-phase data in nasopharyngeal cancer (NPC), where traditional covariates are readily available for all subjects, but the primary viral factor, Human Papillomavirus (HPV), is substantially missing. To address this challenge, we propose an expert guided method that incorporates prognostic index based on the observed covariates and clinical importance of key factors. The proposed method makes efficient use of available data, not simply discarding patients with unknown HPV status. We apply the proposed method and evaluate it against other existing approaches through a series of simulation studies and real data example of NPC patients. Under various settings, the proposed method consistently outperforms competing methods in terms of c-index, calibration slope, and integrated Brier score. By efficiently leveraging two-phase data, the model provides a more accurate and reliable predictive ability of survival models.

利用两阶段数据改进鼻咽癌生存结果预测的方法 - arXiv