重新思考青光眼校准：基于投票的双眼数据与元数据集成

Research

arXiv

Rethinking Glaucoma Calibration: Voting-Based Binocular and Metadata Integration

Jaehoon Joo ,

摘要 Abstract

青光眼是一种不可治愈的眼科疾病，会损害视神经、导致视力丧失，并成为全球范围内致盲的主要原因之一。诊断青光眼通常涉及眼底摄影、光学相干断层扫描（OCT）以及视野测试。然而，OCT高昂的成本常常导致依赖于眼底摄影和视野测试，而这两者都存在固有的观察者间变异性。这种变异性源于青光眼作为一种多因素疾病，受到多种因素的影响。因此，青光眼诊断具有高度主观性，强调了校准的必要性，即调整预测概率与实际疾病可能性的一致性。适当的校准对于防止过度诊断或误诊至关重要，尤其是在高风险疾病中。尽管人工智能在提高诊断准确性方面取得了显著进展，但模型的过度自信却恶化了校准性能。近期研究开始关注青光眼的校准问题，但以往的研究尚未充分考虑青光眼的系统性特征及其诊断过程中的高度主观性。为克服这些局限性，我们提出了V-ViT（基于投票的ViT），这是一种新颖的框架，通过整合疾病特异性特征来增强校准效果。V-ViT集成了双眼数据和元数据，反映了青光眼诊断的多面性。此外，我们引入了一种基于MC Dropout的投票系统，以解决高度主观性问题。我们的方法在所有指标上均达到了最先进的性能，包括准确性，表明所提出的方法在解决校准问题方面是有效的。我们使用包含双眼数据的自定义数据集验证了该方法。

Glaucoma is an incurable ophthalmic disease that damages the optic nerve, leads to vision loss, and ranks among the leading causes of blindness worldwide. Diagnosing glaucoma typically involves fundus photography, optical coherence tomography (OCT), and visual field testing. However, the high cost of OCT often leads to reliance on fundus photography and visual field testing, both of which exhibit inherent inter-observer variability. This stems from glaucoma being a multifaceted disease that influenced by various factors. As a result, glaucoma diagnosis is highly subjective, emphasizing the necessity of calibration, which aligns predicted probabilities with actual disease likelihood. Proper calibration is essential to prevent overdiagnosis or misdiagnosis, which are critical concerns for high-risk diseases. Although AI has significantly improved diagnostic accuracy, overconfidence in models have worsen calibration performance. Recent study has begun focusing on calibration for glaucoma. Nevertheless, previous study has not fully considered glaucoma's systemic nature and the high subjectivity in its diagnostic process. To overcome these limitations, we propose V-ViT (Voting-based ViT), a novel framework that enhances calibration by incorporating disease-specific characteristics. V-ViT integrates binocular data and metadata, reflecting the multi-faceted nature of glaucoma diagnosis. Additionally, we introduce a MC dropout-based Voting System to address high subjectivity. Our approach achieves state-of-the-art performance across all metrics, including accuracy, demonstrating that our proposed methods are effective in addressing calibration issues. We validate our method using a custom dataset including binocular data.