Trend AnalysisMedicine & Health

Explainable AI in Medical Imaging: Why Accuracy Is Not Enough

Medical AI achieves radiologist-level accuracy on narrow tasks—but clinicians don't trust what they can't understand. A framework distinguishing transparency, interpretability, and explainability reveals why current XAI methods fall short of clinical needs. The gap is conceptual, not computational.

By Sean K.S. Shin

This blog summarizes research trends based on published paper abstracts. Specific numbers or findings may contain inaccuracies. For scholarly rigor, always consult the original papers cited in each post.

A medical AI system that diagnoses diabetic retinopathy with 97% sensitivity but cannot explain why it flagged a particular image will not be adopted by ophthalmologists. A chest X-ray classifier that outperforms radiologists on pneumonia detection but highlights random image regions in its saliency map will not be trusted in emergency departments. A pathology model that predicts cancer grade with area under the curve of 0.95 but cannot articulate which morphological features drove the prediction will not change clinical workflows.

These are not hypothetical scenarios. They describe the current state of medical AI: systems that achieve impressive accuracy metrics on retrospective datasets while failing to meet the explanatory standards that clinical medicine demands. The explainability gap—the distance between what an AI system can predict and what it can justify—is now the primary barrier to clinical adoption, more limiting than accuracy, regulatory approval, or infrastructure readiness.

A Framework for Accountable AI

Singh et al. (2025) provide a needed conceptual clarification with in Bioengineering. They argue that the terms "transparency," "interpretability," and "explainability" are used interchangeably in the literature but refer to distinct properties:

Transparency: Can you inspect the model's internal state? This is a property of the architecture. Linear models are transparent; deep neural networks generally are not. Transparency enables auditing but does not guarantee understanding.

Interpretability: Can you map the model's internal representations to human-understandable concepts? Attention maps, feature visualizations, and concept activation vectors are interpretability tools. They reveal what the model attends to, but not why.

Explainability: Can the model produce a reasoning chain that a domain expert would find convincing? This is the highest standard—and the one that clinical adoption requires. An explanation must not only identify relevant features but articulate the inferential steps from observation to conclusion in domain-appropriate language.

Singh et al. catalog the dominant post hoc explanation methods—Grad-CAM, SHAP, LIME, integrated gradients—and identify a common limitation: they explain the model's prediction, not the model's reasoning. A Grad-CAM heatmap showing that the model attended to a lung nodule tells the radiologist that the model saw the nodule. It does not tell the radiologist why the model classified it as malignant rather than benign—whether the model detected spiculation, ground-glass opacity, growth on comparison with prior studies, or some entirely non-clinical feature like scanner artifacts.

This distinction between attention and reasoning is the core of the explainability gap. Closing it requires moving beyond post hoc attribution methods toward models that produce structured, clinical-language explanations as part of their inference process.

Fusion Architectures for Explainability

Hussain et al. (2025) demonstrate a practical approach to improving both accuracy and interpretability, published in IEEE Access with —a reception reflecting the practical applicability of their approach. Their EFFResNet-ViT architecture fuses convolutional neural networks (CNNs) with vision transformers (ViTs) and integrates Grad-CAM++ explanations into the classification pipeline.

The fusion rationale: CNNs excel at local feature extraction (texture, edges, spatial patterns) while ViTs capture global context (relationships between distant image regions). Medical diagnosis often requires both—a lung nodule's significance depends on its local morphology and its global context (location, proximity to airways, comparison with contralateral lung).

Hussain et al. report high accuracy on brain tumor CE-MRI classification (99.31%) and retinal image classification (92.54%), outperforming CNN-only and ViT-only baselines on both datasets. The Grad-CAM++ visualizations show improved spatial localization compared to standard Grad-CAM—attending more precisely to clinically relevant regions. However, the authors acknowledge that spatial localization is a necessary but insufficient condition for clinical explainability: knowing where the model looks does not establish why it concludes what it does.

Bridging Pathology and Radiology

He et al. (2025) address an application where explainability is particularly critical: AI-assisted integration of pathology and radiology findings. Published in Frontiers in Medicine their deep learning classifier operates across both imaging modalities, providing a unified diagnostic assessment that combines histopathological features (cell morphology, tissue architecture) with radiological features (density, enhancement patterns, anatomical location).

The clinical value is clear: pathology and radiology currently operate as separate diagnostic silos, with integration happening only in the clinician's mind during multidisciplinary tumor boards. An AI system that performs this integration computationally could surface discordances—cases where pathology and radiology suggest different diagnoses—that might otherwise be missed.

The explainability challenge is compounded: the system must explain not only its individual modality assessments but also how it integrates information across modalities. He et al. use attention mechanisms to visualize cross-modal information flow, showing which radiology features influence the pathology interpretation and vice versa—a step toward multimodal explanations, though still at the level of attention rather than reasoning.

Concept-Level Explanations

Nie et al. (2025) represent the frontier of explainable medical AI with . Their concept-enhanced vision-language pre-training approach trains the model to ground its predictions in a vocabulary of human-interpretable medical concepts rather than in raw pixel features.

The difference is analogous to the difference between a medical student who says "I think it's cancer because the image looks weird" and a senior radiologist who says "the lesion demonstrates arterial-phase hyperenhancement with washout on the portal venous phase and an enhancing capsule, meeting LI-RADS 5 criteria for hepatocellular carcinoma." Both may reach the correct diagnosis; only the second provides a clinically useful explanation.

Nie et al. train their model on large-scale concept annotations derived from radiology reports, teaching it to associate visual patterns with clinical terminology. The resulting model can articulate its predictions using concepts that radiologists recognize and evaluate—approaching the "Level 3 explainability" (justification) that Singh et al. advocate.

The limitation: the concept vocabulary is fixed during training. Novel findings that fall outside the training concept space cannot be explained, and the model may force-fit unfamiliar patterns into existing concepts rather than acknowledging uncertainty.

Critical Analysis: Claims and Evidence

Claim	Evidence	Verdict
Post hoc XAI methods (Grad-CAM, SHAP) are clinically adequate	Unstable attributions; attention ≠ reasoning (Singh et al.)	❌ Refuted
Fusion CNN-ViT architectures improve medical image classification	99.31% on brain tumor CE-MRI and 92.54% on retinal image classification, outperforming CNN-only and ViT-only baselines (Hussain et al.)	✅ Supported
Concept-level explanations improve clinician trust	User studies show increased confidence (Nie et al.)	✅ Supported (early results)
Cross-modal AI integration outperforms single-modality analysis	Demonstrated for pathology-radiology integration (He et al.)	✅ Supported
Explainable medical AI is ready for clinical deployment	No prospective trial; regulatory standards for XAI undefined	❌ Refuted

Open Questions and Future Directions

Should explainability be mandatory for clinical AI? The FDA currently does not require AI/ML-based devices to provide explanations, only to demonstrate clinical performance. Should regulatory frameworks require a minimum level of explainability? What would that standard look like?

Can LLMs generate clinical-grade explanations for visual AI? Combining a visual classifier with a large language model that generates natural-language explanations could produce radiologist-style reports. The risk is that the LLM generates plausible-sounding but incorrect explanations—a failure mode with direct patient safety implications.

How do we evaluate explanations? Accuracy has well-defined metrics (AUROC, sensitivity, specificity). Explainability does not. Developing validated metrics for explanation quality—relevance, completeness, correctness, clinical utility—is an open research challenge.

Do clinicians actually want explanations? Some studies suggest that clinicians prefer confidence-calibrated predictions (knowing how sure the model is) over feature-level explanations (knowing why the model decided). The answer may depend on the clinical context, the clinician's experience level, and the stakes of the decision.

Can we build intrinsically interpretable medical AI? Rather than explaining black-box models post hoc, designing models that are interpretable by construction—using attention over predefined clinical concepts, case-based reasoning, or rule extraction—may be a more robust path to clinical trust.

Implications for Clinical AI Adoption

The explainability gap in medical AI is not a technical problem awaiting a technical solution. It is a conceptual problem that requires reconsidering what we mean by "explanation" in clinical contexts. A heatmap is not an explanation. A confidence score is not an explanation. A list of attended features is not an explanation.

A clinical explanation connects observations to diagnoses through a chain of reasoning that references established medical knowledge, acknowledges uncertainty, and suggests next steps. Building AI systems that produce such explanations—not as an afterthought appended to a classifier, but as an integral part of the diagnostic process—is the challenge that will determine whether medical AI becomes a trusted clinical tool or remains a laboratory curiosity.

면책 조항: 이 게시물은 정보 제공 목적의 연구 동향 개요이다. 학술 연구에서 인용하기 전에 특정 연구 결과, 통계 및 주장은 원본 논문을 통해 검증해야 한다.

의료 영상에서의 설명 가능한 AI: 정확도만으로는 충분하지 않은 이유

당뇨병성 망막병증을 97%의 민감도로 진단하지만 특정 이미지를 왜 표시했는지 설명하지 못하는 의료 AI 시스템은 안과 의사들에게 채택되지 않을 것이다. 폐렴 감지에서 방사선과 의사보다 뛰어난 성능을 보이는 흉부 X선 분류기가 현저도 맵(saliency map)에서 무작위 이미지 영역을 강조 표시한다면 응급실에서 신뢰받지 못할 것이다. 암 등급을 곡선하 면적(area under the curve) 0.95로 예측하지만 어떤 형태학적 특징이 예측을 이끌었는지 명확히 설명하지 못하는 병리 모델은 임상 워크플로를 변화시키지 못할 것이다.

이것은 가상의 시나리오가 아니다. 이는 현재 의료 AI의 상태를 묘사한다. 즉, 회고적 데이터셋에서 인상적인 정확도 지표를 달성하면서도 임상 의학이 요구하는 설명 기준을 충족하지 못하는 시스템들이다. 설명 가능성 격차(explainability gap)—AI 시스템이 예측할 수 있는 것과 정당화할 수 있는 것 사이의 거리—는 현재 임상 채택의 주요 장벽으로, 정확도, 규제 승인, 또는 인프라 준비도보다 더 제한적인 요소이다.

책임 있는 AI를 위한 프레임워크

Singh et al. (2025)은 Bioengineering에서 필요한 개념적 명확화를 제공한다. 이들은 문헌에서 "투명성(transparency)", "해석 가능성(interpretability)", "설명 가능성(explainability)"이라는 용어가 혼용되지만 실제로는 서로 다른 속성을 지칭한다고 주장한다.

투명성(Transparency): 모델의 내부 상태를 검사할 수 있는가? 이는 아키텍처의 속성이다. 선형 모델은 투명하지만 심층 신경망은 일반적으로 그렇지 않다. 투명성은 감사(auditing)를 가능하게 하지만 이해를 보장하지는 않는다.

해석 가능성(Interpretability): 모델의 내부 표현을 인간이 이해할 수 있는 개념으로 매핑할 수 있는가? 어텐션 맵(attention map), 특징 시각화(feature visualization), 개념 활성화 벡터(concept activation vector)는 해석 가능성 도구이다. 이것들은 모델이 무엇에 주의를 기울이는지 드러내지만, 왜 그러한지는 드러내지 않는다.

설명 가능성(Explainability): 모델이 도메인 전문가가 납득할 수 있는 추론 과정을 생성할 수 있는가? 이것이 가장 높은 기준이며, 임상 채택이 요구하는 기준이다. 설명은 관련 특징을 식별하는 것에 그치지 않고, 관찰에서 결론까지의 추론 단계를 도메인에 적합한 언어로 명확히 제시해야 한다.

Singh et al.은 주요 사후(post hoc) 설명 방법들—Grad-CAM, SHAP, LIME, 통합 기울기(integrated gradients)—을 분류하고 공통적인 한계를 식별한다. 이 방법들은 모델의 추론이 아닌 모델의 예측을 설명한다는 것이다. 폐 결절에 모델이 주의를 기울였다는 것을 보여주는 Grad-CAM 히트맵은 방사선과 의사에게 모델이 결절을 보았다는 것을 알려준다. 그러나 모델이 왜 결절을 양성이 아닌 악성으로 분류했는지—모델이 침상 돌기(spiculation), 간유리 음영(ground-glass opacity), 이전 검사와의 비교를 통한 성장, 혹은 스캐너 아티팩트(artifacts)와 같이 완전히 비임상적인 특징을 감지했는지—는 알려주지 않는다.

주의(attention)와 추론(reasoning) 사이의 이 구분이 설명 가능성 격차의 핵심이다. 이를 해소하려면 사후 귀인(attribution) 방법을 넘어, 추론 과정의 일부로 구조화된 임상 언어 설명을 생성하는 모델로 나아가야 한다.

설명 가능성을 위한 융합 아키텍처

Hussain et al. (2025)은 IEEE Access에 발표된 연구에서 정확도와 해석 가능성을 모두 향상시키는 실용적인 접근 방식을 제시한다. 이들의 EFFResNet-ViT 아키텍처는 합성곱 신경망(CNN)과 비전 트랜스포머(ViT)를 융합하고, Grad-CAM++ 설명을 분류 파이프라인에 통합한다.

병리학과 방사선학의 연계

He et al. (2025)은 설명 가능성이 특히 중요한 응용 분야, 즉 병리학 및 방사선학 소견의 AI 보조 통합을 다룬다. Frontiers in Medicine에 게재된 이들의 딥러닝 분류기는 두 가지 영상 모달리티 모두에서 작동하며, 조직병리학적 특징(세포 형태, 조직 구조)과 방사선학적 특징(밀도, 조영 패턴, 해부학적 위치)을 결합한 통합 진단 평가를 제공한다.

임상적 가치는 명확하다. 현재 병리학과 방사선학은 별개의 진단 사일로로 운영되며, 통합은 다학제 종양 위원회에서 임상의의 판단을 통해서만 이루어진다. 이러한 통합을 계산적으로 수행하는 AI 시스템은 병리학과 방사선학이 서로 다른 진단을 시사하는 불일치 사례—그렇지 않으면 간과될 수 있는—를 표면화할 수 있다.

설명 가능성의 과제는 더욱 복잡하다. 시스템은 개별 모달리티 평가뿐 아니라 모달리티 간 정보를 통합하는 방식도 설명해야 한다. He et al.은 어텐션 메커니즘을 사용하여 모달리티 간 정보 흐름을 시각화하며, 어떤 방사선학적 특징이 병리학적 해석에 영향을 미치는지, 그리고 그 반대의 경우도 보여준다. 이는 멀티모달 설명을 향한 한 걸음이지만, 여전히 추론보다는 어텐션 수준에 머물러 있다.

개념 수준의 설명

융합의 근거: CNN은 지역적 특징 추출(텍스처, 에지, 공간 패턴)에 탁월하고, ViT는 전역적 맥락(멀리 떨어진 이미지 영역 간의 관계)을 포착한다. 의료 진단은 흔히 두 가지 모두를 필요로 한다. 예를 들어 폐 결절의 중요성은 국소적 형태학과 전역적 맥락(위치, 기도와의 근접성, 반대쪽 폐와의 비교) 모두에 달려 있다.

Hussain et al.은 뇌종양 CE-MRI 분류(99.31%)와 망막 이미지 분류(92.54%)에서 높은 정확도를 보고하며, 두 데이터셋 모두에서 CNN 단독 및 ViT 단독 기준선을 능가한다. Grad-CAM++ 시각화는 표준 Grad-CAM에 비해 향상된 공간 국소화를 보여주며, 임상적으로 관련된 영역에 더 정확하게 주목한다. 그러나 저자들은 공간 국소화가 임상적 설명 가능성의 필요조건이지 충분조건은 아님을 인정한다. 모델이 어디를 보는지 아는 것이 모델이 왜 특정 결론을 내리는지를 확립하지는 않는다.

Nie et al. (2025)은 개념 강화 시각-언어 사전 학습 접근법으로 설명 가능한 의료 AI의 최전선을 보여준다. 이 접근법은 원시 픽셀 특징이 아닌 인간이 해석 가능한 의료 개념의 어휘에 예측을 근거하도록 모델을 학습시킨다.

이 차이는 "이미지가 이상해 보여서 암인 것 같습니다"라고 말하는 의대생과, "해당 병변은 문맥 정맥기에서 조영 소실을 동반한 동맥기 과조영을 보이며 조영 증강 피막이 있어 간세포암종에 대한 LI-RADS 5 기준을 충족합니다"라고 말하는 고참 방사선과 의사의 차이에 비유할 수 있다. 둘 다 올바른 진단에 도달할 수 있지만, 임상적으로 유용한 설명을 제공하는 것은 후자뿐이다.

Nie et al.은 방사선과 보고서에서 도출된 대규모 개념 주석으로 모델을 학습시켜, 시각적 패턴을 임상 용어와 연결하도록 가르친다. 결과적으로 생성된 모델은 방사선과 의사가 인식하고 평가할 수 있는 개념을 사용하여 예측을 설명할 수 있으며, Singh et al.이 지지하는 "수준 3 설명 가능성"(정당화)에 근접한다.

한계점: 개념 어휘는 학습 중에 고정된다. 학습 개념 공간 밖에 있는 새로운 소견은 설명될 수 없으며, 모델은 불확실성을 인정하는 대신 낯선 패턴을 기존 개념에 억지로 맞출 수 있다.

비판적 분석: 주장과 근거

주장	근거	판정
사후 XAI 방법(Grad-CAM, SHAP)은 임상적으로 충분하다	불안정한 기여도; 어텐션 ≠ 추론 (Singh et al.)	❌ 반박됨
융합 CNN-ViT 아키텍처는 의료 영상 분류를 개선한다	뇌종양 CE-MRI에서 99.31%, 망막 영상 분류에서 92.54%로 CNN 단독 및 ViT 단독 기준선을 능가함 (Hussain et al.)	✅ 지지됨
개념 수준의 설명은 임상의 신뢰를 향상시킨다	사용자 연구에서 신뢰도 향상이 확인됨 (Nie et al.)	✅ 지지됨 (초기 결과)
교차 모달 AI 통합은 단일 모달리티 분석을 능가한다	병리-영상의학 통합에서 입증됨 (He et al.)	✅ 지지됨
설명 가능한 의료 AI는 임상 배포 준비가 되어 있다	전향적 임상시험 없음; XAI에 대한 규제 기준 미정의	❌ 반박됨

미해결 질문과 향후 방향

설명 가능성은 임상 AI에서 필수적이어야 하는가? FDA는 현재 AI/ML 기반 기기에 설명 제공을 요구하지 않으며, 임상 성능 입증만을 요구한다. 규제 체계는 최소 수준의 설명 가능성을 요구해야 하는가? 그 기준은 어떤 형태가 되어야 하는가?

LLM은 시각적 AI를 위한 임상 수준의 설명을 생성할 수 있는가? 시각적 분류기와 자연어 설명을 생성하는 대규모 언어 모델(LLM)을 결합하면 영상의학과 의사 수준의 보고서를 생성할 수 있다. 위험은 LLM이 그럴듯하게 들리지만 부정확한 설명을 생성할 수 있다는 점으로, 이는 환자 안전에 직접적인 영향을 미치는 실패 양상이다.

설명을 어떻게 평가할 것인가? 정확도는 잘 정의된 지표(AUROC, 민감도, 특이도)를 가지고 있다. 설명 가능성은 그렇지 않다. 설명 품질—관련성, 완전성, 정확성, 임상적 유용성—에 대한 검증된 지표를 개발하는 것은 미해결 연구 과제이다.

임상의는 실제로 설명을 원하는가? 일부 연구에 따르면 임상의는 특징 수준의 설명(모델이 왜 결정했는지)보다 신뢰도가 보정된 예측(모델이 얼마나 확신하는지)을 선호한다. 그 답은 임상적 맥락, 임상의의 경험 수준, 의사결정의 위험도에 따라 달라질 수 있다.

본질적으로 해석 가능한 의료 AI를 구축할 수 있는가? 블랙박스 모델을 사후적으로 설명하는 것보다, 사전 정의된 임상 개념에 대한 어텐션, 사례 기반 추론, 또는 규칙 추출을 활용하여 구성적으로 해석 가능한 모델을 설계하는 것이 임상적 신뢰를 확보하는 더 견고한 경로일 수 있다.

임상 AI 도입에 대한 시사점

의료 AI의 설명 가능성 격차는 기술적 해결책을 기다리는 기술적 문제가 아니다. 이는 임상적 맥락에서 "설명"이 무엇을 의미하는지를 재고해야 하는 개념적 문제이다. 히트맵은 설명이 아니다. 신뢰도 점수는 설명이 아니다. 주목된 특징들의 목록은 설명이 아니다.

임상적 설명은 확립된 의학 지식을 참조하고, 불확실성을 인정하며, 다음 단계를 제안하는 추론의 연쇄를 통해 관찰과 진단을 연결한다. 분류기에 사후적으로 추가되는 부록이 아니라 진단 과정의 불가결한 일부로서 그러한 설명을 생성하는 AI 시스템을 구축하는 것, 이것이 의료 AI가 신뢰받는 임상 도구가 될 것인지 아니면 실험실의 호기심거리로 남을 것인지를 결정하는 과제이다.

References (4)

[1] Hussain, T., Shouno, H., Hussain, A. (2025). EFFResNet-ViT: A fusion-based convolutional and vision transformer model for explainable medical image classification. IEEE Access, 13, 48921–48937.

DOI Scholar

[2] Singh, Y., Hathaway, Q.A., Keishing, V. et al. (2025). Beyond post hoc explanations: A comprehensive framework for accountable AI in medical imaging through transparency, interpretability, and explainability. Bioengineering, 12(8), 879.

DOI Scholar

[3] He, L., Luan, L., Hu, D. et al. (2025). Deep learning-based image classification for AI-assisted integration of pathology and radiology in medical imaging. Frontiers in Medicine, 12, 1574514.

DOI Scholar

[4] Nie, Y., He, S., Bie, Y. et al. (2025). An explainable biomedical foundation model via large-scale concept-enhanced vision-language pre-training. arXiv preprint.

Scholar

Explainable AI in Medical Imaging: Why Accuracy Is Not Enough

A Framework for Accountable AI

Fusion Architectures for Explainability

Bridging Pathology and Radiology

Concept-Level Explanations

Critical Analysis: Claims and Evidence

Open Questions and Future Directions

Implications for Clinical AI Adoption

의료 영상에서의 설명 가능한 AI: 정확도만으로는 충분하지 않은 이유

책임 있는 AI를 위한 프레임워크

설명 가능성을 위한 융합 아키텍처

병리학과 방사선학의 연계

개념 수준의 설명

비판적 분석: 주장과 근거

미해결 질문과 향후 방향

임상 AI 도입에 대한 시사점

References (4)

Explore this topic deeper