An improved transformer-driven approach for explainable ICD code prediction

Meghana rao Kanneganti

Back

An improved transformer-driven approach for explainable ICD code prediction

Thesis

Open access

An improved transformer-driven approach for explainable ICD code prediction

Meghana rao Kanneganti

California State University, Sacramento

Master of Science (MS), California State University, Sacramento

10/04/2024

Handle:

https://hdl.handle.net/20.500.12741/rep:12532

Abstract

Automated ICD coding is a multi-label classification task that involves assigning ICD codes to long clinical texts such as discharge summaries. It is an active research field and numerous studies demonstrate how using Natural language processing (NLP) techniques simplify this task making it cost-effective and aiding in accurate classifications. In recent years, transformers have extensively been used in NLP tasks, and offer a solution for accurately processing long texts and aid in automating ICD code classification. Transformers employ their self-attention mechanism to capture long-range dependencies. To enhance the state-of-the-art approaches for automated ICD-9 coding, we propose 2 models Long-LAT and Long-HiLAT leveraging the Longformer’s capability to process a large number of tokens. In Long-LAT, we introduce Label wise attention with the Clinical pretrained Longformer instead of solely relying on the classification head provided by the Longformer. This allows us to refine the classification process further by directing attention to the specific ICD codes. In Long-HiLAT, we make use of the Longformer’s sliding window attention to enhance the existing Hierarchical Label-wise Attention Transformer (HiLAT). This integration allows a more comprehensive analysis of the text and improves the model’s performance. Extensive experiments using the MIMIC-III top-50 and top-5 ICD code datasets show that Long-HiLAT achieved superior performance compared to the baseline models.

Files and links (1)

pdf

KannegantiMeghanarao_Spring20242.03 MBDownload View

TextProject Open Access

Metrics

1 Record Views

Details

Title: An improved transformer-driven approach for explainable ICD code prediction
Creators: Meghana rao Kanneganti
Contributors: Haiquan Chen (Advisor)
Parham Phoulady (Advisor)
Academic Unit: Computer Science Department
Theses and Dissertations: Master of Science (MS); Computer Science; California State University, Sacramento; 04/29/2024; 2024
Publisher: California State University, Sacramento
Publication Details: 10/04/2024
Identifiers: 99258164263601671; https://hdl.handle.net/20.500.12741/rep:12532
Resource Type: Masters Project
Language: English
Number of pages: 49
Comment: The accessibility of this document has been verified by Sacramento State University Library. For questions, please contact lib-508Accessibility@csus.edu.