How to Leverage Multimodal EHR Data for Better Medical Predictions?

Published:

Please cite:
@inproceedings{bo2021how,
title={How to Leverage Multimodal EHR Data for Better Medical Predictions?},
author={Yang, Bo and Wu, Lijun},
booktitle={Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)},
year={2021}
}

Abstract

Healthcare is becoming a more and more important research topic recently. With the growing data in the healthcare domain, it offers agreat opportunity for deep learning to improvethe quality of medical service. However, the complexity of electronic health records (EHR) data is a challenge for the application of deeplearning. Specifically, the data produced in the hospital admissions are monitored by the EHR system, which includes structured data likedaily body temperature, and unstructured datalike free text and laboratory measurements. Although there are some preprocessing frameworks proposed for specific EHR data, the clinical notes that contain significant clinical valueare beyond the realm of their consideration. Besides, whether these different data from various views are all beneficial to the medical tasks and how to best utilize these data remain unclear. Therefore, in this paper, we first extract the accompanying clinical notes from EHR and propose a method to integrate thesedata, we also comprehensively study the different models and the data leverage methods for better medical task prediction. The resultson two medical prediction tasks show that ourfused model with different data outperformsthe state-of-the-art method that without clinical notes, which illustrates the importance of our fusion method and the value of clinical note features. Our code is available at https://github.com/emnlp-mimic/mimic.

[PDF]