A Data Preprocessing Framework for Supporting Probability-learning in Dynamic Decision Modeling in Medicine

Publication Type

Conference Proceeding Article

Publication Date



Data preprocessing is needed when real-life clinical databases are used as the data sources to learn the probabilities for dynamic decision models. Data preprocessing is challenging as it involves extensive manual effort and time in developing the data operation scripts. This paper presents a framework to facilitate automated and interactive generation of the problem-specific data preprocessing scripts. The framework has three major components: 1) A model parser that parses the decision model definition, 2) A graphical user interface that facilitates the interaction between the user and the system, and 3) A script generator that automatically generates the specific database scripts for the data preprocessing. We have implemented a prototype system of the framework and evaluated its effectiveness via a case study in the clinical domain. Preliminary results demonstrate the practical promise of the framework.


Databases and Information Systems | Health Information Technology

Research Areas

Intelligent Systems and Decision Analytics


American Medical Informatics Association Annual Fall Symposium (AMIA)

First Page


Last Page




City or Country

Los Angeles, CA, USA