Decision-Support Systems designed for Critical Care

M. Frize, H.C.E. Trigg, F.G. Solven, M. Stevenson. B.G. Nickerson

University of New Brunswick, Fredericton, N.B. , Canada E3B 5A3



ABSTRACT

A case-based reasoner tool has been developed, allowing users to compare the ten-closest matching cases to the newest patient admission, from a database of intensive care medical records. A back-propagation, feed-forward artificial neural network has been trained and tested to estimate patient outcomes: duration of artificial ventilation, the length of stay , and mortality.

INTRODUCTION

Several authors have developed, and others have described, various scoring systems to assess patient outcomes in critical care medicine. Despite the ongoing research on the prediction of mortality, much less has been done for estimating length of stay, and duration of artificial ventilation. Hyzg 1 and LeGall et al.2 are of the opinion that many clinicians remain sceptical about using scoring models in their actual patient care. Castella 3 and Fery-Lemonier 4, for their part, have provided a comparison between various scoring systems and this is helpful to guide efforts in this area of research.

Recently, case-based reasoners (CBR) and knowledge-based systems have been increasingly accepted as clinical decision-support systems. Johnston et al 5, in a broad review of the literature found that, of 793 citations in the area of clinical decision-support systems (CDSS), only 28 controlled trials met the predefined criteria for proper study design and assessment. They also concluded that these systems can improve clinician performance and patient outcomes in clinical settings such as: computer-assisted dosing, preventive care reminders, and computer-aided quality assurance for active medical care. These authors also pointed out the need for additional well-designed studies to assess the effects and the cost-effectiveness of decision-support systems, especially for those attempting to affect patient outcomes.

Because of their non-linear modelling capabilities, artificial neural networks (ANNs) have been extensively applied to non-linear statistical modelling problems and can be used to perform very complex recognition tasks. As a result, they are a natural choice for modelling complex medical problems when large databases of relevant medical information are available. Many have been applied to a particular pathology, such as Baxt 6, 7 who used ANNs as an aid to diagnose acute coronary occlusion and later for myocardial infarction; and Kuntz 8, to estimate mortality and length of stay (LOS) for patients with closed-head injuries. Tu and Guerriere 9 reported estimates of LOS and mortality. Buskard et al. 10 added studies of estimated duration of artificial ventilation to those of mortality and LOS. In view of the limitations of severity of disease classification systems, much could be gained in investigating an integrated, rather than compartmentalised, approach to critical care (and other medical environments). Most scoring systems are more useful in estimating outcomes for a group of patients, than for a single one. Thus, new approaches should attempt to make estimates on a patient by patient basis. This has been a main focus of the work reported here.

DESIGN OBJECTIVES

A multi-disciplinary group (University of New Brunswick) brings together electrical engineers, computer scientists, graduate students, and physicians from several health care facilities across Canada. The focus of the work is to use artificial intelligence (AI) to integrate a number of promising decision-aid systems (http://www.unb.ca/web/mirg/). Since its inception, the group's approach has been to view AI as a set of tools to simulate (and support) common clinical thinking, rather than replace it. As a result, each of the techniques reported here were chosen because of a particular utility in modelling a particular aspect of the clinical approach. For example, a clinician's approach may be to think: "I have seen patients like that and this is what happened to them". The corresponding AI approach is:"The case-based reasoner selects a number of "closest-matching cases" and displays selected aspects of their clinical course". Or, a clinician's approach may be to think: "And for this particular patient, this is what I think will happen". The AI approach: "A trained artificial neural network provides an estimate of selected clinical outcomes".

SYSTEM DESCRIPTION

a) The Case Based Reasoner (CBR) A medical database of over 2000 intensive care patients, containing 98 fields of clinical and administrative information on patients admitted to the ICU at the Dr. E.Chalmers Hospital (DECH, since 1988) was transferred from its DBASE format to a case-based reasoner prototype. The case-based reasoner uses a modified Rete match algorithm to develop a match score based on string, word, character, or number matching (ART-IM software). The Rete match algorithm is faster than conventional matching algorithms. Each of the matching methods and the weights associated with them were selected in direct consultation with physicians to determine the preset (default) values. In addition to this default mode, the system allows the physician to fine-tune any of the matching field weights to a patient's specific characteristics. This flexibility was felt to be important, since the same procedures performed on individual patients may have varying degrees of significance. As more information is known on the index patient case and entered into the program, the matched cases also change. When a patient is discharged, the file is automatically added to the database, allowing the accuracy and quality of the data to improve with time and use of the system.

The IDEAS for ICUs (Intelligent Decision Aid System for Intensive Care Units) system grew out of this early model by incorporating a graphical-user interface. The presented (new) patient admission information is entered into the CBR, along with any changes of match weights (if desired), and the system can then generate a screen of ten closest-matching cases to the newest admission case. It is the blending of the rule-based ability along with the knowledge of medical experts on the team, that provided the foundation for the development of this decision-aid system. IDEAS for ICUs issues warnings, especially in cases where a high match score is made with a patient who died.11, 12.

The original DBASE IV patient database had been previously used to provide statistics on the ICU performance. Unfortunately, it is limited to finding exact matches through its 'Structured Query Language' (SQL), which is far from being as useful as 'near' matches provided by CBR. In addition to this additional power in the matching capability, the system can be expanded by using the expert shell's rule-based abilities. The Graphical user interface (GUI) was written in VISUAL BASIC (Version 3.0), allowing for a natural integration of the patient database and the case-base, thus providing combined storage and information retrieval. The GUI presents a choice of several windows to enter or to display the various types of information. The system is user-friendly and allows to flow from window to window and back to any section desired. The various parts interact with each other, as well as with other text and binary files, to form a dynamic software tool.

b) The Artificial Neural Network (ANN) To estimate medical outcomes for individual patients in the ICU, a feedforward ANN was trained using the backpropagation algorithm, because of its relative ease of implementation and past success on various classification tasks. In the initial experiments, networks which estimated 'length of stay' (LOS), 'mortality', and 'duration of artificial ventilation' were trained and tested on a subset of the DECH ICU database (1322 patients). Two-thirds of these patients records were used for training the network, while the remaining third were used to test the ANN's performance. After experimenting with various network architectures, an ANN was considered as acceptable if it provided better results than a Constant Predictor (CP), which is a simple statistical benchmark that classifies all patterns as belonging to the class with the highest training set a priori probability. Although the initial results were promising, the networks exhibited a behaviour characteristic of memorisation (over-fitting), that is, after a few hundred epochs, the classification rate and error curves for the training and test sets began to diverge. Since the generalisation ability of an ANN depends upon a balance between network complexity and the information contained in the training examples13, the over-fitting was most likely caused by the fact that the number of parameters (there were 41) required too many weights for the relatively small number of example patterns (868). Rather than arbitrarily eliminating potentially useful information, to reduce the network size (and complexity), or waiting until a significant amount of additional data were collected, it was decided that implementing weight-elimination 14 might be a better approach.

A new set of experiments were performed on a slightly larger database (1491 patients records), using 51 input variables, to estimate duration of artificial ventilation. Among the 51 inputs provided to the network were several new variable fields describing admission diagnosis and patient admission source. Weight-elimination was turned on and turned off, and the results were compared. 15

RESULTS

a) The case-based reasoner The IDEAS for ICUs prototype was placed for a short period in a clinical setting (three weeks). Several constructive comments were collected and this led to a number of revisions of the prototype's software. The new version (2.2) includes many new features and a faster matching engine is currently being incorporated. The new software has been customised for two hospitals where a more substantial clinical evaluation will be conducted this year. The clinical setting will allow to assess the system's usefulness and performance. Some of the qualitative points to be studied are: Is the system helpful as an instant 'memory' of past similar cases? Does the information change the diagnosis that would have been made without the tool? Does the information help the physician to explain the prognosis to the family? To the nurses? Does the information help in the choice of treatment and management of the patient? This clinical assessment is essential before claims can be made on the system's relevance and usefulness. The CBR has generated substantial interest in the medical community in Canada and other sites are currently thinking of participating in the study .

b) The Artificial Neural Network The results of our most recent experiments with the ANN showed that the weight-elimination technique improves both the generalization and the overall performance of a fully-optimized network trained to estimate the outcome 'postoperative VENT8 15. VENT8 is a binary output variable which was assigned a value of -1 when a patient's actual duration of ventilation was less than, or equal to 8 hours, and a value of 1 when a patient's actual duration of artificial ventilation was greater than 8 hours. When weight-elimination was used, the weights associated with parameters that have little significance in determining the network's output were driven to zero, simplifying the network's structure and resulting in improved network generalization and performance. Thus weight-elimination provided the network with its own means of screening out unimportant variables and eliminates the need for making preconceived judgements as to what medical parameters are most instrumental in determining a particular patient outcome.

By combining weight-elimination with a second technique, that is, to present 'high' and 'low' values of continuous and integer-valued medical parameters to a pair of input nodes, rather than presenting these just to a single node, a single-layered ANN was constructed from which interesting information could be extracted: i.e. the variables which the ANN considered to be most important in estimating postoperative VENT8 could be extracted 15. The 'high/low' nodes data presentation technique facilitates the independent interpretation of high and low values of each input parameter by the ANN model, and hence the weights selected by such a network should be more representative of the true significance of each input parameter in determining an outcome such as VENT8, than those identified by a network which does not employ 'high/low' nodes techniques.

For this experiment, the five largest weights extracted at the point of maximum test set classification performance were obtained for the following input variables: lower-than-normal respiratory rate, higher-than-normal arterial pH and higher-than-normal fraction of inspired oxygen all favoured a patient having a duration of ventilation longer than 8 hours; whereas, a higher-than-normal of Glasgow Coma Score, and having had a carotid endarectomy, favoured a patient having a duration of ventilation less than or equal to 8 hours 15.

DISCUSSION AND FUTURE WORK

Building on the success of the initial phases of this work, the following research aspects are in progress: Generalising the case-based reasoner (IDEAS for ICUs) approach for use in other medical environment: This is currently being applied to neonatal ICU and rheumatoid arthritis populations. An epidemiologist (whose expertise is in the ICU environment and clinical trials) is helping to develop the clinical evaluation plan and questionnaire.

For the ANN work, the current experience will be enlarged in several ways: the experiments will be repeated using more output classes (possibly three or four) for the output "duration of ventilation" with the adult ICU database. The next step will be to extend this entire approach to study outcomes using a neonatal database to see if the network performs equally well with this different type of data.

An interface for on-line data acquisition from monitors and ventilators is under development 16. The interface was designed to communicate information aquired from the patient bedside, over modem facilities, stored and eventually to be used by both the case-based reasoner and the artificial neural network. The data acquisition itself, although not a major technical problem, creates a much larger challenge, that is, of modifying the case-based reasoner and neural network to accommodate and use effectively the temporal information. Addressing this challenge will be part of the future work planned by the team.

CONCLUSION

The results of the group's research efforts to develop a decision-support system in critical care provide an enhancement of the conventional medical model used by physicians for patient management in an Intensive Care environment. To date, the research done by our group compares previous patient experience held in a large clinical database, with new patient admissions, and generates outputs which maybe used to aid physician decision making. Our system positions itself between the medical model and usual testing systems and represents a new class of "intelligent monitoring instrumentation" for critical care. In the future, this approach may be applicable to other patient care environments, such as neonatal care, cardiac units, neuro-intensive care and the operating room.

The authors would like to acknowledge the support of the Medical Research Council of Canada and NSERC for its PGS-A Scholarship.

REFERENCES

1. Hyzg RC, ICU scoring and clinical decision making. [Edit.] Chest. 1995; 107(2): 1482-1483.

2. LeGall J et .al. Customized probability models for early severe sepsis in adult intensive care patients. JAMA. 1995; 273(8): 644-650.

3. Castella X et .al. A comparison of severity of illness scoring systems for intensive care unit patients: results of a multicenter, multinational study. Crit. Care Med. 1995; 23(8): 1327-1335.

4. Fery-Lemonnier E et .al. Evaluation of severity scoring systems in ICUs - translation, conversion and definition ambiguities as a source of inter-observer variability in APACHE II, SAPS and OSF. Int. Care Med. 1995; 21: 356-360.

5. Johnston ME., Langton KB, Haynes RB, Mathieu, A. Effects of computer-based clinical decision-support systems on clinician performance and patient outcome. Annals of Internal Medicine. 1994; 120: 135-142.

6. Baxt WG. Use of an Artificial Neural Network for Data Analysis in Clinical Decision-Making! The Diagnosis of Acute Coronary Occlusion. Neural Computation. 1990; 2: 480-489.

7. Baxt WG. Use of an Artificial Neural Network for the Diagnosis of Myocardial Infarction. Annals of Internal Medicine. 1991; 115: 843-848.

8. Kuntz GR, Hammond J. Predicting Outcomes after Closed-Head Injury (CHI) using a Cascade- Correlation Artificial Neural Network. Crit. Care Med. 1994; 22(1): A25.

9. Tu JV. and Guerriere MRJ. Use of a Neural Network as a Predictive Instrument for Length of Stay in the Intensive Care Unit Following Cardiac Surgery.' Computers and Biomed. Research. 1993; 26: 220-229.

10. Buskard T, Stevenson M, Frize M, Solven FG.. Estimation of Ventilation, Length of Stay, and Mortality Using Artificial Neural Networks. Proc. of Canadian Conference on Electrical and Comp. Eng. (IEEE). 1994, Halifax: 726-729.

11. Taylor KB, Nickerson BG, Frize M, Solven FG, Dunfield V. Use of Case-Based Reasoning to Assis Patient Management in an Intensive Care Unit. Proc. of Joint Conf. (COMP and CMBES), Ottawa. 1993:.248-249.

12. Frize M, Solven FG, Stevenson M, Nickerson BG, Buskard T, Taylor K. Computer-Assisted Decision- Support Systems for Patient Management in an Intensive Care Unit. Proc. of Medinfo95, Vancouver. 1995: 1009-1012.

13. Krogh A, Hertz JA. A Simple Weight Decay can Improve Generalisation. In RP. Lippman, J Moody, and DS Touretzky, ed. Advances in Neural Information Processing Systems. Morgan Kaufman: San Mateo. 1992; 4: 950-957.

14. Weigend AS, Rumelhart DE, Huberman BA. Generalisation by Weight-Elimination with Application to Forecasting. In Lippman, RP, Neural Network Classifiers Estimate Bayesian a posteriori Probabilities. Neural Computation. 1991; 3(4): 461-483.

15. Trigg HCE. An Investigation of Methods to Enhance the Performance of Artificial Neural Networks Used to estimate ICU Outcomes. MScE Thesis, U. of New Brunswick, January 1997.

16. Barton M, Frize M. Patient Data Acquisition and Telemedicine Access to a Decision-Support System. Medicine 2001 Conf. Montreal. 1996:3.