MYCIN

MYCIN
MYCIN
Original author	Edward H. Shortliffe
Written in	Lisp
Platform	DEC KI10 PDP-10
Type	Expert system

The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

MYCIN was an early backward chaining expert system that used black box to identify bacteria causing severe infections, such as bacteremia and meningitis, and to recommend antibiotics, with the dosage adjusted for patient's body weight — the name derived from the antibiotics themselves, as many antibiotics have the suffix "-mycin". The Mycin system was also used for the diagnosis of blood clotting diseases. MYCIN was developed over five or six years in the early 1970s at Stanford University. It was written in Lisp as the doctoral dissertation of Edward Shortliffe under the direction of Bruce G. Buchanan, Stanley N. Cohen and others.

MYCIN emerged from the Stanford Heuristic Programming Project. ^[1] MYCIN demonstrated the potential for expert systems in building high-performance medical reasoning programs. MYCIN is often viewed as a pioneer in the field of expert systems, even being referred to as the "grandaddy of them all-the one that launched the field" by Dr. Allen Newell. ^[2]

MYCIN led to the EMYCIN expert system shell ("essential MYCIN") for acquiring knowledge, reasoning with it, and explaining the results, without the specific medical knowledge. It can be described as "EMYCIN = Prolog + uncertainty + caching + questions + explanations + contexts - variables". An introduction is in Chapter 16 of Paradigms of Artificial Intelligence Programming (PAIP).^[3]

Method

MYCIN operated using a fairly simple inference engine and a knowledge base of ~600 rules by obtaining individual inferential facts identified by experts and encoding such facts as individual production rules. No other AI program at the time contained as much domain-specific knowledge clearly separated from its inference procedures as MYCIN. It would query the physician running the program via a long series of simple yes/no or textual questions. At the end, it provided a list of possible culprit bacteria ranked from high to low based on the probability of each diagnosis, its confidence in each diagnosis' probability, the reasoning behind each diagnosis (that is, MYCIN would also list the questions and rules which led it to rank a diagnosis a particular way), and its recommended course of drug treatment. MYCIN could additionally respond to queries by physicians related to why it asked the user a certain question, how it arrived at a conclusion, and why it did not consider certain factors. ^[4]

The developers performed studies showing that MYCIN's performance was minimally affected by perturbations in the uncertainty metrics associated with individual rules, suggesting that the power in the system was related more to its knowledge representation and reasoning scheme than to the details of its numerical uncertainty model. Some observers felt that it should have been possible to use classical Bayesian statistics. MYCIN's developers argued that this would require either unrealistic assumptions of probabilistic independence, or require the experts to provide estimates for an unfeasibly large number of conditional probabilities.^[5]^[6]

Subsequent studies later showed that the certainty factor model could indeed be interpreted in a probabilistic sense, and highlighted problems with the implied assumptions of such a model. However the modular structure of the system would prove very successful, leading to the development of graphical models such as Bayesian networks.^[7]

Context

A context in MYCIN determines what types of objects can be reasoned about. They are similar to variables in Prolog, or environment variables in operating systems.^[3]

(defun mycin ()
  "Determine what organism is infecting a patient."
  (emycin
    (list (defcontext patient  (name sex age)  ())
          (defcontext culture  (site days-old) ())
          (defcontext organism ()              (identity)))))

Evidence combination

In MYCIN it was possible that two or more rules might draw conclusions about a parameter with different weights of evidence. For example, one rule may conclude that the organism in question is E. Coli with a certainty of 0.8 whilst another concludes that it is E. Coli with a certainty of 0.5 or even −0.8. In the event the certainty is less than zero the evidence is actually against the hypothesis. In order to calculate the certainty factor MYCIN combined these weights using the formula below to yield a single certainty factor:

CF(x,y)={\begin{cases}X+Y-XY&{\text{if }}X,Y>0\\X+Y+XY&{\text{if }}X,Y<0\\{\frac {X+Y}{1-\min(|X|,|Y|)}}&{\text{otherwise}}\end{cases}}

Where X and Y are the certainty factors.^[8] This formula can be applied more than once if more than two rules draw conclusions about the same parameter. It is commutative, so it does not matter in which order the weights were combined.

The combination formula was designed to have the following desirable properties:^[3]

−1 can be interpreted as "false", +1 as "true", and 0 as "uncertain".
Combining unknown with anything leaves it unchanged.
Combining true with anything (except false) gives true. Similarly for false.
Combining true and false is a division-by-zero error.
Combining +x and -x gives unknown.
Combining two positives (except true) gives a larger positive. Similarly for negatives.
Combining a positive and a negative gives something in between.

Examples

The following examples come from Chapter 16 of PAIP, which contains an implementation in Common Lisp of a modified and simplified version of MYCIN for pedagogical purposes.

A rule, and an English paraphrase generated by the system:

(defrule 52
 if (site culture is blood)
  (gram organism is neg)
  (morphology organism is rod)
  (burn patient is serious)
 then .4
  (identity organism is pseudomonas))
Rule 52:
 If
  1) THE SITE OF THE CULTURE IS BLOOD
  2) THE GRAM OF THE ORGANISM IS NEG
  3) THE MORPHOLOGY OF THE ORGANISM IS ROD
  4) THE BURN OF THE PATIENT IS SERIOUS
 Then there is weakly suggestive evidence (0.4) that
  1) THE IDENTITY OF THE ORGANISM IS PSEUDOMONAS

Results

An evaluation of MYCIN was conducted at the Stanford Medical School. The first phase of the evaluation consisted of 10 test cases of diverse origin, chosen by a physician who was not acquainted with MYCIN's methods or knowledge base. These cases were presented to 7 physicians and 1 senior medical student. 10 prescriptions were compiled for each of the cases, 1 recommended by MYCIN, 1 prescribed by the treating physician at the county hospital, and 8 by the aforementioned individuals. The second phase of the evaluation consisted of eight infectious disease specialists being provided the clinical summary and set of 10 prescriptions for each of the 10 cases and tasked to provide their own recommendations for each case and assess the 10 prescriptions. MYCIN received an acceptability rating of 65%, which was comparable to the 42.5% to 62.5% rating of five faculty members. ^[9] This study is often cited as showing the potential for disagreement about therapeutic decisions, even among experts, when there is no "gold standard" for correct treatment.^{[citation needed]}

Practical use

MYCIN was never actually used in practice. This wasn't because of any weakness in its performance. Some observers raised ethical and legal issues related to the use of computers in medicine, regarding the responsibility of the physicians in case the system gave wrong diagnosis. ^[10] However, the greatest problem, and the reason that MYCIN was not used in routine practice, was the state of technologies for system integration, especially at the time it was developed. MYCIN was a stand-alone system that required a user to enter all relevant information about a patient by typing in responses to questions MYCIN posed. MYCIN ran on the DEC KI10 PDP-10, supporting a large time-shared system available over the early Internet (ARPANet), before personal computers were developed. ^[11]^[12]

MYCIN's greatest influence was accordingly its demonstration of the power of its representation and reasoning approach. Rule-based systems in many non-medical domains were developed in the years that followed MYCIN's introduction of the approach. In the 1980s, expert system "shells" were introduced (including one based on MYCIN, known as E-MYCIN (followed by Knowledge Engineering Environment - KEE)) ^[13]

and supported the development of expert systems in a wide variety of application areas. A difficulty that rose to prominence during the development of MYCIN and subsequent complex expert systems has been the extraction of the necessary knowledge for the inference engine to use from the human expert in the relevant fields into the rule base (the so-called "knowledge acquisition bottleneck"). ^[14]

References

^ Feigenbaum, Edward A., and Buchanan, Bruce G. Proposal to the Advanced Research Projects Agency for the Continuation of The Heuristic Programming Project, Stanford University, April 1979, p. 46. https://purl.stanford.edu/vp226xm6367
^ Buchanan, B. G.; Shortliffe, E. H. (1984). Rule Based Expert Systems: The MYCIN Experiments of the Stanford Heuristic Programming Project. Reading, Massachusetts: Addison-Wesley. ISBN 978-0-201-10172-0.
^ ^a ^b ^c Norvig, Peter (2007). "16. Expert Systems". Paradigms of artificial intelligence programming: case studies in Common LISP (Nachdr. ed.). San Francisco, Calif: Morgan Kaufmann. ISBN 978-1-55860-191-8.
^ "How Does MYCIN...transparency, n.d." Edward A. Feigenbaum Papers (SC0340, Accession 2005-101), Stanford University Libraries. https://purl.stanford.edu/vt702rj7364
^ Shortliffe, E. H.; Buchanan, B. G. (1975). "A model of inexact reasoning in medicine". Mathematical Biosciences. 23 (3–4): 351–379. doi:10.1016/0025-5564(75)90047-4. MR 0381762. S2CID 118063112.
^ Buchanan, B. G.; Shortliffe, E. H. (1984). Rule Based Expert Systems: The MYCIN Experiments of the Stanford Heuristic Programming Project. Reading, Massachusetts: Addison-Wesley. ISBN 978-0-201-10172-0.
^ Heckerman, D.; Shortliffe, E. (1992). "From certainty factors to belief networks" (PDF). Artificial Intelligence in Medicine. 4 (1): 35–52. CiteSeerX 10.1.1.157.4459. doi:10.1016/0933-3657(92)90036-O.
^ Jackson, Peter (1999). Introduction to expert systems. Addison Wesley Longman Limited. p. 52. ISBN 978-0-201-87686-4.
^ Yu, Victor L. (1979-09-21). "Antimicrobial Selection by a Computer". JAMA. 242 (12): 1279–82. doi:10.1001/jama.1979.03300120033020. ISSN 0098-7484. PMID 480542.
^ Trivedi, M. C. (2014). A Classical Approach to Artificial Intelligence (2nd ed.). Van Haren Publishing. p. 331
^ Feigenbaum, E. A. (1975). ARPA Contractors Meeting: San Diego, March 12-14, 1975 - 1974 ARPA Project Summary - Heuristic Programming Project.
^ Stanford University Libraries. (1977). Notes on MYCIN. Edward A. Feigenbaum Papers, SC0340, Accession 1986-052, Box 39, Folder 14. purl.stanford.edu/nt215ps9486
^ Bond, A. Machine Intelligence (Infotech State of the Art Report). Pergamon Infotech, 1981. ISBN 978-0080285566.
^ Feigenbaum, Edward A. (1984). "Knowledge Engineering: The Applied Side of Artificial Intelligence." Annals of the New York Academy of Sciences, 426(1), 91–107. doi:10.1111/j.1749-6632.1984.tb23349.x

Winston, Patrick Henry, ed. (1986). The AI business: the commercial uses of artificial intelligence (4. print ed.). Cambridge, Mass.: MIT Pr. ISBN 978-0-262-23117-6.