Rules & Logic: Expert Systems
AI before it learned from data: asked experts, wrote down rules, hoped.
In the 1980s, companies spent millions trying to bottle human expertise into computer programs. These "expert systems" diagnosed diseases, configured computers, and analyzed chemical compounds — all by following thousands of hand-coded if-then rules.
Some saved Fortune 500 companies tens of millions of dollars per year. Then they collapsed under the weight of their own rules. Their rise and fall is the cautionary tale that explains why modern AI learns from data instead of following instructions.
Core Thesis
Expert systems represent the pinnacle and the breaking point of symbolic AI: the idea that human expertise can be captured as explicit rules worked impressively in constrained domains, but the combinatorial explosion of real-world complexity proved that encoding knowledge by hand cannot scale. This failure forced the paradigm shift toward machine learning.
The Architecture of an Expert System
Expert System
Limitation: A decision tree is static and linear. Real inference engines can chain rules dynamically and handle branching.
How the Inference Engine Works
MYCIN — A Real Expert System
MYCIN was developed at Stanford University in 1976 and diagnosed bacterial blood infections using approximately 600 hand-coded rules. Here are two simplified MYCIN rules as pseudocode:
MYCIN — A Real Expert System
The 0.8 in Rule 047 shows that MYCIN could handle uncertainty — more on this in the Deep Dive at the end. In studies, MYCIN achieved 65% diagnostic accuracy — demonstrably better than many general practitioners. However, it was never used clinically due to legal and ethical concerns.
Two Inference Strategies
Data-driven: Start with facts, fire matching rules, reach a conclusion. Example: Patient has fever + cough; system fires all matching rules; diagnosis: flu.
Goal-driven: Start with hypothesis, search for supporting facts. Example: "Does the patient have flu?" System checks: Fever present? Cough present? Hypothesis confirmed or rejected.
Common Misconception
Interactive: What Plant Is This?
Experience an expert system in action. Answer questions about an unknown plant — the system applies if-then rules to reach a diagnosis. This is exactly how real expert systems like MYCIN work.
What type of stem does the plant have?
Rise and Fall — The Expert Systems Boom
In the 1980s, expert systems left university laboratories and triggered an unprecedented commercial boom.
The Hype Cycle
DENDRAL (1965) analyzed chemical compounds. MYCIN (1976) diagnosed blood infections. R1/XCON (1980) configured DEC computer orders with 95-98% accuracy, saving an estimated $25-40 million per year. Japan launched the billion-dollar "Fifth Generation Computer Systems" project in 1982. Western governments hastily followed with their own subsidies.
The Maintenance Nightmare
The flagship project R1/XCON revealed the weakness. Every new hard drive and every new cable that DEC released required updating the knowledge base. A large team of expensive knowledge engineers was permanently needed just to keep the system running. The maintenance costs eventually approached the savings.
The expert systems boom resembles the dot-com bubble of the 2000s: everyone believed every company needed an expert system. Consultants sold million-dollar AI solutions that often delivered disappointing results. The correction was inevitable.
Common Misconception
Combinatorial Explosion — Why Rules Don't Scale
If expert systems can reason so precisely, why not simply build one for every problem? The answer is a mathematical phenomenon: combinatorial explosion.
The Mathematics of Failure
The Knowledge Bottleneck
Try explaining the concept of "funny" to an alien as if-then rules: "IF someone falls unexpectedly THEN it's funny — BUT NOT if they're hurt — BUT YES if they only stumble slightly — BUT NOT if they're elderly — BUT..." The rules never end. Human judgment relies on implicit knowledge that resists explicit formalization.
Example: Autonomous Driving
"IF traffic light is red THEN stop" is easy. But: "IF a pedestrian is on the road AND a car is approaching fast AND swerving left is physically possible BUT oncoming traffic exists THEN..." Every new context creates countless new branches. A rule-based system would need millions of rules for traffic — and would still fail at the first unexpected puddle.
The Paradigm Shift
This is exactly why machine learning replaced expert systems. Instead of trying to write all the rules, ML systems learn patterns from millions of examples. A self-driving car has "seen" millions of traffic situations and learned to generalize — no explicit rules needed.
Deep Dive: MYCIN's Certainty Factors
Deep Dive: Expert Systems Today
Takeaways
Knowledge Check: Expert Systems
Self-Check
- What are the three components of an expert system and what role does each one play?
- How do forward chaining and backward chaining differ and when would you choose each strategy?
- Why did the combinatorial explosion make expert systems impractical in complex real-world domains?