Overview of Artificial Intelligence
Introduction
Artificial Intelligence (AI) is a branch of computer science dedicated to creating systems capable of simulating intelligent human behavior. Since its formal inception in 1956, AI has undergone multiple cycles of boom and bust, and today it has profoundly transformed human society.
1. Definitions of AI
Different scholars define AI from various perspectives:
| Dimension | Thinking Humanly | Thinking Rationally |
|---|---|---|
| Thinking | Cognitive modeling: simulating human thought processes | Laws of thought: logic-based reasoning |
| Acting | Turing test: behaviorally indistinguishable | Rational agent: making optimal decisions |
Key definitions:
- Turing (1950): if a machine cannot be distinguished from a human in conversation, it possesses intelligence
- McCarthy (1956): the science and engineering of making intelligent machines
- Russell & Norvig: the study of agents that perceive their environment and take actions to maximize success
- Modern view: computational systems that learn patterns from data and make decisions in complex environments
2. Three Schools of AI
graph TD
A[Artificial Intelligence] --> B[Symbolism]
A --> C[Connectionism]
A --> D[Behaviorism]
B --> B1[Knowledge Representation & Reasoning]
B --> B2[Expert Systems]
B --> B3[Logic Programming]
C --> C1[Neural Networks]
C --> C2[Deep Learning]
C --> C3[Large Language Models]
D --> D1[Reinforcement Learning]
D --> D2[Evolutionary Algorithms]
D --> D3[Robotics]
style B fill:#f9f,stroke:#333
style C fill:#bbf,stroke:#333
style D fill:#bfb,stroke:#333
2.1 Symbolism
- Core idea: Intelligence = symbol manipulation. Represent knowledge with formal symbols, reason with logical rules
- Representative methods: expert systems, knowledge graphs, logic programming (Prolog)
- Strengths: explainable, verifiable, knowledge can be encoded
- Limitations: knowledge acquisition bottleneck, difficulty handling uncertainty and perceptual tasks
2.2 Connectionism
- Core idea: Intelligence emerges from the connections of many simple units. Simulates neural networks
- Representative methods: artificial neural networks, deep learning, Transformers
- Strengths: automatically learns from data, excels at perceptual tasks
- Limitations: requires large amounts of data, lacks explainability, high computational cost
2.3 Behaviorism
- Core idea: Agents learn behavior through interaction with the environment. No internal knowledge representation needed
- Representative methods: reinforcement learning, evolutionary algorithms, swarm intelligence
- Strengths: suited for decision and control problems, does not require labeled data
- Limitations: low sample efficiency, exploration difficulties
Extension: Domingos's Five Tribes of ML In The Master Algorithm (2015), Pedro Domingos cuts learning algorithms — from inside ML — into five tribes: Symbolists / Connectionists / Bayesians / Evolutionaries / Analogizers. This is orthogonal and complementary to the three-school taxonomy above: the three schools cut by "where does intelligence come from", the five tribes cut by "what mathematical tool does the learner use". See the The Master Algorithm notebook.
3. The Turing Test
3.1 The Original Turing Test (1950)
┌──────────┐ Text conversation ┌──────────┐
│ Human │ ◄───────────────────→ │ Subject │
│ Judge │ │(Human/ │
└──────────┘ │ Machine) │
└──────────┘
Rule: if the judge cannot reliably distinguish whether the subject
is human or machine, the machine is considered to have passed
the Turing test.
3.2 Limitations of the Turing Test
| Criticism | Argument |
|---|---|
| Tests only imitation | Imitating humans does not equal possessing intelligence |
| Text-only | Ignores perception, motor skills, and other intelligence dimensions |
| Human bias | Judge expectations influence results |
| Deception possible | ELIZA (1966) could "deceive" some users |
4. The Chinese Room Argument
A thought experiment proposed by Searle (1980):
Outside the room: Chinese input → [Room] → Chinese output
Inside the room:
A person who does not understand Chinese
+ A detailed set of Chinese processing rule books
→ Mechanically manipulates symbols according to rules
→ Produces "correct" Chinese responses
Conclusion: although the system appears to "understand" Chinese
from the outside, the person inside does not understand
the meaning of Chinese.
→ Syntactic processing ≠ Semantic understanding
Significance: the fundamental distinction between Strong AI (machines truly understand and think) and Weak AI (machines simulate intelligent behavior).
5. Weak AI vs Strong AI
| Dimension | Weak AI (Narrow AI) | Strong AI (AGI) |
|---|---|---|
| Capability scope | Specific tasks | General intelligence |
| Understanding | No true understanding | Possesses understanding and consciousness |
| Current status | Already achieved | Not yet achieved |
| Examples | AlphaGo, ChatGPT, self-driving | AI in science fiction |
Superintelligence
- Definition: intelligence that far surpasses the smartest humans in virtually all domains
- Bostrom's view: once AGI is achieved, it may rapidly recursively self-improve to superintelligence
- Debate: Is it possible? Is it dangerous? What is the timeline?
6. The Current AI Landscape
graph LR
subgraph Perception
A1[Computer Vision]
A2[Speech Recognition]
A3[Natural Language Understanding]
end
subgraph Reasoning & Decision-Making
B1[Knowledge Graphs]
B2[Planning & Search]
B3[Reinforcement Learning]
end
subgraph Generation
C1[Text Generation LLM]
C2[Image Generation Diffusion]
C3[Code Generation Copilot]
end
subgraph Applications
D1[Autonomous Driving]
D2[Medical Diagnosis]
D3[Scientific Discovery]
D4[AI Agents]
end
A1 & A2 & A3 --> B1 & B2 & B3
B1 & B2 & B3 --> C1 & C2 & C3
C1 & C2 & C3 --> D1 & D2 & D3 & D4
Key Current Trends
- Large Language Models (LLMs): GPT-4, Claude, Gemini demonstrate general language capabilities
- Multimodal: unified processing of text, images, audio, and video
- AI Agents: LLM-driven autonomous agents that can use tools and plan
- Scientific AI: AlphaFold (protein structure), GNoME (materials discovery)
- Embodied Intelligence: combining AI with physical-world interaction
7. AI Classification System
| Classification Dimension | Categories |
|---|---|
| By learning method | Supervised, unsupervised, reinforcement, self-supervised |
| By task type | Classification, regression, generation, decision, reasoning |
| By modality | Text, vision, speech, multimodal |
| By application domain | NLP, CV, robotics, recommender systems, scientific computing |
| By intelligence level | Narrow AI → AGI → ASI |
References
- "Artificial Intelligence: A Modern Approach" - Russell & Norvig
- "Computing Machinery and Intelligence" - Alan Turing (1950)
- "Minds, Brains, and Programs" - John Searle (1980)
- "Superintelligence" - Nick Bostrom