Overview of Artificial Intelligence

Introduction

Artificial Intelligence (AI) is a branch of computer science dedicated to creating systems capable of simulating intelligent human behavior. Since its formal inception in 1956, AI has undergone multiple cycles of boom and bust, and today it has profoundly transformed human society.

1. Definitions of AI

Different scholars define AI from various perspectives:

Dimension	Thinking Humanly	Thinking Rationally
Thinking	Cognitive modeling: simulating human thought processes	Laws of thought: logic-based reasoning
Acting	Turing test: behaviorally indistinguishable	Rational agent: making optimal decisions

Key definitions:

Turing (1950): if a machine cannot be distinguished from a human in conversation, it possesses intelligence
McCarthy (1956): the science and engineering of making intelligent machines
Russell & Norvig: the study of agents that perceive their environment and take actions to maximize success
Modern view: computational systems that learn patterns from data and make decisions in complex environments

2. Three Schools of AI

graph TD
    A[Artificial Intelligence] --> B[Symbolism]
    A --> C[Connectionism]
    A --> D[Behaviorism]

    B --> B1[Knowledge Representation & Reasoning]
    B --> B2[Expert Systems]
    B --> B3[Logic Programming]

    C --> C1[Neural Networks]
    C --> C2[Deep Learning]
    C --> C3[Large Language Models]

    D --> D1[Reinforcement Learning]
    D --> D2[Evolutionary Algorithms]
    D --> D3[Robotics]

    style B fill:#f9f,stroke:#333
    style C fill:#bbf,stroke:#333
    style D fill:#bfb,stroke:#333

2.1 Symbolism

Core idea: Intelligence = symbol manipulation. Represent knowledge with formal symbols, reason with logical rules
Representative methods: expert systems, knowledge graphs, logic programming (Prolog)
Strengths: explainable, verifiable, knowledge can be encoded
Limitations: knowledge acquisition bottleneck, difficulty handling uncertainty and perceptual tasks

2.2 Connectionism

Core idea: Intelligence emerges from the connections of many simple units. Simulates neural networks
Representative methods: artificial neural networks, deep learning, Transformers
Strengths: automatically learns from data, excels at perceptual tasks
Limitations: requires large amounts of data, lacks explainability, high computational cost

2.3 Behaviorism

Core idea: Agents learn behavior through interaction with the environment. No internal knowledge representation needed
Representative methods: reinforcement learning, evolutionary algorithms, swarm intelligence
Strengths: suited for decision and control problems, does not require labeled data
Limitations: low sample efficiency, exploration difficulties

Extension: Domingos's Five Tribes of ML　In The Master Algorithm (2015), Pedro Domingos cuts learning algorithms — from inside ML — into five tribes: Symbolists / Connectionists / Bayesians / Evolutionaries / Analogizers. This is orthogonal and complementary to the three-school taxonomy above: the three schools cut by "where does intelligence come from", the five tribes cut by "what mathematical tool does the learner use". See the The Master Algorithm notebook.

3. The Turing Test

3.1 The Original Turing Test (1950)

┌──────────┐     Text conversation    ┌──────────┐
│  Human    │ ◄───────────────────→   │  Subject  │
│  Judge    │                         │(Human/    │
└──────────┘                          │ Machine)  │
                                      └──────────┘

Rule: if the judge cannot reliably distinguish whether the subject 
      is human or machine, the machine is considered to have passed 
      the Turing test.

3.2 Limitations of the Turing Test

Criticism	Argument
Tests only imitation	Imitating humans does not equal possessing intelligence
Text-only	Ignores perception, motor skills, and other intelligence dimensions
Human bias	Judge expectations influence results
Deception possible	ELIZA (1966) could "deceive" some users

4. The Chinese Room Argument

A thought experiment proposed by Searle (1980):

Outside the room: Chinese input → [Room] → Chinese output

Inside the room:
  A person who does not understand Chinese
  + A detailed set of Chinese processing rule books
  → Mechanically manipulates symbols according to rules
  → Produces "correct" Chinese responses

Conclusion: although the system appears to "understand" Chinese
      from the outside, the person inside does not understand 
      the meaning of Chinese.
      → Syntactic processing ≠ Semantic understanding

Significance: the fundamental distinction between Strong AI (machines truly understand and think) and Weak AI (machines simulate intelligent behavior).

5. Weak AI vs Strong AI

Dimension	Weak AI (Narrow AI)	Strong AI (AGI)
Capability scope	Specific tasks	General intelligence
Understanding	No true understanding	Possesses understanding and consciousness
Current status	Already achieved	Not yet achieved
Examples	AlphaGo, ChatGPT, self-driving	AI in science fiction

Superintelligence

Definition: intelligence that far surpasses the smartest humans in virtually all domains
Bostrom's view: once AGI is achieved, it may rapidly recursively self-improve to superintelligence
Debate: Is it possible? Is it dangerous? What is the timeline?

6. The Current AI Landscape

graph LR
    subgraph Perception
        A1[Computer Vision]
        A2[Speech Recognition]
        A3[Natural Language Understanding]
    end

    subgraph Reasoning & Decision-Making
        B1[Knowledge Graphs]
        B2[Planning & Search]
        B3[Reinforcement Learning]
    end

    subgraph Generation
        C1[Text Generation LLM]
        C2[Image Generation Diffusion]
        C3[Code Generation Copilot]
    end

    subgraph Applications
        D1[Autonomous Driving]
        D2[Medical Diagnosis]
        D3[Scientific Discovery]
        D4[AI Agents]
    end

    A1 & A2 & A3 --> B1 & B2 & B3
    B1 & B2 & B3 --> C1 & C2 & C3
    C1 & C2 & C3 --> D1 & D2 & D3 & D4

Key Current Trends

Large Language Models (LLMs): GPT-4, Claude, Gemini demonstrate general language capabilities
Multimodal: unified processing of text, images, audio, and video
AI Agents: LLM-driven autonomous agents that can use tools and plan
Scientific AI: AlphaFold (protein structure), GNoME (materials discovery)
Embodied Intelligence: combining AI with physical-world interaction

7. AI Classification System

Classification Dimension	Categories
By learning method	Supervised, unsupervised, reinforcement, self-supervised
By task type	Classification, regression, generation, decision, reasoning
By modality	Text, vision, speech, multimodal
By application domain	NLP, CV, robotics, recommender systems, scientific computing
By intelligence level	Narrow AI → AGI → ASI

References

"Artificial Intelligence: A Modern Approach" - Russell & Norvig
"Computing Machinery and Intelligence" - Alan Turing (1950)
"Minds, Brains, and Programs" - John Searle (1980)
"Superintelligence" - Nick Bostrom