Knowledge Representation
The Mind Map of a Machine
Close your eyes and think about "Paris." Instantly, your brain lights up with connections: France, Eiffel Tower, croissants, City of Lights, romance, the Seine river...
Your brain doesn't store "Paris" as an isolated word. It stores it as a web of connections. Paris IS-A city. Paris IS-IN France. Paris HAS-A Eiffel Tower. Paris IS-KNOWN-FOR romance.
This is knowledge representation β figuring out how to store facts and relationships so a computer can reason about the world the way you do. Without it, AI is just crunching numbers. With it, AI can understand concepts.
Why This Matters
Imagine asking an AI: "If I'm in the capital of France, which famous tower can I visit?"
To answer this, the AI needs to know: Paris is the capital of France, and the Eiffel Tower is in Paris. It needs structured knowledge β not just a bag of words, but facts organized in a way that supports reasoning.
Four Ways to Represent Knowledge
Over the decades, AI researchers have developed several approaches:
1. Semantic Networks
A graph where nodes are concepts and edges are relationships. Like a mind map: Dog β IS-A β Animal, Dog β HAS β Fur. Simple, visual, and intuitive.
2. Frames
Like filling out a form for each concept. A "Restaurant" frame might have slots for: name, cuisine, location, price_range. Think of it as an early version of object-oriented programming, but for knowledge.
3. Ontologies
Formal, hierarchical systems. "Animal" is a superclass. "Mammal" is a subclass of Animal. "Dog" is a subclass of Mammal. They inherit properties down the chain β if mammals are warm-blooded, then dogs are warm-blooded too.
4. Knowledge Graphs
The modern standard. Google's Knowledge Graph connects billions of facts as triples: (subject, predicate, object). (Einstein, born-in, Ulm), (Einstein, field, Physics). When you Google a celebrity and see that info panel on the right? That's a knowledge graph at work.
Building a Simple Knowledge Graph
The Challenge: Common Sense
Humans know millions of "obvious" facts that are surprisingly hard to encode: water is wet, fire is hot, you can't fit an elephant in a suitcase. This is called the common sense problem, and it remains one of the biggest challenges in AI.
The CYC project, started in 1984, attempted to manually encode all common sense knowledge. Decades later, it has millions of rules β and still doesn't cover everything a five-year-old knows intuitively.
Modern AI systems like large language models take a different approach: they learn implicit knowledge from vast amounts of text, but they still struggle with basic physical and social reasoning that comes naturally to humans.