Claude
Born from a Safety Mission
In 2021, a group of researchers left OpenAI because they believed the AI industry wasn't taking safety seriously enough. They founded Anthropic, and their first major product was an AI assistant called Claude.
The name isn't random. It's a nod to Claude Shannon, the father of information theory β the mathematician who figured out how to measure and transmit information, laying the groundwork for everything from the internet to, well, LLMs.
From day one, Anthropic's goal wasn't just to make the smartest AI. It was to make an AI that was helpful, harmless, and honest β the "3 H's" that guide everything Claude does.
Who Founded Anthropic?
Dario Amodei (CEO, former VP of Research at OpenAI) and Daniela Amodei (President, former VP of Safety at OpenAI) lead the company. They brought along several top researchers who shared their concerns about building increasingly powerful AI without adequate safety measures.
Anthropic isn't anti-AI. They believe powerful AI is coming regardless, and they'd rather be at the frontier building it safely than watching from the sidelines. They call this approach "responsible scaling" β push the boundaries, but with guardrails.
Constitutional AI: Teaching an AI Values
Most AI companies use RLHF (Reinforcement Learning from Human Feedback) β hire humans to rate AI responses, then train the model to produce responses humans rate highly. This works, but it has problems:
- Humans are expensive and slow
- Different humans have different values
- It's hard to be consistent across millions of ratings
Anthropic invented something different: Constitutional AI (CAI). Here's how it works:
- Write a constitution β A set of principles like: "Choose the response that is most helpful while being harmless," "Don't help with illegal activities," "If unsure, say so honestly."
- Self-critique β The AI generates a response, then critiques its own response against the constitution. "Does this response violate any principles? How could it be improved?"
- Self-revise β The AI rewrites its response based on its own critique.
- Train on the improved responses β The model learns to produce the revised (better) version directly.
The beauty of this approach: the AI's values are explicit and readable. You can look at the constitution and see exactly what principles the AI was trained to follow. It's more transparent than a black box of human ratings.
Using Claude via the API
What Claude Is Great At
- Long documents β With a 200K token context window, Claude can read and analyze entire books, codebases, legal contracts, or research paper collections in a single conversation. Ask it to find contradictions in a 100-page contract? No problem.
- Coding β Claude excels at writing, reviewing, and debugging code. It's particularly good at understanding large codebases and explaining complex code.
- Careful reasoning β Claude is trained to think step by step and show its work. It's less likely to jump to conclusions and more likely to consider edge cases.
- Saying "I don't know" β Unlike some models that confidently make things up, Claude is designed to express uncertainty when it's unsure. It will say "I'm not confident about this" rather than inventing a plausible-sounding falsehood.
- Following nuanced instructions β Claude is strong at understanding complex, multi-part instructions and following them precisely.
Claude vs. ChatGPT: How Do They Compare?
Neither is strictly "better" β they have different strengths:
- Context window β Claude: 200K tokens. ChatGPT (GPT-4 Turbo): 128K tokens. Claude wins for long documents.
- Ecosystem β ChatGPT has plugins, DALL-E image generation, code interpreter, web browsing, and a huge app store of custom GPTs. Claude is more focused on the core chat experience.
- Safety approach β Claude uses Constitutional AI (explicit principles). ChatGPT uses RLHF (human ratings). Different philosophies, both effective.
- Hallucination β Claude tends to be more cautious, sometimes declining to answer rather than risk being wrong. ChatGPT is more willing to attempt an answer but may hallucinate more.
- Writing style β Claude tends to be more measured and nuanced. ChatGPT tends to be more enthusiastic and verbose. Personal preference varies.
Anthropic's Bigger Vision
Anthropic isn't just building a chatbot. They're working on some of the hardest problems in AI:
- Interpretability β Understanding what's happening inside neural networks. Most AI models are "black boxes" β we know what goes in and what comes out, but not why. Anthropic is leading research into opening that black box.
- Responsible Scaling Policy β A framework for deciding when it's safe to train more powerful models and when to pause. As models get more capable, the stakes get higher.
- Alignment research β Making sure AI systems do what humans actually want, not just what they literally asked for. The difference between "clean my room" (put everything away neatly) and a misaligned interpretation (throw everything in the trash).
Whether Claude or ChatGPT or Gemini ends up being the "best" AI assistant, the safety research Anthropic is doing benefits the entire field. If they figure out how to make AI provably safe, everyone wins.
The Three H's in Practice
Every Claude response is guided by three principles:
- Helpful β Actually answer the question, provide useful information, and complete the task. Don't be so cautious that you're useless.
- Harmless β Don't help with dangerous or illegal activities. Don't produce harmful content. Don't manipulate or deceive.
- Honest β Tell the truth. Express uncertainty when uncertain. Don't pretend to be human. Don't make up facts.
These three principles can sometimes conflict. What if being maximally helpful means answering a dangerous question? Claude is trained to balance all three β being as helpful as possible while staying within the bounds of harmlessness and honesty.
Quick check
Continue reading