AI Ethics & Safety
When AI Goes Wrong
In 2018, Amazon discovered that its AI-powered hiring tool had a serious problem. The system, trained on a decade of resumes from past hires, had learned a pattern: most successful engineering candidates were men. So it started penalizing resumes that included the word "women's" β as in "women's chess club captain" or "women's university."
The AI wasn't sexist on purpose. It was doing exactly what it was trained to do: find patterns in historical data and use them to predict future success. The problem was that the historical data was biased, and the AI faithfully learned and amplified that bias.
Amazon scrapped the tool. But the damage was done β and the lesson was clear: AI is powerful, but power without responsibility is a recipe for disaster.
Meanwhile, in 2024, a deepfake video of a prominent CEO appeared to announce that his company was going bankrupt. The stock dropped 8% in minutes before the video was debunked. Billions of dollars in market value, wiped out by a piece of AI-generated fiction.
These aren't hypothetical scenarios. They're real events that happened to real companies and real people. And as AI becomes more powerful, the ethical stakes keep rising.
Bias: Garbage In, Garbage Out
The oldest rule in computing applies perfectly to AI: garbage in, garbage out. If you train an AI on biased data, you get a biased AI. And our data is very biased.
Where Does Bias Come From?
- Historical data. If past decisions were discriminatory (hiring, lending, policing), an AI trained on that data will learn those same discriminatory patterns.
- Representation gaps. Training data may over-represent certain demographics and under-represent others. AI image generators initially produced mostly white faces when asked for "a professional" because their training data was skewed.
- Label bias. Humans who label training data bring their own biases. If labelers associate certain accents with "unprofessional," the AI will too.
- Measurement bias. The metrics we choose to optimize can embed bias. If a loan AI optimizes for "likelihood of repayment" using ZIP code as a feature, it may discriminate against communities that were historically denied access to banking.
Real-World Bias Examples
- Healthcare AI β A widely used algorithm in US hospitals was found to systematically deprioritize Black patients for extra care, because it used healthcare spending as a proxy for health needs. Black patients historically had less access to healthcare, so they spent less β which the AI interpreted as "less sick."
- Criminal justice β The COMPAS algorithm, used in courtrooms to predict recidivism, was found to be twice as likely to falsely flag Black defendants as future criminals compared to white defendants.
- Language models β Studies have shown that LLMs associate certain professions with certain genders ("nurse" with female, "CEO" with male) and certain names with certain racial stereotypes.
Privacy: Who's Watching?
AI systems are hungry for data. The more data they have, the better they perform. But that hunger creates a fundamental tension with privacy.
The Data Collection Problem
Consider everything AI systems can now analyze about you:
- Your face β Facial recognition can identify you in crowds, track your movements, and even estimate your emotional state
- Your voice β Voice assistants record and analyze your speech patterns, accent, and the content of what you say
- Your writing β AI can analyze your emails, messages, and social media posts to build a detailed psychological profile
- Your behavior β Shopping patterns, browsing history, location data, fitness trackers β all fed into AI models
The Consent Problem
Did you consent to all of this? Usually, it's buried in a Terms of Service agreement that nobody reads. The average person would need 76 working days to read all the privacy policies they agree to in a year.
The Surveillance Concern
In some countries, AI-powered surveillance systems track millions of citizens in real time β their faces, their movements, their social connections. Even in democratic countries, the combination of AI and data creates surveillance capabilities that would have been unthinkable a decade ago.
Deepfakes: Seeing Is No Longer Believing
A deepfake is AI-generated media β video, audio, or images β that depicts real people doing or saying things they never did. The technology has become so good that most people can't tell the difference.
How Deepfakes Work
Deepfakes typically use a type of AI called a Generative Adversarial Network (GAN) or modern diffusion models:
- One AI (the "generator") creates fake content
- Another AI (the "discriminator") tries to spot the fakes
- They train against each other, each getting better β until the fakes are virtually undetectable
The Damage Deepfakes Can Do
- Political manipulation β Fake videos of politicians saying things they never said, timed for maximum impact before elections
- Financial fraud β In 2024, an employee at a Hong Kong company transferred $25 million after a video call with what appeared to be the company's CFO. It was a deepfake.
- Reputation destruction β Non-consensual deepfake images and videos, overwhelmingly targeting women
- Erosion of trust β The "liar's dividend" β even real video evidence can be dismissed as a deepfake. When anything can be faked, nothing can be trusted.
Job Displacement: The Elephant in the Room
Let's address the question everyone's thinking about: will AI take my job?
The honest answer is nuanced:
- Some jobs will disappear. Roles that are primarily about routine data processing, simple content creation, or repetitive pattern recognition are most at risk.
- Many jobs will transform. A graphic designer won't disappear β but their job will shift from creating images from scratch to directing AI and refining its output. The skills change; the role evolves.
- New jobs will emerge. "Prompt engineer," "AI trainer," "AI ethics auditor," and "AI safety researcher" didn't exist 5 years ago. Every technology wave destroys some jobs and creates others.
- The transition is the hard part. Even if AI creates more jobs than it destroys in the long run, the people who lose jobs aren't always the ones who get new ones. The transition period can be painful.
Historical perspective: the ATM was supposed to kill bank teller jobs. Instead, banks opened more branches (because they were cheaper to run), and the number of bank tellers actually increased. The job just changed β from counting cash to relationship management.
Will the same happen with AI? Maybe. But AI is broader and faster than any previous technology shift, so the scale of disruption may also be unprecedented.
What Companies Are Doing
Responsible AI companies are investing heavily in safety and ethics. Here's what that looks like:
Safety Teams
Major AI labs (OpenAI, Google DeepMind, Anthropic, Meta AI) have dedicated safety and alignment teams whose job is to make sure AI systems behave as intended. These teams work on:
- Preventing harmful outputs (violence, hate speech, dangerous instructions)
- Reducing hallucinations and improving accuracy
- Ensuring AI systems are honest and transparent about their limitations
Red Teaming
Red teaming is when companies hire people to deliberately try to break their AI systems. Red teamers try to get the AI to:
- Generate harmful or dangerous content
- Reveal private information from training data
- Behave in biased or discriminatory ways
- Be manipulated into doing things it's not supposed to
By finding vulnerabilities before bad actors do, companies can patch them proactively.
Responsible AI Frameworks
Organizations like the Partnership on AI, the EU AI Act (the world's first comprehensive AI law), and various government bodies are developing rules and frameworks for responsible AI development and deployment.
Key principles that most frameworks share:
- Fairness β AI should not discriminate
- Transparency β People should know when AI is being used and how decisions are made
- Privacy β Personal data must be protected
- Accountability β Someone must be responsible when AI causes harm
- Safety β AI should not cause physical or psychological harm
- Human oversight β Humans should remain in control of critical decisions
A Simple Bias Detector
What You Can Do
You don't need to be an AI researcher to care about AI ethics. Here's how everyday people can make a difference:
- Be a skeptical consumer. Question AI-generated content. Don't share images, videos, or claims without verifying them. Be especially careful during elections and crises.
- Demand transparency. Support companies that are transparent about their AI use. Ask "Is this AI-generated?" and "How was this decision made?"
- Protect your data. Read privacy settings. Opt out of data collection when possible. Use privacy-focused tools.
- Speak up about bias. If you encounter biased AI output β an image generator that only shows one race, a chatbot that makes stereotypical assumptions β report it. Companies can't fix what they don't know about.
- Learn the basics. Understanding how AI works (which you're doing right now!) is the best defense against being manipulated by it. An informed public is the strongest safeguard against AI misuse.
- Support good policy. Advocate for sensible AI regulation that balances innovation with protection. The EU AI Act is a starting point; every country needs thoughtful AI governance.