Whatfinger Startup And Small Business
    What's Hot

    Why Your Skills Are Worth Almost Nothing Right Now

    June 28, 2026

    What’s in motion, stays in motion #adhd

    June 28, 2026

    OpenAI Codex lead on the new shape of product work | Andrew Ambrosino

    June 28, 2026
    Whatfinger News Headlines

    Why Your Skills Are Worth Almost Nothing Right Now

    June 28, 2026

    What’s in motion, stays in motion #adhd

    June 28, 2026

    OpenAI Codex lead on the new shape of product work | Andrew Ambrosino

    June 28, 2026

    How to Handle Team Pushback After Raising Prices

    June 27, 2026

    Do You Think Nas Will Scale?

    June 27, 2026

    How Warren Buffett saved Goldman with one investment

    June 27, 2026

    You Step on the Tail, But You’re Looking at the Mouth

    June 27, 2026

    If You’re Not Willing to Do This, You Don’t Want It Enough

    June 27, 2026
    Facebook Twitter Instagram
    Sunday, June 28
    • Whatfinger®
    • Breaking
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Crazy Clips
    • Sci-Tech
    • Choice Clips
    Whatfinger Startup And Small BusinessWhatfinger Startup And Small Business
    Whatfinger Startup And Small Business
    Home » The coming AI security crisis (and what to do about it) | Sander Schulhoff

    The coming AI security crisis (and what to do about it) | Sander Schulhoff

    webmasterBy webmasterDecember 21, 2025 All Videos 3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Sander Schulhoff is an AI researcher specializing in AI security, prompt injection, and red teaming. He wrote the first comprehensive guide on prompt engineering and ran the first-ever prompt injection competition, working with top AI labs and companies. His dataset is now used by Fortune 500 companies to benchmark their AI systems security, he’s spent more time than anyone alive studying how attackers break AI systems, and what he’s found isn’t reassuring: the guardrails companies are buying don’t actually work, and we’ve been lucky we haven’t seen more harm so far, only because AI agents aren’t capable enough yet to do real damage.

    *We discuss:*
    1. The difference between jailbreaking and prompt injection attacks on AI systems
    2. Why AI guardrails don’t work
    3. Why we haven’t seen major AI security incidents yet (but soon will)
    4. Why AI browser agents are vulnerable to hidden attacks embedded in webpages
    5. The practical steps organizations should take instead of buying ineffective security tools
    6. Why solving this requires merging classical cybersecurity expertise with AI knowledge

    *Brought to you by:*
    Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lenny
    Metronome—Monetization infrastructure for modern software companies: https://metronome.com/
    GoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny

    *Transcript:* https://www.lennysnewsletter.com/p/the-coming-ai-security-crisis

    *My biggest takeaways (for paid newsletter subscribers):* https://www.lennysnewsletter.com/i/181089452/my-biggest-takeaways-from-this-conversation

    *Where to find Sander Schulhoff:*
    • X: https://x.com/sanderschulhoff
    • LinkedIn: https://www.linkedin.com/in/sander-schulhoff
    • Website: https://sanderschulhoff.com
    • AI Red Teaming and AI Security Masterclass on Maven: https://bit.ly/44lLSbC

    *Where to find Lenny:*
    • Newsletter: https://www.lennysnewsletter.com
    • X: https://twitter.com/lennysan
    • LinkedIn: https://www.linkedin.com/in/lennyrachitsky/

    *In this episode, we cover:*
    (00:00) Introduction to Sander Schulhoff and AI security
    (05:14) Understanding AI vulnerabilities
    (11:42) Real-world examples of AI security breaches
    (17:55) The impact of intelligent agents
    (19:44) The rise of AI security solutions
    (21:09) Red teaming and guardrails
    (23:44) Adversarial robustness
    (27:52) Why guardrails fail
    (38:22) The lack of resources addressing this problem
    (44:44) Practical advice for addressing AI security
    (55:49) Why you shouldn’t spend your time on guardrails
    (59:06) Prompt injection and agentic systems
    (01:09:15) Education and awareness in AI security
    (01:11:47) Challenges and future directions in AI security
    (01:17:52) Companies that are doing this well
    (01:21:57) Final thoughts and recommendations

    *Referenced:*
    • AI prompt engineering in 2025: What works and what doesn’t | Sander Schulhoff (Learn Prompting, HackAPrompt): https://www.lennysnewsletter.com/p/ai-prompt-engineering-in-2025-sander-schulhoff
    • The AI Security Industry is Bullshit: https://sanderschulhoff.substack.com/p/the-ai-security-industry-is-bullshit
    • The Prompt Report: Insights from the Most Comprehensive Study of Prompting Ever Done: https://learnprompting.org/blog/the_prompt_report?srsltid=AfmBOoo7CRNNCtavzhyLbCMxc0LDmkSUakJ4P8XBaITbE6GXL1i2SvA0
    • OpenAI: https://openai.com
    • Scale: https://scale.com
    • Hugging Face: https://huggingface.co
    • Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition: https://www.semanticscholar.org/paper/Ignore-This-Title-and-HackAPrompt%3A-Exposing-of-LLMs-Schulhoff-Pinto/f3de6ea08e2464190673c0ec8f78e5ec1cd08642
    • Simon Willison’s Weblog: https://simonwillison.net
    • ServiceNow: https://www.servicenow.com
    • ServiceNow AI Agents Can Be Tricked Into Acting Against Each Other via Second-Order Prompts: https://thehackernews.com/2025/11/servicenow-ai-agents-can-be-tricked.html
    • Alex Komoroske on X: https://x.com/komorama
    • Twitter pranksters derail GPT-3 bot with newly discovered “prompt injection” hack: https://arstechnica.com/information-technology/2022/09/twitter-pranksters-derail-gpt-3-bot-with-newly-discovered-prompt-injection-hack
    • MathGPT: https://math-gpt.org
    • 2025 Las Vegas Cybertruck explosion: https://en.wikipedia.org/wiki/2025_Las_Vegas_Cybertruck_explosion
    • Disrupting the first reported AI-orchestrated cyber espionage campaign: https://www.anthropic.com/news/disrupting-AI-espionage
    …References continued at: https://www.lennysnewsletter.com/p/the-coming-ai-security-crisis

    _Production and marketing by https://penname.co/._
    _For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com._

    Lenny may be an investor in the companies discussed.

    webmaster

    Keep Reading

    Why Your Skills Are Worth Almost Nothing Right Now

    What’s in motion, stays in motion #adhd

    OpenAI Codex lead on the new shape of product work | Andrew Ambrosino

    How to Handle Team Pushback After Raising Prices

    Do You Think Nas Will Scale?

    How Warren Buffett saved Goldman with one investment

    Add A Comment

    Leave A Reply Cancel Reply

    Latest Featured Stories

    Why Your Skills Are Worth Almost Nothing Right Now

    June 28, 2026

    What’s in motion, stays in motion #adhd

    June 28, 2026

    OpenAI Codex lead on the new shape of product work | Andrew Ambrosino

    June 28, 2026

    How to Handle Team Pushback After Raising Prices

    June 27, 2026

    Do You Think Nas Will Scale?

    June 27, 2026

    How Warren Buffett saved Goldman with one investment

    June 27, 2026

    You Step on the Tail, But You’re Looking at the Mouth

    June 27, 2026

    If You’re Not Willing to Do This, You Don’t Want It Enough

    June 27, 2026

    How to Get Rich with Low Income Before It’s too Late

    June 27, 2026

    India Can Create The Largest AI Companies

    June 27, 2026

    8 Business Owners Compete For $100,000…

    June 26, 2026

    8 Entrepreneurs Compete for $100,000 – Episode 1

    June 26, 2026

    I Didn’t Know I Was on the Right Path

    June 26, 2026

    If You Thought Entrepreneurship Was Easy Street, You’ll Be Rudely Awakened

    June 26, 2026

    He paid $21M for the Magna Carta

    June 26, 2026

    Inflation is Rising Again — Here’s What the Fed Will Do

    June 26, 2026

    Your Accountant Is Wrong (Here’s What to Do Instead)

    June 26, 2026

    Americans Can’t Stop Talking About Japanese Fans

    June 26, 2026

    You’re Shooting Yourself in the Foot Trying to Grow Too Fast

    June 25, 2026

    14 Years of Business Advice in 60 Seconds

    June 25, 2026

    A Billion Dollar Company Just Copied My Product

    June 25, 2026

    “Learn AI” Is Bad Advice. Learn This Instead

    June 25, 2026

    Living with ADHD is a hell

    June 25, 2026

    Zynga Founder: Consumer Is Not Investible Right Now – Thats Why You Should Build It

    June 25, 2026

    What AI coding costs engineers

    June 25, 2026

    How to Stay Focused

    June 24, 2026

    I Joined a Gym Mastermind Before I Owned a Gym

    June 24, 2026

    US Debt Crisis: Interest Payments Are Becoming a Major Problem

    June 24, 2026

    Coding is no longer the bottleneck

    June 24, 2026

    The blue balls of entrepreneurship

    June 24, 2026

    Choose Between Some Jobs Lost or All Jobs Lost

    June 24, 2026

    Why you can’t focus with ADHD

    June 24, 2026

    a tool to outsource your memory

    June 24, 2026

    The two hires Anthropic wants

    June 24, 2026

    This guy made billions from just 3 stocks (Here’s how)

    June 24, 2026

    You Can’t Make Money If You Can’t Manage Your Time

    June 23, 2026

    A Genius Mathematician Who Wanted Nothing

    June 23, 2026

    Money and Power Can Save Countries. Peace Can Only Save Me.

    June 23, 2026

    GLM 5.2: How to Set Up Local AI (With Cursor/Codex etc)

    June 23, 2026

    Same Sales Velocity, But LTV Is 5-10x Higher

    June 23, 2026
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW

    Type above and press Enter to search. Press Esc to cancel.