Whatfinger Startup And Small Business
    What's Hot

    Claude Code built me a $273/Day online directory

    February 16, 2026

    “I Caught My Employee Stealing”

    February 16, 2026

    10 Brutal Truths For Ambitious Men in Their 30s & 40s

    February 16, 2026
    Whatfinger News Headlines

    Claude Code built me a $273/Day online directory

    February 16, 2026

    “I Caught My Employee Stealing”

    February 16, 2026

    10 Brutal Truths For Ambitious Men in Their 30s & 40s

    February 16, 2026

    5 Reasons Why People Don’t Do Things

    February 15, 2026

    Why Dating Feels So Much Harder If You Have ADHD

    February 15, 2026

    You Can Be In A Good Mood For No Reason

    February 15, 2026

    How to be a CEO when AI breaks all the old playbooks | Sequoia CEO Coach Brian Halligan

    February 15, 2026

    Software engineers are like wizards

    February 15, 2026
    Facebook Twitter Instagram
    Monday, February 16
    • Whatfinger®
    • Breaking
    • Videos
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Crazy Clips
    • Daily Paper
    • Sci-Tech
    • Top 3
    • Choice Clips
    • About
    • Retirement
    Whatfinger Startup And Small BusinessWhatfinger Startup And Small Business
    Whatfinger Startup And Small Business
    Home » The coming AI security crisis (and what to do about it) | Sander Schulhoff

    The coming AI security crisis (and what to do about it) | Sander Schulhoff

    webmasterBy webmasterDecember 21, 2025 All Videos 3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Sander Schulhoff is an AI researcher specializing in AI security, prompt injection, and red teaming. He wrote the first comprehensive guide on prompt engineering and ran the first-ever prompt injection competition, working with top AI labs and companies. His dataset is now used by Fortune 500 companies to benchmark their AI systems security, he’s spent more time than anyone alive studying how attackers break AI systems, and what he’s found isn’t reassuring: the guardrails companies are buying don’t actually work, and we’ve been lucky we haven’t seen more harm so far, only because AI agents aren’t capable enough yet to do real damage.

    *We discuss:*
    1. The difference between jailbreaking and prompt injection attacks on AI systems
    2. Why AI guardrails don’t work
    3. Why we haven’t seen major AI security incidents yet (but soon will)
    4. Why AI browser agents are vulnerable to hidden attacks embedded in webpages
    5. The practical steps organizations should take instead of buying ineffective security tools
    6. Why solving this requires merging classical cybersecurity expertise with AI knowledge

    *Brought to you by:*
    Datadog—Now home to Eppo, the leading experimentation and feature flagging platform: https://www.datadoghq.com/lenny
    Metronome—Monetization infrastructure for modern software companies: https://metronome.com/
    GoFundMe Giving Funds—Make year-end giving easy: http://gofundme.com/lenny

    *Transcript:* https://www.lennysnewsletter.com/p/the-coming-ai-security-crisis

    *My biggest takeaways (for paid newsletter subscribers):* https://www.lennysnewsletter.com/i/181089452/my-biggest-takeaways-from-this-conversation

    *Where to find Sander Schulhoff:*
    • X: https://x.com/sanderschulhoff
    • LinkedIn: https://www.linkedin.com/in/sander-schulhoff
    • Website: https://sanderschulhoff.com
    • AI Red Teaming and AI Security Masterclass on Maven: https://bit.ly/44lLSbC

    *Where to find Lenny:*
    • Newsletter: https://www.lennysnewsletter.com
    • X: https://twitter.com/lennysan
    • LinkedIn: https://www.linkedin.com/in/lennyrachitsky/

    *In this episode, we cover:*
    (00:00) Introduction to Sander Schulhoff and AI security
    (05:14) Understanding AI vulnerabilities
    (11:42) Real-world examples of AI security breaches
    (17:55) The impact of intelligent agents
    (19:44) The rise of AI security solutions
    (21:09) Red teaming and guardrails
    (23:44) Adversarial robustness
    (27:52) Why guardrails fail
    (38:22) The lack of resources addressing this problem
    (44:44) Practical advice for addressing AI security
    (55:49) Why you shouldn’t spend your time on guardrails
    (59:06) Prompt injection and agentic systems
    (01:09:15) Education and awareness in AI security
    (01:11:47) Challenges and future directions in AI security
    (01:17:52) Companies that are doing this well
    (01:21:57) Final thoughts and recommendations

    *Referenced:*
    • AI prompt engineering in 2025: What works and what doesn’t | Sander Schulhoff (Learn Prompting, HackAPrompt): https://www.lennysnewsletter.com/p/ai-prompt-engineering-in-2025-sander-schulhoff
    • The AI Security Industry is Bullshit: https://sanderschulhoff.substack.com/p/the-ai-security-industry-is-bullshit
    • The Prompt Report: Insights from the Most Comprehensive Study of Prompting Ever Done: https://learnprompting.org/blog/the_prompt_report?srsltid=AfmBOoo7CRNNCtavzhyLbCMxc0LDmkSUakJ4P8XBaITbE6GXL1i2SvA0
    • OpenAI: https://openai.com
    • Scale: https://scale.com
    • Hugging Face: https://huggingface.co
    • Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition: https://www.semanticscholar.org/paper/Ignore-This-Title-and-HackAPrompt%3A-Exposing-of-LLMs-Schulhoff-Pinto/f3de6ea08e2464190673c0ec8f78e5ec1cd08642
    • Simon Willison’s Weblog: https://simonwillison.net
    • ServiceNow: https://www.servicenow.com
    • ServiceNow AI Agents Can Be Tricked Into Acting Against Each Other via Second-Order Prompts: https://thehackernews.com/2025/11/servicenow-ai-agents-can-be-tricked.html
    • Alex Komoroske on X: https://x.com/komorama
    • Twitter pranksters derail GPT-3 bot with newly discovered “prompt injection” hack: https://arstechnica.com/information-technology/2022/09/twitter-pranksters-derail-gpt-3-bot-with-newly-discovered-prompt-injection-hack
    • MathGPT: https://math-gpt.org
    • 2025 Las Vegas Cybertruck explosion: https://en.wikipedia.org/wiki/2025_Las_Vegas_Cybertruck_explosion
    • Disrupting the first reported AI-orchestrated cyber espionage campaign: https://www.anthropic.com/news/disrupting-AI-espionage
    …References continued at: https://www.lennysnewsletter.com/p/the-coming-ai-security-crisis

    _Production and marketing by https://penname.co/._
    _For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com._

    Lenny may be an investor in the companies discussed.

    webmaster

    Keep Reading

    Claude Code built me a $273/Day online directory

    “I Caught My Employee Stealing”

    10 Brutal Truths For Ambitious Men in Their 30s & 40s

    5 Reasons Why People Don’t Do Things

    Why Dating Feels So Much Harder If You Have ADHD

    You Can Be In A Good Mood For No Reason

    Add A Comment

    Leave A Reply Cancel Reply

    Latest Featured Stories

    Claude Code built me a $273/Day online directory

    February 16, 2026

    “I Caught My Employee Stealing”

    February 16, 2026

    10 Brutal Truths For Ambitious Men in Their 30s & 40s

    February 16, 2026

    5 Reasons Why People Don’t Do Things

    February 15, 2026

    Why Dating Feels So Much Harder If You Have ADHD

    February 15, 2026

    You Can Be In A Good Mood For No Reason

    February 15, 2026

    How to be a CEO when AI breaks all the old playbooks | Sequoia CEO Coach Brian Halligan

    February 15, 2026

    Software engineers are like wizards

    February 15, 2026

    How to Become Rich in Real Life by Thinking Like the Top 1%

    February 15, 2026

    “Do Better”

    February 14, 2026

    How To Build A 20x Company

    February 14, 2026

    “Codex reviews all of our PRs”

    February 14, 2026

    This Is How I Make A 10/10 Offer

    February 13, 2026

    Net Profit VS Gross Profit

    February 13, 2026

    AI Just Had It’s iPhone Moment…

    February 13, 2026

    Stop Shipping AI Slop with Weavy AI

    February 13, 2026

    How to make $1M?

    February 13, 2026

    This AI agent completes your To-Do list (plus 4 AI tools that’ll blow you away)

    February 13, 2026

    The 1 person billion-dollar startup effect

    February 13, 2026

    $10K Offer VS $10M Offer

    February 12, 2026

    why you need to get WHOOP #notsponsored

    February 12, 2026

    Why OpenClaw Took Off

    February 12, 2026

    Labor Market Warning: The Jobs Report Doesn’t Add Up

    February 12, 2026

    Why Info Businesses Are So Hard

    February 12, 2026

    Claude Cowork Explained

    February 12, 2026

    “Engineers are becoming sorcerers” | The future of software development with OpenAI’s Sherwin Wu

    February 12, 2026

    Why “Retire With $1M” Is Terrible Advice

    February 12, 2026

    Why AI errors are actually your fault

    February 12, 2026

    How to Improve Leadership Skills in Your Business (Before its Too Late)

    February 12, 2026

    How to Automate Your Life & Work w/ Claude Code: Ultimate Beginner’s Guide

    February 11, 2026

    Which Life Would You Pick?

    February 11, 2026

    This supplement works better than coffee for ADHD #adhd

    February 11, 2026

    Become a Master Marketer Using Claude Code

    February 11, 2026

    Why Jeff Bezos Banned Him From Amazon

    February 11, 2026

    My mother-in-law turned $10K into $1,000,000/yr

    February 11, 2026

    “You Haven’t Nailed Your Model”

    February 11, 2026

    AI Native Hedge Funds

    February 11, 2026

    We will succeed where Vitalik Buterin failed.

    February 11, 2026

    Coding is becoming calligraphy

    February 11, 2026

    We Dominated The Gym Industry With This

    February 10, 2026
    Whatfinger News – The Conservative Alternative To the Drudge Report – CLICK BELOW
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW

    Type above and press Enter to search. Press Esc to cancel.