Whatfinger Startup And Small Business
    What's Hot

    Humans Will Always Invent More Work

    July 4, 2026

    The Truth About Distraction and Focus #shorts

    July 4, 2026

    “I Keep Losing on Price”

    July 4, 2026
    Whatfinger News Headlines

    Humans Will Always Invent More Work

    July 4, 2026

    The Truth About Distraction and Focus #shorts

    July 4, 2026

    “I Keep Losing on Price”

    July 4, 2026

    The Front End Offer That Takes You From $99 to $299/Month

    July 4, 2026

    A 25 year old $1.3B founder explains what young people are truly capable of

    July 4, 2026

    AI Solved Olympiad Math That This Math Professor Couldn’t

    July 3, 2026

    How we almost sold our startup to Yahoo

    July 3, 2026

    I Don’t Have Hobbies. This Is What I Do Instead.

    July 3, 2026
    Facebook Twitter Instagram
    Saturday, July 4
    • Whatfinger®
    • Breaking
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Crazy Clips
    • Sci-Tech
    • Choice Clips
    Whatfinger Startup And Small BusinessWhatfinger Startup And Small Business
    Whatfinger Startup And Small Business
    Home » GPT-OSS vs. Qwen vs. Deepseek: Comparing Open Source LLM Architectures

    GPT-OSS vs. Qwen vs. Deepseek: Comparing Open Source LLM Architectures

    webmasterBy webmasterAugust 29, 2025 All Videos 1 Min Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba’s Qwen.

    YC’s Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.

    Apply to Y Combinator: https://www.ycombinator.com/apply
    Work at a startup: https://www.ycombinator.com/jobs

    00:00 – OpenAI OSS Launch
    01:00 – Comparing Open Source LLM Architectures
    01:46 – GPT OSS Overview
    02:37 – Under The Hood of GPT OSS
    03:25 – Qwen-3 Architecture
    04:17 – Qwen-3 Training
    05:12 – Qwen-3 Post-Training
    06:08 – Qwen-3 Reasoning & RL Innovations
    06:52 – DeepSeek V3 Overview
    07:40 – DeepSeek V3.1 Updates
    08:39 – Attention Mechanism (MLA)
    09:39 – Comparing Model Sizes
    10:35 – Long Context Strategies
    11:25 – Reflections on Methods
    12:00 – Takeaways

    webmaster

    Keep Reading

    Humans Will Always Invent More Work

    The Truth About Distraction and Focus #shorts

    “I Keep Losing on Price”

    The Front End Offer That Takes You From $99 to $299/Month

    A 25 year old $1.3B founder explains what young people are truly capable of

    AI Solved Olympiad Math That This Math Professor Couldn’t

    Add A Comment

    Leave A Reply Cancel Reply

    Latest Featured Stories

    Humans Will Always Invent More Work

    July 4, 2026

    The Truth About Distraction and Focus #shorts

    July 4, 2026

    “I Keep Losing on Price”

    July 4, 2026

    The Front End Offer That Takes You From $99 to $299/Month

    July 4, 2026

    A 25 year old $1.3B founder explains what young people are truly capable of

    July 4, 2026

    AI Solved Olympiad Math That This Math Professor Couldn’t

    July 3, 2026

    How we almost sold our startup to Yahoo

    July 3, 2026

    I Don’t Have Hobbies. This Is What I Do Instead.

    July 3, 2026

    You’re at Level 1 of 9 Levels

    July 3, 2026

    Stop Fixing, Start Building: Unlock Your ADHD Superpower #shorts

    July 3, 2026

    Why product roles need to stay

    July 3, 2026

    $25M founder: My dating app was too good to get funded

    July 3, 2026

    You Killed the Premium Offer Too Early

    July 2, 2026

    Why I Don’t Have Hobbies

    July 2, 2026

    Jobs Report: Much Worse Than Expected

    July 2, 2026

    Why is AI so bad at design?

    July 2, 2026

    ADHD Brain: Why Stimulation Control Is So Hard #shorts

    July 2, 2026

    What to do when you’re extremely overwhelmed #adhd

    July 2, 2026

    Codex App would’ve failed if released in November 2025. Here’s why;

    July 2, 2026

    The investing hack hiding in your own company

    July 2, 2026

    I dropped out of college and built a $3.6B company from scratch

    July 2, 2026

    4 Pivots in 7 Years. Now He Serves Millions of Patients | MedMe Health, Purya Sarmadi

    July 2, 2026

    30 Business Truths Small Business Owners Ignore (Watch before It’s Too Late)

    July 2, 2026

    ADHD Hacks: Brown Noise & Body Doubling for Focus #shorts

    July 1, 2026

    This Is Secretly Making Kids Dumber

    July 1, 2026

    You’re at Level 1 of 9 — They Don’t Know That

    July 1, 2026

    AI Agents are the new SaaS

    July 1, 2026

    “I Take Bad Jobs Out of Desperation…”

    July 1, 2026

    There’s Nothing More Expensive Than Underselling a Rich Customer

    July 1, 2026

    Stop Trying to Sell Yourself on Your Own Job Posting

    July 1, 2026

    Why you should cut coffee to focus #adhd

    July 1, 2026

    Overwhelmed? Bored? Your Phone is the Problem! #shorts

    July 1, 2026

    Taste is more than aesthetics

    July 1, 2026

    “Every Call Ends the Same Way…”

    June 30, 2026

    It’s Not Your Fault, But It Is Your Problem

    June 30, 2026

    Beat Boredom & Doomscrolling Naturally: My Secret #shorts

    June 30, 2026

    The Best Ways to Make Money With Claude AI That Nobody Is Talking About

    June 30, 2026

    PRDs are not dead

    June 30, 2026

    Career and Leadership Lessons for the AI Era from an $80B+ CEO | Snowflake, Sridhar Ramaswamy

    June 30, 2026

    When to Cut Underperforming Properties

    June 30, 2026
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW

    Type above and press Enter to search. Press Esc to cancel.