Whatfinger Startup And Small Business
    What's Hot

    Don’t Answer These Questions In A Sale

    November 27, 2025

    “Putin you’re full of sh*t”

    November 27, 2025

    Software Itself Is NOT Valuable

    November 27, 2025
    Whatfinger News Headlines

    Don’t Answer These Questions In A Sale

    November 27, 2025

    “Putin you’re full of sh*t”

    November 27, 2025

    Software Itself Is NOT Valuable

    November 27, 2025

    This Sales Guy Crushed All Our Records..

    November 27, 2025

    How I Optimize My Content for Every Platform #shorts #contentcreator

    November 27, 2025

    The Day1 Rule for Every New Content Creator #shorts #contentcreator

    November 27, 2025

    Why It’s Easier The 2nd Time

    November 26, 2025

    How to Achieve So Much in 24 Hours It Feels Illegal

    November 26, 2025
    Facebook Twitter Instagram
    Thursday, November 27
    • Whatfinger®
    • Breaking
    • Videos
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Crazy Clips
    • Daily Paper
    • Sci-Tech
    • Top 3
    • Choice Clips
    • About
    • Retirement
    Whatfinger Startup And Small BusinessWhatfinger Startup And Small Business
    Whatfinger Startup And Small Business
    Home » GPT-OSS vs. Qwen vs. Deepseek: Comparing Open Source LLM Architectures

    GPT-OSS vs. Qwen vs. Deepseek: Comparing Open Source LLM Architectures

    webmasterBy webmasterAugust 29, 2025 All Videos 1 Min Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    OpenAI recently released its first open-weights model since GPT-2, entering a field led by DeepSeek and Alibaba’s Qwen.

    YC’s Ankit Gupta breaks down everything you need to know about these top OSS models, including what sets them apart under the hood. He’ll compare their approaches to mixture-of-experts, long-context training, and post-training techniques that shape reasoning and alignment—and explore how different design choices lead to surprisingly similar performance.

    Apply to Y Combinator: https://www.ycombinator.com/apply
    Work at a startup: https://www.ycombinator.com/jobs

    00:00 – OpenAI OSS Launch
    01:00 – Comparing Open Source LLM Architectures
    01:46 – GPT OSS Overview
    02:37 – Under The Hood of GPT OSS
    03:25 – Qwen-3 Architecture
    04:17 – Qwen-3 Training
    05:12 – Qwen-3 Post-Training
    06:08 – Qwen-3 Reasoning & RL Innovations
    06:52 – DeepSeek V3 Overview
    07:40 – DeepSeek V3.1 Updates
    08:39 – Attention Mechanism (MLA)
    09:39 – Comparing Model Sizes
    10:35 – Long Context Strategies
    11:25 – Reflections on Methods
    12:00 – Takeaways

    webmaster

    Keep Reading

    Don’t Answer These Questions In A Sale

    “Putin you’re full of sh*t”

    Software Itself Is NOT Valuable

    This Sales Guy Crushed All Our Records..

    How I Optimize My Content for Every Platform #shorts #contentcreator

    The Day1 Rule for Every New Content Creator #shorts #contentcreator

    Add A Comment

    Leave A Reply Cancel Reply

    Latest Featured Stories

    Don’t Answer These Questions In A Sale

    November 27, 2025

    “Putin you’re full of sh*t”

    November 27, 2025

    Software Itself Is NOT Valuable

    November 27, 2025

    This Sales Guy Crushed All Our Records..

    November 27, 2025

    How I Optimize My Content for Every Platform #shorts #contentcreator

    November 27, 2025

    The Day1 Rule for Every New Content Creator #shorts #contentcreator

    November 27, 2025

    Why It’s Easier The 2nd Time

    November 26, 2025

    How to Achieve So Much in 24 Hours It Feels Illegal

    November 26, 2025

    The Sure Way To Grow Your Business

    November 26, 2025

    Is This The Best Revenge?

    November 26, 2025

    This MILLIONAIRE is scared to go broke

    November 26, 2025

    $10M Business ideas w/ The Most Interesting Guy In Tech

    November 26, 2025

    Why your approach to conflict is wrong

    November 26, 2025

    How to Know Your Customers and 10x Your Business Revenue

    November 26, 2025

    The MOST expensive lunch in the world

    November 25, 2025

    Reviewing Claude Opus 4.5

    November 25, 2025

    Can Slate Trucks Scale?

    November 25, 2025

    This Laundromat makes HOW MUCH?!

    November 25, 2025

    Fed’s Money Printing is About to Start — Melt-Up Will Accelerate

    November 25, 2025

    Will AI Take Over?

    November 25, 2025

    Chase the Hard Problems

    November 25, 2025

    The GREATEST investor you’ve never heard of

    November 25, 2025

    The Best Time To Start

    November 25, 2025

    How 2M-Subscriber Creator Achieves 10x Virality with AI | The Rundown AI, Rowan Cheung

    November 25, 2025

    Coaching your team to rely on you less

    November 25, 2025

    We Hired 90 People In 12 Weeks

    November 24, 2025

    The Goal Of Good Branding

    November 24, 2025

    He finessed Amazon’s INSANE return policy

    November 24, 2025

    Find a Pain Point Worth Solving

    November 24, 2025

    Why leaders should learn to coach

    November 24, 2025

    Vibe Coding Mobile Apps People Love (Free Course)

    November 24, 2025

    Luck Is A Skill

    November 24, 2025

    How a $200 Doorbell Became a $4B Business

    November 24, 2025

    How to Attract Customers from Big Players #shorts

    November 24, 2025

    Why 90% Businesses Fail in The First 3 Years

    November 24, 2025

    Everyone wants to have done this

    November 24, 2025

    The Rags to BILLIONS story.

    November 24, 2025

    5 Things To Get People To Buy

    November 23, 2025

    I Solve It As Many Ways I Can

    November 23, 2025

    We’re All Screwed

    November 23, 2025
    Whatfinger News – The Conservative Alternative To the Drudge Report – CLICK BELOW
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW

    Type above and press Enter to search. Press Esc to cancel.