Whatfinger Startup And Small Business
    What's Hot

    Choose Between Some Jobs Lost or All Jobs Lost

    June 24, 2026

    Why you can’t focus with ADHD

    June 24, 2026

    a tool to outsource your memory

    June 24, 2026
    Whatfinger News Headlines

    Choose Between Some Jobs Lost or All Jobs Lost

    June 24, 2026

    Why you can’t focus with ADHD

    June 24, 2026

    a tool to outsource your memory

    June 24, 2026

    The two hires Anthropic wants

    June 24, 2026

    This guy made billions from just 3 stocks (Here’s how)

    June 24, 2026

    You Can’t Make Money If You Can’t Manage Your Time

    June 23, 2026

    A Genius Mathematician Who Wanted Nothing

    June 23, 2026

    Money and Power Can Save Countries. Peace Can Only Save Me.

    June 23, 2026
    Facebook Twitter Instagram
    Wednesday, June 24
    • Whatfinger®
    • Breaking
    • Fast Clips
    • Entertainment
    • Military
    • Sports
    • Humor
    • Money
    • Daily List
    • World
    • Crazy Clips
    • Sci-Tech
    • Choice Clips
    Whatfinger Startup And Small BusinessWhatfinger Startup And Small Business
    Whatfinger Startup And Small Business
    Home » GLM 5.2: How to Set Up Local AI (With Cursor/Codex etc)

    GLM 5.2: How to Set Up Local AI (With Cursor/Codex etc)

    webmasterBy webmasterJune 23, 2026 All Videos 4 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email

    In this episode I sit down with Amir to get tactical about running local AI models as part of a daily workflow. We center on GLM 5.2 from ZAI, how it stacks up against frontier models like Opus 4.8, and how a fusion approach lets you sequence a heavy thinking model with a lighter execution model for the best output at the lowest cost. Amir walks through setup in Cursor and Codex via OpenRouter, shares real token-cost math, and demos GLM 5.2 refining a live app. By the end you will know how to start today, where local models shine, and how model chaining keeps spend in check.

    Timestamps
    00:00 – Intro
    02:09 – GLM 5.2 and Z AI
    04:01 – Specs: 1M context and Terminal Bench 2.1
    05:22 – Making sense of benchmark scores
    06:42 – Setup in Cursor or Codex with OpenRouter
    10:18 – Local model upside: buy a machine, run tasks
    11:42 – Token cost: 44 cents versus $2.38
    13:36 – Future-proofing with an upfront hardware bet & The Uber subsidy analogy
    16:49 – Model chaining and the vision workaround
    19:23 – Token maxing vs routing tasks to the right model
    20:54 – Answering the "cost is irrelevant" crowd
    21:59 – Closing thoughts

    Key Points

    * GLM 5.2 ships with a 1M-token context window and scores 81 on Terminal Bench 2.1, landing about four points behind Opus 4.8.
    * A fusion approach (a term OpenRouter coined) sequences models: plan with Opus, execute with GLM 5.2, review with Composer 2.5 or Codex 5.5.
    * Running GLM 5.2 in the cloud through OpenRouter costs roughly 44 cents for a task that runs about $2.38 on Opus 4.8 — close to a 5X saving.
    * You can start today with credit-based access: load $20 in OpenRouter and route tasks to the right model.
    * For images, Amir uses Opus 4.8 to read screenshots and describe them, then hands the layout to GLM 5.2 to act on.
    * Teams are shifting from token-maxing to output-maxing, making model governance and chaining the smart play

    Numbered Section Summaries

    1. The Promise: Local Models Catch Up — I open by framing the goal: a tactical look at how local models now keep pace with closed models, and how Amir puts them to work every day.

    2. GLM 5.2 Arrives — Amir covers ZAI’s GLM 5.2 release: a 1M-token context window, an 81 on Terminal Bench 2.1, and strong long-horizon task performance, marking a clear leap from 5.1.

    3. Reading Benchmarks by Vibes — We agree benchmarks feel abstract, so Amir favors building with the model and judging output directly. He sees about 62% where Opus reaches roughly 69%, then trusts hands-on testing to settle it.

    4. Setup in Cursor and Codex — Amir lays out two paths: paste a ZAI API key into Cursor and override the OpenAI endpoint to add GLM 5.2 as a custom model, or use OpenRouter with a Codex profile and switch models from the CLI.

    5. The Fusion Approach — Borrowing OpenRouter’s term, Amir describes sequencing models so each handles its strength: a thinking model plans, an execution model builds, and a reviewer polishes, keeping cost and performance balanced.

    6. The Token Math — Amir maps a real example: 50k input and 85k output tokens land near Opus 4.8 quality for about 44 cents on GLM 5.2 versus $2.38 on Opus 4.8. I call out that 5X as a big deal at scale.

    7. Future-Proofing and the Subsidy Clock — We compare today’s token subsidies to Uber’s early cheap rides. Amir suggests an upfront hardware investment now pays off as heavier future models arrive and subsidies wind down.

    8. Governance, Chaining, and the Vision Workaround — Amir shares how teams overspend (think formatting an email with Opus 4.8 high thinking) and how chaining fixes it. For images, he routes screenshots through Opus 4.8, then hands the layout to GLM 5.2.

    The #1 tool to find startup ideas/trends – https://www.ideabrowser.com/

    LCA helps Fortune 500s and fast-growing startups build their future – from Warner Music to Fortnite to Dropbox. We turn ‘what if’ into reality with AI, apps, and next-gen products https://latecheckout.agency/

    The Vibe Marketer – Resources for people into vibe marketing/marketing with AI: https://www.thevibemarketer.com/

    FIND ME ON SOCIAL
    X/Twitter: https://twitter.com/gregisenberg
    Instagram: https://instagram.com/gregisenberg/
    LinkedIn: https://www.linkedin.com/in/gisenberg/

    FIND AMIR ON SOCIAL
    Humblytics: https://humblytics.com/?via=community
    X/Twitter: https://x.com/amirmxt
    Youtube: https://www.youtube.com/@amirmxt

    webmaster

    Keep Reading

    Choose Between Some Jobs Lost or All Jobs Lost

    Why you can’t focus with ADHD

    a tool to outsource your memory

    The two hires Anthropic wants

    This guy made billions from just 3 stocks (Here’s how)

    You Can’t Make Money If You Can’t Manage Your Time

    Add A Comment

    Leave A Reply Cancel Reply

    Latest Featured Stories

    Choose Between Some Jobs Lost or All Jobs Lost

    June 24, 2026

    Why you can’t focus with ADHD

    June 24, 2026

    a tool to outsource your memory

    June 24, 2026

    The two hires Anthropic wants

    June 24, 2026

    This guy made billions from just 3 stocks (Here’s how)

    June 24, 2026

    You Can’t Make Money If You Can’t Manage Your Time

    June 23, 2026

    A Genius Mathematician Who Wanted Nothing

    June 23, 2026

    Money and Power Can Save Countries. Peace Can Only Save Me.

    June 23, 2026

    GLM 5.2: How to Set Up Local AI (With Cursor/Codex etc)

    June 23, 2026

    Same Sales Velocity, But LTV Is 5-10x Higher

    June 23, 2026

    One Second On The Sofa Can Kill Your Day #adhd

    June 23, 2026

    How to Start a Business From Zero in 2026 (Before Its too Late)

    June 23, 2026

    The antidote for AI anxiety

    June 23, 2026

    He Completely Reinvented Himself 

    June 23, 2026

    Fiona Fung’s Claude now automatically reads her Slack every morning?

    June 22, 2026

    Give Someone a Label and They’ll Change Their Own Behavior

    June 22, 2026

    You’re Not Competing Against the Company, You’re Competing Against Rep #7

    June 22, 2026

    How to Get Your First 10 Customers

    June 22, 2026

    How to Measure Customer Happiness

    June 22, 2026

    Housing Market Update: Home Prices, Mortgage Rates & Outlook

    June 22, 2026

    ADHD Time Blindness Is Broken

    June 22, 2026

    7 Weird ADHD Hacks to Improve Your Life (Without Discipline)

    June 22, 2026

    You Don’t Have to Write, But You Can’t Do Anything Else

    June 22, 2026

    8 Entrepreneurs Compete For $100,000Scale or Fail Season 1. Premiering this Friday, June 26th

    June 22, 2026

    Gong CEO on How to Pick the Right Market in the AI Era | Amit Bendov

    June 22, 2026

    I can’t hide this anymore.

    June 22, 2026

    35-year-old dad trains for the World Cup

    June 22, 2026

    Stop Chasing Happiness, Start Chasing Things You Can Do

    June 21, 2026

    You’re Ashamed of Not Having Integrity With Your Own Pricing

    June 21, 2026

    How to Measure Customer Happiness (Paired Metrics)

    June 21, 2026

    If You Have No Money, You Should Have No Shame

    June 21, 2026

    How to Measure Customer Happiness (Paired Metrics)

    June 21, 2026

    Building the most AI-pilled engineering team in the world | Fiona Fung (Anthropic)

    June 21, 2026

    How to Raise Your Prices Without Losing Clients

    June 20, 2026

    Why Every New Opportunity Is the Woman in the Red Dress

    June 20, 2026

    Can AI Fix This $1M Luxury Travel Business?

    June 19, 2026

    Winners Are Relieved, Losers Are Excited

    June 19, 2026

    Why Domain Experts Are Winning Right Now

    June 19, 2026

    How to Start a Business and Survive the First 3 Years (Most Don’t)

    June 19, 2026

    You Bought a Business, Not a Passive Investment

    June 18, 2026
    More news daily than any other news site on Earth. All sources, all on one page! BAM! There can be ONLY one… CLICK BELOW

    Type above and press Enter to search. Press Esc to cancel.