Close Menu
Finsider

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Creators and communities everywhere take a stand against ICE

    January 27, 2026

    Market Update: CSX, SLB, WBD

    January 26, 2026

    Dow Rises 313 Points to Begin a Big Week: Stock Market Today

    January 26, 2026
    Facebook X (Twitter) Instagram
    Trending
    • Creators and communities everywhere take a stand against ICE
    • Market Update: CSX, SLB, WBD
    • Dow Rises 313 Points to Begin a Big Week: Stock Market Today
    • What next for the Vodafone share price? Here’s what the experts say
    • Obvious Ventures lands fund five with a 360-degree view of planetary, human, economic health
    • Iceland Named Europe’s Fastest-Growing Tourism Destination
    • Stock Indexes Gain to Begin Big Tech Earnings, Fed Decision Week; Gold Tops $5,000 for First Time
    • 7 Ways to Kick Off an Estate Planning Talk With Your Parents
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Finsider
    • Markets & Ecomony
    • Tech & Innovation
    • Money & Wealth
    • Business & Startups
    • Visa & Residency
    Finsider
    Home»Tech & Innovation»New study shows AI isn’t ready for office work
    Tech & Innovation

    New study shows AI isn’t ready for office work

    FinsiderBy FinsiderJanuary 24, 2026No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    New study shows AI isn’t ready for office work
    Share
    Facebook Twitter LinkedIn Pinterest Email

    It has been nearly two years since Microsoft CEO Satya Nadella predicted that generative AI would take over knowledge work, but if you look around a typical law firm or investment bank today, the human workforce is still very much in charge. Despite all the hype about “reasoning” and “planning,” a new study from training-data company Mercor explains exactly why the robot revolution is stalled: AI just can’t handle the messiness of real work.

    A reality check for the “replacement” theory

    Mercor released a new benchmark called APEX-Agents, and it is brutal. unlike the usual tests that ask AI to write a poem or solve a math problem, this one uses actual queries from lawyers, consultants, and bankers. It asks the models to do complete, multi-step tasks that require jumping between different types of information.

    The results? Even the absolute best models on the market—we are talking about Gemini 3 Flash and GPT-5.2—couldn’t crack a 25% accuracy rate. Gemini led the pack at 24%, with GPT-5.2 right behind it at 23%. Most others were stuck in the teens.

    Why AI is failing the “office test”

    Mercor CEO Brendan Foody points out that the issue isn’t raw intelligence; it’s context. In the real world, answers aren’t served up on a silver platter. A lawyer has to check a Slack thread, read a PDF policy, look at a spreadsheet, and then synthesize all that to answer a question about GDPR compliance.

    Humans do this context-switching naturally. AI, it turns out, is terrible at it. When you force these models to hunt for information across “scattered” sources, they either get confused, give the wrong answer, or just give up entirely.

    The “Unreliable Intern”

    For anyone worried about their job security, this is a bit of a relief. The study suggests that right now, AI functions less like a seasoned professional and more like an unreliable intern who gets things right about a quarter of the time.

    That said, the progress is terrifyingly fast. Foody noted that just a year ago, these models were scoring between 5% and 10%. Now they are hitting 24%. So, while they aren’t ready to take the wheel yet, they are learning to drive much faster than we expected. For now, though, the “knowledge work” revolution is on hold until the bots learn how to multitask p

    Isnt Office Ready Shows Study work
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMy sister is buying our parents’ $3 million house, but wants to deduct $100K for renovations. Who’s right?
    Next Article Up 73% year to date, this stock in my SIPP is suddenly on fire!
    Finsider
    • Website

    Related Posts

    Tech & Innovation

    Creators and communities everywhere take a stand against ICE

    January 27, 2026
    Tech & Innovation

    If you use Google AI for symptoms, know it cites YouTube a lot

    January 26, 2026
    Tech & Innovation

    5 Costco Tech Accessories That Will Level Up Your Car

    January 26, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Cursor snaps up enterprise startup Koala in challenge to GitHub Copilot

    July 18, 2025

    What is Mistral AI? Everything to know about the OpenAI competitor

    July 18, 2025

    Analyst Report: Kinder Morgan Inc

    July 18, 2025
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Using Gen AI for Early-Stage Market Research

    July 18, 2025

    Cursor snaps up enterprise startup Koala in challenge to GitHub Copilot

    July 18, 2025

    What is Mistral AI? Everything to know about the OpenAI competitor

    July 18, 2025
    news

    Creators and communities everywhere take a stand against ICE

    January 27, 2026

    Market Update: CSX, SLB, WBD

    January 26, 2026

    Dow Rises 313 Points to Begin a Big Week: Stock Market Today

    January 26, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2020 - 2026 The Finsider . Powered by LINC GLOBAL Inc.
    • Contact us
    • Guest Post Policy
    • Privacy Policy
    • Terms of Service

    Type above and press Enter to search. Press Esc to cancel.