Close Menu
Finsider

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    BKL buys London practice RBS Chartered Accountants

    April 28, 2026

    The Average Millennial 401(k) Balance is Not ‘Superbad’

    April 28, 2026

    Drizzle on top: a new high-end dog food brand is coming for the 1%

    April 28, 2026
    Facebook X (Twitter) Instagram
    Trending
    • BKL buys London practice RBS Chartered Accountants
    • The Average Millennial 401(k) Balance is Not ‘Superbad’
    • Drizzle on top: a new high-end dog food brand is coming for the 1%
    • Nasdaq Notches Another New All-Time High: Stock Market Today
    • I found an app that finally broke my toxic affair with doomscrolling
    • Here’s how long-term investors can benefit from a stock market crash
    • Bed Bath & Beyond is seeing new life, as rare sales growth lifts stock more than 30%
    • Canonical lays out a plan for AI in Ubuntu Linux
    Facebook X (Twitter) Instagram Pinterest Vimeo
    Finsider
    • Markets & Ecomony
    • Tech & Innovation
    • Money & Wealth
    • Business & Startups
    • Visa & Residency
    Finsider
    Home»Tech & Innovation»New study shows AI isn’t ready for office work
    Tech & Innovation

    New study shows AI isn’t ready for office work

    FinsiderBy FinsiderJanuary 24, 2026No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    New study shows AI isn’t ready for office work
    Share
    Facebook Twitter LinkedIn Pinterest Email

    It has been nearly two years since Microsoft CEO Satya Nadella predicted that generative AI would take over knowledge work, but if you look around a typical law firm or investment bank today, the human workforce is still very much in charge. Despite all the hype about “reasoning” and “planning,” a new study from training-data company Mercor explains exactly why the robot revolution is stalled: AI just can’t handle the messiness of real work.

    A reality check for the “replacement” theory

    Mercor released a new benchmark called APEX-Agents, and it is brutal. unlike the usual tests that ask AI to write a poem or solve a math problem, this one uses actual queries from lawyers, consultants, and bankers. It asks the models to do complete, multi-step tasks that require jumping between different types of information.

    The results? Even the absolute best models on the market—we are talking about Gemini 3 Flash and GPT-5.2—couldn’t crack a 25% accuracy rate. Gemini led the pack at 24%, with GPT-5.2 right behind it at 23%. Most others were stuck in the teens.

    Why AI is failing the “office test”

    Mercor CEO Brendan Foody points out that the issue isn’t raw intelligence; it’s context. In the real world, answers aren’t served up on a silver platter. A lawyer has to check a Slack thread, read a PDF policy, look at a spreadsheet, and then synthesize all that to answer a question about GDPR compliance.

    Humans do this context-switching naturally. AI, it turns out, is terrible at it. When you force these models to hunt for information across “scattered” sources, they either get confused, give the wrong answer, or just give up entirely.

    The “Unreliable Intern”

    For anyone worried about their job security, this is a bit of a relief. The study suggests that right now, AI functions less like a seasoned professional and more like an unreliable intern who gets things right about a quarter of the time.

    That said, the progress is terrifyingly fast. Foody noted that just a year ago, these models were scoring between 5% and 10%. Now they are hitting 24%. So, while they aren’t ready to take the wheel yet, they are learning to drive much faster than we expected. For now, though, the “knowledge work” revolution is on hold until the bots learn how to multitask p

    Isnt Office Ready Shows Study work
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleMy sister is buying our parents’ $3 million house, but wants to deduct $100K for renovations. Who’s right?
    Next Article Up 73% year to date, this stock in my SIPP is suddenly on fire!
    Finsider
    • Website

    Related Posts

    Tech & Innovation

    I found an app that finally broke my toxic affair with doomscrolling

    April 28, 2026
    Tech & Innovation

    Canonical lays out a plan for AI in Ubuntu Linux

    April 27, 2026
    Tech & Innovation

    10 Smart Gadgets That Will Transform Your Home Office

    April 27, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    Cursor snaps up enterprise startup Koala in challenge to GitHub Copilot

    July 18, 2025

    What is Mistral AI? Everything to know about the OpenAI competitor

    July 18, 2025

    Analyst Report: Kinder Morgan Inc

    July 18, 2025
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    Using Gen AI for Early-Stage Market Research

    July 18, 2025

    Cursor snaps up enterprise startup Koala in challenge to GitHub Copilot

    July 18, 2025

    What is Mistral AI? Everything to know about the OpenAI competitor

    July 18, 2025
    news

    BKL buys London practice RBS Chartered Accountants

    April 28, 2026

    The Average Millennial 401(k) Balance is Not ‘Superbad’

    April 28, 2026

    Drizzle on top: a new high-end dog food brand is coming for the 1%

    April 28, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    © 2020 - 2026 The Finsider . Powered by LINC GLOBAL Inc.
    • Contact us
    • Guest Post Policy
    • Privacy Policy
    • Terms of Service

    Type above and press Enter to search. Press Esc to cancel.