Human Cloud
Login

Mercor

Mercor is a marketplace connecting domain experts to remote, paid AI roles and providing AI labs and enterprises with expert-created frontier datasets, benchmarks, and evaluation environments.

San Francisco, CA, United States
Est.2022
FOCUS
MercorMercor

Connect Directly With Mercor

EmailActive

Solution Highlights

Products

Showcase the products and solutions offered by Mercor
Refresh Products

APEX Benchmarks (APEX, APEX-Agents, ACE)

Benchmark family assessing frontier model capability on economically valuable professional tasks (APEX), long-horizon agent tasks (APEX-Agents), and consumer activities (ACE), with supporting blog/paper/data/code/sample tasks.

Benchmarking

Leaderboards

Open Tooling

Best for:ML Evaluator

Expert Marketplace for Remote AI Roles

Marketplace for professionals to find top-tier, remote AI roles matched to their expertise, with listed hourly pay ranges and ongoing work opportunities.

Remote Roles

Hourly Pay

AI Interviewing

Best for:Domain Expert

Frontier Human Data

Large-scale expert data creation to fuel AI breakthroughs, including specialized annotations and datasets across many domains for model training and post-training.

Expert Annotations

Post-training Data

Domain Coverage

Best for:AI Research Lead

RL Environments

Reinforcement learning environments built by creating realistic data-rich worlds, implementing tools/applications for agents, and creating rigorous tasks and verifiers.

Task Verifiers

Tool Simulation

Data-rich Worlds

Best for:RL Engineer

Performance

Tracking the performance of the solution based on what's most important to you
Company logo
Business Case

$100M+ Revenue Was Unlocked by Compressing Decision Cycles From Days to Hours

Mercor scaled from fewer than a dozen active client projects to managing hundreds of projects while growing rapidly in headcount. The company had no data team and lacked a central analytics platform, collaborative dashboards, or reliable access to key operational metrics. Teams pulled raw data via VPN into AWS and relied on spreadsheets and a few technical people for custom reports. For a business operating on hour-to-hour timelines, these delays risked millions in lost revenue. Mercor made a single analytics platform the foundation for Ops, Finance, Sourcing, and Sales, connecting data from its warehouse and operational sources like Google Sheets, Airtable, and the Mercor platform. The company rolled out self-serve reporting so non-technical users could build dashboards without needing SQL or Python. Notebook-based AI assistance removed the reporting bottleneck and enabled teams to iterate on metrics and views in real time. Operations used dashboards to monitor project health across hundreds of customer engagements. Decision cycles were compressed from days to hours, enabling faster action on throughput, efficiency, quality, and revenue metrics. Over the past year, improved execution and velocity expanded capacity to take on more projects, which unlocked over $100M in revenue. Dashboards were created in hours rather than days, and the operations team tracked 60+ metrics per project across hundreds of active projects. Mercor also reported zero enterprise customer churn.

Key Results
  • $100M+ revenue unlocked over the past year
  • Decision cycles reduced from days to hours
  • 60+ metrics tracked per project
Save
Source this exact business case
Share
Feb 20, 2026
Self Reported
Company logo
Business Case

Pass@1 Nearly Doubled With 874 Expert-Labeled Tasks and 1 Training Epoch

Mercor needed to prove that a small amount of expert-labeled data could materially improve real-world agent performance on long-horizon, professional tasks. The goal was to drive measurable gains on the APEX-Agents benchmark, which tested day-to-day work across investment banking, management consulting, and corporate law. A key risk in this low-data setting was wasting scarce expert effort on data that would not transfer to the hardest benchmark tasks. Mercor partnered with Applied Compute to post-train an open-source model using an expert-labeled dev set. Mercor supplied a dev set of 874 tasks split across 50 unique “worlds,” and none of the tasks or worlds appeared in the APEX-Agents benchmark. Applied Compute deployed its proprietary long-horizon RL stack and ran single-epoch training with no SFT warmup, no filtering, and no task or rubric modifications. The team evaluated performance on the full APEX-Agents benchmark (n=480) using Pass@1, Pass@3, and mean criteria passed, starting from a GLM 4.6 baseline. The post-trained model outperformed the baseline across all metrics using just 874 expert-labeled tasks, with the largest gains in corporate law. With fewer than 1,000 high-quality data points, Pass@1 and mean score nearly doubled on APEX-Agents. On the corporate law evaluations, Pass@1 tripled. The baseline GLM 4.6 model scored 3.8% Pass@1 and 12.1% mean score prior to post-training, and the training trendline remained near-linear, indicating additional data would likely continue yielding gains.

Key Results
  • 874 expert-labeled tasks used for post-training
  • 50 unique “worlds” in the dev set
  • 480 tasks in the APEX-Agents benchmark (n=480)
Save
Source this exact business case
Share
Feb 20, 2026
Self Reported
Gabriela Fontoura
Review

Gabriela Fontoura

Gabriela Fontoura

I have had the pleasure of working on several Mercor projects, and my experience has been outstanding. The only area for improvement would be the response time regarding evaluations.

Feb 20, 2026
Self Reported
Brian Ackerman
Review

Brian Ackerman

Brian Ackerman

I have worked on multiple AI training platforms, Mercor stands out. The work is well-organized, communication is clear, and the team is responsive. Highly recommend.

Feb 20, 2026
Self Reported
M. Davis
Review

M. Davis

M. Davis

Just wrapped up my second contract with Mercor. Their professionalism cuts through immediately -- seamless workflows, clear communication, and a genuine respect for expert talent.

Feb 20, 2026
Self Reported
Nurdin Kaparov
Review

Nurdin Kaparov

Nurdin Kaparov

My experience with Mercor has been exceptional. I have been participating since July 2024 and the compensation is strong and well above average on an hourly basis.

Feb 20, 2026
Self Reported
Paul W.
Review

Paul W.

Paul W.

Working in the area of AI training can lead you to companies that pay an absolute pittance. Mercor is by far the best of all similar companies I've worked for. Can't rate it highly enough.

Feb 20, 2026
Self Reported
Calvin Beighle
Review

Calvin Beighle

Calvin Beighle

Mercor consistently followed up with our team to make sure we were having a good experience. Their site is easy to use, engineers responsive, and their vetting, extensive. A must have for anyone building a business with engineering load.

Feb 20, 2026
Self Reported
Milton Tembelis
Review

Milton Tembelis

Milton Tembelis • Interventional radiology fellow

After five or six years of training, you get a little sick of it. You’re working nonstop, but financially you’re still barely treading water.

Feb 20, 2026
Self Reported
show more...

Qualifications

Certifications, badges, customers, and features that qualify this solution

Customers

https://vjifsowxcmmapmvnkwlq.supabase.co/storage/v1/object/public/public assets/logo/org/a7c35718 918d 4428 86f4 84d4ed38d9f6/d00e9988f14ef63d260ae98200bdaecc7fec9339464a25f85f176dce27b5cc5b
https://vjifsowxcmmapmvnkwlq.supabase.co/storage/v1/object/public/public assets/logo/bda6d3e2 b8fc 4fc7 9a00 92a3527c39cd/logo.webp
https://vjifsowxcmmapmvnkwlq.supabase.co/storage/v1/object/public/public assets/logo/org/8b1f258c 40ea 41a3 949f eb9c49636fd1/2ff535a391fa
https://vjifsowxcmmapmvnkwlq.supabase.co/storage/v1/object/public/public assets/logo/org/eef966b8 5ec9 4e93 89dc 7fd7eceb383d/1e576769ab7e

Badges

Performance across Human Cloud, as measured by company interest, kudos, and business case success.

Top 20
Top 20
Top 20%
Top 20%
Top 5%
Top 5%

Features

AI Interviewing
Bi-weekly Pay
Comp Benchmarking
Daily Payouts
Evaluation Evals
Expert Marketplace
Human Datasets
Leaderboards
Remote Work
RL Environments

Focus Areas

Specialized areas the solution focuses on. The best solutions specialize in niches across skillsets, functions, industries, regions, and more.

AI
Data

Category

General category of the solution.

Talent Platforms

About Mercor

Mercor is a talent marketplace that connects top-tier experts with remote, paid AI roles and projects, positioning itself as a way for professionals to “shape the future of AI.” The platform offers role-based opportunities across high-skill domains such as medicine, law, finance, consulting, and software engineering, and highlights regular payouts and competitive hourly pay for expert work. For AI labs and enterprises, Mercor provides “frontier data for frontier AI” by mobilizing subject-matter experts to create specialized datasets, benchmarks, and evaluation environments. The company states it develops benchmarks, evaluation environments, and large-scale human datasets, and offers data, evals, and post-training work designed to drive improvements in advanced reasoning, long-horizon planning, tool use, and safe behavior under uncertainty. Mercor also publishes benchmark families including APEX (AI Productivity Index), APEX-Agents, and ACE (AI Consumer Index), with associated artifacts like papers, datasets, code, and sample tasks. The company positions its work at the cutting edge of AI evaluation and data creation, and claims usage by leading AI labs and major public-company enterprises. As an employer, Mercor emphasizes high-velocity, in-person collaboration from its San Francisco headquarters, and describes itself as profitable, Series C, and valued at $10 billion. It provides benefits for US full-time employees including equity, food stipend, housing support, relocation assistance, fitness membership, unlimited time off, 401(k), parental leave, and wellness services.

Additional Details

Customer Regions
US
Talent Regions
US
Industries
Artificial Intelligence
Clinical Healthcare
Computer software
Financial Services
Legal Services
Management Consulting
Languages
en
Network Size
10,000+
Business Model & Pricing
Marketplace
Human Cloud Logo

Human Cloud is a global workforce advisory firm that helps Fortune 500 companies future-proof their workforces through cloud-driven talent solutions. Led by CEO Matthew Mottola and Head of Enterprise Strategy Tony Buffum, the firm has been at the forefront of AI, talent platforms, and enterprise adoption since 2012.

STAY CONNECTED

© 2026 Human Cloud. All rights reserved.

AI Content may contain mistakes and is not legal, financial or investment advice.

© 2026 All rights reserved

Built by our incredible talent cloud of independent designers, developers, and content writers