AI's Unsung Hero: Data Labeling and Expert Evals

June 27, 2025 • 46 min

🎧 Listen Now

🤖 AI Summary

Overview

This episode explores the evolution of data labeling and evaluation in AI, tracing its journey from supervised learning to reinforcement learning and the rise of foundation models. Manu Sharma, CEO of Labelbox, discusses how the company adapted to these shifts, the increasing role of human expertise in fine-tuning AI, and the broader implications for the AGI race. The conversation also delves into the challenges and opportunities of transitioning from traditional machine learning to generative AI, and the critical role of data and talent in advancing AI capabilities.

Notable Quotes

- The world is shifting from building AI models to renting AI intelligence. Enterprises are no longer building their own models; they're adding on top of base intelligence to make it work for their company. – Manu Sharma, on the changing AI landscape.

- It's not simply about getting the answer right; it's about how great the answer is. – Manu Sharma, on the evolution of reinforcement learning and quality assessment in AI.

- To produce state-of-the-art data, you really have to tap into professions and people who are at the top of their game. – Manu Sharma, on the growing need for expert-driven data labeling.

🧠 The Shift from Supervised to Reinforcement Learning

- Manu Sharma highlights the transition from supervised learning, where humans labeled data for models to predict, to reinforcement learning with human feedback (RLHF), where models learn from preferences and evaluations.

- RLHF simplifies human involvement by capturing preferences rather than requiring humans to solve problems from scratch.

- The current focus is on teaching models to assess the quality of their outputs, a process Sharma likens to meta-learning.

🚀 Labelbox’s Evolution and Adaptation

- Labelbox began by solving the problem of collaborative data labeling for computer vision, launching its first prototype on Reddit.

- The company transitioned from traditional machine learning to generative AI by expanding its tools to support text, voice, and reasoning models.

- Sharma describes the challenge of retooling Labelbox’s DNA from a software-focused company to one that also provides expert-driven data services.

👩‍🔬 The Role of Experts in AI Training

- The demand for high-quality, domain-specific data has grown, requiring experts like software engineers, mathematicians, and healthcare professionals to label and evaluate data.

- Sharma emphasizes that the complexity of tasks has increased, moving from simple labeling to creating evaluation rubrics and long-horizon task assessments.

- This shift reflects the need for boutique, small datasets of extremely high quality, rather than large volumes of generic data.

📊 Meta’s Acquisition of Scale AI and Industry Implications

- The acquisition underscores the critical importance of data and talent in the AGI race. Sharma notes that Meta’s move likely focuses on acquiring talent to drive innovation.

- The industry is increasingly focused on post-training data, with reinforcement learning becoming a significant part of compute budgets.

- Sharma predicts that future breakthroughs will rely on a fusion of AI, software, and human expertise to produce cutting-edge datasets.

🌍 The Future of AI and Data Labeling

- As AI systems tackle more complex workflows, they require data that reflects real-world expertise.

- Sharma envisions a future where experts across industries contribute to training AI, from polymer scientists to customer service professionals.

- The ultimate goal is to align AI systems with human judgment and creativity, paving the way for advanced applications and AGI.

AI-generated content may not be accurate or complete and should not be relied upon as a sole source of truth.

📋 Episode Description

Labelbox CEO Manu Sharma joins a16z Infra partner Matt Bornstein to explore the evolution of data labeling and evaluation in AI — from early supervised learning to today’s sophisticated reinforcement learning loops.

Manu recounts Labelbox’s origins in computer vision, and then how the shift to foundation models and generative AI changed the game. The value moved from pre-training to post-training and, today, models are trained not just to answer questions, but to assess the quality of their own responses. Labelbox has responded by building a global network of “aligners” — top professionals from fields like coding, healthcare, and customer service, who label and evaluate data used to fine-tune AI systems.

The conversation also touches on Meta’s acquisition of Scale AI, underscoring how critical data and talent have become in the AGI race.

Here's a sample of Manu explaining how Labelbox was able to transition from one era of AI to another:

It took us some time to really understand like that the world is shifting from building AI models to renting AI intelligence. A vast number of enterprises around the world are no longer building their own models; they're actually renting base intelligence and adding on top of it to make that work for their company. And that was a very big shift.

But then the even bigger opportunity was the hyperscalers and the AI labs that are spending billions of dollars of capital developing these models and data sets. We really ought to go and figure out and innovate for them. For us, it was a big shift from the DNA perspective because Labelbox was built with a hardcore software-tools mindset. Our go-to market, engineering, and product and design teams operated like software companies.

But I think the hardest part for many of us, at that time, was to just make the decision that we're going just go try it and do it. And nothing is better than that: "Let's just go build an MVP and see what happens."

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.