HomeNews

PhD Students Become AI Industry Judges: LMSYS Org's Chatbot Arena Revolutionizes Model Evaluation

Maria LourdesMaria Lourdes1h ago

PhD Students Become AI Industry Judges: LMSYS Org's Chatbot Arena Revolutionizes Model Evaluation

PhD students Lianmin Zheng and his LMSYS Org team have transformed from academic researchers into the de facto judges of the AI industry.

Featured on a recent TechCrunch podcast, they recount their improbable rise through innovative benchmarks that now dictate AI model rankings.

From Berkeley Labs to Global Influence

The journey began at UC Berkeley's Sky Computing Lab, where the students sought better ways to evaluate large language models.

In March 2023, they unveiled Vicuna, an open-source chatbot achieving over 90% of ChatGPT's quality using just 7 billion parameters.

This success led to Chatbot Arena, a crowdsourced platform pitting anonymous AI models in blind pairwise battles voted on by millions of users.

Impact on AI Development

Chatbot Arena quickly became the gold standard, with top labs like OpenAI and Anthropic tuning models to climb its Elo-based leaderboard.

Its real-world preference data offers a more reliable gauge than traditional benchmarks, influencing billions in AI investments.

However, controversies arose, including accusations that some labs gamed the system through targeted optimizations.

Funding and Future Horizons

In May 2025, LM Arena—the rebranded LMSYS project—secured $100 million in seed funding to scale operations.

Looking ahead, the team plans to expand into multimodal evaluations and harder benchmarks like Hard Prompts.

Their story underscores how grassroots innovation by PhD students can reshape an industry dominated by tech giants.

With LMSYS.org at the forefront, the future of AI judging promises greater transparency and competition.

Article Details

Author / Journalist:

Category: StartupsBusiness

Markets:

Topics:

Source Website Secure: No (HTTP)

News Sentiment: Positive

Fact Checked: Legitimate

Article Type: News Report

Published On: 2026-03-18 @ 15:00:00 (1 hours ago)

News Timezone: GMT -5:00

News Source URL: beamstart.com

Language: English

Platforms: Desktop Web, Mobile Web, iOS App, Android App

Copyright Owner: © TechCrunch

News ID: 30631740

About TechCrunch

TechCrunch Logo

Main Topics: StartupsBusiness

Official Website: techcrunch.com

Update Frequency: 8 posts per day

Year Established: 2005

Headquarters: United States

Coverage Areas: United States

Ownership: Independent Company

Publication Timezone: GMT -5:00

Content Availability: Worldwide

News Language: English

RSS Feed: Available (XML)

API Access: Available (JSON, REST)

Website Security: Secure (HTTPS)

Publisher ID: #1

Frequently Asked Questions

Which news outlet covered this story?

The story "PhD Students Become AI Industry Judges: LMSYS Org's Chatbot Arena Revolutionizes Model Evaluation" was covered 1 hours ago by TechCrunch, a news publisher based in United States.

How trustworthy is 'TechCrunch' news outlet?

TechCrunch is a fully independent (privately-owned) news outlet established in 2005 that covers mostly startups and business news.

The outlet is headquartered in United States and publishes an average of 8 news stories per day.

What do people currently think of this news story?

The sentiment for this story is currently Positive, indicating that people regard this as "good news".

How do I report this news for inaccuracy?

You can report an inaccurate news publication to us via our contact page. Please also include the news #ID number and the URL to this story.
  • News ID: #30631740
  • URL: https://beamstart.com/news/the-phd-students-who-became-17738463151527

BEAMSTART

BEAMSTART is a global entrepreneurship community, serving as a catalyst for innovation and collaboration. With a mission to empower entrepreneurs, we offer exclusive deals with savings totaling over $1,000,000, curated news, events, and a vast investor database. Through our portal, we aim to foster a supportive ecosystem where like-minded individuals can connect and create opportunities for growth and success.

© Copyright 2026 BEAMSTART. All Rights Reserved.