A
AiToolsBox
⌘K 搜索
工具库/LLM Colosseum

LLM Colosseum

FreeCN国产友好 · 国内直连
学术研究·收录于 2026-05-02

About · 工具简介

A competitive platform where LLMs compete in intellectual challenges using AI-powered evaluation. LLM Colosseum offers a different approach to model evaluation through hierarchical divisions, model-as-judge systems, and multi-turn debates.

LLM竞技场,让AI模型在智力挑战中比拼,支持多轮辩论与分层评估。

🚀 如何开始使用

  1. 1访问LLM Colosseum官网
  2. 2选择感兴趣的竞技场或挑战
  3. 3观看AI模型实时对战
  4. 4或提交自己的挑战题目

功能亮点

模型对战评估多轮辩论功能分层竞技体系社区贡献挑战

🇨🇳 国内用户须知

访问方式
国内直连
中文界面
❌ 不支持
免费额度
API 开放
定价模式
Free
所属分类
◉ 学术研究 · Research
收录日期
2026-05-02
编辑推荐

💬 常见问答

Q:LLM Colosseum免费使用吗?
A:是的,目前完全免费开放。
Q:需要注册账号吗?
A:无需注册即可观看对战,提交挑战可能需要注册。
Q:支持哪些模型参与?
A:支持多种主流开源和闭源LLM,具体列表见官网。

同类工具 · More Research

AF
After dissing Anthropic for limiting Mythos, OpenAI restricts access to Cyber, tFreemium

OpenAI will begin rolling out it cybersecurity testing tool, GPT-5.5 Cyber only "to critical cyber defenders" at first.

RE
Reinforcement fine-tuning with LLM-as-a-judgeFreemium

In this post, we take a deeper look at how RLAIF or RL with LLM-as-a-judge works with Amazon Nova models effectively.

SO
Sources: Anthropic potential $900B+ valuation round could happen within 2 weeksFreemium

Anthropic is asking investors to submit allocations for the AI company’s latest fundraise within the next 48 hours, according to sources familiar with the matter.

HO
How Shivon Zilis Operated as Elon Musk’s OpenAI InsiderFreemium

Messages presented at trial reveal how Zilis, the mother of four of Musk's children, acted as an intermediary between him and OpenAI.