标签:LLM judges

AutoArena

分类: AI开发工具 AI测试工具 大语言模型 AIOpensourcemodels

Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.