标签:LLM judges
AutoArena
分类:
AI开发工具
AI测试工具
大语言模型
AIOpensourcemodels
Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.