标签:GenAI evaluation

AutoArena

分类: AI Developer Tools AI Testing Large Language Models (LLMs) Open Source AI Models

Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.