标签:GenAI evaluation
AutoArena
分类:
AI Developer Tools
AI Testing
Large Language Models (LLMs)
Open Source AI Models
Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.