标签:LLM evaluation

Algomax

分类: AI检测器 AI接口 AI开发工具 AI测试工具

Algomax streamlines LLM & RAG model evaluation and enhances prompt development.

Ottic

分类: AI工作流 AI接口 AI开发工具 AI测试工具 AI监控 大语言模型

Platform to evaluate and test LLM-powered applications for faster and reliable product releases.

Helicone AI

分类: AI接口 AI开发工具 AI监控 大语言模型

Open-source LLM observability platform for monitoring, debugging, and improving AI apps.

EvalMy.AI

分类: AI接口 AI开发工具 AI测试工具 大语言模型

Automated AI answer verification service using C3-score for accuracy and efficiency.

Scale

分类: AI开发工具 AI代理 AI模型 大语言模型 AI研究工具

Scale AI provides high-quality training data and platforms for AI development and evaluation.

Future AGI

分类: AI开发工具 AI测试工具 AI代理 AI模型 大语言模型

AI evaluation and optimization platform for automated quality assessment and performance enhancement.

EvalsOne

分类: AI开发工具 AI测试工具 AI代理 AI生产力工具 大语言模型

A platform for evaluating and optimizing generative AI applications.

Confident AI

分类: AI开发工具 AI测试工具 AI监控 大语言模型 AIOpensourcemodels

All-in-one LLM evaluation platform for testing, benchmarking, and improving LLM application performance.

Maxim AI

分类: AI开发工具 AI测试工具 AI代理 AI监控 大语言模型

End-to-end AI evaluation and observability platform for testing and deploying AI applications.

Airtrain.ai LLM Playground

分类: AI开发工具 大语言模型 AI数据分析

A platform for exploring, curating, and evaluating unstructured datasets and LLMs.

AutoArena

分类: AI开发工具 AI测试工具 大语言模型 AIOpensourcemodels

Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.

You Rate AI

分类: AI评论 AI模型 AI工具目录 大语言模型

A platform for rating AI (LLMs) services based on real-world user experiences.