标签:LLM evaluation

Algomax

分类: AI Detector AI API AI Developer Tools AI Testing

Algomax streamlines LLM & RAG model evaluation and enhances prompt development.

Ottic

分类: AI Workflow AI API AI Developer Tools AI Testing AI Monitor Large Language Models (LLMs)

Platform to evaluate and test LLM-powered applications for faster and reliable product releases.

Helicone AI

分类: AI API AI Developer Tools AI Monitor Large Language Models (LLMs)

Open-source LLM observability platform for monitoring, debugging, and improving AI apps.

EvalMy.AI

分类: AI API AI Developer Tools AI Testing Large Language Models (LLMs)

Automated AI answer verification service using C3-score for accuracy and efficiency.

Scale

分类: AI Developer Tools AI Agent AI Models Large Language Models (LLMs) AI Research Tool

Scale AI provides high-quality training data and platforms for AI development and evaluation.

Future AGI

分类: AI Developer Tools AI Testing AI Agent AI Models Large Language Models (LLMs)

AI evaluation and optimization platform for automated quality assessment and performance enhancement.

EvalsOne

分类: AI Developer Tools AI Testing AI Agent AI Productivity Tools Large Language Models (LLMs)

A platform for evaluating and optimizing generative AI applications.

Confident AI

分类: AI Developer Tools AI Testing AI Monitor Large Language Models (LLMs) Open Source AI Models

All-in-one LLM evaluation platform for testing, benchmarking, and improving LLM application performance.

Maxim AI

分类: AI Developer Tools AI Testing AI Agent AI Monitor Large Language Models (LLMs)

End-to-end AI evaluation and observability platform for testing and deploying AI applications.

Airtrain.ai LLM Playground

分类: AI Developer Tools Large Language Models (LLMs) AI For Data Analytics

A platform for exploring, curating, and evaluating unstructured datasets and LLMs.

AutoArena

分类: AI Developer Tools AI Testing Large Language Models (LLMs) Open Source AI Models

Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.

You Rate AI

分类: AI Reviews AI Models AI Tools Directory Large Language Models (LLMs)

A platform for rating AI (LLMs) services based on real-world user experiences.