Capability: Evaluation & Monitoring
Tools for evaluating, tracing, observing, monitoring, and improving AI or LLM application quality.
Related input types
Related output types
Tools with Evaluation & Monitoring
Augment Code
Augment Code platform for AI evaluation and monitoring and workflow automation.
AWS Bedrock
AWS platform for building generative AI applications and agents with managed model access, agents, knowledge bases, guardrails, and usage-based pricing.
Baseten
Baseten platform for AI evaluation and monitoring and model gateway management.
Braintrust
AI observability and evaluation platform for production AI products.
Continue
AI checks for pull request quality control.
CrewAI
Multi-agent platform for building, operating, tracing, testing, and scaling AI agent workflows.
Databricks Mosaic AI
Databricks Mosaic AI platform for AI data preparation and RAG and vector search.
Dify
Open-source platform for building agentic workflows, RAG pipelines, AI apps, MCP servers, and APIs.
DSPy
DSPy open-source project for AI evaluation and monitoring and agent-building workflows.
Flowise
Open-source visual builder for AI agents, chatflows, agentflows, RAG workflows, APIs, SDKs, and embedded chatbots.
Hugging Face Inference Endpoints
Managed infrastructure for deploying Hugging Face models as dedicated inference endpoints.
IBM watsonx.ai
IBM watsonx.ai platform for AI evaluation and monitoring and agent-building workflows.
LangChain
Agent engineering platform and open-source frameworks for building, observing, evaluating, and deploying agents.
Langfuse
Open-source LLM engineering platform for observability, prompt management, evaluations, metrics, and experiments.
LangSmith
LangChain platform for observing, evaluating, and deploying LLM applications and agents.
Microsoft Foundry
Azure platform for building, optimizing, deploying, and governing AI apps and agents.
Pinecone
Managed vector database and inference platform for retrieval and RAG workflows.
Together AI
AI cloud platform for model APIs, inference, fine-tuning, evaluations, GPU clusters, and sandboxes.
Vertex AI
Google Cloud platform for model building, model APIs, hosting, and generative AI workflows.