Model & Infrastructure AI tools
A factual map of the category: access modes, pricing routes, capabilities, freshness, and evidence.
Category map
Common taxonomy
Tools in this category
AI21 Studio
AI21 Studio API platform for agent-building workflows and model API access.
Anthropic API
Anthropic API platform for Claude model access, Messages API, Managed Agents, SDKs, and usage-based pricing.
AssemblyAI
Speech AI API platform for transcription, audio intelligence, and speech models.
AWS Bedrock
AWS platform for building generative AI applications and agents with managed model access, agents, knowledge bases, guardrails, and usage-based pricing.
Baseten
Baseten platform for AI evaluation and monitoring and model gateway management.
Braintrust
AI observability and evaluation platform for production AI products.
Cartesia
Real-time voice AI API platform for text-to-speech and speech model workflows.
Cerebras Inference
Cerebras Inference API platform for model hosting and model API access.
Cohere API
Enterprise model API platform for generation, reranking, embedding, and retrieval workflows.
Databricks Mosaic AI
Databricks Mosaic AI platform for AI data preparation and RAG and vector search.
Deepgram
Voice AI API platform for speech recognition, speech synthesis, and audio intelligence.
DeepInfra
Serverless inference platform for model APIs, embeddings, and reranking.
DeepSeek API
DeepSeek developer API platform for model access and usage-based pricing.
Exa
Neural search and content API infrastructure for AI applications.
fal.ai
fal.ai API platform for music and audio generation and voice and speech workflows.
Fireworks AI
Fireworks AI platform for RAG and vector search and voice and speech workflows.
Gladia
Audio intelligence API for transcription, translation, and speech workflows.
Google AI Studio
Google developer surface for Gemini API prototyping, API keys, model access, SDKs, and Gemini API pricing.
GroqCloud
Groq inference API platform for fast model serving and developer workflows.
Hugging Face Inference Endpoints
Managed infrastructure for deploying Hugging Face models as dedicated inference endpoints.
IBM watsonx.ai
IBM watsonx.ai platform for AI evaluation and monitoring and agent-building workflows.
Langfuse
Open-source LLM engineering platform for observability, prompt management, evaluations, metrics, and experiments.
LangSmith
LangChain platform for observing, evaluating, and deploying LLM applications and agents.
Lepton AI
Lepton AI provides AI tool capabilities.
LlamaIndex
Document-agent and RAG platform with LlamaParse, LiteParse, workflows, and open-source framework documentation.
Microsoft Foundry
Azure platform for building, optimizing, deploying, and governing AI apps and agents.
Mistral La Plateforme
Mistral AI developer platform for model API access and deployment workflows.
Modal
Serverless compute platform for AI, batch, and inference workloads.
NVIDIA NIM
NVIDIA NIM platform for model hosting and model API access.
OpenAI API Platform
OpenAI API Platform platform for model API access.
OpenRouter
API platform for accessing AI models through a unified routing layer.
Perplexity Sonar API
API access to Perplexity web-grounded AI responses, search, research, and embeddings.
Pinecone
Managed vector database and inference platform for retrieval and RAG workflows.
Portkey AI
AI gateway platform for model routing, observability, access, and guardrails.
Qdrant
Vector database and cloud service for retrieval, semantic search, and AI applications.
Replicate
Cloud API platform for running, fine-tuning, and deploying AI models.
RunPod
GPU cloud and serverless infrastructure for AI workloads.
SambaNova Cloud
Cloud API access to SambaNova-hosted open-source and reasoning models.
Snowflake Cortex AI
Snowflake Cortex AI platform for RAG and vector search and data analysis workflows.
Stability AI API
Developer API platform for Stability AI image, 3D, and audio generation services.
Tavily
Search and extraction API for AI agents and retrieval workflows.
Together AI
AI cloud platform for model APIs, inference, fine-tuning, evaluations, GPU clusters, and sandboxes.
Unstructured
Platform and API for preparing unstructured documents for AI and RAG workflows.
Vertex AI
Google Cloud platform for model building, model APIs, hosting, and generative AI workflows.
Weaviate
Open-source AI database and vector-search platform with cloud, marketplace, and partner deployment routes.
xAI API
xAI API platform for RAG and vector search and AI search and answer workflows.
You.com
Web search, contents, and research APIs for AI systems.