Registry / Categories / Model & Infrastructure

Model & Infrastructure AI tools

A factual map of the category: access modes, pricing routes, capabilities, freshness, and evidence.

47 formal tools
01

Category map

This is not a best list or ranking.
Tools tracked
47
Tools with API access
46
Tools with free plan
6
Enterprise route
14
02

Common taxonomy

Model APIsAgent BuildersCode AssistantsModel HostingRAG & Vector SearchEvaluation & MonitoringAI GatewaysData AnalysisDeveloper AgentsData & ETL for AIAI SearchImage GenerationVideo GenerationVoice & SpeechMusic & AudioWorkflow AutomationDocument AnalysisResearch AssistantsDesign ToolsAPIWeb AppSDKCLIMarketplace AppUsage BasedHybridCredit Based
03

Tools in this category

AI21 Studio

AI21 Studio API platform for agent-building workflows and model API access.

Model & InfrastructureApi PlatformAPIWeb App
pricing model
Usage Based
primary route
Model Api
platforms
Web

Anthropic API

Anthropic API platform for Claude model access, Messages API, Managed Agents, SDKs, and usage-based pricing.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

AssemblyAI

Speech AI API platform for transcription, audio intelligence, and speech models.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

AWS Bedrock

AWS platform for building generative AI applications and agents with managed model access, agents, knowledge bases, guardrails, and usage-based pricing.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

Baseten

Baseten platform for AI evaluation and monitoring and model gateway management.

Model & InfrastructurePlatformAPISDKCLI
pricing model
Hybrid
primary route
Model Api
platforms
macOS, Windows, Linux

Braintrust

AI observability and evaluation platform for production AI products.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Hybrid
primary route
App Subscription
platforms
Web

Cartesia

Real-time voice AI API platform for text-to-speech and speech model workflows.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

Cerebras Inference

Cerebras Inference API platform for model hosting and model API access.

Model & InfrastructureApi PlatformAPISDK
pricing model
Hybrid
primary route
Api Usage
platforms
Not applicable

Cohere API

Enterprise model API platform for generation, reranking, embedding, and retrieval workflows.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

Databricks Mosaic AI

Databricks Mosaic AI platform for AI data preparation and RAG and vector search.

Model & InfrastructurePlatformWeb AppAPI
pricing model
Usage Based
primary route
Model Api
platforms
Web

Deepgram

Voice AI API platform for speech recognition, speech synthesis, and audio intelligence.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

DeepInfra

Serverless inference platform for model APIs, embeddings, and reranking.

model-infrastructureApi PlatformAPI
pricing model
Usage Based
primary route
Model Api
platforms
Not applicable

DeepSeek API

DeepSeek developer API platform for model access and usage-based pricing.

Model & InfrastructureApi PlatformAPI
pricing model
Usage Based
primary route
Model Api
platforms
Web

Exa

Neural search and content API infrastructure for AI applications.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

fal.ai

fal.ai API platform for music and audio generation and voice and speech workflows.

Model & InfrastructureApi PlatformAPISDKCLI
pricing model
Hybrid
primary route
Model Api
platforms
macOS, Windows, Linux

Fireworks AI

Fireworks AI platform for RAG and vector search and voice and speech workflows.

Model & InfrastructurePlatformAPISDKCLI
pricing model
Hybrid
primary route
Model Api
platforms
macOS, Windows, Linux

Gladia

Audio intelligence API for transcription, translation, and speech workflows.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

Google AI Studio

Google developer surface for Gemini API prototyping, API keys, model access, SDKs, and Gemini API pricing.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Hybrid
primary route
Model Api
platforms
Web

GroqCloud

Groq inference API platform for fast model serving and developer workflows.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

Hugging Face Inference Endpoints

Managed infrastructure for deploying Hugging Face models as dedicated inference endpoints.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Web

IBM watsonx.ai

IBM watsonx.ai platform for AI evaluation and monitoring and agent-building workflows.

Model & InfrastructurePlatformWeb AppAPI
pricing model
Hybrid
primary route
Model Api
platforms
Web

Langfuse

Open-source LLM engineering platform for observability, prompt management, evaluations, metrics, and experiments.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Hybrid
primary route
App Subscription
platforms
Web

LangSmith

LangChain platform for observing, evaluating, and deploying LLM applications and agents.

Model & InfrastructurePlatformWeb AppAPICLI
pricing model
Hybrid
primary route
Team Workspace
platforms
Web

Lepton AI

Lepton AI provides AI tool capabilities.

Model & InfrastructureApi PlatformAPISDKWeb App
pricing model
Usage Based
primary route
Model Api
platforms
Web

LlamaIndex

Document-agent and RAG platform with LlamaParse, LiteParse, workflows, and open-source framework documentation.

Model & InfrastructurePlatformWeb AppSDKAPI
pricing model
Hybrid
primary route
Credits
platforms
Web

Microsoft Foundry

Azure platform for building, optimizing, deploying, and governing AI apps and agents.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Web

Mistral La Plateforme

Mistral AI developer platform for model API access and deployment workflows.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

Modal

Serverless compute platform for AI, batch, and inference workloads.

model-infrastructurePlatformAPISDKCLI
pricing model
Usage Based
primary route
Api Usage
platforms
Source detail

NVIDIA NIM

NVIDIA NIM platform for model hosting and model API access.

Model & InfrastructurePlatformAPISDK
pricing model
Hybrid
primary route
Model Api
platforms
Web, Linux

OpenAI API Platform

OpenAI API Platform platform for model API access.

Model & InfrastructureApi PlatformAPI
pricing model
Usage Based
primary route
Model Api
platforms
Web

OpenRouter

API platform for accessing AI models through a unified routing layer.

Model & InfrastructureApi PlatformAPISDKWeb App
pricing model
Credit Based
primary route
Credits
platforms
Web

Perplexity Sonar API

API access to Perplexity web-grounded AI responses, search, research, and embeddings.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Web

Pinecone

Managed vector database and inference platform for retrieval and RAG workflows.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Hybrid
primary route
Api Usage
platforms
Web

Portkey AI

AI gateway platform for model routing, observability, access, and guardrails.

model-infrastructurePlatformAPISDK
pricing model
Hybrid
primary route
Api Usage
platforms
Source detail

Qdrant

Vector database and cloud service for retrieval, semantic search, and AI applications.

Model & InfrastructureSoftwareAPISDKCLI
pricing model
Hybrid
primary route
Open Source
platforms
Linux, macOS, Windows

Replicate

Cloud API platform for running, fine-tuning, and deploying AI models.

Model & InfrastructurePlatformAPISDKWeb App
pricing model
Usage Based
primary route
Model Api
platforms
Web

RunPod

GPU cloud and serverless infrastructure for AI workloads.

model-infrastructurePlatformAPISDKWeb App
pricing model
Usage Based
primary route
Api Usage
platforms
Source detail

SambaNova Cloud

Cloud API access to SambaNova-hosted open-source and reasoning models.

Model & InfrastructureApi PlatformWeb AppAPISDK
pricing model
Hybrid
primary route
Model Api
platforms
Web

Snowflake Cortex AI

Snowflake Cortex AI platform for RAG and vector search and data analysis workflows.

Model & InfrastructurePlatformWeb AppAPI
pricing model
Usage Based
primary route
Model Api
platforms
Web

Stability AI API

Developer API platform for Stability AI image, 3D, and audio generation services.

Model & InfrastructureApi PlatformAPI
pricing model
Hybrid
primary route
Credits
platforms
Web

Tavily

Search and extraction API for AI agents and retrieval workflows.

model-infrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable

Together AI

AI cloud platform for model APIs, inference, fine-tuning, evaluations, GPU clusters, and sandboxes.

Model & InfrastructurePlatformAPISDKWeb App
pricing model
Usage Based
primary route
Model Api
platforms
Web

Unstructured

Platform and API for preparing unstructured documents for AI and RAG workflows.

Model & InfrastructurePlatformWeb AppAPI
pricing model
Usage Based
primary route
Api Usage
platforms
Web

Vertex AI

Google Cloud platform for model building, model APIs, hosting, and generative AI workflows.

Model & InfrastructurePlatformWeb AppAPISDK
pricing model
Usage Based
primary route
Model Api
platforms
Web

Weaviate

Open-source AI database and vector-search platform with cloud, marketplace, and partner deployment routes.

Model & InfrastructurePlatformMarketplace AppWeb App
pricing model
Hybrid
primary route
Marketplace
platforms
Web

xAI API

xAI API platform for RAG and vector search and AI search and answer workflows.

Model & InfrastructureApi PlatformAPISDK
pricing model
Hybrid
primary route
Model Api
platforms
Not applicable

You.com

Web search, contents, and research APIs for AI systems.

Model & InfrastructureApi PlatformAPISDK
pricing model
Usage Based
primary route
Api Usage
platforms
Not applicable