Capability: Model Hosting
Platforms for hosting, deploying, or running model inference services.
Related input types
Related output types
Tools with Model Hosting
AWS Bedrock
AWS platform for building generative AI applications and agents with managed model access, agents, knowledge bases, guardrails, and usage-based pricing.
Baseten
Baseten platform for AI evaluation and monitoring and model gateway management.
Cerebras Inference
Cerebras Inference API platform for model hosting and model API access.
Databricks Mosaic AI
Databricks Mosaic AI platform for AI data preparation and RAG and vector search.
fal.ai
fal.ai API platform for music and audio generation and voice and speech workflows.
Fireworks AI
Fireworks AI platform for RAG and vector search and voice and speech workflows.
GroqCloud
Groq inference API platform for fast model serving and developer workflows.
Hugging Face Inference Endpoints
Managed infrastructure for deploying Hugging Face models as dedicated inference endpoints.
IBM watsonx.ai
IBM watsonx.ai platform for AI evaluation and monitoring and agent-building workflows.
Lepton AI
Lepton AI provides AI tool capabilities.
Microsoft Foundry
Azure platform for building, optimizing, deploying, and governing AI apps and agents.
Mistral La Plateforme
Mistral AI developer platform for model API access and deployment workflows.
Modal
Serverless compute platform for AI, batch, and inference workloads.
NVIDIA NIM
NVIDIA NIM platform for model hosting and model API access.
Replicate
Cloud API platform for running, fine-tuning, and deploying AI models.
RunPod
GPU cloud and serverless infrastructure for AI workloads.
SambaNova Cloud
Cloud API access to SambaNova-hosted open-source and reasoning models.
Together AI
AI cloud platform for model APIs, inference, fine-tuning, evaluations, GPU clusters, and sandboxes.
Vertex AI
Google Cloud platform for model building, model APIs, hosting, and generative AI workflows.
Weaviate
Open-source AI database and vector-search platform with cloud, marketplace, and partner deployment routes.