Registry / Tools / Hugging Face Inference Endpoints
Hugging Face Inference Endpoints facts
H
Platform
Hugging Face Inference Endpoints
Managed infrastructure for deploying Hugging Face models as dedicated inference endpoints.
last verified: 2026-05-06pricing checked: 2026-05-06
01
Fact summary
- Name
- Hugging Face Inference Endpoints
- Company / maker
- Hugging Face
- Category
- Model & Infrastructure
- Tags
- Model Hosting, Inference, Deployment
- Entity type
- Platform
- Availability
- Public
- Pricing model
- Usage Based
- Pricing unit
- compute time
- Primary pricing route
- Api Usage
- Free plan
- No
- Delivery modes
- Web AppAPISDK
- Supported platforms
- Web
- Data state
- source backed
02
Formal source rows with URL, source type, and freshness.Official links
| Link | URL | source type | checked |
|---|---|---|---|
| Official website | huggingface.co/docs/inference-endpoints/en/index | official-site | 2026-05-06 |
| Pricing page | huggingface.co/docs/inference-endpoints/en/pricing | official-pricing-page | 2026-05-06 |
| Documentation | huggingface.co/docs/inference-endpoints/en/index | official-documentation | 2026-05-06 |
| API docs | huggingface.co/docs/inference-endpoints/en/api_reference | official-documentation | 2026-05-06 |
03
Supported platforms and delivery modes are intentionally separate.Access and delivery
Supported platforms
Web
Delivery modes
Web AppAPISDK
04
Capabilities
Capability tags
Model HostingModel ApisEvaluation Monitoring
Input types
Structured DataFunction CallTextImageAudioVideo
Output types
Structured JsonTextImageAudioVideoEmbeddingsDocument
05
Pricing summary
Pricing summary
Derived from official pricing routes where available- pricingModel
- Usage Based
- pricingUnit
- compute time
- currency
- USD
- hasFreePlan
- No
- pricingLastChecked
- 2026-05-06
06
Each route is a first-class record with source URL and checked date.Pricing routes
Request a quote
routeType: enterprise-sales
The pricing page includes a request-a-quote path and notes quota requests for unavailable instance types.
No numeric price asserted
Dedicated endpoint compute
routeType: api-usage
Dedicated endpoints are priced by selected instance type; the pricing page states hourly rates are shown and actual cost is calculated by the minute while deployed endpoints are initializing or running.
No numeric price asserted / compute time
07
Pricing plans
08
Evidence and sources
| source URL | source type | used for | checked |
|---|---|---|---|
| huggingface.co/docs/inference-endpoints/en/pricing | official-pricing-page | pricing-model, pricing-routes | 2026-05-06 |
| huggingface.co/docs/inference-endpoints/en/api_reference | official-documentation | api-access, delivery-modes, capability-claims | 2026-05-06 |
| huggingface.co/docs/inference-endpoints/index | official-documentation | identity, capabilities, delivery-modes | 2026-05-06 |
09
Freshness and recent changes
Facts verified
2026-05-06
Identity and core facts
Pricing checked
2026-05-06
Pricing routes and plans
Record updated
2026-05-06
Public record freshness
10
One-hop registry pages derived from this record's modeled facts. This section does not infer peer tools.