Reserved GPUs

Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.

What Are Reserved GPUs

Reserved GPUs are for requests requiring:

Bulk quantities - Multiple GPUs (8, 16, 32, 64+)
Specific locations - Regional compliance or data proximity requirements
Long-term commitments - Multi-months with preferential pricing
Custom configurations - Specialized hardware or network requirements

How It Works

Submit Request - Fill out the reservation form with your requirements
Team Review - Spheron team reviews your request within 24 hours
Receive Quotes - Multiple providers compete to offer best pricing
Choose Option - Select the quote that fits your needs
Book Meeting (Optional) - Schedule consultation for complex requirements

Competitive bidding across providers ensures optimal pricing.

Benefits

Cost Savings:

30-50% lower than on-demand hourly rates
Bulk discounts for multiple GPUs
Long-term commitment pricing advantages

Guaranteed Availability:

Reserved capacity ensures GPU access
No competition with spot market
Predictable resource allocation

Provider Competition:

Multiple quotes from different providers
Compare pricing and terms
Choose best value for your requirements

Submitting a Request

Visit app.spheron.ai → Reserved GPU to access the request form.

Reserved GPU Request Form

Fill Out Request Form

GPU Model

Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100

H100/H200: Highest performance, large-scale training
A100/B200: Production-grade training and inference
RTX 4090/5090: Development and medium workloads
L40S/A40: Balanced price-performance
L4/V100: Cost-effective for inference

Unsure? Book a consultation with the team.

Quantity

Enter number of GPUs needed (8 to 64+)

Duration

Specify reservation length:

Enter value (e.g., 6)
Select unit (Months or Years)
Choose start time (ASAP or Within 12 Months)

Longer commitments typically receive better pricing.

Location

Select region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location

Any Location: Maximum provider competition, best pricing
Specific Region: Required for data compliance or proximity

Start Date

Calendar selection for specific deployment timing (optional)

Additional Requirements

Specify custom needs (optional):

Network requirements (e.g., InfiniBand, NVLink)
Compliance needs (e.g., GDPR, HIPAA)
Storage requirements
Special configurations

Contact Information

Provide your contact details so the team can deliver quotes:

Name: your full name
Email: where quotes and follow-ups will be sent
Phone (required): include country code (e.g. +1 555-123-4567). Phone is validated and mandatory; the form will not advance to the review step without a valid number.

Review & Submit

Review all details before submission. Edit any field if needed.

Click Submit Request to send to team.

Receive Quotes

Within 24 hours:

Multiple provider quotes via email
Pricing, hardware specs, and availability
Terms and conditions

No obligation to accept. Compare and choose best option.

Support & Consultation

For complex requirements, schedule a 30-minute consultation with the Spheron team to discuss GPU selection, quantity, and received quotes. For general questions, submit a request via the platform; responses are typically within 24 hours.

Common Use Cases

LLM Training - Large language models requiring days/weeks of GPU time

Research Projects - Academic and lab projects needing predictable long-term costs

Production Inference - AI services requiring guaranteed GPU availability

Data Processing - Video processing, simulations, large-scale data analysis

Multi-GPU Workloads - Distributed training requiring 8+ GPUs with high-speed interconnects

Best Practices

Optimize Costs:

Request exact quantity needed (can submit additional requests later)
Choose "Any Location" for competitive bidding unless region-specific required
Longer commitments (6-12+ months) typically offer better per-month pricing
Select appropriate GPU tier (L40S vs H100) based on actual workload needs

Improve Quote Quality:

Provide detailed requirements in additional notes
Specify network/storage/compliance needs upfront
Include realistic timelines
Book consultation for complex configurations

Frequently Asked Questions

Can I modify my request after submission? Contact the team with your updated requirements and a revised quote will be provided.

What if I need additional GPUs later? Submit a new request. Multiple concurrent reservations are supported.

Am I obligated to accept a quote?
No. Quotes are non-binding offers. Choose only if terms meet your needs.

What are cancellation policies?
Policies vary by provider. Review specific terms included with each quote.

What if my preferred GPU is unavailable?
Providers will suggest equivalent alternatives. The large provider network ensures options.

How much cheaper are reserved vs on-demand?
Typically 30-50% savings depending on duration, quantity, and GPU type.

Can I get a quote without committing?
Yes. Request quotes freely with no commitment. Consultation is also free.

Additional Resources

Getting Started - Deploy on-demand instances
Quick Start - Fast deployment guide
Billing - Credit management and pricing
API Reference - Programmatic deployments
General Info - Support and official channels

Visit app.spheron.ai and go to Reserved GPU to submit a request.