Reserved GPUs
Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.
What Are Reserved GPUs
Reserved GPUs are for requests requiring:
- Bulk quantities - Multiple GPUs (8, 16, 32, 64+)
- Specific locations - Regional compliance or data proximity requirements
- Long-term commitments - Multi-months with preferential pricing
- Custom configurations - Specialized hardware or network requirements
How It Works
- Submit Request - Fill out the reservation form with your requirements
- Team Review - Spheron team reviews your request within 24 hours
- Receive Quotes - Multiple providers compete to offer best pricing
- Choose Option - Select the quote that fits your needs
- Book Meeting (Optional) - Schedule consultation for complex requirements
Competitive bidding across providers ensures optimal pricing.
Benefits
Cost Savings:- 30-50% lower than on-demand hourly rates
- Bulk discounts for multiple GPUs
- Long-term commitment pricing advantages
- Reserved capacity ensures GPU access
- No competition with spot market
- Predictable resource allocation
- Multiple quotes from different providers
- Compare pricing and terms
- Choose best value for your requirements
Submitting a Request
Visit app.spheron.ai → Reserved GPU to access the request form.

Fill Out Request Form
GPU Model
Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100
- H100/H200: Highest performance, large-scale training
- A100/B200: Production-grade training and inference
- RTX 4090/5090: Development and medium workloads
- L40S/A40: Balanced price-performance
- L4/V100: Cost-effective for inference
Unsure? Book a consultation with the team.
Quantity
Enter number of GPUs needed (8 to 64+)
Duration
Specify reservation length:
- Enter value (e.g., 6)
- Select unit (Months or Years)
- Choose start time (ASAP or Within 12 Months)
Longer commitments typically receive better pricing.
Location
Select region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location
- Any Location: Maximum provider competition, best pricing
- Specific Region: Required for data compliance or proximity
Start Date
Calendar selection for specific deployment timing (optional)
Additional Requirements
Specify custom needs (optional):
- Network requirements (e.g., InfiniBand, NVLink)
- Compliance needs (e.g., GDPR, HIPAA)
- Storage requirements
- Special configurations
Contact Information
Provide your contact details so the team can deliver quotes:
- Name: your full name
- Email: where quotes and follow-ups will be sent
- Phone (required): include country code (e.g.
+1 555-123-4567). Phone is validated and mandatory; the form will not advance to the review step without a valid number.
Review & Submit
Review all details before submission. Edit any field if needed.
Click Submit Request to send to team.
Receive Quotes
Within 24 hours:
- Multiple provider quotes via email
- Pricing, hardware specs, and availability
- Terms and conditions
No obligation to accept. Compare and choose best option.
Support & Consultation
For complex requirements, schedule a 30-minute consultation with the Spheron team to discuss GPU selection, quantity, and received quotes. For general questions, submit a request via the platform; responses are typically within 24 hours.
Common Use Cases
LLM Training - Large language models requiring days/weeks of GPU time
Research Projects - Academic and lab projects needing predictable long-term costs
Production Inference - AI services requiring guaranteed GPU availability
Data Processing - Video processing, simulations, large-scale data analysis
Multi-GPU Workloads - Distributed training requiring 8+ GPUs with high-speed interconnects
Best Practices
Optimize Costs:- Request exact quantity needed (can submit additional requests later)
- Choose "Any Location" for competitive bidding unless region-specific required
- Longer commitments (6-12+ months) typically offer better per-month pricing
- Select appropriate GPU tier (L40S vs H100) based on actual workload needs
- Provide detailed requirements in additional notes
- Specify network/storage/compliance needs upfront
- Include realistic timelines
- Book consultation for complex configurations
Frequently Asked Questions
Can I modify my request after submission? Contact the team with your updated requirements and a revised quote will be provided.
What if I need additional GPUs later? Submit a new request. Multiple concurrent reservations are supported.
Am I obligated to accept a quote?
No. Quotes are non-binding offers. Choose only if terms meet your needs.
What are cancellation policies?
Policies vary by provider. Review specific terms included with each quote.
What if my preferred GPU is unavailable?
Providers will suggest equivalent alternatives. The large provider network ensures options.
How much cheaper are reserved vs on-demand?
Typically 30-50% savings depending on duration, quantity, and GPU type.
Can I get a quote without committing?
Yes. Request quotes freely with no commitment. Consultation is also free.
Additional Resources
- Getting Started - Deploy on-demand instances
- Quick Start - Fast deployment guide
- Billing - Credit management and pricing
- API Reference - Programmatic deployments
- General Info - Support and official channels
Visit app.spheron.ai and go to Reserved GPU to submit a request.