Skip to content

Reserved GPUs

Request bulk GPU allocations, specific locations, or preferential pricing for long-term commitments.

What Are Reserved GPUs

Reserved GPUs are for requests requiring:

  • Bulk quantities - Multiple GPUs (8, 16, 32, 64+)
  • Specific locations - Regional compliance or data proximity requirements
  • Long-term commitments - Multi-months with preferential pricing
  • Custom configurations - Specialized hardware or network requirements

How It Works

  1. Submit Request - Fill out the reservation form with your requirements
  2. Team Review - Spheron team reviews your request within 24 hours
  3. Receive Quotes - Multiple providers compete to offer best pricing
  4. Choose Option - Select the quote that fits your needs
  5. Book Meeting (Optional) - Schedule consultation for complex requirements

Competitive bidding across providers ensures optimal pricing.

Benefits

Cost Savings:
  • 30-50% lower than on-demand hourly rates
  • Bulk discounts for multiple GPUs
  • Long-term commitment pricing advantages
Guaranteed Availability:
  • Reserved capacity ensures GPU access
  • No competition with spot market
  • Predictable resource allocation
Provider Competition:
  • Multiple quotes from different providers
  • Compare pricing and terms
  • Choose best value for your requirements

Submitting a Request

Visit app.spheron.aiReserved GPU to access the request form.

Reserved GPU Request Form

Fill Out Request Form

GPU Model

Select GPU type: H100, H200, A100, B200, RTX 4090, RTX 5090, L40S, A40, L4, V100

  • H100/H200: Highest performance, large-scale training
  • A100/B200: Production-grade training and inference
  • RTX 4090/5090: Development and medium workloads
  • L40S/A40: Balanced price-performance
  • L4/V100: Cost-effective for inference

Unsure? Book a consultation with the team.

Quantity

Enter number of GPUs needed (8 to 64+)

Duration

Specify reservation length:

  • Enter value (e.g., 6)
  • Select unit (Months or Years)
  • Choose start time (ASAP or Within 12 Months)

Longer commitments typically receive better pricing.

Location

Select region: North America, Europe, Asia Pacific, South America, Middle East, Africa, or Any Location

  • Any Location: Maximum provider competition, best pricing
  • Specific Region: Required for data compliance or proximity

Start Date

Calendar selection for specific deployment timing (optional)

Additional Requirements

Specify custom needs (optional):

  • Network requirements (e.g., InfiniBand, NVLink)
  • Compliance needs (e.g., GDPR, HIPAA)
  • Storage requirements
  • Special configurations

Review & Submit

Review all details before submission. Edit any field if needed.

Click Submit Request to send to team.

Receive Quotes

Within 24 hours:

  • Multiple provider quotes via email
  • Pricing, hardware specs, and availability
  • Terms and conditions

No obligation to accept. Compare and choose best option.

Support & Consultation

Need help deciding? The Spheron team provides free consultation.

Book a Meeting: Email Support:
  • Submit questions via platform
  • Response within 24 hours
  • No commitment required

Available 24/7 for guidance and quote requests.

Common Use Cases

LLM Training - Large language models requiring days/weeks of GPU time

Research Projects - Academic and lab projects needing predictable long-term costs

Production Inference - AI services requiring guaranteed GPU availability

Data Processing - Video processing, simulations, large-scale data analysis

Multi-GPU Workloads - Distributed training requiring 8+ GPUs with high-speed interconnects

Best Practices

Optimize Costs:
  • Request exact quantity needed (can submit additional requests later)
  • Choose "Any Location" for competitive bidding unless region-specific required
  • Longer commitments (6-12+ months) typically offer better per-month pricing
  • Select appropriate GPU tier (L40S vs H100) based on actual workload needs
Improve Quote Quality:
  • Provide detailed requirements in additional notes
  • Specify network/storage/compliance needs upfront
  • Include realistic timelines
  • Book consultation for complex configurations

Frequently Asked Questions

Can I modify my request after submission?
No worries, we are flexible and you can share your new requirements with us and we will try to get you the best quote possible.

What if I need additional GPUs later?
Submit new requests anytime. Multiple concurrent reservations are supported. We will try to get you the best quote possible.

Am I obligated to accept a quote?
No. Quotes are non-binding offers. Choose only if terms meet your needs.

What are cancellation policies?
Policies vary by provider. Review specific terms included with each quote.

What if my preferred GPU is unavailable?
Providers will suggest equivalent alternatives. The large provider network ensures options.

How much cheaper are reserved vs on-demand?
Typically 30-50% savings depending on duration, quantity, and GPU type.

Can I get a quote without committing?
Yes. Request quotes freely with no commitment. Consultation is also free.

Additional Resources

Ready to request? Visit app.spheron.ai → Reserved GPU to submit your requirements.