Autoscale

Spheron offers Autoscale compute that dynamically adjusts the allocation of computational resources based on real-time changes in application demands. This automated process allows the system to seamlessly scale resources up or down to match varying workloads, ensuring optimal performance and efficient resource utilization. By continuously monitoring key performance metrics, Autoscale ensures that the right amount of resources is provisioned at any given time, minimizing underutilization during low demand and preventing performance bottlenecks during peak usage.

How to use Autoscale Compute?

With Docker

To use Autoscale compute with a custom Docker image on Spheron:

Click "New Cluster" on the top right corner.
Select Import from Docker Hub.
Enter the names for your cluster and docker image.
Then, Add the tag and Click "Next".
Select "Autoscale" under Compute Type.
When selecting a region, we recommend starting by trying to deploy in a region closer to you. If you encounter any issues, you can consider switching to other regions. Choosing a region closer to you can improve performance and reduce latency. Click here to know more.
Select the instance plan that suits your needs. Use the "Create Custom Plan" toggle to create custom plans for your instance.
Configure Storage (SSD) plan for your instance. Use the "Add Persistent Storage" toggle to add persistent storage for your instance.
Create new Port Policy Mapping. Add the container port, and Select the exposed port you want to map it to. Click here to know more.
Add Environment Variable, if any.
Add Secret Environment Variable if the value is a secret key. It will not be saved in the database. Click here to know more.
Under Auto Scaling, Choose an instance plan that best suits your needs.
Set the maximum number of replicas allowed for your application. Autoscale will not exceed this limit.
Set the minimum number of replicas required for your application. Autoscale will ensure that your application always has at least this many replicas running.
You can create auto scaling custom plans if required. Click here to know more.
You can add advanced configuration if required. Click here to know more.
You can add health checkup if required. Click here to know more.
Click "Deploy" to initiate deployment.

With Marketplace App

To use Autoscale compute with a marketplace app on Spheron:

Click "New Cluster" on the top right corner.
Select Start from marketplace app.
Pick your desired template from the marketplace.
Select "Autoscale" under Compute Type.
When selecting a region, we recommend starting by trying to deploy in a region closer to you. If you encounter any issues, you can consider switching to other regions. Choosing a region closer to you can improve performance and reduce latency. Click here to know more.
Select the instance plan that suits your needs. Use the "Create Custom Plan" toggle to create custom plans for your instance.
Configure Storage (SSD) plan for your instance. Use the "Add Persistent Storage" toggle to add persistent storage for your instance.
Under Auto Scaling, Choose an instance plan that best suits your needs.
Set the maximum number of replicas allowed for your application. Autoscale will not exceed this limit.
Set the minimum number of replicas required for your application. Autoscale will ensure that your application always has at least this many replicas running.
You can create auto scaling custom plans if required. Click here to know more.
You can add advanced configuration if required. Click here to know more.
Click "Deploy" to initiate deployment.

How to update configuration of your instance?

To update the configuration of your instance:

Select your instance and Go to the Settings tab, below the cluster information card.
Click "Update Instance" under the Instance Plan section.
Update the instance plan to suit your needs. Use the "Create Custom Plan" toggle to create custom plans for your instance.
Configure Storage (SSD) plan for your instance. Use the "Add Persistent Storage" toggle to add persistent storage for your instance.
You can update advanced configuration if required. Click here to know more.
Under the Auto Scaling tab, you can update an instance plan to suit your needs.
You can update the maximum and minimum number of replicas allowed for your application.
You can update to auto scaling custom plans if required. Click here to know more.
Click "Save" to update deployment.

Custom Plans in Auto Scaling

You can use the "Custom Plan / Specification" toggle to create custom plans.

Max Replica: Set the maximum number of replicas allowed for your application. Autoscale will not exceed this limit.
Min Replica: Set the minimum number of replicas required for your application. Autoscale will ensure that your application always has at least this many replicas running.
Time Window: Define the time window within which Autoscale will consider scaling actions. For example, if you set it to 5 minutes, Autoscale will evaluate performance metrics over the past 5 minutes to decide whether to scale.
Cooldown: Set a cooldown period between scaling actions. During this period, Autoscale will not perform any additional scaling actions to allow the system to stabilize.
No. of Windows (Before Scaling Evaluation): Specify the number of time windows to consider before evaluating whether scaling is needed.
Threshold % (No. of checks in Alarm): Set the threshold percentage (the number of checks in an alarm state) required to trigger a scaling action.

Scale Up Policy

CPU: Define the CPU utilization threshold that, when exceeded, triggers a scaling action to increase the number of replicas. You can add the number of steps to gradually scale up your application based on CPU utilization.
RAM: Specify the RAM utilization threshold that, when exceeded, triggers a scaling action to increase the number of replicas. You can add the number of steps to gradually scale up your application based on RAM utilization.

Scale Down Policy

CPU: Define the CPU utilization threshold that, when not met, triggers a scaling action to decrease the number of replicas. You can add the number of steps to gradually scale down your application based on CPU utilization.
RAM: Specify the RAM utilization threshold that, when not met, triggers a scaling action to decrease the number of replicas. You can add the number of steps to gradually scale down your application based on RAM utilization.

On Demand Compute Plans