1. Define the service config
Copy the following YAML into asimple-http.yaml
file:
Scaling to Zero
You can setnum_replicas
to 0 in order to enable scale-to-zero. When there are no replicas running, and you try
to make a request to the service, you will get back a response from the load balancer with a message that says ‘No ready replicas’.
If you are enabling scale-to-zero, you should add error-handling logic in your code to handle the case when there are no ready replicas.
As soon as the first request is made when there are zero replicas, a replica will be immediately provisioned. The time it takes for the replica
to be ready depends largely on the boot time of your cloud instances, as well as the time it takes to run your setup/run scripts.
2. Launch the service
Launch the service by simply running:Machine names are required to be unique, because that is how you reference them.
Listing services
You can list all the services you have running using the CLI:Service logs
You can stream the logs for the replicas right from the web dashboard.4. Make a request
When you launch the service, the CLI will print out the endpoint for the service. Alternatively, you can find it on the web dashboard as well. It will look like:https://services.komodo.io/service-id
You can use this endpoint as you would any other URL. This points to our load balancer, which will redirect you to one of the replicas,
so if you’re using curl, make sure to specify the -L
flag to follow redirects:
3. Terminate the service
When interacting with services using the CLI, you reference them using their unique name rather than their ID.