Challenge 2: Model Deployment as an Azure Container Instance

Deploy the model for real-time inference

Once your model has been registered, you can deploy your machine learning model as an Azure Container Instance in the Azure cloud . In this challenge you will learn how to deploy a model so that an application can consume (inference) the model over REST.

Create deployment configuration

The code cell gets a curated environment, which specifies all the dependencies required to host the model (for example, the packages like scikit-learn).

Moreover, you create a deployment configuration, which specifies the amount of compute required to host the model. In this case, the compute will have 1CPU and 1GB memory.

# create environment for the deploy
from azureml.core.environment import Environment
from azureml.core.conda_dependencies import CondaDependencies
from azureml.core.webservice import AciWebservice

# get a curated environment
env = Environment.get(
    workspace=ws, 
    name="AzureML-sklearn-1.0-ubuntu20.04-py38-cpu",
    version=1
)
env.inferencing_stack_version='latest'

# create deployment config i.e. compute resources
aciconfig = AciWebservice.deploy_configuration(
    cpu_cores=1,
    memory_gb=1,
    tags={"data": "MNIST", "method": "sklearn"},
    description="Predict MNIST with sklearn",
)

Deploy model

This next code cell deploys the model to Azure Container Instance.

The deployment takes approximately 3 minutes to complete.

%%time
import uuid
from azureml.core.model import InferenceConfig
from azureml.core.environment import Environment
from azureml.core.model import Model

# get the registered model
model = Model(ws, "sklearn_mnist_model")

# create an inference config i.e. the scoring script and environment
inference_config = InferenceConfig(entry_script="score.py", environment=env)

# deploy the service
service_name = "sklearn-mnist-svc-" + str(uuid.uuid4())[:4]
service = Model.deploy(
    workspace=ws,
    name=service_name,
    models=[model],
    inference_config=inference_config,
    deployment_config=aciconfig,
)

service.wait_for_deployment(show_output=True)

The scoring script file referenced in the code above can be found in the same folder as this notebook, and has two functions:

An init function that executes once when the service starts - in this function you normally get the model from the registry and set global variables
A run(data) function that executes each time a call is made to the service. In this function, you normally format the input data, run a prediction, and output the predicted result.

View endpoint

Once the model has been successfully deployed, you can view the endpoint by navigating to Endpoints in the left-hand menu in Azure Machine Learning studio. You will be able to see the state of the endpoint (healthy/unhealthy), logs, and consume (how applications can consume the model).

Test the model service

You can test the model by sending a raw HTTP request to test the web service.

# send raw HTTP request to test the web service.
import requests

# send a random row from the test set to score
random_index = np.random.randint(0, len(X_test) - 1)
input_data = '{"data": [' + str(list(X_test[random_index])) + "]}"

headers = {"Content-Type": "application/json"}

resp = requests.post(service.scoring_uri, input_data, headers=headers)

print("POST to url", service.scoring_uri)
print("label:", y_test[random_index])
print("prediction:", resp.text)

Clean up resources

If you're not going to continue to use this model, delete the Model service using:

# if you want to keep workspace and only delete endpoint (it will incur cost while running)
service.delete()

If you want to control cost further, stop the compute instance by selecting the "Stop compute" button next to the Compute dropdown. Then start the compute instance again the next time you need it. Please don't stop the compute right now, as you will be reuising it in the following challenges.

What we have learned so far

We've trained a Machine Learning model using scikit-learn inside a Compute Instance running Jupyter
We achieved ~92% accuracy (not very good for this data set)
Azure ML knows about our experiment and our initial run and tracked metrics
We have registered our initial model as a Azure ML Model in our Workspace

In the next challenge, we'll build an MLOps pipeline and use Github Actions to train and deploy a model automatically.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

challenge_02.md

challenge_02.md

Challenge 2: Model Deployment as an Azure Container Instance

Deploy the model for real-time inference

Create deployment configuration

Deploy model

View endpoint

Test the model service

Clean up resources

What we have learned so far

Files

challenge_02.md

Latest commit

History

challenge_02.md

File metadata and controls

Challenge 2: Model Deployment as an Azure Container Instance

Deploy the model for real-time inference

Create deployment configuration

Deploy model

View endpoint

Test the model service

Clean up resources

What we have learned so far