Wallaroo ML Workload Orchestration Comprehensive Tutorial
This can be downloaded as part of the Wallaroo Tutorials repository.
ML Workload Orchestration Comprehensive Tutorial
This tutorial provides a complete set of methods and examples regarding Wallaroo Connections and Wallaroo ML Workload Orchestration.
Wallaroo provides data connections, orchestrations, and tasks to provide organizations with a method of creating and managing automated tasks that can either be run on demand, on a regular schedule, or as a service so they respond to requests.
Object | Description |
---|---|
Orchestration | A set of instructions written as a python script with a requirements library. Orchestrations are uploaded to the Wallaroo instance |
Task | An implementation of an orchestration. Tasks are run either once when requested, on a repeating schedule, or as a service. |
Connection | Definitions set by MLOps engineers that are used by other Wallaroo users for connection information to a data source. Usually paired with orchestrations. |
A typical flow in the orchestration, task and connection life cycle is:
- (Optional) A connection is defined with information such as username, connection URL, tokens, etc.
- One or more connections are applied to a workspace for users to implement in their code or orchestrations.
- An orchestration is created to perform some set instructions. For example:
- Deploy a pipeline, request data from an external service, store the results in an external database, then undeploy the pipeline.
- Download a ML Model then replace a current pipeline step with the new version.
- Collect log files from a deployed pipeline once every hour and submit it to a Kafka or other service.
- A task is created that specifies the orchestration to perform and the schedule:
- Run once.
- Run on a schedule (based on
cron
like settings). - Run as a service to be run whenever requested.
- Once the use for a task is complete, it is killed and its schedule or service removed.
Tutorial Goals
The tutorial will demonstrate the following:
- Create a simple connection to retrieve an Apache Arrow table file from a GitHub registry.
- Create an orchestration that retrieves the Apache Arrow table file from the location defined by the connection, deploy a pipeline, perform an inference, then undeploys the pipeline.
- Implement the orchestration as a task that runs every minute.
- Display the logs from the pipeline after 5 minutes to verify the task is running.
Tutorial Required Libraries
The following libraries are required for this tutorial, and included by default in a Wallaroo instance’s JupyterHub service.
IMPORTANT NOTE: These libraries are already installed in the Wallaroo JupyterHub service. Do not uninstall and reinstall the Wallaroo SDK with the command below.
wallaroo: The Wallaroo SDK.
pandas: The pandas data analysis library.
pyarrow: The Apache Arrow Python library.
The specific versions used are set in the file ./resources/requirements.txt
. Supported libraries are automatically installed with the pypi
or conda
commands. For example, from the root of this tutorials folder:
pip install -r ./resources/requirements.txt
Initialization
The first step is to connect to a Wallaroo instance. We’ll load the libraries and set our client connection settings
Workspace, Model and Pipeline Setup
For this tutorial, we’ll create a workspace, upload our sample model and deploy a pipeline. We’ll perform some quick sample inferences to verify that everything it working.
import wallaroo
from wallaroo.object import EntityNotFoundError
# to display dataframe tables
from IPython.display import display
# used to display dataframe information without truncating
import pandas as pd
pd.set_option('display.max_colwidth', None)
pd.set_option('display.max_columns', None)
import pyarrow as pa
import requests
Connect to the Wallaroo Instance
The first step is to connect to Wallaroo through the Wallaroo client. The Python library is included in the Wallaroo install and available through the Jupyter Hub interface provided with your Wallaroo environment.
This is accomplished using the wallaroo.Client()
command, which provides a URL to grant the SDK permission to your specific Wallaroo environment. When displayed, enter the URL into a browser and confirm permissions. Store the connection into a variable that can be referenced later.
If logging into the Wallaroo instance through the internal JupyterHub service, use wl = wallaroo.Client()
. For more information on Wallaroo Client settings, see the Client Connection guide.
# Login through local Wallaroo instance
wl = wallaroo.Client()
# Setting variables for later steps
workspace_name = f'orchestrationworkspace'
pipeline_name = f'orchestrationpipeline'
model_name = f'orchestrationmodel'
model_file_name = './models/rf_model.onnx'
connection_name = f'houseprice_arrow_table'
Create the Workspace and Pipeline
We’ll now create our workspace and pipeline for the tutorial. If this tutorial has been run previously, then this will retrieve the existing ones with the assumption they’re for us with this tutorial.
We’ll set the retrieved workspace as the current workspace in the SDK, so all commands will default to that workspace.
workspace = wl.get_workspace(name=workspace_name, create_if_not_exist=True)
wl.set_current_workspace(workspace)
pipeline = wl.build_pipeline(pipeline_name)
Upload the Model and Deploy Pipeline
We’ll upload our model into our sample workspace, then add it as a pipeline step before deploying the pipeline to it’s ready to accept inference requests.
# Upload the model
housing_model_control = (wl.upload_model(model_name,
model_file_name,
framework=wallaroo.framework.Framework.ONNX)
.configure(tensor_fields=["tensor"])
)
# Add the model as a pipeline step
pipeline.add_model_step(housing_model_control)
name | orchestrationpipeline |
---|---|
created | 2024-12-09 21:09:36.107470+00:00 |
last_updated | 2024-12-09 21:10:01.573281+00:00 |
deployed | True |
workspace_id | 12 |
workspace_name | orchestrationworkspace |
arch | x86 |
accel | none |
tags | |
versions | 0492779b-3b18-4d29-a87c-b4cfaae53c5a, 67aed035-88b0-42a7-9a30-3d85c2617d37, 7f62cc3a-2f7f-41fc-bdc8-f598fe6a1926 |
steps | orchestrationmodel |
published | False |
#deploy the pipeline
pipeline.deploy(wait_for_status=False)
Deployment initiated for orchestrationpipeline. Please check pipeline status.
name | orchestrationpipeline |
---|---|
created | 2024-12-09 21:09:36.107470+00:00 |
last_updated | 2024-12-09 21:10:04.259743+00:00 |
deployed | True |
workspace_id | 12 |
workspace_name | orchestrationworkspace |
arch | x86 |
accel | none |
tags | |
versions | ff669947-df11-4971-acd8-9f71c31e51a0, 0492779b-3b18-4d29-a87c-b4cfaae53c5a, 67aed035-88b0-42a7-9a30-3d85c2617d37, 7f62cc3a-2f7f-41fc-bdc8-f598fe6a1926 |
steps | orchestrationmodel |
published | False |
# check the pipeline status before performing an inference
import time
while pipeline.status()['status'] != 'Running':
time.sleep(15)
pipeline.status()
{'status': 'Running',
'details': [],
'engines': [{'ip': '10.28.0.8',
'name': 'engine-8574cf699d-4nwg8',
'status': 'Running',
'reason': None,
'details': [],
'pipeline_statuses': {'pipelines': [{'id': 'orchestrationpipeline',
'status': 'Running',
'version': 'ff669947-df11-4971-acd8-9f71c31e51a0'}]},
'model_statuses': {'models': [{'model_version_id': 7,
'name': 'orchestrationmodel',
'sha': 'e22a0831aafd9917f3cc87a15ed267797f80e2afa12ad7d8810ca58f173b8cc6',
'status': 'Running',
'version': 'd2ff6835-e736-489e-8b4b-c3aa3684a96a'}]}}],
'engine_lbs': [{'ip': '10.28.0.7',
'name': 'engine-lb-6676794678-g6j6q',
'status': 'Running',
'reason': None,
'details': []}],
'sidekicks': []}
Sample Inferences
We’ll perform some quick sample inferences using an Apache Arrow table as the input. Once that’s finished, we’ll undeploy the pipeline and return the resources back to the Wallaroo instance.
# sample inferences
batch_inferences = pipeline.infer_from_file('./data/xtest-1k.arrow')
large_inference_result = batch_inferences.to_pandas()
display(large_inference_result.head(20))
time | in.tensor | out.variable | anomaly.count | |
---|---|---|---|---|
0 | 2024-12-09 21:10:09.099 | [4.0, 2.5, 2900.0, 5505.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2900.0, 0.0, 47.6063, -122.02, 2970.0, 5251.0, 12.0, 0.0, 0.0] | [718013.75] | 0 |
1 | 2024-12-09 21:10:09.099 | [2.0, 2.5, 2170.0, 6361.0, 1.0, 0.0, 2.0, 3.0, 8.0, 2170.0, 0.0, 47.7109, -122.017, 2310.0, 7419.0, 6.0, 0.0, 0.0] | [615094.56] | 0 |
2 | 2024-12-09 21:10:09.099 | [3.0, 2.5, 1300.0, 812.0, 2.0, 0.0, 0.0, 3.0, 8.0, 880.0, 420.0, 47.5893, -122.317, 1300.0, 824.0, 6.0, 0.0, 0.0] | [448627.72] | 0 |
3 | 2024-12-09 21:10:09.099 | [4.0, 2.5, 2500.0, 8540.0, 2.0, 0.0, 0.0, 3.0, 9.0, 2500.0, 0.0, 47.5759, -121.994, 2560.0, 8475.0, 24.0, 0.0, 0.0] | [758714.2] | 0 |
4 | 2024-12-09 21:10:09.099 | [3.0, 1.75, 2200.0, 11520.0, 1.0, 0.0, 0.0, 4.0, 7.0, 2200.0, 0.0, 47.7659, -122.341, 1690.0, 8038.0, 62.0, 0.0, 0.0] | [513264.7] | 0 |
5 | 2024-12-09 21:10:09.099 | [3.0, 2.0, 2140.0, 4923.0, 1.0, 0.0, 0.0, 4.0, 8.0, 1070.0, 1070.0, 47.6902, -122.339, 1470.0, 4923.0, 86.0, 0.0, 0.0] | [668288.0] | 0 |
6 | 2024-12-09 21:10:09.099 | [4.0, 3.5, 3590.0, 5334.0, 2.0, 0.0, 2.0, 3.0, 9.0, 3140.0, 450.0, 47.6763, -122.267, 2100.0, 6250.0, 9.0, 0.0, 0.0] | [1004846.5] | 0 |
7 | 2024-12-09 21:10:09.099 | [3.0, 2.0, 1280.0, 960.0, 2.0, 0.0, 0.0, 3.0, 9.0, 1040.0, 240.0, 47.602, -122.311, 1280.0, 1173.0, 0.0, 0.0, 0.0] | [684577.2] | 0 |
8 | 2024-12-09 21:10:09.099 | [4.0, 2.5, 2820.0, 15000.0, 2.0, 0.0, 0.0, 4.0, 9.0, 2820.0, 0.0, 47.7255, -122.101, 2440.0, 15000.0, 29.0, 0.0, 0.0] | [727898.1] | 0 |
9 | 2024-12-09 21:10:09.099 | [3.0, 2.25, 1790.0, 11393.0, 1.0, 0.0, 0.0, 3.0, 8.0, 1790.0, 0.0, 47.6297, -122.099, 2290.0, 11894.0, 36.0, 0.0, 0.0] | [559631.1] | 0 |
10 | 2024-12-09 21:10:09.099 | [3.0, 1.5, 1010.0, 7683.0, 1.5, 0.0, 0.0, 5.0, 7.0, 1010.0, 0.0, 47.72, -122.318, 1550.0, 7271.0, 61.0, 0.0, 0.0] | [340764.53] | 0 |
11 | 2024-12-09 21:10:09.099 | [3.0, 2.0, 1270.0, 1323.0, 3.0, 0.0, 0.0, 3.0, 8.0, 1270.0, 0.0, 47.6934, -122.342, 1330.0, 1323.0, 8.0, 0.0, 0.0] | [442168.06] | 0 |
12 | 2024-12-09 21:10:09.099 | [4.0, 1.75, 2070.0, 9120.0, 1.0, 0.0, 0.0, 4.0, 7.0, 1250.0, 820.0, 47.6045, -122.123, 1650.0, 8400.0, 57.0, 0.0, 0.0] | [630865.6] | 0 |
13 | 2024-12-09 21:10:09.099 | [4.0, 1.0, 1620.0, 4080.0, 1.5, 0.0, 0.0, 3.0, 7.0, 1620.0, 0.0, 47.6696, -122.324, 1760.0, 4080.0, 91.0, 0.0, 0.0] | [559631.1] | 0 |
14 | 2024-12-09 21:10:09.099 | [4.0, 3.25, 3990.0, 9786.0, 2.0, 0.0, 0.0, 3.0, 9.0, 3990.0, 0.0, 47.6784, -122.026, 3920.0, 8200.0, 10.0, 0.0, 0.0] | [909441.1] | 0 |
15 | 2024-12-09 21:10:09.099 | [4.0, 2.0, 1780.0, 19843.0, 1.0, 0.0, 0.0, 3.0, 7.0, 1780.0, 0.0, 47.4414, -122.154, 2210.0, 13500.0, 52.0, 0.0, 0.0] | [313096.0] | 0 |
16 | 2024-12-09 21:10:09.099 | [4.0, 2.5, 2130.0, 6003.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2130.0, 0.0, 47.4518, -122.12, 1940.0, 4529.0, 11.0, 0.0, 0.0] | [404040.8] | 0 |
17 | 2024-12-09 21:10:09.099 | [3.0, 1.75, 1660.0, 10440.0, 1.0, 0.0, 0.0, 3.0, 7.0, 1040.0, 620.0, 47.4448, -121.77, 1240.0, 10380.0, 36.0, 0.0, 0.0] | [292859.5] | 0 |
18 | 2024-12-09 21:10:09.099 | [3.0, 2.5, 2110.0, 4118.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2110.0, 0.0, 47.3878, -122.153, 2110.0, 4044.0, 25.0, 0.0, 0.0] | [338357.88] | 0 |
19 | 2024-12-09 21:10:09.099 | [4.0, 2.25, 2200.0, 11250.0, 1.5, 0.0, 0.0, 5.0, 7.0, 1300.0, 900.0, 47.6845, -122.201, 2320.0, 10814.0, 94.0, 0.0, 0.0] | [682284.6] | 0 |
Create Wallaroo Connection
Connections are created at the Wallaroo instance level, typically by a MLOps or DevOps engineer, then applied to a workspace.
For this section:
- We will create a sample connection that just has a URL to the same Arrow table file we used in the previous step.
- We’ll apply the data connection to the workspace above.
- For a quick demonstration, we’ll use the connection to retrieve the Arrow table file and use it for a quick sample inference.
Create Connection
Connections are created with the Wallaroo client command create_connection
with the following parameters.
Parameter | Type | Description |
---|---|---|
name | string (Required) | The name of the connection. This must be unique - if submitting the name of an existing connection it will return an error. |
type | string (Required) | The user defined type of connection. |
details | Dict (Requires) | User defined configuration details for the data connection. These can be {'username':'dataperson', 'password':'datapassword', 'port': 3339} , or {'token':'abcde123==', 'host':'example.com', 'port:1234'} , or other user defined combinations. |
We’ll create the connection named houseprice_arrow_table
, set it to the type HTTPFILE
, and provide the details as 'host':'https://github.com/WallarooLabs/Wallaroo_Tutorials/raw/main/wallaroo-testing-tutorials/houseprice-saga/data/xtest-1k.arrow'
- the location for our sample Arrow table inference input.
wl.create_connection(connection_name,
"HTTPFILE",
{'host':'https://github.com/WallarooLabs/Wallaroo_Tutorials/raw/refs/heads/wallaroo2024.4_tutorials/wallaroo-automate/orchestration_sdk_comprehensive_tutorial/data/xtest-1k.arrow'}
)
Field | Value |
---|---|
Name | houseprice_arrow_table |
Connection Type | HTTPFILE |
Details | ***** |
Created At | 2024-12-09T21:20:26.268422+00:00 |
Linked Workspaces | [] |
List Data Connections
The Wallaroo Client list_connections()
method lists all connections for the Wallaroo instance.
wl.list_connections()
name | connection type | details | created at | linked workspaces |
---|---|---|---|---|
external_inference_connection_sample_2 | HTTP | ***** | 2024-12-09T19:48:44.311264+00:00 | ['simpleorchestrationworkspace2'] |
external_inference_connection_sample | HTTP | ***** | 2024-12-09T20:58:43.896309+00:00 | ['simpleorchestrationworkspace'] |
houseprice_arrow_table | HTTPFILE | ***** | 2024-12-09T21:20:26.268422+00:00 | [] |
Add Connection to Workspace
The method Workspace add_connection(connection_name)
adds a Data Connection to a workspace, and takes the following parameters.
Parameter | Type | Description |
---|---|---|
name | string (Required) | The name of the Data Connection |
We’ll add this connection to our sample workspace.
workspace.add_connection(connection_name)
Get Connection
Connections are retrieved by the Wallaroo Client get_connection(name)
method.
connection = wl.get_connection(connection_name)
Connection Details
The Connection method details()
retrieves a the connection details()
as a dict
.
display(connection.details())
{'host': 'https://github.com/WallarooLabs/Wallaroo_Tutorials/raw/refs/heads/wallaroo2024.4_tutorials/wallaroo-automate/orchestration_sdk_comprehensive_tutorial/data/xtest-1k.arrow'}
Using a Connection Example
For this example, the connection will be used to retrieve the Apache Arrow file referenced in the connection, and use that to turn it into an Apache Arrow table, then use that for a sample inference.
connection.details()['host']
'https://github.com/WallarooLabs/Wallaroo_Tutorials/raw/refs/heads/wallaroo2024.4_tutorials/wallaroo-automate/orchestration_sdk_comprehensive_tutorial/data/xtest-1k.arrow'
# Deploy the pipeline
pipeline.deploy()
# Retrieve the file
# set accept as apache arrow table
headers = {
'Accept': 'application/vnd.apache.arrow.file'
}
response = requests.get(
connection.details()['host'],
headers=headers
)
# Arrow table is retrieved
with pa.ipc.open_file(response.content) as reader:
arrow_table = reader.read_all()
results = pipeline.infer(arrow_table)
result_table = results.to_pandas()
display(result_table.head(20))
time | in.tensor | out.variable | anomaly.count | |
---|---|---|---|---|
0 | 2024-12-09 21:20:35.324 | [4.0, 2.5, 2900.0, 5505.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2900.0, 0.0, 47.6063, -122.02, 2970.0, 5251.0, 12.0, 0.0, 0.0] | [718013.75] | 0 |
1 | 2024-12-09 21:20:35.324 | [2.0, 2.5, 2170.0, 6361.0, 1.0, 0.0, 2.0, 3.0, 8.0, 2170.0, 0.0, 47.7109, -122.017, 2310.0, 7419.0, 6.0, 0.0, 0.0] | [615094.56] | 0 |
2 | 2024-12-09 21:20:35.324 | [3.0, 2.5, 1300.0, 812.0, 2.0, 0.0, 0.0, 3.0, 8.0, 880.0, 420.0, 47.5893, -122.317, 1300.0, 824.0, 6.0, 0.0, 0.0] | [448627.72] | 0 |
3 | 2024-12-09 21:20:35.324 | [4.0, 2.5, 2500.0, 8540.0, 2.0, 0.0, 0.0, 3.0, 9.0, 2500.0, 0.0, 47.5759, -121.994, 2560.0, 8475.0, 24.0, 0.0, 0.0] | [758714.2] | 0 |
4 | 2024-12-09 21:20:35.324 | [3.0, 1.75, 2200.0, 11520.0, 1.0, 0.0, 0.0, 4.0, 7.0, 2200.0, 0.0, 47.7659, -122.341, 1690.0, 8038.0, 62.0, 0.0, 0.0] | [513264.7] | 0 |
5 | 2024-12-09 21:20:35.324 | [3.0, 2.0, 2140.0, 4923.0, 1.0, 0.0, 0.0, 4.0, 8.0, 1070.0, 1070.0, 47.6902, -122.339, 1470.0, 4923.0, 86.0, 0.0, 0.0] | [668288.0] | 0 |
6 | 2024-12-09 21:20:35.324 | [4.0, 3.5, 3590.0, 5334.0, 2.0, 0.0, 2.0, 3.0, 9.0, 3140.0, 450.0, 47.6763, -122.267, 2100.0, 6250.0, 9.0, 0.0, 0.0] | [1004846.5] | 0 |
7 | 2024-12-09 21:20:35.324 | [3.0, 2.0, 1280.0, 960.0, 2.0, 0.0, 0.0, 3.0, 9.0, 1040.0, 240.0, 47.602, -122.311, 1280.0, 1173.0, 0.0, 0.0, 0.0] | [684577.2] | 0 |
8 | 2024-12-09 21:20:35.324 | [4.0, 2.5, 2820.0, 15000.0, 2.0, 0.0, 0.0, 4.0, 9.0, 2820.0, 0.0, 47.7255, -122.101, 2440.0, 15000.0, 29.0, 0.0, 0.0] | [727898.1] | 0 |
9 | 2024-12-09 21:20:35.324 | [3.0, 2.25, 1790.0, 11393.0, 1.0, 0.0, 0.0, 3.0, 8.0, 1790.0, 0.0, 47.6297, -122.099, 2290.0, 11894.0, 36.0, 0.0, 0.0] | [559631.1] | 0 |
10 | 2024-12-09 21:20:35.324 | [3.0, 1.5, 1010.0, 7683.0, 1.5, 0.0, 0.0, 5.0, 7.0, 1010.0, 0.0, 47.72, -122.318, 1550.0, 7271.0, 61.0, 0.0, 0.0] | [340764.53] | 0 |
11 | 2024-12-09 21:20:35.324 | [3.0, 2.0, 1270.0, 1323.0, 3.0, 0.0, 0.0, 3.0, 8.0, 1270.0, 0.0, 47.6934, -122.342, 1330.0, 1323.0, 8.0, 0.0, 0.0] | [442168.06] | 0 |
12 | 2024-12-09 21:20:35.324 | [4.0, 1.75, 2070.0, 9120.0, 1.0, 0.0, 0.0, 4.0, 7.0, 1250.0, 820.0, 47.6045, -122.123, 1650.0, 8400.0, 57.0, 0.0, 0.0] | [630865.6] | 0 |
13 | 2024-12-09 21:20:35.324 | [4.0, 1.0, 1620.0, 4080.0, 1.5, 0.0, 0.0, 3.0, 7.0, 1620.0, 0.0, 47.6696, -122.324, 1760.0, 4080.0, 91.0, 0.0, 0.0] | [559631.1] | 0 |
14 | 2024-12-09 21:20:35.324 | [4.0, 3.25, 3990.0, 9786.0, 2.0, 0.0, 0.0, 3.0, 9.0, 3990.0, 0.0, 47.6784, -122.026, 3920.0, 8200.0, 10.0, 0.0, 0.0] | [909441.1] | 0 |
15 | 2024-12-09 21:20:35.324 | [4.0, 2.0, 1780.0, 19843.0, 1.0, 0.0, 0.0, 3.0, 7.0, 1780.0, 0.0, 47.4414, -122.154, 2210.0, 13500.0, 52.0, 0.0, 0.0] | [313096.0] | 0 |
16 | 2024-12-09 21:20:35.324 | [4.0, 2.5, 2130.0, 6003.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2130.0, 0.0, 47.4518, -122.12, 1940.0, 4529.0, 11.0, 0.0, 0.0] | [404040.8] | 0 |
17 | 2024-12-09 21:20:35.324 | [3.0, 1.75, 1660.0, 10440.0, 1.0, 0.0, 0.0, 3.0, 7.0, 1040.0, 620.0, 47.4448, -121.77, 1240.0, 10380.0, 36.0, 0.0, 0.0] | [292859.5] | 0 |
18 | 2024-12-09 21:20:35.324 | [3.0, 2.5, 2110.0, 4118.0, 2.0, 0.0, 0.0, 3.0, 8.0, 2110.0, 0.0, 47.3878, -122.153, 2110.0, 4044.0, 25.0, 0.0, 0.0] | [338357.88] | 0 |
19 | 2024-12-09 21:20:35.324 | [4.0, 2.25, 2200.0, 11250.0, 1.5, 0.0, 0.0, 5.0, 7.0, 1300.0, 900.0, 47.6845, -122.201, 2320.0, 10814.0, 94.0, 0.0, 0.0] | [682284.6] | 0 |
Remove Connection from Workspace
The Workspace method remove_connection(connection_name)
removes the connection from the workspace, but does not delete the connection from the Wallaroo instance. This method takes the following parameters.
Parameter | Type | Description |
---|---|---|
name | String (Required) | The name of the connection to be removed |
The previous connection will be removed from the workspace, then the workspace connections displayed to verify it has been removed.
workspace.remove_connection(connection_name)
display(workspace.list_connections())
(no connections)
Delete Connection
The Connection method delete_connection()
removes the connection from the Wallaroo instance, and all attachments in workspaces they were connected to.
connection.delete_connection()
wl.list_connections()
name | connection type | details | created at | linked workspaces |
---|---|---|---|---|
external_inference_connection_sample_2 | HTTP | ***** | 2024-12-09T19:48:44.311264+00:00 | ['simpleorchestrationworkspace2'] |
external_inference_connection_sample | HTTP | ***** | 2024-12-09T20:58:43.896309+00:00 | ['simpleorchestrationworkspace'] |
Orchestration Tutorial
The next series of examples will build on what we just did. So far we have:
- Deployed a pipeline, performed sample inferences with a local Apache Arrow file, displayed the results, then undeployed the pipeline.
- Deployed a pipeline, use a Wallaroo connection details to retrieve a remote Apache Arrow file, performed inferences and displayed the results, then undeployed the pipeline.
For the orchestration tutorial, we’ll do the same thing only package it into a separate python script and upload it to the Wallaroo instance, then create a task from that orchestration and perform our sample inferences again.
Orchestration Requirements
Orchestrations are uploaded to the Wallaroo instance as a ZIP file with the following requirements:
- The ZIP file should not contain any directories - only files at the top level.
Parameter | Type | Description |
---|---|---|
User Code | (Required) Python script as .py files | Python scripts for the orchestration to run. If the file main.py exists, that will be the entrypoint. Otherwise, if only one .py exists, then that will be the entrypoint. |
Python Library Requirements | (Required) requirements.txt file in the requirements file format. This is in the root of the zip file, and there can only be one requirements.txt file for the orchestration. | |
Other artifacts | Other artifacts such as files, data, or code to support the orchestration. |
Zip Instructions
In a terminal with the zip
command, assemble artifacts as above and then create the archive. The zip
command is included by default with the Wallaroo JupyterHub service.
zip
commands take the following format, with {zipfilename}.zip
as the zip file to save the artifacts to, and each file thereafter as the files to add to the archive.
zip {zipfilename}.zip file1, file2, file3....
For example, the following command will add the files main.py
and requirements.txt
into the file hello.zip
.
$ zip hello.zip main.py requirements.txt
adding: main.py (deflated 47%)
adding: requirements.txt (deflated 52%)
Orchestration Recommendations
The following recommendations will make using Wallaroo orchestrations
- The version of Python used should match the same version as in the Wallaroo JupyterHub service.
- The same version of the Wallaroo SDK should match the server. For a 2023.2 Wallaroo instance, use the Wallaroo SDK version 2023.2.
- Specify the version of
pip
dependencies. - The
wallaroo.Client
constructorauth_type
argument is ignored. Usingwallaroo.Client()
is sufficient. - The following methods will assist with orchestrations:
wallaroo.in_task()
: ReturnsTrue
if the code is running within an Orchestrator task.wallaroo.task_args()
: Returns aDict
of invocation-specific arguments passed to therun_
calls.
- Use
print
commands so outputs are saved to the task’s log files.
Example requirements.txt file
dbt-bigquery==1.4.3
dbt-core==1.4.5
dbt-extractor==0.4.1
dbt-postgres==1.4.5
google-api-core==2.8.2
google-auth==2.11.0
google-auth-oauthlib==0.4.6
google-cloud-bigquery==3.3.2
google-cloud-bigquery-storage==2.15.0
google-cloud-core==2.3.2
google-cloud-storage==2.5.0
google-crc32c==1.5.0
google-pasta==0.2.0
google-resumable-media==2.3.3
googleapis-common-protos==1.56.4
Sample Orchestrator
The following orchestrator artifacts are in the directory ./remote_inference
and includes the file main.py
with the following code:
import wallaroo
from wallaroo.object import EntityNotFoundError
import pandas as pd
import pyarrow as pa
import requests
wl = wallaroo.Client()
# Setting variables for later steps
# get the arguments
arguments = wl.task_args()
if "workspace_name" in arguments:
workspace_name = arguments['workspace_name']
else:
workspace_name="orchestrationworkspace"
if "pipeline_name" in arguments:
pipeline_name = arguments['pipeline_name']
else:
pipeline_name="orchestrationpipeline"
if "connection_name" in arguments:
connection_name = arguments['connection_name']
else:
connection_name = "houseprice_arrow_table"
# helper methods to retrieve workspaces and pipelines
def get_workspace(name):
workspace = None
for ws in wl.list_workspaces():
if ws.name() == name:
workspace= ws
if(workspace == None):
workspace = wl.create_workspace(name)
return workspace
def get_pipeline(name):
try:
pipeline = wl.pipelines_by_name(name)[0]
except EntityNotFoundError:
pipeline = wl.build_pipeline(name)
return pipeline
print(f"Getting the workspace {workspace_name}")
workspace = wl.get_workspace(name=workspace_name, create_if_not_exist=True)
wl.set_current_workspace(workspace)
print(f"Getting the pipeline {pipeline_name}")
pipeline = wl.build_pipeline(pipeline_name)
pipeline.deploy()
# Get the connection - assuming it will be the only one
inference_source_connection = wl.get_connection(name=connection_name)
print(f"Getting arrow table file")
# Retrieve the file
# set accept as apache arrow table
headers = {
'Accept': 'application/vnd.apache.arrow.file'
}
response = requests.get(
inference_source_connection.details()['host'],
headers=headers
)
# Arrow table is retrieved
with pa.ipc.open_file(response.content) as reader:
arrow_table = reader.read_all()
print("Inference time. Displaying results after.")
# Perform the inference
result = pipeline.infer(arrow_table)
print(result)
pipeline.undeploy()
This is saved to the file ./remote_inference/remote_inference.zip
.
Preparing the Wallaroo Instance
To prepare the Wallaroo instance, we’ll once again create the Wallaroo connection houseprice_arrow_table
and apply it to the workspace.
wl.create_connection(connection_name,
"HTTPFILE",
{'host':'https://github.com/WallarooLabs/Wallaroo_Tutorials/raw/main/wallaroo-testing-tutorials/houseprice-saga/data/xtest-1k.arrow'}
)
workspace.add_connection(connection_name)
Upload the Orchestration
Orchestrations are uploaded with the Wallaroo client upload_orchestration(path)
method with the following parameters.
Parameter | Type | Description |
---|---|---|
path | string (Required) | The path to the ZIP file to be uploaded. |
Once uploaded, the deployment will be prepared and any requirements will be downloaded and installed.
For this example, the orchestration ./remote_inference/remote_inference.zip
will be uploaded and saved to the variable orchestration
.
orchestration = wl.upload_orchestration(name="comprehensive sample",
path="./remote_inference/remote_inference.zip")
Orchestration Status
The Orchestration method status()
displays the current status of the uploaded orchestration.
Status | Description |
---|---|
— | — |
pending_packaging | The orchestration is uploaded, but packaging hasn’t started yet. |
packaging | The orchestration is being packaged for use with the Wallaroo instance. |
ready | The orchestration is ready for use. |
For this example, the status of the orchestration will be displayed then looped until it has reached status ready
.
import time
while orchestration.status() != 'ready' and orchestration.status() != 'error':
print(orchestration.status())
time.sleep(5)
pending_packaging
packaging
packaging
packaging
packaging
packaging
packaging
packaging
packaging
List Orchestrations
Orchestrations are listed with the Wallaroo Client list_orchestrations()
which returns a list of available orchestrations.
wl.list_orchestrations()
id | name | status | filename | sha | created at | updated at | workspace id | workspace name |
---|---|---|---|---|---|---|---|---|
26de20c7-4068-4f03-9ea1-d5bb534491e4 | None | ready | remote_inference.zip | 4dd816...df4e12 | 2024-09-Dec 19:48:45 | 2024-09-Dec 19:50:04 | 10 | simpleorchestrationworkspace2 |
5182f394-bf3f-4c8d-9ccc-f03ee56cbf1a | uploadedbytesdemo | ready | inferencetest.zip | 4dd816...df4e12 | 2024-09-Dec 19:50:09 | 2024-09-Dec 19:51:08 | 10 | simpleorchestrationworkspace2 |
35eaff98-9683-4f76-bcd5-0b8f07f1651b | None | ready | remote_inference.zip | 4dd816...df4e12 | 2024-09-Dec 20:58:45 | 2024-09-Dec 20:59:31 | 11 | simpleorchestrationworkspace |
962b2ba9-498c-4802-a04f-22438dcd99a0 | uploadedbytesdemo | ready | inferencetest.zip | 4dd816...df4e12 | 2024-09-Dec 20:59:37 | 2024-09-Dec 21:00:30 | 11 | simpleorchestrationworkspace |
3fa7a100-5de5-46d6-bc89-c648dcbd491f | comprehensive sample | ready | remote_inference.zip | 0cb3c3...4768cc | 2024-09-Dec 21:20:38 | 2024-09-Dec 21:21:25 | 12 | orchestrationworkspace |
Task Management Tutorial
Once an Orchestration has the status ready
, it can be run as a task. Tasks have three run options.
Type | SDK Call | How triggered | Purpose |
---|---|---|---|
Once | orchestration.run_once(name, json_args, timeout) | Task runs once and exits. | Single batch, experimentation |
Scheduled | orchestration.run_scheduled() | User provides schedule. Task runs exits whenever schedule dictates. | Recurrent batch ETL. |
Task Run Once
Tasks are generated and run once with the Orchestration run_once(name, json_args, timeout)
method. Any arguments for the orchestration are passed in as a Dict
. If there are no arguments, then an empty set {}
is passed.
For our example, we will pass the workspace, pipeline, and connection into our task.
# Example: run once
import datetime
task_start = datetime.datetime.now()
task = orchestration.run_once(name="house price run once 2", json_args={"workspace_name": workspace_name,
"pipeline_name":pipeline_name,
"connection_name": connection_name
}
)
task
Field | Value |
---|---|
ID | bfe8a389-f72f-4687-9d7d-b037e6600d5b |
Name | house price run once 2 |
Last Run Status | unknown |
Type | Temporary Run |
Active | True |
Schedule | - |
Created At | 2024-09-Dec 21:21:31 |
Updated At | 2024-09-Dec 21:21:31 |
Workspace ID | 12 |
Workspace Name | orchestrationworkspace |
List Tasks
The list of tasks in the Wallaroo instance is retrieved through the Wallaroo Client list_tasks()
method that accepts the following parameters.
Parameter | Type | Description |
---|---|---|
killed | Boolean (Optional Default: False ) | Returns tasks depending on whether they have been issued the kill command. False returns all tasks whether killed or not. True only returns killed tasks. |
This returns an array list of the following in reverse chronological order from updated at
.
Parameter | Type | Description |
---|---|---|
id | string | The UUID identifier for the task. |
last run status | string | The last reported status the task. Values are:
|
type | string | The type of the task. Values are:
|
active | Boolean | True : The task is scheduled or running. False : The task has completed or has been issued the kill command. |
schedule | string | The cron style schedule for the task. If the task is not a scheduled one, then the schedule will be - . |
created at | DateTime | The date and time the task was started. |
updated at | DateTime | The date and time the task was updated. |
wl.list_tasks()
id | name | last run status | type | active | schedule | created at | updated at | workspace id | workspace name |
---|---|---|---|---|---|---|---|---|---|
bfe8a389-f72f-4687-9d7d-b037e6600d5b | house price run once 2 | unknown | Temporary Run | True | - | 2024-09-Dec 21:21:31 | 2024-09-Dec 21:21:31 | 12 | orchestrationworkspace |
5b95265f-52b5-4324-9d77-8004f0d9da4e | simpletaskdemo | success | Temporary Run | True | - | 2024-09-Dec 20:59:39 | 2024-09-Dec 20:59:51 | 11 | simpleorchestrationworkspace |
f7aa8740-d104-4eb9-a3b8-09f79b300d9f | simpletaskdemo | success | Temporary Run | True | - | 2024-09-Dec 19:50:11 | 2024-09-Dec 19:50:44 | 10 | simpleorchestrationworkspace2 |
Task Status
The status of the task is returned with the Task status()
method that returned the tasks status. Tasks can have the following status.
pending
: The task has not been started or is being prepared.started
: The task has started to execute.
while task.status() != "started":
display(task.status())
time.sleep(5)
'pending'
Task Last Runs History
The history of a task, which each deployment of the task is known as a task run is retrieved with the Task last_runs
method that takes the following arguments.
Parameter | Type | Description |
---|---|---|
status | String (Optional *Default: all ) | Filters the task history by the status . If all , returns all statuses. Status values are:
|
limit | Integer (Optional) | Limits the number of task runs returned. |
This returns the following in reverse chronological order by updated at
.
Parameter | Type | Description |
---|---|---|
task id | string | Task id in UUID format. |
pod id | string | Pod id in UUID format. |
status | string | Status of the task. Status values are:
|
created at | DateTime | Date and time the task was created at. |
updated at | DateTime | Date and time the task was updated. |
task.last_runs()
task id | pod id | status | created at | updated at |
---|---|---|---|---|
bfe8a389-f72f-4687-9d7d-b037e6600d5b | 13c5a449-6f58-4839-9fbc-64ab8e98a657 | running | 2024-09-Dec 21:21:34 | 2024-09-Dec 21:21:34 |
Task Run Logs
The output of a task is displayed with the Task Run logs()
method that takes the following parameters.
Parameter | Type | Description |
---|---|---|
limit | Integer (Optional) | Limits the lines returned from the task run log. The limit parameter is based on the log tail - starting from the last line of the log file, then working up until the limit of lines is reached. This is useful for viewing final outputs, exceptions, etc. |
The Task Run logs()
returns the log entries as a string list, with each entry as an item in the list.
- IMPORTANT NOTE: It may take around a minute for task run logs to be integrated into the Wallaroo log database.
# give time for the task to complete and the log files entered
time.sleep(60)
recent_run = task.last_runs()[0]
display(recent_run.logs())
2024-09-Dec 21:21:34 WALLAROO: Setting up task run...
2024-09-Dec 21:21:36 WALLAROO: *** If you don't see any output, Python is probably buffering stdout. ***
2024-09-Dec 21:21:36 WALLAROO: *** Consider `print('...', flush=True)` or `sys.stdout.flush()` in your orchestration code. ***
2024-09-Dec 21:21:36 WALLAROO: Task environment ready. Running...
2024-09-Dec 21:21:40 Getting the pipeline orchestrationpipeline
2024-09-Dec 21:21:40 Getting the workspace orchestrationworkspace
2024-09-Dec 21:21:40 Deployment initiated for orchestrationpipeline. Please check pipeline status.
2024-09-Dec 21:21:40 {'status': 'Running', 'details': [], 'engines': [{'ip': '10.28.0.8', 'name': 'engine-8574cf699d-4nwg8', 'status': 'Running', 'reason': None, 'details': [], 'pipeline_statuses': {'pipelines': [{'id': 'orchestrationpipeline', 'status': 'Running', 'version': 'fb0022ec-0162-4a7f-abbf-d3ef2e128d1c'}]}, 'model_statuses': {'models': [{'model_version_id': 7, 'name': 'orchestrationmodel', 'sha': 'e22a0831aafd9917f3cc87a15ed267797f80e2afa12ad7d8810ca58f173b8cc6', 'status': 'Running', 'version': 'd2ff6835-e736-489e-8b4b-c3aa3684a96a'}]}}], 'engine_lbs': [{'ip': '10.28.0.7', 'name': 'engine-lb-6676794678-g6j6q', 'status': 'Running', 'reason': None, 'details': []}], 'sidekicks': []}
2024-09-Dec 21:21:40 Getting arrow table file
2024-09-Dec 21:21:40 Traceback (most recent call last):
2024-09-Dec 21:21:40 File "/home/jovyan/main.py", line 63, in
2024-09-Dec 21:21:40 with pa.ipc.open_file(response.content) as reader:
2024-09-Dec 21:21:40 File "/home/jovyan/venv/lib/python3.10/site-packages/pyarrow/ipc.py", line 234, in open_file
2024-09-Dec 21:21:40 return RecordBatchFileReader(
2024-09-Dec 21:21:40 File "/home/jovyan/venv/lib/python3.10/site-packages/pyarrow/ipc.py", line 110, in __init__
2024-09-Dec 21:21:40 File "pyarrow/ipc.pxi", line 1013, in pyarrow.lib._RecordBatchFileReader._open
2024-09-Dec 21:21:40 self._open(source, footer_offset=footer_offset,
2024-09-Dec 21:21:40 File "pyarrow/error.pxi", line 154, in pyarrow.lib.pyarrow_internal_check_status
2024-09-Dec 21:21:40 File "pyarrow/error.pxi", line 91, in pyarrow.lib.check_status
2024-09-Dec 21:21:40 pyarrow.lib.ArrowInvalid: Not an Arrow file
2024-09-Dec 21:21:41 WALLAROO: Task run complete. Status: failure
Failed Task Logs
We can create a task that fails and show it in the last_runs
list, then retrieve the logs to display why it failed.
# Example: run once
import datetime
task_start = datetime.datetime.now()
taskfail = orchestration.run_once(name="house price run once 2", json_args={"workspace_name": "bob",
"pipeline_name":"does not exist",
"connection_name": connection_name
}
)
while taskfail.status() != "started":
display(taskfail.status())
time.sleep(5)
'pending'
# sleep to give time for the task to complete
time.sleep(60)
taskfail.last_runs()
task id | pod id | status | created at | updated at |
---|---|---|---|---|
ee77704b-4695-4e30-8e14-43c4f0947959 | b1359154-9cbf-4b3e-a778-2f6069e93a1d | failure | 2024-09-Dec 21:22:42 | 2024-09-Dec 21:22:42 |
# sleep to give time for the logs to be available
time.sleep(60)
taskfaillogs = taskfail.last_runs()[0].logs()
display(taskfaillogs)
2024-09-Dec 21:22:42 WALLAROO: Setting up task run...
2024-09-Dec 21:22:45 WALLAROO: *** If you don't see any output, Python is probably buffering stdout. ***
2024-09-Dec 21:22:45 WALLAROO: *** Consider `print('...', flush=True)` or `sys.stdout.flush()` in your orchestration code. ***
2024-09-Dec 21:22:45 WALLAROO: Task environment ready. Running...
2024-09-Dec 21:22:48 Getting the workspace bob
2024-09-Dec 21:22:48 Traceback (most recent call last):
2024-09-Dec 21:22:48 File "/home/jovyan/main.py", line 31, in
2024-09-Dec 21:22:48 workspace = wl.get_workspace(workspace_name)
2024-09-Dec 21:22:48 return Workspace.get_workspace(
2024-09-Dec 21:22:48 File "/home/jovyan/venv/lib/python3.10/site-packages/wallaroo/client.py", line 2595, in get_workspace
2024-09-Dec 21:22:48 File "/home/jovyan/venv/lib/python3.10/site-packages/wallaroo/workspace.py", line 230, in get_workspace
2024-09-Dec 21:22:48 raise Exception(
2024-09-Dec 21:22:48 Exception: Error: Workspace with name bob does not exist. If you would like to create one, send in the request with `create_if_not_exist` flag set to True.
2024-09-Dec 21:22:48 WALLAROO: Task run complete. Status: failure
Task Results
We can view the inferences from our logs and verify that new entries were added from our task. In our case, we’ll assume the task once started takes about 1 minute to run (deploy the pipeline, run the inference, undeploy the pipeline). We’ll add in a wait of 1 minute, then display the logs during the time period the task was running.
task_end = datetime.datetime.now()
display(task_end)
pipeline.logs(start_datetime = task_start, end_datetime = task_end)
datetime.datetime(2024, 12, 9, 14, 24, 47, 794992)
Scheduled Tasks
Scheduled tasks are run with the Orchestration run_scheduled
method. We’ll set it up to run every 5 minutes, then check the results.
It is recommended that orchestrations that have pipeline deploy or undeploy commands be spaced out no less than 5 minutes to prevent colliding with other tasks that use the same pipeline.
task_start = datetime.datetime.now()
schedule = "*/5 * * * *"
task_scheduled = orchestration.run_scheduled(name="schedule example",
timeout=600,
schedule=schedule,
json_args={"workspace_name": workspace_name,
"pipeline_name": pipeline_name,
"connection_name": connection_name
})
while task_scheduled.status() != "started":
display(task_scheduled.status())
time.sleep(5)
task_scheduled
Field | Value |
---|---|
ID | c3ff008d-5057-4437-8d93-369a732b5d52 |
Name | schedule example |
Last Run Status | unknown |
Type | Scheduled Run |
Active | True |
Schedule | */5 * * * * |
Created At | 2024-09-Dec 21:24:48 |
Updated At | 2024-09-Dec 21:24:48 |
Workspace ID | 12 |
Workspace Name | orchestrationworkspace |
# time for the task to complete in the next 5 minutes
time.sleep(420)
recent_run = task_scheduled.last_runs()[0]
display(recent_run.logs())
2024-09-Dec 21:30:01 WALLAROO: Setting up task run...
2024-09-Dec 21:30:04 WALLAROO: *** If you don't see any output, Python is probably buffering stdout. ***
2024-09-Dec 21:30:04 WALLAROO: *** Consider `print('...', flush=True)` or `sys.stdout.flush()` in your orchestration code. ***
2024-09-Dec 21:30:04 WALLAROO: Task environment ready. Running...
2024-09-Dec 21:30:08 Getting the workspace orchestrationworkspace
2024-09-Dec 21:30:08 Getting the pipeline orchestrationpipeline
2024-09-Dec 21:30:08 Deployment initiated for orchestrationpipeline. Please check pipeline status.
2024-09-Dec 21:30:08 {'status': 'Running', 'details': [], 'engines': [{'ip': '10.28.0.8', 'name': 'engine-8574cf699d-4nwg8', 'status': 'Running', 'reason': None, 'details': [], 'pipeline_statuses': {'pipelines': [{'id': 'orchestrationpipeline', 'status': 'Running', 'version': '8dee7040-f443-44a9-906f-c53329799d17'}]}, 'model_statuses': {'models': [{'model_version_id': 7, 'name': 'orchestrationmodel', 'sha': 'e22a0831aafd9917f3cc87a15ed267797f80e2afa12ad7d8810ca58f173b8cc6', 'status': 'Running', 'version': 'd2ff6835-e736-489e-8b4b-c3aa3684a96a'}]}}], 'engine_lbs': [{'ip': '10.28.0.7', 'name': 'engine-lb-6676794678-g6j6q', 'status': 'Running', 'reason': None, 'details': []}], 'sidekicks': []}
2024-09-Dec 21:30:08 Getting arrow table file
2024-09-Dec 21:30:08 Traceback (most recent call last):
2024-09-Dec 21:30:08 File "/home/jovyan/main.py", line 63, in
2024-09-Dec 21:30:08 with pa.ipc.open_file(response.content) as reader:
2024-09-Dec 21:30:08 return RecordBatchFileReader(
2024-09-Dec 21:30:08 File "/home/jovyan/venv/lib/python3.10/site-packages/pyarrow/ipc.py", line 234, in open_file
2024-09-Dec 21:30:08 File "/home/jovyan/venv/lib/python3.10/site-packages/pyarrow/ipc.py", line 110, in __init__
2024-09-Dec 21:30:08 self._open(source, footer_offset=footer_offset,
2024-09-Dec 21:30:08 File "pyarrow/error.pxi", line 154, in pyarrow.lib.pyarrow_internal_check_status
2024-09-Dec 21:30:08 File "pyarrow/ipc.pxi", line 1013, in pyarrow.lib._RecordBatchFileReader._open
2024-09-Dec 21:30:08 File "pyarrow/error.pxi", line 91, in pyarrow.lib.check_status
2024-09-Dec 21:30:08 pyarrow.lib.ArrowInvalid: Not an Arrow file
2024-09-Dec 21:30:08 WALLAROO: Task run complete. Status: failure
Kill a Task
Killing a task removes the schedule or removes it from a service. Tasks are killed with the Task kill()
method, and returns a message with the status of the kill procedure.
If necessary, all tasks can be killed through the following script.
- IMPORTANT NOTE: This command will kill all running tasks - scheduled or otherwise. Only use this if required.
# Kill all tasks
for t in wl.list_tasks(): t.kill()
When listed with Wallaroo client task_list(killed=True)
, the field active
displays tasks that are killed (False
) or either completed running or still scheduled to run (True
).
task_scheduled.kill()
<ArbexStatus.PENDING_KILL: ‘pending_kill’>
wl.list_tasks()
id | name | last run status | type | active | schedule | created at | updated at | workspace id | workspace name |
---|---|---|---|---|---|---|---|---|---|
c3ff008d-5057-4437-8d93-369a732b5d52 | schedule example | failure | Scheduled Run | True | */5 * * * * | 2024-09-Dec 21:24:48 | 2024-09-Dec 21:31:51 | 12 | orchestrationworkspace |
ee77704b-4695-4e30-8e14-43c4f0947959 | house price run once 2 | failure | Temporary Run | True | - | 2024-09-Dec 21:22:40 | 2024-09-Dec 21:22:48 | 12 | orchestrationworkspace |
bfe8a389-f72f-4687-9d7d-b037e6600d5b | house price run once 2 | failure | Temporary Run | True | - | 2024-09-Dec 21:21:31 | 2024-09-Dec 21:21:41 | 12 | orchestrationworkspace |
5b95265f-52b5-4324-9d77-8004f0d9da4e | simpletaskdemo | success | Temporary Run | True | - | 2024-09-Dec 20:59:39 | 2024-09-Dec 20:59:51 | 11 | simpleorchestrationworkspace |
f7aa8740-d104-4eb9-a3b8-09f79b300d9f | simpletaskdemo | success | Temporary Run | True | - | 2024-09-Dec 19:50:11 | 2024-09-Dec 19:50:44 | 10 | simpleorchestrationworkspace2 |
Cleaning Up
With the tutorial complete we will undeploy the pipeline and ensure the resources are returned back to the Wallaroo instance.
pipeline.undeploy()
name | orchestrationpipeline |
---|---|
created | 2024-12-09 21:09:36.107470+00:00 |
last_updated | 2024-12-09 21:20:33.470614+00:00 |
deployed | False |
workspace_id | 12 |
workspace_name | orchestrationworkspace |
arch | x86 |
accel | none |
tags | |
versions | d507b058-ca47-41a0-a934-eb6fb401308a, 64337279-4d6a-450d-9ddb-cb2d2299c5ce, 7d9d4dc7-aa29-4e44-a349-a5e2a6d975f7, ff669947-df11-4971-acd8-9f71c31e51a0, 0492779b-3b18-4d29-a87c-b4cfaae53c5a, 67aed035-88b0-42a7-9a30-3d85c2617d37, 7f62cc3a-2f7f-41fc-bdc8-f598fe6a1926 |
steps | orchestrationmodel |
published | False |