Deployment Configuration with the Wallaroo SDK
How to manage deployment configurations using the Wallaroo SDK
Organizations manage model deployments through pipelines.
Pipelines represent how data is submitted to your uploaded Machine Learning (ML) models. Pipelines allow you to:
The Wallaroo Dashboard provides users methods to create, deploy, and other pipeline operations through the Wallaroo Dashboard user interface.
Pipeline names are not forced to be unique. You can have 50 pipelines all named my-pipeline
, which can cause confusion in determining which pipeline to use.
It is recommended that organizations agree on a naming convention and select pipeline to use rather than creating a new one each time. See the SDK guides for more information on how to select an existing pipeline.
To create a pipeline:
From the Wallaroo Dashboard, set the current workspace from the top left dropdown list.
Select View Pipelines from the pipeline’s row.
From the upper right hand corner, select Create Pipeline.
Enter the following:
When finished, select Next.
Review the name of the pipeline and the steps. If any adjustments need to be made, select either Back to rename the pipeline or Add Step(s) to change the pipeline’s steps.
When finished, select Build to create the pipeline in this workspace. The pipeline will be built and be ready for deployment within a minute.
Deployed pipelines create new namespaces in the Kubernetes environment where the Wallaroo instance is deployed, and allocate resources from the Kubernetes environment to run the pipeline and its steps.
To deploy a pipeline:
From the Wallaroo Dashboard, set the current workspace from the top left dropdown list.
Select View Pipelines from the pipeline’s row.
Select the pipeline to deploy.
From the right navigation panel, select Deploy.
A popup module will request verification to deploy the pipeline. Select Deploy again to deploy the pipeline.
Undeploying a pipeline returns resources back to the Kubernetes environment and removes the namespaces created when the pipeline was deployed.
To undeploy a pipeline:
From the Wallaroo Dashboard, set the current workspace from the top left dropdown list.
Select View Pipelines from the pipeline’s row.
Select the pipeline to deploy.
From the right navigation panel, select Undeploy.
A popup module will request verification to undeploy the pipeline. Select Undeploy again to undeploy the pipeline.
Pipeline deployment configurations are updated through the Wallaroo Dashboard for previously deployed pipelines and currently undeployed pipelines. For more details, see Deployment Configuration via the Wallaroo Dashboard.
To view a pipeline’s details:
To view a pipeline’s metrics:
The following is available from the Pipeline Details page:
Pipelines are the method of taking submitting data and processing that data through the models. Each pipeline can have one or more steps that submit the data from the previous step to the next one. Information can be submitted to a pipeline as a file, or through the pipeline’s URL.
A pipeline’s metrics can be viewed through the Wallaroo Dashboard Pipeline Details and Metrics page.
Pipeline names map onto Kubernetes objects, and must be DNS compliant. Pipeline names must be ASCII alpha-numeric characters or dash (-) only. .
and _
are not allowed.
New pipelines are created in the current workspace.
Pipeline names are not forced to be unique. You can have 50 pipelines all named my-pipeline
, which can cause confusion in determining which workspace to use.
It is recommended that organizations agree on a naming convention and select pipeline to use rather than creating a new one each time.
To create a new pipeline, use the Wallaroo Client build_pipeline("{Pipeline Name}")
command.
The following example creates a new pipeline imdb-pipeline
through a Wallaroo Client connection wl
:
imdb_pipeline = wl.build_pipeline("imdb-pipeline")
imdb_pipeline.status()
{'status': 'Pipeline imdb-pipeline is not deployed'}
Pipelines are retrieved via the method wallaroo.client.get_pipeline
. This retrieves the most recent version of the specified pipeline in the current workspace that that matches the requested parameters.
For more details on workspaces, see How to Set the Current Workspace.
wallaroo.client.get_pipeline
takes the following parameters.
Parameter | Type | Description |
---|---|---|
name | (String) (Required) | The name of the pipeline in the current workspace. |
version | (String) (Optional) (Default: None ) | The specific version of the pipeline to retrieve. If no version is provided, the most recent pipeline version is returned. |
wallaroo.client.get_pipeline
returns wallaroo.pipeline_version.PipelineVersion
for the specified pipeline and version in the current workspace. This includes the following fields:
Field | Type | Description |
---|---|---|
name | String | The pipeline name. |
created | DateTime | The pipeline creation date and time. |
last_updated | DateTime | The date and time of the last pipeline update. |
deployed | Bool | If the pipeline has been deployed or not. If None , then the pipeline has never been deployed before. |
workspace_id | Int | The workspace id for the workspace the pipeline is assigned to. |
workspace_name | String | The workspace name for the workspace the pipeline is assigned to. |
arch | String | The hardware architecture inherited from the ML model added as pipeline step. |
accel | String | The AI accelerator inherited from the ML model added as pipeline steps. |
tags | List[String] | A list of the pipeline’s tags. |
versions | List[String] | A list of the pipeline versions. |
steps | List[String] | A list of pipeline steps. |
published | Bool | If the pipeline is published to an OCI Registry (True) or not (False). |
If the pipeline or pipeline version do not match the requested parameters, an error is returned.
The following demonstrates retrieving the most recent pipeline version of the requested pipeline in the current workspace.
wl.get_pipeline(name=pipeline_name)
name | helper-demo-pipeline-1 |
---|---|
created | 2024-07-15 21:44:19.342687+00:00 |
last_updated | 2024-07-15 21:45:36.935273+00:00 |
deployed | True |
workspace_id | 8 |
workspace_name | john.hummel@wallaroo.ai - Default Workspace |
arch | x86 |
accel | none |
tags | |
versions | ad133bce-198d-4cf1-be3d-8f553a8baf03, 9153adea-72de-4925-938a-44366c0f4ef7 |
steps | helper-demo-model-1 |
published | False |
The following demonstrates requesting a pipeline when no pipeline matching the name exists in the current workspace.
wl.get_pipeline(name="no-such-pipeline")
---------------------------------------------------------------------------
Exception Traceback (most recent call last)
/tmp/ipykernel_208/1835144440.py in <module>
----> 1 wl.get_pipeline(name="no-such-pipeline")
~/.local/lib/python3.9/site-packages/wallaroo/client.py in get_pipeline(self, name, version)
1357 )
1358 if pipeline is None:
-> 1359 raise Exception(f"Pipeline {name} not found in this workspace.")
1360 if version is not None:
1361 pipeline_version = next(
Exception: Pipeline no-such-pipeline not found in this workspace.
wl.get_pipeline(name=pipeline_name, version="62e0236f-e040-40c1-8666-7b703fd5d2f0")
name | helper-demo-pipeline-1 |
---|---|
created | 2024-07-15 18:15:32.873521+00:00 |
last_updated | 2024-07-15 18:15:32.873521+00:00 |
deployed | (none) |
workspace_id | 25 |
workspace_name | helper-demo-workspace-1 |
arch | None |
accel | None |
tags | |
versions | 62e0236f-e040-40c1-8666-7b703fd5d2f0 |
steps | |
published | False |
The following demonstrates requesting a pipeline version when no pipeline version matching the requested version
parameter exists in the current workspace and pipeline.
pipeline_name="helper-demo-pipeline-1"
wl.get_pipeline(name=pipeline_name, version="no-such-version")
---------------------------------------------------------------------------
Exception Traceback (most recent call last)
/tmp/ipykernel_208/2171664880.py in <module>
----> 1 wl.get_pipeline(name=pipeline_name, version="no-such-version")
~/.local/lib/python3.9/site-packages/wallaroo/client.py in get_pipeline(self, name, version)
1366 return pipeline
1367 else:
-> 1368 raise Exception(
1369 f"Pipeline version {version} not found in this workspace."
1370 )
Exception: Pipeline version no-such-version not found in this workspace.
The Wallaroo Client method list_pipelines()
lists all pipelines in Wallaroo in workspaces the SDK user is a member of. The list is filterable by the workspace id or the workspace name, listed in reverse chronological order.
Admin users have unrestricted access to all workspaces. For more details, see Wallaroo Enterprise User Management.
Parameter | Type | Description |
---|---|---|
workspace_id | (Int) (Optional) | The numerical identifier of the workspace to filter by. |
workspace_name | (String) (Optional) | The name of the workspace to filter by. |
The following fields are returned from the list_pipeline
method.
Field | Type | Description |
---|---|---|
name | String | The assigned name of the pipeline. |
deployed | Bool | Whether the pipeline is currently deployed or not. If None, the pipeline has never been deployed. |
workspace_id | Int | The numerical identifier of the workspace the pipeline is assigned to. |
workspace_name | String | The workspace name of the workspace the pipeline is assigned to. |
arch | String | The hardware architecture inherited from the models assigned as pipeline steps. |
accel | String | The AI accelerator hardware inherited from the models assigned as pipeline steps. |
created | DateTime | The date and time the pipeline was created. |
last_updated | DateTime | The date and time the pipeline was updated. |
tags | List | The list of tags applied to the pipeline. For more details, see Wallaroo SDK Essentials Guide: Tag Management. |
versions | List | The list of pipeline versions, each version ID in UUID format. |
steps | List | The list of pipeline steps. |
published | Bool | If the pipeline was published to the Edge Deployment Registry. See Wallaroo SDK Essentials Guide: Pipeline Edge Publication for more details. |
The following retrieves all pipelines in workspaces the SDK user is a member of.
wl.list_pipelines()
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 21:44:19 | 2024-15-Jul 21:45:56 | True | 8 | john.hummel@wallaroo.ai - Default Workspace | x86 | none | f5266b8b-07fd-4797-af2b-23a96208fb78, ad133bce-198d-4cf1-be3d-8f553a8baf03, 9153adea-72de-4925-938a-44366c0f4ef7 | helper-demo-model-1 | False | |
helper-demo-pipeline-1 | 2024-15-Jul 18:43:50 | 2024-15-Jul 18:44:09 | True | 26 | helper-demo-workspace-2 | x86 | none | 023ddae5-56d6-4574-8e59-eb02c004457c, 772dd02f-d7e7-4986-98df-b5cdbccbd2a7 | helper-model-replace | False | |
helper-demo-pipeline-1 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | x86 | none | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d, a1b4337a-3379-403c-8d03-bed291092feb, 62e0236f-e040-40c1-8666-7b703fd5d2f0 | helper-model-replace | False | |
api-pipeline-with-models | 2024-11-Jul 17:48:00 | 2024-11-Jul 17:48:00 | True | 24 | sample-api-workspace-assays | x86 | none | 3fe1411e-b2e8-4a28-a0df-860267b7ad3d | api-sample-model | False | |
apiinferenceexamplepipeline | 2024-11-Jul 17:41:23 | 2024-11-Jul 17:41:23 | False | 22 | apiinferenceexampleworkspace | x86 | none | f410e6e0-59d2-488e-9ec9-11013078721e | ccfraud | False | |
sdkinferenceexamplepipeline | 2024-11-Jul 17:27:18 | 2024-11-Jul 17:38:49 | False | 21 | sdkinferenceexampleworkspace | x86 | none | a003db88-d68c-4ee0-b5be-9dd9b44195d7, ef24ecb7-1e4f-4926-b4d9-1e15ab8d6650, b4606f90-70ce-40f7-854f-dab5763ffa89, a6350b91-3fec-4c50-9e7d-b34feb759a1c, 55dc0f1f-5970-4169-af07-35ff54953b00, ccb2c84b-579d-436f-ab07-cccb28628b6d, 893920f4-dd25-4d43-906d-4261b9adf860, cc572647-c758-4b3c-b725-8fff9976a34d | ccfraud | False | |
api-pipeline-with-models | 2024-11-Jul 17:17:42 | 2024-11-Jul 17:17:42 | False | 20 | test-api-workspace-assays | x86 | none | 385f5383-c800-4ff8-9274-5f51c93465e3 | api-sample-model | False | |
ccfraudpipeline | 2024-11-Jul 16:51:54 | 2024-11-Jul 17:40:24 | True | 14 | ccfraudworkspace | x86 | none | ff45b328-9d7c-41bb-aa7c-5b5c7e9cbcc4, 004d69fa-67e1-448b-bd38-8fe5299be206, 7e1cec77-d24f-41ae-b9c1-7c4450194e53, c838a411-8515-4633-8a0c-45bc83d3dcde | ccfraudmodel | False |
Filter by workspace id:
wl.list_pipelines(workspace_id=25)
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | x86 | none | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d, a1b4337a-3379-403c-8d03-bed291092feb, 62e0236f-e040-40c1-8666-7b703fd5d2f0 | helper-model-replace | False |
Filter by workspace name:
wl.list_pipelines(workspace_name=workspace_name)
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | x86 | none | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d, a1b4337a-3379-403c-8d03-bed291092feb, 62e0236f-e040-40c1-8666-7b703fd5d2f0 | helper-model-replace | False |
The method wallaroo.client.Client.search_pipelines
returns a List of pipeline versions that match the pipeline name for workspaces the requester is a member of in reverse chronological order.
Admin users have unrestricted access to all workspaces. For more details, see Wallaroo Enterprise User Management.
This search is filtered by the other parameters, including:
workspace_id
: The numerical identifier of the workspace to filter by.workspace_name
: The name of the workspace to filter by.Parameter | Type | Description |
---|---|---|
search_term | (String) (Optional) | The name or tags of the pipeline to search for. |
deployed | (Bool) (Optional) | If the pipeline is deployed (True) or not (False). |
created_start | (Datetime) (Optional) | When the pipeline was created. |
updated_start | (Datetime) (Optional) | When the pipeline was last updated. |
workspace_id | (Int) (Optional) | The numerical identifier of the workspace to filter by. |
workspace_name | (String) (Optional) | The name of the workspace to filter by. |
A List of the most recent Pipeline Versions with the following fields in reverse chronological order:
Field | Type | Description |
---|---|---|
name | (String) | The name of the pipeline version. |
version | (String) | The UUID pipeline version identifier. |
last_updated_time | (DateTime) | The name of the pipeline to search for. |
deployed | (Bool) | Whether the pipeline is deployed (True) or not (False). If (unknown) , the pipeline has never been deployed. |
workspace_id | (String) | The name of the pipeline to search for. |
workspace_name | (String) | The name of the pipeline to search for. |
tags | (String) | The name of the pipeline to search for. |
steps | (String) | The name of the pipeline to search for. |
wl.search_pipelines()
name | version | creation_time | last_updated_time | deployed | workspace_id | workspace_name | tags | steps |
---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 023ddae5-56d6-4574-8e59-eb02c004457c | 2024-15-Jul 18:44:09 | 2024-15-Jul 18:44:09 | True | 26 | helper-demo-workspace-2 | helper-model-replace | |
helper-demo-pipeline-1 | 772dd02f-d7e7-4986-98df-b5cdbccbd2a7 | 2024-15-Jul 18:43:50 | 2024-15-Jul 18:43:50 | (unknown) | 26 | helper-demo-workspace-2 | ||
helper-demo-pipeline-1 | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d | 2024-15-Jul 18:25:07 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | helper-model-replace | |
helper-demo-pipeline-1 | a1b4337a-3379-403c-8d03-bed291092feb | 2024-15-Jul 18:20:02 | 2024-15-Jul 18:20:02 | (unknown) | 25 | helper-demo-workspace-1 | ||
helper-demo-pipeline-1 | 62e0236f-e040-40c1-8666-7b703fd5d2f0 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:15:32 | (unknown) | 25 | helper-demo-workspace-1 | ||
api-pipeline-with-models | 3fe1411e-b2e8-4a28-a0df-860267b7ad3d | 2024-11-Jul 17:48:00 | 2024-11-Jul 17:48:00 | True | 24 | sample-api-workspace-assays | api-sample-model | |
apiinferenceexamplepipeline | f410e6e0-59d2-488e-9ec9-11013078721e | 2024-11-Jul 17:41:23 | 2024-11-Jul 17:41:23 | False | 22 | apiinferenceexampleworkspace | ccfraud | |
ccfraudpipeline | ff45b328-9d7c-41bb-aa7c-5b5c7e9cbcc4 | 2024-11-Jul 17:40:24 | 2024-11-Jul 17:40:24 | True | 14 | ccfraudworkspace | ccfraudmodel | |
ccfraudpipeline | 004d69fa-67e1-448b-bd38-8fe5299be206 | 2024-11-Jul 17:40:22 | 2024-11-Jul 17:40:22 | (unknown) | 14 | ccfraudworkspace | ||
sdkinferenceexamplepipeline | a003db88-d68c-4ee0-b5be-9dd9b44195d7 | 2024-11-Jul 17:38:49 | 2024-11-Jul 17:38:49 | False | 21 | sdkinferenceexampleworkspace | ccfraud | |
sdkinferenceexamplepipeline | ef24ecb7-1e4f-4926-b4d9-1e15ab8d6650 | 2024-11-Jul 17:38:49 | 2024-11-Jul 17:38:49 | (unknown) | 21 | sdkinferenceexampleworkspace | ||
sdkinferenceexamplepipeline | b4606f90-70ce-40f7-854f-dab5763ffa89 | 2024-11-Jul 17:31:35 | 2024-11-Jul 17:31:35 | (unknown) | 21 | sdkinferenceexampleworkspace | ccfraud | |
sdkinferenceexamplepipeline | a6350b91-3fec-4c50-9e7d-b34feb759a1c | 2024-11-Jul 17:31:34 | 2024-11-Jul 17:31:34 | (unknown) | 21 | sdkinferenceexampleworkspace | ||
sdkinferenceexamplepipeline | 55dc0f1f-5970-4169-af07-35ff54953b00 | 2024-11-Jul 17:28:24 | 2024-11-Jul 17:28:24 | (unknown) | 21 | sdkinferenceexampleworkspace | ccfraud | |
sdkinferenceexamplepipeline | ccb2c84b-579d-436f-ab07-cccb28628b6d | 2024-11-Jul 17:28:23 | 2024-11-Jul 17:28:23 | (unknown) | 21 | sdkinferenceexampleworkspace | ||
sdkinferenceexamplepipeline | 893920f4-dd25-4d43-906d-4261b9adf860 | 2024-11-Jul 17:27:20 | 2024-11-Jul 17:27:20 | (unknown) | 21 | sdkinferenceexampleworkspace | ccfraud | |
sdkinferenceexamplepipeline | cc572647-c758-4b3c-b725-8fff9976a34d | 2024-11-Jul 17:27:18 | 2024-11-Jul 17:27:18 | (unknown) | 21 | sdkinferenceexampleworkspace | ||
api-pipeline-with-models | 385f5383-c800-4ff8-9274-5f51c93465e3 | 2024-11-Jul 17:17:42 | 2024-11-Jul 17:17:42 | False | 20 | test-api-workspace-assays | api-sample-model | |
ccfraudpipeline | 7e1cec77-d24f-41ae-b9c1-7c4450194e53 | 2024-11-Jul 16:51:56 | 2024-11-Jul 16:51:56 | (unknown) | 14 | ccfraudworkspace | ccfraudmodel | |
ccfraudpipeline | c838a411-8515-4633-8a0c-45bc83d3dcde | 2024-11-Jul 16:51:54 | 2024-11-Jul 16:51:54 | (unknown) | 14 | ccfraudworkspace |
Search pipeline with workspace name:
wl.search_pipelines(search_term=pipeline_name, workspace_name=workspace_name)
name | version | creation_time | last_updated_time | deployed | workspace_id | workspace_name | tags | steps |
---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 023ddae5-56d6-4574-8e59-eb02c004457c | 2024-15-Jul 18:44:09 | 2024-15-Jul 18:44:09 | True | 26 | helper-demo-workspace-2 | helper-model-replace | |
helper-demo-pipeline-1 | 772dd02f-d7e7-4986-98df-b5cdbccbd2a7 | 2024-15-Jul 18:43:50 | 2024-15-Jul 18:43:50 | (unknown) | 26 | helper-demo-workspace-2 |
Search pipeline with workspace id:
wl.search_pipelines(search_term=pipeline_name, workspace_id=25)
name | version | creation_time | last_updated_time | deployed | workspace_id | workspace_name | tags | steps |
---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d | 2024-15-Jul 18:25:07 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | helper-model-replace | |
helper-demo-pipeline-1 | a1b4337a-3379-403c-8d03-bed291092feb | 2024-15-Jul 18:20:02 | 2024-15-Jul 18:20:02 | (unknown) | 25 | helper-demo-workspace-1 | ||
helper-demo-pipeline-1 | 62e0236f-e040-40c1-8666-7b703fd5d2f0 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:15:32 | (unknown) | 25 | helper-demo-workspace-1 |
The method wallaroo.client.Client.pipelines_by_name
returns a List of pipelines that match the pipeline name and other filtered parameters, in workspaces the user is a member of, in reverse chronological order.
Admin users have unrestricted access to all workspaces. For more details, see Wallaroo Enterprise User Management.
Parameter | Type | Description |
---|---|---|
pipeline_name | (String) (Optional) | The name or tags of the pipeline to search for. |
workspace_id | (Int) (Optional) | The numerical identifier of the workspace to filter by. |
workspace_name | (String) (Optional) | The name of the workspace to filter by. |
A List of the pipelines in reverse chronological order:
Field | Type | Description |
---|---|---|
name | (String) | The name of the pipeline version. |
version | (String) | The UUID pipeline version identifier. |
last_updated_time | (DateTime) | The name of the pipeline to search for. |
deployed | (Bool) | Whether the pipeline is deployed (True) or not (False). If (unknown) , the pipeline has never been deployed. |
workspace_id | (String) | The name of the pipeline to search for. |
workspace_name | (String) | The name of the pipeline to search for. |
tags | (String) | The name of the pipeline to search for. |
steps | (String) | The name of the pipeline to search for. |
Searching by pipeline name:
wl.pipelines_by_name(pipeline_name=pipeline_name)
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 18:43:50 | 2024-15-Jul 18:44:09 | True | 26 | helper-demo-workspace-2 | x86 | none | 023ddae5-56d6-4574-8e59-eb02c004457c, 772dd02f-d7e7-4986-98df-b5cdbccbd2a7 | helper-model-replace | False | |
helper-demo-pipeline-1 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | x86 | none | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d, a1b4337a-3379-403c-8d03-bed291092feb, 62e0236f-e040-40c1-8666-7b703fd5d2f0 | helper-model-replace | False |
Search by pipeline name and workspace id:
wl.pipelines_by_name(pipeline_name=pipeline_name, workspace_id=25)
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 18:15:32 | 2024-15-Jul 18:25:07 | True | 25 | helper-demo-workspace-1 | x86 | none | 5c173333-87bd-4bc5-b3fa-7dd651d76d1d, a1b4337a-3379-403c-8d03-bed291092feb, 62e0236f-e040-40c1-8666-7b703fd5d2f0 | helper-model-replace | False |
Search by pipeline name and workspace name:
wl.pipelines_by_name(pipeline_name=pipeline_name, workspace_name=workspace_name)
name | created | last_updated | deployed | workspace_id | workspace_name | arch | accel | tags | versions | steps | published |
---|---|---|---|---|---|---|---|---|---|---|---|
helper-demo-pipeline-1 | 2024-15-Jul 18:43:50 | 2024-15-Jul 18:44:09 | True | 26 | helper-demo-workspace-2 | x86 | none | 023ddae5-56d6-4574-8e59-eb02c004457c, 772dd02f-d7e7-4986-98df-b5cdbccbd2a7 | helper-model-replace | False |
Once a pipeline has been created, or during its creation process, a pipeline step can be added. The pipeline step refers to the model that will perform an inference off of the data submitted to it. Each time a step is added, it is added to the pipeline’s models
array.
Pipeline steps are not saved until the pipeline is deployed. Until then, pipeline steps are stored in local memory as a potential pipeline configuration until the pipeline is deployed.
A pipeline step is added through the pipeline add_model_step({Model})
command.
In the following example, two models uploaded to the workspace are added as pipeline step:
imdb_pipeline.add_model_step(embedder)
imdb_pipeline.add_model_step(smodel)
imdb_pipeline.status()
{'name': 'imdb-pipeline', 'create_time': datetime.datetime(2022, 3, 30, 21, 21, 31, 127756, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'embedder-o', 'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d', 'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4'}]}}, {'ModelInference': {'models': [{'name': 'smodel-o', 'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19', 'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650'}]}}]"}
The model specified in a pipeline step can be replaced with the pipeline method replace_with_model_step(index, model)
.
The following parameters are used for replacing a pipeline step:
Parameter | Default Value | Purpose |
---|---|---|
index | null | The pipeline step to be replaced. Pipeline steps follow array numbering, where the first step is 0 , etc. |
model | null | The new model to be used in the pipeline step. |
In the following example, a deployed pipeline will have the initial model step replaced with a new one. A status of the pipeline will be displayed after deployment and after the pipeline swap to show the model has been replaced from ccfraudoriginal
to ccfraudreplacement
, each with their own versions.
pipeline.deploy()
pipeline.status()
{'status': 'Running',
'details': [],
'engines': [{'ip': '10.244.2.145',
'name': 'engine-75bfd7dc9d-7p9qk',
'status': 'Running',
'reason': None,
'details': [],
'pipeline_statuses': {'pipelines': [{'id': 'hotswappipeline',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'ccfraudoriginal',
'version': '3a03dc94-716e-46bb-84c8-91bc99ceb2c3',
'sha': 'bc85ce596945f876256f41515c7501c399fd97ebcb9ab3dd41bf03f8937b4507',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.244.2.144',
'name': 'engine-lb-55dcdff64c-vf74s',
'status': 'Running',
'reason': None,
'details': []}],
'sidekicks': []}
pipeline.replace_with_model_step(0, replacement_model).deploy()
pipeline.status()
{'status': 'Running',
'details': [],
'engines': [{'ip': '10.244.2.153',
'name': 'engine-96486c95d-zfchr',
'status': 'Running',
'reason': None,
'details': [],
'pipeline_statuses': {'pipelines': [{'id': 'hotswappipeline',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'ccfraudreplacement',
'version': '714efd19-5c83-42a8-aece-24b4ba530925',
'sha': 'bc85ce596945f876256f41515c7501c399fd97ebcb9ab3dd41bf03f8937b4507',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.244.2.154',
'name': 'engine-lb-55dcdff64c-9np9k',
'status': 'Running',
'reason': None,
'details': []}],
'sidekicks': []}
A Pipeline Step can be more than models - they can also be pre processing and post processing steps. For example, this preprocessing step uses the following code:
import numpy
import pandas
import json
# add interaction terms for the model
def actual_preprocess(pdata):
pd = pdata.copy()
# convert boolean cust_known to 0/1
pd.cust_known = numpy.where(pd.cust_known, 1, 0)
# interact UnitPrice and cust_known
pd['UnitPriceXcust_known'] = pd.UnitPrice * pd.cust_known
return pd.loc[:, ['UnitPrice', 'cust_known', 'UnitPriceXcust_known']]
# If the data is a json string, call this wrapper instead
# Expected input:
# a dictionary with fields 'colnames', 'data'
# test that the code works here
def wallaroo_json(data):
obj = json.loads(data)
pdata = pandas.DataFrame(obj['query'],
columns=obj['colnames'])
pprocessed = actual_preprocess(pdata)
# return a dictionary, with the fields the model expect
return {
'tensor_fields': ['model_input'],
'model_input': pprocessed.to_numpy().tolist()
}
It is added as a Python module by uploading it as a model:
# load the preprocess module
module_pre = wl.upload_model("preprocess", "./preprocess.py").configure('python')
And then added to the pipeline as a step:
# now make a pipeline
demandcurve_pipeline = (wl.build_pipeline("demand-curve-pipeline")
.add_model_step(module_pre)
.add_model_step(demand_curve_model)
.add_model_step(module_post))
To remove a step from the pipeline, use the Pipeline remove_step(index)
command, where the index
is the array index for the pipeline’s steps.
In the following example the pipeline imdb_pipeline
will have the step with the model smodel-o
removed.
imdb_pipeline.status
<bound method Pipeline.status of {'name': 'imdb-pipeline', 'create_time': datetime.datetime(2022, 3, 30, 21, 21, 31, 127756, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'embedder-o', 'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d', 'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4'}]}}, {'ModelInference': {'models': [{'name': 'smodel-o', 'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19', 'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650'}]}}]"}>
imdb_pipeline.remove_step(1)
{'name': 'imdb-pipeline', 'create_time': datetime.datetime(2022, 3, 30, 21, 21, 31, 127756, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'embedder-o', 'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d', 'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4'}]}}]"}
The Pipeline clear()
method removes all pipeline steps from a pipeline. Note that pipeline steps are not saved until the pipeline is deployed.
Each time the pipeline steps are updated and the pipeline is either deployed or a pipeline version is manually created, a new pipeline version is created. This each Pipeline version includes the following:
Field | Type | Description |
---|---|---|
name | String | The assigned name of the pipeline. |
version | String | The UUID version identifier. |
creation_time | DateTime | The date and time the pipeline was created. |
last_updated_time | DateTime | The date and time the pipeline was updated. |
deployed | Bool | Whether the pipeline is currently deployed or not. |
tags | List | The list of tags applied to the pipeline. For more details, see Wallaroo SDK Essentials Guide: Tag Management. |
steps | List | The list of pipeline steps. |
Pipeline versions are saved automatically when the method wallaroo.pipeline.deploy(deployment_configuration)
is called. Pipeline versions are stored manually with the wallaroo.pipeline.create_versions()
method, which stores the current pipeline steps and other details stored in the local SDK session into the Wallaroo pipeline database.
The following example demonstrates setting a model as a pipeline step, then saving the pipeline configuration in the local SDK session to Wallaroo as a new pipeline version.
display("Current pipeline.")
display(pipeline) # display the current pipeline
pipeline.clear() #clear the current steps
pipeline.add_model_step(houseprice_rf_model_version) #set a different model as a pipeline step
new_pipeline_version = pipeline.create_version()
display("New pipeline version.")
display(new_pipeline_version)
Current pipeline.
name | houseprice-estimator |
version | 342f4605-9467-460e-866f-1b74e6e863d1 |
creation_time | 2023-11-Sep 21:27:00 |
last_updated_time | 2023-11-Sep 21:27:00 |
deployed | False |
tags | |
steps | house-price-prime |
New pipeline version.
name | houseprice-estimator |
version | 937fd68d-2eaa-4b30-80b2-e66ea7be3086 |
creation_time | 2023-26-Sep 16:36:13 |
last_updated_time | 2023-26-Sep 16:36:13 |
deployed | False |
tags | |
steps | house-price-rf-model |
Pipeline versions are retrieved with the method wallaroo.pipeline.versions()
and returns a List of the versions of the pipelines listed in descending creation order, with the most recent version stored in position 0.
The following example demonstrates retrieving the list of pipeline version, then storing a specific version to a variable and displaying its version id and other details.
for pipeline_version in pipeline.versions():
display(pipeline_version)
pipeline_version_01 = pipeline.versions()[7]
display(pipeline_version_01.name())
display(pipeline_version_01)
name | houseprice-estimator |
version | 342f4605-9467-460e-866f-1b74e6e863d1 |
creation_time | 2023-11-Sep 21:27:00 |
last_updated_time | 2023-11-Sep 21:27:00 |
deployed | False |
tags | |
steps | house-price-prime |
name | houseprice-estimator |
— | — |
version | 47a5fb9f-2456-4132-abea-88a3147c4446 |
creation_time | 2023-11-Sep 21:15:00 |
last_updated_time | 2023-11-Sep 21:15:00 |
deployed | False |
tags | |
steps | house-price-prime Check Check Check |
…
‘92f2b4f3-494b-4d69-895f-9e767ac1869d’
name | houseprice-estimator |
version | 92f2b4f3-494b-4d69-895f-9e767ac1869d |
creation_time | 2023-11-Sep 20:49:17 |
last_updated_time | 2023-11-Sep 20:49:17 |
deployed | False |
tags | |
steps | house-price-rf-model |
When a pipeline step is added or removed, the pipeline must be deployed through the pipeline wallaroo.pipeline.Pipeline.deploy(deployment_config, wait_for_status)
. This allocates resources to the pipeline from the Kubernetes environment and make it available to submit information to perform inferences. For full details on pipeline deployment configurations, see Wallaroo SDK Essentials Guide: Pipeline Deployment Configuration.
The method wallaroo.pipeline.Pipeline.deploy
takes the following parameters.
Parameter | Type | Description |
---|---|---|
deployment_config | wallaroo.deployment_config.DeploymentConfig (Optional) | The number of resources to allocate to the cluster such as the number of cpus, amount of ram, etc, along with how many replicas, autoscaling, and other settings. For complete details, see Wallaroo SDK Essentials Guide: Pipeline Deployment Configuration. |
wait_for_status | Boolean (Optional Default: True ) | If True , the Python script will wait until the pipeline is finished successfully deploying before continuing or reaches the timeout. If False , continue on with the Python script. |
By default, wallaroo.pipeline.Pipeline.deploy
sets wait_for_status=True
. In this situation, the Python code pauses and display the deployment process until either:
If wait_for_status=False
, then Python code will not wait for the success or failure of the pipeline to deploy, but proceed to the next instruction without pause.
In these situations, before performing an inference request, use pipeline status to verify that the pipeline status
is Running
.
For example, the following example:
wait_for_status=False
.sample_pipeline_1
status is status='Running'
.for pipe, deploy_config in [(sample_pipeline_1, deployment_config_1), (sample_pipeline_2, deployment_config_2), (sample_pipeline_2, deployment_config_3)]:
pipe.deploy(deployment_config=deploy_config, wait_for_status=False)
# perform other code
# check the pipeline status before performing an inference
if sample_pipeline_1.status()['status'] == 'Running':
sample_pipeline_1.infer(sample_data)
Deployment configurations default to the following*.
Runtime | CPUs | Memory | GPUs |
---|---|---|---|
Wallaroo Native Runtime** | 4 | 3 Gi | 0 |
Wallaroo Containerized Runtime*** | 2 | 1 Gi | 0 |
*: For Kubernetes limits and requests.
**: Resources are always allocated for the Wallaroo Native Runtime engine even if there are no Wallaroo Native Runtimes included in the deployment, so it is recommended to decrease these resources when pipelines use Containerized Runtimes.
***: Resources for Wallaroo Containerized Runtimes only apply with a Wallaroo Containerized Runtime is part of the deployment.
Pipelines do not need to be undeployed to deploy new pipeline versions or pipeline deployment configurations. For example, the following pipeline is deployed, new pipeline steps are set, and the pipeline deploy command is issues again. This creates a new version of the pipeline and updates the deployed pipeline with the new configuration.
# clear all steps
pipeline.clear()
# set modelA as the step
pipeline.add_model_step(modelA)
# deploy the pipeline - the version is saved and the resources allocated to the pipeline
pipeline.deploy()
# clear the steps - this configuration is only stored in the local SDK session until the deploy or create_version command is given
pipeline.clear()
# set modelB as the step
pipeline.add_model_step(modelB)
# deploy the pipeline - the pipeline configuration is saved and the pipeline deployment updated without significant downtime
pipeline.deploy()
Deployment configurations inherit the model’s architecture setting. This is set during model upload by specifying the arch
parameter. By default, models uploaded to Wallaroo default to the x86 architecture.
The following model operations inherit the model’s architecture setting.
The following example shows uploading a model set with the architecture set to ARM
, and how the deployment inherits that architecture without additional deployment configuration changes. For this example, an ONNX model is uploaded.
import wallaroo
housing_model_control_arm = (wl.upload_model(model_name_arm,
model_file_name,
framework=Framework.ONNX,
arch=wallaroo.engine_config.Architecture.ARM)
.configure(tensor_fields=["tensor"])
)
display(housing_model_control_arm)
Name | house-price-estimator-arm |
Version | 163ff0a9-0f1a-4229-bbf2-a19e4385f10f |
File Name | rf_model.onnx |
SHA | e22a0831aafd9917f3cc87a15ed267797f80e2afa12ad7d8810ca58f173b8cc6 |
Status | ready |
Image Path | None |
Architecture | arm |
Acceleration | None |
Updated At | 2024-04-Mar 20:34:00 |
Note that the deployment configuration settings, no architecture is specified. When pipeline_arm
is displayed, we see the arch
setting inherited the model’s arch
setting.
pipeline_arm = wl.build_pipeline(arm_pipeline_name)
# set the model step with the ARM targeted model
pipeline_arm.add_model_step(housing_model_control_arm)
#minimum deployment config for this model
deploy_config = wallaroo.DeploymentConfigBuilder().replica_count(1).cpus(1).memory("1Gi").build()
pipeline_arm.deploy(deployment_config = deploy_config)
Waiting for deployment - this will take up to 45s .......... ok
display(pipeline_arm)
name | architecture-demonstration-arm |
---|---|
created | 2024-03-04 20:34:08.895396+00:00 |
last_updated | 2024-03-04 21:52:01.894671+00:00 |
deployed | True |
arch | arm |
accel | None |
tags | |
versions | 55d834b4-92c8-4a93-b78b-6a224f17f9c1, 98821b85-401a-4ab5-af8e-1b3126727069, 74571863-9eb0-47aa-8b5a-3bdaa7aa9f03, b72fb0db-e4b4-4936-a7cb-3d0fb7827a6f, 3ae70818-10f3-4f61-a998-dee5e2f00daf |
steps | house-price-estimator-arm |
published | True |
By default, deploying a Wallaroo pipeline will deploy the most current version. For example:
sample_pipeline = wl.build_pipeline("test-pipeline")
sample_pipeline.add_model_step(model)
sample_pipeline.deploy()
sample_pipeline.status()
{'status': 'Running',
'details': None,
'engines': [{'ip': '10.12.1.65',
'name': 'engine-778b65459-f9mt5',
'status': 'Running',
'reason': None,
'pipeline_statuses': {'pipelines': [{'id': 'imdb-pipeline',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'embedder-o',
'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d',
'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4',
'status': 'Running'},
{'name': 'smodel-o',
'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19',
'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.12.1.66',
'name': 'engine-lb-85846c64f8-ggg2t',
'status': 'Running',
'reason': None}]}
Pipeline versions are deployed with the method wallaroo.pipeline_variant.deploy(deployment_name, model_configs, config: Optional[wallaroo.deployment_config.DeploymentConfig])
. Note that the deployment_name
and model_configs
are required. The model_configs
are retrieved with the wallaroo.pipeline_variant.model_configs()
method.
The following demonstrates retrieving a previous version of a pipeline, deploying it, and retrieving the deployment status.
pipeline_version = pipeline.versions()[7]
display(pipeline_version.name())
display(pipeline_version)
pipeline_version.deploy("houseprice-estimator", pipeline_version.model_configs())
display(pipeline.status())
name | houseprice-estimator |
version | 92f2b4f3-494b-4d69-895f-9e767ac1869d |
creation_time | 2023-11-Sep 20:49:17 |
last_updated_time | 2023-11-Sep 20:49:17 |
deployed | False |
tags | |
steps | house-price-rf-model |
{'status': 'Running',
'details': [],
'engines': [{'ip': '10.244.3.211',
'name': 'engine-578dc7cdcf-qkx5n',
'status': 'Running',
'reason': None,
'details': [],
'pipeline_statuses': {'pipelines': [{'id': 'houseprice-estimator',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'house-price-rf-model',
'version': '616c2306-bf93-417b-9656-37bee6f14379',
'sha': 'e22a0831aafd9917f3cc87a15ed267797f80e2afa12ad7d8810ca58f173b8cc6',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.244.4.243',
'name': 'engine-lb-584f54c899-2rtvg',
'status': 'Running',
'reason': None,
'details': []}],
'sidekicks': []}
Once complete, the pipeline status()
command will show 'status':'Running'
.
Pipeline deployments can be modified to enable auto-scaling to allow pipelines to allocate more or fewer resources based on need by setting the pipeline’s This will then be applied to the deployment of the pipeline
ccfraudPipelineby specifying it's
deployment_config` optional parameter. If this optional parameter is not passed, then the deployment will defer to default values. For more information, see Manage Pipeline Deployment Configuration.
In the following example, the pipeline imdb-pipeline
that contains two steps will be deployed with default deployment configuration:
imdb_pipeline.status
<bound method Pipeline.status of {'name': 'imdb-pipeline', 'create_time': datetime.datetime(2022, 3, 30, 21, 21, 31, 127756, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'embedder-o', 'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d', 'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4'}]}}, {'ModelInference': {'models': [{'name': 'smodel-o', 'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19', 'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650'}]}}]"}>
imdb_pipeline.deploy()
Waiting for deployment - this will take up to 45s ...... ok
imdb_pipeline.status()
{'status': 'Running',
'details': None,
'engines': [{'ip': '10.12.1.65',
'name': 'engine-778b65459-f9mt5',
'status': 'Running',
'reason': None,
'pipeline_statuses': {'pipelines': [{'id': 'imdb-pipeline',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'embedder-o',
'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d',
'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4',
'status': 'Running'},
{'name': 'smodel-o',
'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19',
'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.12.1.66',
'name': 'engine-lb-85846c64f8-ggg2t',
'status': 'Running',
'reason': None}]}
For full details on pipeline deployment configurations, see Wallaroo SDK Essentials Guide: Pipeline Deployment Configuration.
If you deploy more pipelines than your environment can handle, or if you deploy more pipelines than your license allows, you may see an error like the following:
LimitError: You have reached a license limit in your Wallaroo instance. In order to add additional resources, you can remove some of your existing resources. If you have any questions contact us at community@wallaroo.ai: MAX_PIPELINES_LIMIT_EXCEEDED
Undeploy any unnecessary pipelines either through the SDK or through the Wallaroo Pipeline Dashboard, then attempt to redeploy the pipeline in question again.
When a pipeline is not currently needed, it can be undeployed and its resources turned back to the Kubernetes environment. To undeploy a pipeline, use the pipeline undeploy()
command.
In this example, the aloha_pipeline
will be undeployed:
aloha_pipeline.undeploy()
{'name': 'aloha-test-demo', 'create_time': datetime.datetime(2022, 3, 29, 20, 34, 3, 960957, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'aloha-2', 'version': 'a8e8abdc-c22f-416c-a13c-5fe162357430', 'sha': 'fd998cd5e4964bbbb4f8d29d245a8ac67df81b62be767afbceb96a03d1a01520'}]}}]"}
The pipeline status()
command shows the current status, models, and other information on a pipeline.
The following example shows the pipeline imdb_pipeline
status before and after it is deployed:
imdb_pipeline.status
<bound method Pipeline.status of {'name': 'imdb-pipeline', 'create_time': datetime.datetime(2022, 3, 30, 21, 21, 31, 127756, tzinfo=tzutc()), 'definition': "[{'ModelInference': {'models': [{'name': 'embedder-o', 'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d', 'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4'}]}}, {'ModelInference': {'models': [{'name': 'smodel-o', 'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19', 'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650'}]}}]"}>
imdb_pipeline.deploy()
Waiting for deployment - this will take up to 45s ...... ok
imdb_pipeline.status()
{'status': 'Running',
'details': None,
'engines': [{'ip': '10.12.1.65',
'name': 'engine-778b65459-f9mt5',
'status': 'Running',
'reason': None,
'pipeline_statuses': {'pipelines': [{'id': 'imdb-pipeline',
'status': 'Running'}]},
'model_statuses': {'models': [{'name': 'embedder-o',
'version': '1c16d21d-fe4c-4081-98bc-65fefa465f7d',
'sha': 'd083fd87fa84451904f71ab8b9adfa88580beb92ca77c046800f79780a20b7e4',
'status': 'Running'},
{'name': 'smodel-o',
'version': '8d311ba3-c336-48d3-99cd-85d95baa6f19',
'sha': '3473ea8700fbf1a1a8bfb112554a0dde8aab36758030dcde94a9357a83fd5650',
'status': 'Running'}]}}],
'engine_lbs': [{'ip': '10.12.1.66',
'name': 'engine-lb-85846c64f8-ggg2t',
'status': 'Running',
'reason': None}]}
A/B testing is a method that provides the ability to test competing ML models for performance, accuracy or other useful benchmarks. Different models are added to the same pipeline steps as follows:
A/B testing splits a portion of the inference requests between the champion model and the one or more challengers through the add_random_split
method. This method splits the inferences submitted to the model through a randomly weighted step.
Each model receives inputs that are approximately proportional to the weight it is assigned. For example, with two models having weights 1 and 1, each will receive roughly equal amounts of inference inputs. If the weights were changed to 1 and 2, the models would receive roughly 33% and 66% respectively instead.
When choosing the model to use, a random number between 0.0 and 1.0 is generated. The weighted inputs are mapped to that range, and the random input is then used to select the model to use. For example, for the two-models equal-weight case, a random key of 0.4 would route to the first model, 0.6 would route to the second.
A random split step can be added to a pipeline through the add_random_split
method.
The following parameters are used when adding a random split step to a pipeline:
Parameter | Type | Description |
---|---|---|
champion_weight | Float (Required) | The weight for the champion model. |
champion_model | Wallaroo.Model (Required) | The uploaded champion model. |
challenger_weight | Float (Required) | The weight of the challenger model. |
challenger_model | Wallaroo.Model (Required) | The uploaded challenger model. |
hash_key | String(Optional) | A key used instead of a random number for model selection. This must be between 0.0 and 1.0. |
Note that multiple challenger models with different weights can be added as the random split step.
add_random_split([(champion_weight, champion_model), (challenger_weight, challenger_model), (challenger_weight2, challenger_model2),...], hash_key)
In this example, a pipeline will be built with a 2:1 weighted ratio between the champion and a single challenger model.
pipeline = (wl.build_pipeline("randomsplitpipeline-demo")
.add_random_split([(2, control), (1, challenger)]))
The results for a series of single are displayed to show the random weighted split between the two models in action:
results = []
results.append(experiment_pipeline.infer_from_file("data/data-1.json"))
results.append(experiment_pipeline.infer_from_file("data/data-1.json"))
results.append(experiment_pipeline.infer_from_file("data/data-1.json"))
results.append(experiment_pipeline.infer_from_file("data/data-1.json"))
results.append(experiment_pipeline.infer_from_file("data/data-1.json"))
for result in results:
print(result[0].model())
print(result[0].data())
('aloha-control', 'ff81f634-8fb4-4a62-b873-93b02eb86ab4')
[array([[0.00151959]]), array([[0.98291481]]), array([[0.01209957]]), array([[4.75912966e-05]]), array([[2.02893716e-05]]), array([[0.00031977]]), array([[0.01102928]]), array([[0.99756402]]), array([[0.01034162]]), array([[0.00803896]]), array([[0.01615506]]), array([[0.00623623]]), array([[0.00099858]]), array([[1.79337805e-26]]), array([[1.38899512e-27]])]
('aloha-control', 'ff81f634-8fb4-4a62-b873-93b02eb86ab4')
[array([[0.00151959]]), array([[0.98291481]]), array([[0.01209957]]), array([[4.75912966e-05]]), array([[2.02893716e-05]]), array([[0.00031977]]), array([[0.01102928]]), array([[0.99756402]]), array([[0.01034162]]), array([[0.00803896]]), array([[0.01615506]]), array([[0.00623623]]), array([[0.00099858]]), array([[1.79337805e-26]]), array([[1.38899512e-27]])]
('aloha-challenger', '87fdfe08-170e-4231-a0b9-543728d6fc57')
[array([[0.00151959]]), array([[0.98291481]]), array([[0.01209957]]), array([[4.75912966e-05]]), array([[2.02893716e-05]]), array([[0.00031977]]), array([[0.01102928]]), array([[0.99756402]]), array([[0.01034162]]), array([[0.00803896]]), array([[0.01615506]]), array([[0.00623623]]), array([[0.00099858]]), array([[1.79337805e-26]]), array([[1.38899512e-27]])]
('aloha-challenger', '87fdfe08-170e-4231-a0b9-543728d6fc57')
[array([[0.00151959]]), array([[0.98291481]]), array([[0.01209957]]), array([[4.75912966e-05]]), array([[2.02893716e-05]]), array([[0.00031977]]), array([[0.01102928]]), array([[0.99756402]]), array([[0.01034162]]), array([[0.00803896]]), array([[0.01615506]]), array([[0.00623623]]), array([[0.00099858]]), array([[1.79337805e-26]]), array([[1.38899512e-27]])]
('aloha-challenger', '87fdfe08-170e-4231-a0b9-543728d6fc57')
[array([[0.00151959]]), array([[0.98291481]]), array([[0.01209957]]), array([[4.75912966e-05]]), array([[2.02893716e-05]]), array([[0.00031977]]), array([[0.01102928]]), array([[0.99756402]]), array([[0.01034162]]), array([[0.00803896]]), array([[0.01615506]]), array([[0.00623623]]), array([[0.00099858]]), array([[1.79337805e-26]]), array([[1.38899512e-27]])]
If a pipeline already had steps as detailed in Add a Step to a Pipeline, this step can be replaced with a random split with the replace_with_random_split
method.
The following parameters are used when adding a random split step to a pipeline:
Parameter | Type | Description |
---|---|---|
index | Integer (Required) | The pipeline step being replaced. |
champion_weight | Float (Required) | The weight for the champion model. |
champion_model | Wallaroo.Model (Required) | The uploaded champion model. |
**challenger_weight | Float (Required) | The weight of the challenger model. |
challenger_model | Wallaroo.Model (Required) | The uploaded challenger model. |
hash_key | String(Optional) | A key used instead of a random number for model selection. This must be between 0.0 and 1.0. |
Note that one or more challenger models can be added for the random split step:
replace_with_random_split(index, [(champion_weight, champion_model), (challenger_weight, challenger_model)], (challenger_weight2, challenger_model2),...], hash_key)
A/B Testing logs entries contain the model used for the inferences in the column out._model_split
.
logs = experiment_pipeline.logs(limit=5)
display(logs.loc[:,['time', 'out._model_split', 'out.main']])
time | out._model_split | out.main | |
---|---|---|---|
0 | 2023-03-03 19:08:35.653 | [{“name”:“aloha-control”,“version”:“89389786-0c17-4214-938c-aa22dd28359f”,“sha”:“fd998cd5e4964bbbb4f8d29d245a8ac67df81b62be767afbceb96a03d1a01520”}] | [0.9999754] |
1 | 2023-03-03 19:08:35.702 | [{“name”:“aloha-challenger”,“version”:“3acd3835-be72-42c4-bcae-84368f416998”,“sha”:“223d26869d24976942f53ccb40b432e8b7c39f9ffcf1f719f3929d7595bceaf3”}] | [0.9999727] |
2 | 2023-03-03 19:08:35.753 | [{“name”:“aloha-challenger”,“version”:“3acd3835-be72-42c4-bcae-84368f416998”,“sha”:“223d26869d24976942f53ccb40b432e8b7c39f9ffcf1f719f3929d7595bceaf3”}] | [0.6606688] |
3 | 2023-03-03 19:08:35.799 | [{“name”:“aloha-control”,“version”:“89389786-0c17-4214-938c-aa22dd28359f”,“sha”:“fd998cd5e4964bbbb4f8d29d245a8ac67df81b62be767afbceb96a03d1a01520”}] | [0.9998954] |
4 | 2023-03-03 19:08:35.846 | [{“name”:“aloha-control”,“version”:“89389786-0c17-4214-938c-aa22dd28359f”,“sha”:“fd998cd5e4964bbbb4f8d29d245a8ac67df81b62be767afbceb96a03d1a01520”}] | [0.99999803] |
Wallaroo provides a method of testing the same data against two different models or sets of models at the same time through shadow deployments otherwise known as parallel deployments or A/B test. This allows data to be submitted to a pipeline with inferences running on several different sets of models. Typically this is performed on a model that is known to provide accurate results - the champion - and a model or set of models that is being tested to see if it provides more accurate or faster responses depending on the criteria known as the challenger(s). Multiple challengers can be tested against a single champion to determine which is “better” based on the organization’s criteria.
As described in the Wallaroo blog post The What, Why, and How of Model A/B Testing:
In data science, A/B tests can also be used to choose between two models in production, by measuring which model performs better in the real world. In this formulation, the control is often an existing model that is currently in production, sometimes called the champion. The treatment is a new model being considered to replace the old one. This new model is sometimes called the challenger….
Keep in mind that in machine learning, the terms experiments and trials also often refer to the process of finding a training configuration that works best for the problem at hand (this is sometimes called hyperparameter optimization).
When a shadow deployment is created, only the inference from the champion is returned in the InferenceResult Object data
, while the result data for the shadow deployments is stored in the InferenceResult Object shadow_data
.
Create a parallel or shadow deployment for a pipeline with the pipeline.add_shadow_deploy(champion, challengers[])
method, where the champion
is a Wallaroo Model object, and challengers[]
is one or more Wallaroo Model objects.
Each inference request sent to the pipeline is sent to all the models. The prediction from the champion is returned by the pipeline, while the predictions from the challengers are not part of the standard output, but are kept stored in the shadow_data
attribute and in the logs for later comparison.
In this example, a shadow deployment is created with the champion versus two challenger models.
champion = wl.upload_model(champion_model_name, champion_model_file).configure()
model2 = wl.upload_model(shadow_model_01_name, shadow_model_01_file).configure()
model3 = wl.upload_model(shadow_model_02_name, shadow_model_02_file).configure()
pipeline.add_shadow_deploy(champion, [model2, model3])
pipeline.deploy()
name | cc-shadow |
created | 2022-08-04 20:06:55.102203+00:00 |
last_updated | 2022-08-04 20:37:28.785947+00:00 |
deployed | True |
tags | |
steps | ccfraud-lstm |
An alternate method is with the pipeline.replace_with_shadow_deploy(index, champion, challengers[])
method, where the index
is the pipeline step to replace.
Model outputs are listed by column based on the model’s outputs. The output data is set by the term out
, followed by the name of the model. For the default model, this is out.{variable_name}
, while the shadow deployed models are in the format out_{model name}.variable
, where {model name}
is the name of the shadow deployed model.
sample_data_file = './smoke_test.df.json'
response = pipeline.infer_from_file(sample_data_file)
time | in.tensor | out.dense_1 | anomaly.count | out_ccfraudrf.variable | out_ccfraudxgb.variable | |
---|---|---|---|---|---|---|
0 | 2023-03-03 17:35:28.859 | [1.0678324729, 0.2177810266, -1.7115145262, 0.682285721, 1.0138553067, -0.4335000013, 0.7395859437, -0.2882839595, -0.447262688, 0.5146124988, 0.3791316964, 0.5190619748, -0.4904593222, 1.1656456469, -0.9776307444, -0.6322198963, -0.6891477694, 0.1783317857, 0.1397992467, -0.3554220649, 0.4394217877, 1.4588397512, -0.3886829615, 0.4353492889, 1.7420053483, -0.4434654615, -0.1515747891, -0.2668451725, -1.4549617756] | [0.0014974177] | 0 | [1.0] | [0.0005066991] |
Shadow deploy results are part of the Pipeline.logs()
method. The output data is set by the term out
, followed by the name of the model. For the default model, this is out.dense_1
, while the shadow deployed models are in the format out_{model name}.variable
, where {model name}
is the name of the shadow deployed model.
logs = pipeline.logs()
display(logs)
time | in.tensor | out.dense_1 | anomaly.count | out_ccfraudrf.variable | out_ccfraudxgb.variable | |
---|---|---|---|---|---|---|
0 | 2023-03-03 17:35:28.859 | [1.0678324729, 0.2177810266, -1.7115145262, 0.682285721, 1.0138553067, -0.4335000013, 0.7395859437, -0.2882839595, -0.447262688, 0.5146124988, 0.3791316964, 0.5190619748, -0.4904593222, 1.1656456469, -0.9776307444, -0.6322198963, -0.6891477694, 0.1783317857, 0.1397992467, -0.3554220649, 0.4394217877, 1.4588397512, -0.3886829615, 0.4353492889, 1.7420053483, -0.4434654615, -0.1515747891, -0.2668451725, -1.4549617756] | [0.0014974177] | 0 | [1.0] | [0.0005066991] |
The Pipeline URL Endpoint or the Pipeline Deploy URL is used to submit data to a pipeline to use for an inference. This is done through the pipeline _deployment._url()
method.
In this example, the pipeline URL endpoint for the pipeline ccfraud_pipeline
will be displayed:
ccfraud_pipeline._deployment._url()
'http://engine-lb.ccfraud-pipeline-1:29502/pipelines/ccfraud-pipeline'
How to manage deployment configurations using the Wallaroo SDK
How to manage deployment configurations using the Wallaroo Dashboard