Wallaroo Enterprise Comprehensive Install Guide: Oracle Cloud Infrastructure (OCI) OKE Support

How to set up Wallaroo Enterprise in the Oracle Cloud Infrastructure Container Engine for Kubernetes (OKE)

Table of Contents

The following guide describes how to set up Wallaroo in an Oracle Cloud Infrastructure Container Engine for Kubernetes (OKE).

Before starting, verify that the cluster hosting the Wallaroo service meets the installation prerequisites.

Create OKE Cluster

Taints and Labels Requirements

Node pools created in Wallaroo require the following taints and labels. For node pools with AI Accelerators, GPUs, ARM support, etc, see the Custom Taints and Labels Guide for how to set up node pools with the proper taints and labels.

NodepoolTaintsLabelsDescription
generalN/Awallaroo.ai/node-purpose: generalFor general Wallaroo services. No taints are applied to this nodepool to allow any process not assigned with a deployment label to run in this space.
persistentwallaroo.ai/persistent=true:NoSchedulewallaroo.ai/node-purpose: persistentFor Wallaroo services with a persistentVolume settings, including JupyterHub, Minio, etc.
pipelines-x86wallaroo.ai/pipelines=true:NoSchedulewallaroo.ai/node-purpose: pipelinesFor deploying pipelines for default x86 architectures. The taints and label must be applied to any nodepool used for model deployments.

The specific nodepool names may differ based on your cloud services naming requirements; check with the cloud services provider for the nodepool name requirements and adjust as needed.

Node Pool Updates

During the procedures, updates to a node pool must wait until the node pool status is no longer Updating.

Once updates are complete, make edits to the node pool. For example, updating the taints and labels as describe in Taints and Labels Requirements, before making updates to a node pool.

Once the node pool edits are made, select Cycle Nodes for the edits to take effect.

Cluster Creation Procedure

The following procedure details how to create a cluster in OKE that meets the Wallaroo prerequisites.

  1. Log into Oracle Cloud.

  2. In the search bar type in OKE and select Kubernetes Clusters (OKE).

    Select OKE
  3. Select Create Cluster, then Quick Create, then select Submit.

  4. Create the cluster steps.

    Standard cluster details
    1. Name your cluster.

    2. Enter the name of the compartment.

    3. Select the desired Kubernetes version. Verify it meets the Kubernetes prerequisites above.

    4. Select “Public endpoint” as Kubernetes API endpoint.

    5. Select “Managed” as the Node type.

    6. Select “Private workers”.

    7. Choose the appropriate Shape and image, such as VM.Standard.E3.Flex. At least one pool must be an x86-64 pool; it’s best to make that the first/default pool. Additional pools are created later.

    8. Set the following. Note that to meet the prerequisites for all Wallaroo services (JupyterHub, etc), a total of 32 cores is required, so a typical install of 4 node pools of 8 cores each would suffice.

      1. Select the desired number of OCPUs per each node, for example, 8.
      2. Select the desired RAM per node, typically 32GB.
      3. Keep the default image.
      4. Specify the node count, typically 1.
    9. Select Show advanced options to expand the section, then set the following.

      Advanced cluster details
      1. Boot volume
        1. Select Specify a custom boot volume size and enter 1000 as the Boot volume size (in GB).
        2. Verify the Boot volume size is 1000 GB.
    10. Select Next.

    11. Review the details, then select Create cluster.

      1. Do NOT check the Create a Basic cluster checkbox!

Next we will update the Cluster Details. Before starting this process, wait until the cluster has completed provisioning.

  1. Select the cluster to update.

  2. In Cluster details, Resources->Node pools.

  3. Select pool1, then Edit.

  4. Under Edit node pool:

    1. Scroll to the very bottom and select the last Show advanced options link to expand the section. Update the following:
      1. Initialization script: Select Paste Cloud-Init Script and paste the following text:

        #!/bin/bash
        curl --fail -H "Authorization: Bearer Oracle" -L0 http://169.254.169.254/opc/v2/instance/metadata/oke_init_script | base64 --decode \>/var/run/oke-init.sh
        bash /var/run/oke-init.sh
        
        sudo /usr/libexec/oci-growfs -y
        
      2. Kubernetes labels: Add the following labels:

        1. Key: wallaroo.ai/node-purpose, Value: general.
  5. Select Save changes.

  6. Back in Node pool details, select Cycle nodes.

  7. Specify 1 as Maximum surge and Maximum unavailable.

  8. Select Cycle nodes.

Create Required Node Pools

With the cluster created, the following node pools are created to fulfill the requirements from Taints and Labels Requirements.

  1. Navigate to your cluster’s Cluster details page.

  2. Under Resources, select Node pools.

  3. Repeat the following process for the additional required nodepools, once for persistent, then for pipelines-x86.

  4. Select the Add node pool button.

  5. Set the following:

    1. Name your node pool, persistent the first node pool, then pipelines-x86 for the second node pool.
    2. Select the same compartment as the cluster.
    3. Select Managed as the Node type.
    4. Select the same Kubernetes version as the primary node pool. This should be pre-selected.
    5. In the Node Placement Configuration section, create a row for every availability domain in the dropdown. For example:
      1. Select the first Availability domain that doesn’t already have a row in this section.
      2. Select a Worker node subnet that is in the Private subnets. Typically these start with the name oke-nodesubnet.
      3. Select the +Another row button until you have exhausted all Availability domains in the first dropdown.
    6. Select the desired Shape and image. For standard setups, the following settings are recommended:
      1. Shape: VM.Standard.E3.Flex.
      2. Select the desired number of OCPUs per each node, for example, 8.
      3. Select the desired RAM per node, typically 32GB.
      4. Image: Keep the default image.
    7. Under Node pool options, specify the desired Node count with the following recommended values.
      1. persistent: 1
      2. pipelines-x86: 2
    8. Under Boot volume, select Specify a custom boot volume size and set 1000 as the Boot volume size (in GB).
    9. Under Pod communication, select the same subnet as the individual availability domains (i.e., one whose name starts with oke-nodesubnet).
    10. Select the last Show advanced options link to expand the section. Set the following based on the node pool type as described in Taints and Labels Requirements.
      1. Initialization script: Select Paste Cloud-Init Script and paste the following text into the text box. Adjust the following based on the node pool type:
        1. --register-with-taints: Set to the taints used for the cluster. For persistent the following settings are used:

          #!/bin/bash
          curl --fail -H "Authorization: Bearer Oracle" -L0 http://169.254.169.254/opc/v2/instance/metadata/oke_init_script | base64 --decode >/var/run/oke-init.sh
          bash /var/run/oke-init.sh --kubelet-extra-args "--register-with-taints=wallaroo.ai/persistent=true:NoSchedule"
          
          sudo /usr/libexec/oci-growfs -y
          

          For pipelines-x86 use the following:

          #!/bin/bash
          curl --fail -H "Authorization: Bearer Oracle" -L0 http://169.254.169.254/opc/v2/instance/metadata/oke_init_script | base64 --decode >/var/run/oke-init.sh
          bash /var/run/oke-init.sh --kubelet-extra-args "--register-with-taints=wallaroo.ai/pipelines=true:NoSchedule"
          
          sudo /usr/libexec/oci-growfs -y
          
  6. Select the Add button. The new node pool will begin provisioning. Note that provisioning errors may occur from rate limits, resource quote limits, or other issues. Consult the Oracle OKE documentation for further details.

  7. Once the node pool is fully provisioned, select the new node pool and select Edit.

  8. Under Edit node pool:

    1. Scroll to the very bottom and select the last Show advanced options link to expand the section. Update the following:
      1. Kubernetes labels: Add the labels as required based on Taints and Labels Requirements.
        1. For the persistent node pool, the deployment label: Key: wallaroo.ai/node-purpose, Value: persistent.
        2. For the persistent node pool, the deployment label: Key: wallaroo.ai/node-purpose, Value: pipelines.

Create Additional OKE Node Pools

The following process creates an additional pool for the cluster. This is useful for ARM or GPU node pools.

  1. Log into Oracle Cloud.

  2. In the search bar type in OKE and select Kubernetes Clusters (OKE).

    Select OKE
  3. Navigate to your cluster’s Cluster details page.

  4. Under Resources, select Node pools.

  5. Select the Add node pool button.

  6. Set the following:

    1. Name your node pool.
    2. Select the same compartment as the cluster.
    3. Select Managed as the Node type.
    4. Select the same Kubernetes version as the primary node pool. This should be pre-selected.
    5. In the Node Placement Configuration section, create a row for every availability domain in the dropdown. For example:
      1. Select the first Availability domain that doesn’t already have a row in this section.
      2. Select a Worker node subnet that is in the Private subnets. Typically these start with the name oke-nodesubnet.
      3. Select the +Another row button until you have exhausted all Availability domains in the first dropdown.
    6. Select the desired Shape and image. For example:
      1. For ARM, select VM.Standard.A1.Flex.
      2. For GPU, select VM.GPU.A10.1.
      3. Select the desired number of CPUs and RAM size if applicable. Some node pools, such as GPU nodes, typically don’t allow modifications.
    7. Keep the suggested Image.
    8. Under Node pool options, specify the desired Node count (e.g., 1 for GPU node pools) .
    9. Under Boot volume, select Specify a custom boot volume size and set 1000 as the Boot volume size (in GB).
    10. Under Pod communication, select the same subnet as the individual availability domains (i.e., one whose name starts with oke-nodesubnet).
    11. Select the last Show advanced options link to expand the section. Set the following based on the node pool type as described in Taints and Labels Requirements.
      1. Initialization script: Select Paste Cloud-Init Script and paste the following text into the text box. Adjust the following based on the node pool type:

        1. --register-with-taints: Set to the taints used for the cluster. By default, this is wallaroo.ai/pipelines=true:NoSchedule for node pools used for model deployment and for any GPU nodepool, also include a GPU-specific taint such as nvidia.com/gpu=a10:NoSchedule.
        #!/bin/bash
         curl --fail -H "Authorization: Bearer Oracle" -L0 http://169.254.169.254/opc/v2/instance/metadata/oke_init_script | base64 --decode >/var/run/oke-init.sh
         bash /var/run/oke-init.sh --kubelet-extra-args "--register-with-taints=wallaroo.ai/pipelines=true:NoSchedule,nvidia.com/gpu=a10:NoSchedule"
        
         sudo /usr/libexec/oci-growfs -y
        
  7. Select the Add button. The new node pool will begin provisioning. Note that provisioning errors may occur from rate limits, resource quote limits, or other issues. Consult the Oracle OKE documentation for further details.

  8. Once the node pool is fully provisioned, select the new node pool and select Edit.

  9. Under Edit node pool:

    1. Scroll to the very bottom and select the last Show advanced options link to expand the section. Update the following:
      1. Kubernetes labels: Add the labels as required based on Taints and Labels Requirements. For example, for model deployments with GPUs, the labels would be:
        1. The pipeline deployment label: Key: wallaroo.ai/node-purpose, Value: pipelines.
        2. GPU custom label: Key: wallaroo.ai/accelerator, Value: a10. This value can be adjusted based on the organization’s requirements.

Oracle OKE Cluster Access

The following process retrieves the Kubernetes Cluster credentials used for the install Wallaroo process.

This requires the Oracle OCI command line tool to perform the Kubernetes cluster configuration on the local system.

  1. Log into Oracle Cloud.

  2. In the search bar type in “OKE” and select “Kubernetes Clusters (OKE)”.

    Select OKE
  3. From the List Scope options, select the Compartment associated with the cluster.

  4. Select the cluster used for the Wallaroo installation.

  5. From the Cluster Details page, select Access Cluster and select Local Access.

    OKE Local Access Instructions
    1. Follow the instructions provided to access the Kubernetes cluster from a local machine meeting the Prerequisites for either a kots or helm based installation of Wallaroo.

Install Wallaroo

Wallaroo installations are available via two methods:

  • kots based installations.
  • helm based installations.

Kubernetes Client Software Prerequisites

Before installing or upgrading Wallaroo, the administrative node managing the Kubernetes cluster will require these tools.

  • kubectl
    • For Kots based installs:
      • kots Version 1.112.2

    • For Helm installs:
      • helm: Install Helm
      • krew: Install Krew
      • krew preflight and krew support-bundle. Install with the following commands:
        • kubectl krew install support-bundle
        • kubectl krew install preflight

The following are quick guides for installing kubectl for macOS.

To install kubectl on a macOS system using Homebrew:

  1. Issue the brew install command:

    brew install kubectl
    
  2. Verify the installation:

    kubectl version --client
    

Kots Based Install of Wallaroo

Wallaroo kots based installations are broken up into two formats:

  • Automated Install: Variables are passed to the installation process to auto-set the password, license file, and other settings before beginning the installation configuration process.
  • Interactive Install: All parts of the installation process are performed through the browser interface.

Note that in either instance, the default installation timeout is 10 minutes. For more details, see kots CLI flags documentation.

Kots Install Guides

kots Quick Install Guide

The following are quick guides for installing kots for different operating systems. For more details, see the instructions for your specific environment.

  • IMPORTANT NOTE

    As of this time, Wallaroo requires kots version 1.112.2. Please verify that version is installed before starting the Wallaroo installation process.

  1. Install kots by downloading the script and piping it into the bash shell, assuming curl is installed:

    curl https://kots.io/install/1.112.2 | REPL_USE_SUDO=y bash
    

For instructions on updating the kots version for a Wallaroo Ops installation, see Updating KOTS.

Manual Kots Install

A manual method to install KOTS is:

  1. Download from https://github.com/replicatedhq/kots/releases/tag/v1.112.2. Linux and MacOS are supported.

  2. Unpack the release

  3. Rename the kots executable to kubectl kots.

  4. Copy the renamed kubectl kots to anywhere on the PATH.

  5. Next, verify successful installation.

    ~ kubectl kots version
    
    Replicated KOTS 1.112.2
    

Automated Kots Wallaroo Install

To automatically install Wallaroo into the namespace wallaroo, specify the administrative password and the license file during the installation as in the following format with the following variables:

  • NAMESPACE: The namespace for the Wallaroo Enterprise install, typically wallaroo.
  • LICENSEFILE: The location of the Wallaroo Enterprise license file.
  • SHAREDPASSWORD: The password of for the Kots Administrative Dashboard.
kubectl kots install wallaroo/2024-4 -n $NAMESPACE --license-file $LICENSEFILE --shared-password $SHAREDPASSWORD

For example, the following settings translate to the following install command:

  • NAMESPACE: wallaroo.
  • LICENSEFILE: myWallaroolicense.yaml
  • SHAREDPASSWORD: snugglebunnies
kubectl kots install wallaroo/2024-4 -n wallaroo --license-file myWallaroolicense.yaml --shared-password wallaroo

Interactive Kots Wallaroo Install

The Interactive Install process allows users to adjust the configuration settings before Wallaroo is deployed. It requires users be able to access the Kots Administrative Dashboard through a browser, typically on port 8800.

  • IMPORTANT NOTE: Users who install Wallaroo through another node can use SSH tunneling to access the Kots Administrative Dashboard. For example:

    ssh IP -L8800:localhost:8800
    
  1. Install the Wallaroo Enterprise Edition using kots install wallaroo/2024-4, specifying the namespace to install Wallaroo into. For example, if wallaroo is the namespace, then the command is:

    kubectl kots install wallaroo/2024-1 --namespace wallaroo
    
  2. Wallaroo Enterprise Edition will be downloaded and installed into your Kubernetes environment in the namespace specified. When prompted, set the default password for the Wallaroo environment. When complete, Wallaroo Enterprise Edition will display the URL for the Admin Console, and how to end the Admin Console from running.

    • Deploying Admin Console
    • Creating namespace ✓
    • Waiting for datastore to be ready ✓
        Enter a new password to be used for the Admin Console: •••••••••••••
      • Waiting for Admin Console to be ready ✓
    
    • Press Ctrl+C to exit
    • Go to http://localhost:8800 to access the Admin Console
    

To relaunch the Kots Administrative Dashboard and make changes or updates, use the following command:

kubectl kots admin-console --namespace wallaroo
Initial Wallaroo Config

Once Wallaroo has been installed for the first time, we can perform initial configuration.

  1. If Wallaroo Edition has not started, launch it with the following command:

    ❯ kubectl kots admin-console --namespace wallaroo
      • Press Ctrl+C to exit
      • Go to http://localhost:8800 to access the Admin Console
    
  2. Enter the Kots Administrative Console address into a browser. You will be prompted for the default password as set in the step above. Enter it and select Log in.

    Kots Administrative Console Initial Login
  3. Upload your license file. If the license file was already selected from the Automated installation, this step is skipped.

    Wallaroo Admin Upload License
  4. The Kots Administrative Console will run the preflight checks to verify that all of the minimum requirements are met. This may take a few minutes. If there are any issues, Wallaroo can still be launched but may not function properly. When ready, select Continue.

    Wallaroo Admin Preflight Successful
  5. The Configure Wallaroo page will be displayed which allows you to customize your Wallaroo environment. The following are the minimum required settings.

    1. Networking Configuration: Set whether the Wallaroo instance is available from outside the Kubernetes cluster, or only from within it.

      1. Ingress Mode for Wallaroo Endpoints:
        1. None: Services are local to the Kubernetes cluster. kubectl-port forward or some other means is required to access them. If all work will be done in-cluster, select this option.
        2. Internal: An internal cloud load balancer and associated resources are created. Network users outside the Kubernetes cluster – but on the same internal network – can connect directly using DNS names, and do not need to use port forward or related configurations.
        3. External: An external, Internet-facing cloud load balancer, public IP, and associated resources are created. This is highly discouraged. Public DNS is also required. This is the default for Wallaroo Community Edition.
          1. Enable external URL inference endpoints: Creates pipeline inference endpoints. For more information, see Model Endpoints Guide.
    2. From the Wallaroo Dashboard, select Config and set the following:

    3. DNS Settings

      Kots DNS Settings
      1. Wallaroo domain name (Required): The domain for the Wallaroo instance.
    4. TLS Certificates

      1. Use custom TLS Certs: Checked
      2. TLS Certificate: Enter your TLS Certificate (.crt file).
      3. TLS Private Key: Enter your TLS private key (.key file).
    Wallaroo DNS Records
  6. Once complete, scroll to the bottom of the Config page and select Deploy.

  7. At this point, continue to Required Installation Configurations for the required configuration settings.

Edge Deployment Configuration

The following optional configurations to enable OCI Registry integration with Wallaroo for deploying models in edge and multi-cloud environments through Wallaroo. For more details, see Inference Anywhere.

To set the Edge Registry Settings through the Kots Administrative Dashboard:

  1. Launch the Kots Administrative Dashboard using the following command, replacing the --namespace parameter with the Kubernetes namespace for the Wallaroo instance:

    kubectl kots admin-console --namespace wallaroo
    
  2. Open a browser at the URL detailed in the step above and authenticate using the console password set as described in the as detailed in the Wallaroo Install Guides.

  3. From the top menu, select Config then scroll to Edge Deployment.

  4. Enable Provide OCI registry credentials for pipelines.

  5. Enter the following:

    Edge deployment registry service details

    1. Registry URL: The address of the registry service. For example: us-west1-docker.pkg.dev.
    2. email: The email address of the user account used to authenticate to the service.
    3. username: The account used to authenticate to the registry service.
    4. password: The password or token used to authenticate to the registry service.
    5. To enable edge observability, enable Enable pipelines deployed on the edge to send data back to the OpsCenter.
  6. Save the updated configuration, then deploy it. Once complete, the edge registry settings will be available.

Set Assay Version

To enable Wallaroo assays via kots:

  1. From the Config page, scroll to Config -> Observability -> Enable Assays.

  2. Select from one of the following options:

    1. Disable: Disable assays.
    2. v1: Enable Wallaroo Assays V1.
    3. v2: Enable Wallaroo Assays V2 (Default).
  3. Complete the deployment as needed.

Custom Tolerations and Node Selectors Configuration

By default, the following taints and labels are applied to nodepools used for Wallaroo installations.

NodepoolTaintsLabelsDescription
generalN/Awallaroo.ai/node-purpose: generalFor general Wallaroo services. No taints are applied to this nodepool to allow any process not assigned with a deployment label to run in this space.
persistentwallaroo.ai/persistent=true:NoSchedulewallaroo.ai/node-purpose: persistentFor Wallaroo services with a persistentVolume settings, including JupyterHub, Minio, etc.
pipelines-x86wallaroo.ai/pipelines=true:NoSchedulewallaroo.ai/node-purpose: pipelinesFor deploying pipelines for default x86 architectures. The taints and label must be applied to any nodepool used for model deployments.

The specific nodepool names may differ based on your cloud services naming requirements; check with the cloud services provider for the nodepool name requirements and adjust as needed.

For organizations that use custom Kubernetes taints and labels for their nodepools, the tolerations and node selectors for Wallaroo services will have to be modified to match. For full details, see the Custom Taints and Labels Guide.

To modify the tolerations and node selector labels to match the taints and labels set for the nodepools:

  1. From the Kots Administrative Dashboard, select Config
  2. Update each of the following as needed:
    1. Node Affinities:

      1. Node affinity type key: Verify that the node affinity key matches the label for the nodepools.

      2. Engine affinity value: Set the engine affinity - the affinity used for pipeline deployment - to match the label.

        Node Affinity
    2. Taints and Tolerations. Set the custom tolerations to match the taints applied to the nodepools, and the node selectors to match the labels used for the nodepools.

    3. Node Selectors: Update the node selector to match the custom nodepools labels for each service.

      Kots Custom Tolerances and Node Selectors

Helm Based Installation of Wallaroo

Installation Client Helm Requirements

The following software is required for the client with administrative access to the Kubernetes cluster that will host Wallaroo via Helm.

  • For Helm installs:
    • helm: Install Helm
    • krew: Install Krew
    • krew preflight and krew support-bundle. Install with the following commands:
      • kubectl krew install support-bundle
      • kubectl krew install preflight

The following details how to install Wallaroo via helm.

Wallaroo Provided Data

Members of the Wallaroo support staff will provide the following information:

  • Wallaroo Container Registration Login: Commands to login to the Wallaroo container registry.
  • Preflight verification command: The commands to verify that the Kubernetes environment meets the requirements for the Wallaroo install.
  • Install Wallaroo Command: Instructions on installations into the Kubernetes environment using Helm through the Wallaroo container registry.

The following steps are used with these command and configuration files to install Wallaroo Enterprise via Helm.

Registration Login

The first step in the Wallaroo installation process via Helm is to connect to the Kubernetes environment that will host the Wallaroo Enterprise instance and login to the Wallaroo container registry through the command provided by the Wallaroo support staff. The command will take the following format, replacing $YOURUSERNAME and $YOURPASSWORD with the respective username and password provided.

helm registry login registry.replicated.com --username $YOURUSERNAME --password $YOURPASSWORD

Preflight Verification

Preflight verification is performed with the following command format. The variables $LICENSE_CHANNEL and $VERSION is supplied by your Wallaroo support representative.

helm template oci://registry.replicated.com/wallaroo/$LICENSE_CHANNEL/wallaroo --version $VERSION | kubectl preflight -

For example, the $LICENSE_CHANNEL=2024-4 of 2024-4 and the $VERSION=2024.4.0-5879

helm template oci://registry.replicated.com/wallaroo/2024-4/wallaroo --version 2024.4.0-5879 | kubectl preflight -

This displays the Preflight Checks report.

Preflight check example

The following commands are available:

  • s: Save the report to a text file as the file preflight-checks-results-DATETIME.txt. For example: preflight-checks-results-2024-03-19T13_30_41.txt.
  • q: Exit the preflight report.
  • Up Arrow or Down Arrow: Scroll through the preflight elements and view the report details.

The following example shows a successful preflight test.

Preflight Checks Preflight Checks

Check PASS
Title: Required Kubernetes Version
Message: Your cluster meets the recommended and required versions of Kubernetes.

------------
Check PASS
Title: Container Runtime
Message: Containerd container runtime was found.

------------
Check PASS
Title: Check Kubernetes environment.
Message: GKE is a supported distribution

------------
Title: Cluster CPU Resources
Message: 

------------
Check PASS
Title: Cluster Resources
Message: Cluster resources are satisfactory

------------
Check PASS
Title: Every node in the cluster must have at least 12Gi of memory
Message: All nodes have at least 12 GB of memory capacity

------------
Check PASS
Title: Every node in the cluster must have at least 8 cpus allocatable.
Message: All nodes have at least 8 CPU capacity

------------

Prepare Helm Installation

The following instructions detail how to install Wallaroo Enterprise via Helm for Kubernetes cloud environments such as Microsoft Azure, Amazon Web Service, and Google Cloud Platform.

Helm Network Configuration

apilb.serviceType settings have the following effects.

SettingCloud Kubernetes
Internal Only ConnectionsClusterIP
External ConnectionsLoadBalancer

Refer to the instructions for environment host for details on IP address allocation and support.

With the preflight checks and prerequisites met, Wallaroo can be installed via Helm through the following process:

  1. Create namespace. By default, the namespace wallaroo is used:

    kubectl create namespace wallaroo
    
  2. Set the new namespace as the current namespace:

    kubectl config set-context --current --namespace wallaroo
    
  3. Set the TLS certificate secret in the Kubernetes environment:

    1. Create the certificate and private key. It is recommended to name it after the domain name of your Wallaroo instance. For example: wallaroo.example.com. For production environments, organizations are recommended to use certificates from their certificate authority. Note that the Wallaroo SDK will not connect from an external connection without valid certificates. For more information on using DNS settings and certificates, see the Wallaroo DNS Integration Guide.

    2. Create the Kubernetes secret from the certificates created in the previous step, replacing $TLSCONFIG with the name of the Kubernetes secret, with the certificate file $TLSCERT and the private key $TLSKEY. Store the secret name for a the step Configure local values file.

      kubectl create secret tls $TLSCONFIG --cert=$TLSCERT --key=$TLSKEY
      

      For example, if $TLSCONFIG is cust-cert-secret with the certificate $TLSCERT is fullchain.pem and key $TLSKEY is privkey.pem, then the command would be translated as

      kubectl create secret tls cust-cert-secret --cert=fullchain.pem --key=privkey.pem
      

Default Helm Installation Settings

A default Helm install of Wallaroo contains various default settings. The local values file overwrites values based on the organization needs. The following represents the minimum mandatory values for a Wallaroo installation using certificates and the default LoadBalancer for a cloud Kubernetes cluster. The configuration details below is saved as values.yaml for these examples.

Note the following required settings:

  • wallarooDomain: Used to set the DNS domain name for the Wallaroo instance. For more information, see the Wallaroo DNS Integration Guide.
  • custTlsSecretName: Specify the Kubernetes secret created in the previous step. External connections through the Wallaroo SDK require valid certificates.
  • apilb: Sets the apilb service options including the following:
    • serviceType: LoadBalancer: Uses the default LoadBalancer setting for the Kubernetes cloud service the Wallaroo instance is installed into. Replace with the specific service connection settings as required.
    • external_inference_endpoints_enabled: true: This setting is required for performing external SDK inferences to a Wallaroo instance. For more information, see the Wallaroo Model Endpoints Guide
    • kubernetes_distribution: The cloud provider used. Each cloud provider has their own methods of handling storage classes, LoadBalancers (LB), etc. The following values are available; if your selected provider or distribution is not on the list, contact your Wallaroo Support representative for more options.
      • aks: Microsoft Azure Kubernetes Service
      • eks: Amazon Elastic Kubernetes Service
      • gke: Google Kubernetes Engine
      • oke: Oracle Cloud Infrastructure Container Engine for Kubernetes
      • kurl: On premise aka stand alone Linux installations.
    • ingress_mode: How the Wallaroo instance is reached through the Kubernetes network settings. Options include:
      • internal (Default): An internal cloud load balancer and associated resources are created. Network users outside the Kubernetes cluster – but on the same internal network – can connect directly using DNS names, and do not need to use port forward or related configurations.
      • external: An external, Internet-facing cloud load balancer, public IP, and associated resources are created. This is highly discouraged. Public DNS is also required. This is the default for Wallaroo Community Edition.
      • none: Services are local to the Kubernetes cluster. kubectl-port forward or some other means is required to access them. If all work will be done in-cluster, select this option.
  • dashboard: The name displayed when users login to the Wallaroo Ops center Dashboard. For example, “My Company” or “Sales Division”, etc.

The following example shows the minimum required options.

This example shows the uncommented keys for the minimum required settings and additional commented optional settings. For full details on helm values for Wallaroo installations, see the Wallaroo Helm Reference Guides.

wallarooDomain: "wallaroo.example.com" # change to match the actual domain name

custTlsSecretName: cust-cert-secret

apilb:
  serviceType: LoadBalancer
  external_inference_endpoints_enabled: true
  ingress_mode: internal # internal (Default), external,or none

dashboard:
  clientName: "Wallaroo Helm Example" # Insert the name displayed in the Wallaroo Dashboard

kubernetes_distribution: ""   # Required. One of: aks, eks, gke, oke, or kurl.

# enable Wallaroo assays.  Select **one** of the following: `v1` or `v2`, or leave commented out to disable assays.
# v2 is enabled by default.
#assays:
#  enabled: true
#  v1: false
#  v2: true


# Enable edge deployment
#ociRegistry: 
#  enabled: true # true enables the Edge Server registry information, false disables it.
#  registry: ""# The registry url. For example: reg.big.corp:3579.
#  repository: ""# The repository within the registry. This may include the cloud account, or the full path where the Wallaroo published pipelines should be kept. For example: account123/wallaroo/pipelines.
#  email: "" # Optional field to track the email address of the registry credential.
#  username: "" # The username to the registry. This may vary based on the provider. For example, GCP Artifact Registry with service accounts uses the username _json_key_base64 with the p`ass`word as a base64 processed token of the credential information.
#  password: "" # The password or token for the registry service.

# Enable edge deployment observability
# edgelb:
#     enabled: true

# The nodeSelector and tolerations for all components
# This does not apply to nats, fluent-bit, or minio so needs to be applied separately
# nodeSelector:
#   wallaroo.ai/reserved: true

# tolerations:
# - key: "wallaroo.ai/reserved"
#   operator: "Exists"
#   effect: "NoSchedule"

# To change the pipeline taint or nodeSelector, 
# best practice is to change engine, enginelb, and engineAux 
# together unless they will be in different pools.
# engine:
#   nodeSelector:
#     wallaroo.ai/node-purpose: pipelines
#   tolerations:
#     - key: "wallaroo.ai/pipelines"
#       operator: "Exists"
#       effect: "NoSchedule"

# enginelb:
#   nodeSelector:
#     wallaroo.ai/node-purpose: pipelines
#   tolerations:
#     - key: "wallaroo.ai/pipelines"
#       operator: "Exists"
#       effect: "NoSchedule"

# engineAux:
#   nodeSelector:
#     wallaroo.ai/node-purpose: pipelines
#   tolerations:
#     - key: "wallaroo.ai/pipelines"
#       operator: "Exists"
#       effect: "NoSchedule"

# For each service below, adjust the disk size and resources as required.
# If the nodeSelector or tolerations are changed for one service, 
# the other services nodeSelector and tolerations **must** be changed to match
#
#
# plateau:
#   diskSize: 100Gi
#   resources:
#     limits:
#       memory: 4Gi
#       cpu: 1000m
#     requests:
#       memory: 128Mi
#       cpu: 100m
#   nodeSelector:
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule"

# Jupyter has both hub and lab nodeSelectors and tolerations
# They default to the same persistent pool, but can be assigned to different ones
# jupyter:
#   nodeSelector:                 # Node placement for Hub administrative pods
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule"
#   labNodeSelector:              # Node placement for Hub-spawned jupyterlab pods
#     wallaroo.ai/node-purpose: persistent
#   labTolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule"
#   memory:
#     limit: "4"                  # Each Lab - memory limit in GB
#     guarantee: "2"              # Each Lab - lemory guarantee in GB
#   cpu:
#     limit: "2.0"                # Each Lab - fractional CPU limit
#     guarantee: "1.0"            # Each Lab - fractional CPU guarantee
#   storage:
#     capacity: "50"              # Each Lab - disk storage capacity in GB

# minio:
#   persistence:
#     size: 25Gi
#   nodeSelector:
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#   - key: wallaroo.ai/persistent
#     operator: "Exists"
#     effect: "NoSchedule"
#   resources:
#     requests:
#       memory: 1Gi

# postgres:
#   diskSize: 10Gi
#   nodeSelector:
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule"
#   resources:
#     limits:
#       memory: 2Gi
#       cpu: 500m
#     requests:
#       memory: 512Mi
#       cpu: 100m

# Prometheus has the usual persistent options, but also a retention size
# The the size on disk and time can be configured before removing it.
# prometheus:
#   storageRetentionSizeGb: "10"        # Retain this much data, in GB.
#   storageRetentionTimeDays: "15"     # When to remove old data. In days.
#   nodeSelector:
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule" 
#   resources:
#     limits:
#       memory: 6Gi
#       cpu: 2000m
#     requests:
#       memory: 512Mi
#       cpu: 100m

# nats:
#   podTemplate:
#     merge:
#       spec:
#         nodeSelector: 
#           wallaroo.ai/node-purpose: persistent
#         tolerations:
#         - key: "wallaroo.ai/persistent"
#           operator: "Exists"
#           effect: NoSchedule

# wallsvc:
#   nodeSelector:
#     wallaroo.ai/node-purpose: persistent
#   tolerations:
#     - key: "wallaroo.ai/persistent"
#       operator: "Exists"
#       effect: "NoSchedule"

persistentVolume Settings

Wallaroo services that have a persistentVolume have the following default nodeSelector label and tolerations:

  • nodeSelector
    • Label: wallaroo.ai/node-purpose
    • Value: persistent
  • tolerations
    • Key: wallaroo.ai/persistent
    • Operator: “Exists”
    • Effect: “NoSchedule”

For example:

nodeSelector:
  wallaroo.ai/node-purpose: persistent
tolerations:
  - key: wallaroo.ai/persistent
    operator: "Exists"
    effect: "NoSchedule"

If the nodeSelector or tolerations are changed for any service with a persistentVolume, all other services must be edited to match.

For additional information on taints and tolerations settings, see the Taints and Labels Guide.

Install Wallaroo with Helm

  1. Install Wallaroo: The Wallaroo support representative will provide the installation command for the Helm install that will use the Wallaroo container registry. This assumes that the preflight checks were successful. This command uses the following format:

    helm install $RELEASE $REGISTRYURL --version $VERSION --values $LOCALVALUES.yaml --timeout 10m
    

    Where:

    1. $RELEASE: The name of the Helm release. By default, wallaroo.
    2. $REGISTRYURL: The URl for the Wallaroo container registry service.
    3. $VERSION: The version of Wallaroo to install. For this example, 2024.4.0-5879.
    4. $LOCALVALUES: The .yaml file containing the local values overrides. For this example, values.yaml.

    For example, for the registration wallaroo the command would be:

    helm install wallaroo oci://registry.replicated.com/wallaroo/2024-4/wallaroo --version 2024.4.0-5879 --values values.yaml --timeout 10m
    

    If any required elements are missing from the values.yaml file, an error is displayed. For example, leaving out the kubernetes_distribution field returns the following:

    Warning - kubernetes_distribution must be set in user provided values.yaml
    

    Upon successful installation, notes are published indicating the installed version, where to find documentation, etc.:

    NOTES:
    .
    
    Welcome to Wallaroo 2024.4.0
    
    1. Deployment Information:
      Name:             2024.4.0
      Release notes:    https://docs.wallaroo.ai/wallaroo-release-notes/wallaroo-release-202401
      Version:          v2024.4.0-5187
    
    2. Accessing Wallaroo
      Documentation:    https://docs.wallaroo.ai
      Dashboard:        https://sample.wallaroocommunity.ninja
    
    3. Useful Commands:
    
      - Helm tests are available by using: `helm test wallaroo`.
    
      - External load balancer hostname can be found by using:
    
          kubectl get svc api-lb-ext -n wallaroo  -o jsonpath='{.status.loadBalancer.ingress[0].hostname}'
    
      - List Wallaroo namespaces, including pipeline deployments, but not including the main `wallaroo` namespace:
    
          kubectl get namespaces -l wallaroo-managed=true
    
      - In order to change any helm values:
    
          helm upgrade --install wallaroo oci://registry.replicated.com/wallaroo/uat-latest/wallaroo --version v2024.4.0-5187 --values $LOCALVALUES_YAML --timeout 10m
    
     4. Uninstall:
    
       1. To uninstall/delete the Wallaroo deployment, run:
    
         kubectl delete ns wallaroo && kubectl delete \
           all,secret,configmap,clusterroles,clusterrolebindings,storageclass,crd \
           --selector app.kubernetes.io/part-of=wallaroo --selector kots.io/app-slug=wallaroo
    
       2. To delete all pipelines, run:
    
         kubectl delete ns -l wallaroo-managed=true
    .
    
  2. Verify the Installation: Once the installation is complete, verify the installation with the helm test $RELEASE command. A condensed display uses egrep to show only the test suite and phase status as follows. Replace wallaroo with the name of the helm release used.

    helm test wallaroo | egrep 'SUITE:|Phase:'
    TEST SUITE:     wallaroo-fluent-bit-test-connection
    Phase:          Succeeded
    TEST SUITE:     nats-test-request-reply
    Phase:          Succeeded
    TEST SUITE:     wallaroo-wallaroo-test-connections-hook
    Phase:          Succeeded
    TEST SUITE:     wallaroo-test-objects-hook
    Phase:          Succeeded
    

    The following will show the full helm test output with notes.

    helm test wallaroo
    

    which displays the following:

    NAME: wallaroo
    LAST DEPLOYED: Fri May 17 14:00:04 2024
    NAMESPACE: wallaroo
    STATUS: deployed
    REVISION: 2
    TEST SUITE:     wallaroo-fluent-bit-test-connection
    Last Started:   Fri May 17 14:04:48 2024
    Last Completed: Fri May 17 14:04:51 2024
    Phase:          Succeeded
    TEST SUITE:     nats-test-request-reply
    Last Started:   Fri May 17 14:04:43 2024
    Last Completed: Fri May 17 14:04:48 2024
    Phase:          Succeeded
    TEST SUITE:     sample-wallaroo-test-connections-hook
    Last Started:   Fri May 17 14:04:24 2024
    Last Completed: Fri May 17 14:04:31 2024
    Phase:          Succeeded
    TEST SUITE:     sample-wallaroo-test-objects-hook
    Last Started:   Fri May 17 14:04:31 2024
    Last Completed: Fri May 17 14:04:43 2024
    Phase:          Succeeded
    NOTES:
    .
    
    Welcome to Wallaroo 2024.4.0
    
    1. Deployment Information:
      Name:             2024.4.0
      Release notes:    https://docs.wallaroo.ai/wallaroo-release-notes/wallaroo-release-202401
      Version:          v2024.4.0-5187
    
    2. Accessing Wallaroo
      Documentation:    https://docs.wallaroo.ai
      Dashboard:        https://sample.wallaroocommunity.ninja
    
    3. Useful Commands:
    
      - Helm tests are available by using: `helm test wallaroo`.
    
      - External load balancer hostname can be found by using:
    
          kubectl get svc api-lb-ext -n wallaroo  -o jsonpath='{.status.loadBalancer.ingress[0].hostname}'
    
      - List Wallaroo namespaces, including pipeline deployments, but not including the main `wallaroo` namespace:
    
          kubectl get namespaces -l wallaroo-managed=true
    
      - In order to change any helm values:
    
          helm upgrade --install wallaroo oci://registry.replicated.com/wallaroo/2024-4/wallaroo --version 2024.4.0-5879 --values $LOCALVALUES_YAML --timeout 10m
    
     4. Uninstall:
    
      1. To uninstall/delete the Wallaroo deployment, run:
    
        kubectl delete ns wallaroo && kubectl delete \
          all,secret,configmap,clusterroles,clusterrolebindings,storageclass,crd \
          --selector app.kubernetes.io/part-of=wallaroo --selector kots.io/app-slug=wallaroo
    
      2. To delete all pipelines, run:
    
        kubectl delete ns -l wallaroo-managed=true
    .
    

At this point, the installation is complete and can be accessed through the fully qualified domain names set in the installation process above. Verify that the DNS settings are accurate before attempting to connect to the Wallaroo instance. For more information, see the Wallaroo DNS Integration Guide.

To add the initial users if they were not set up through Helm values, see the Wallaroo Enterprise User Management guide.

Required Installation Configurations

Once installed, the following actions are required settings are required to complete the setup process process.

DNS Services Integration

DNS services integration is required for Wallaroo Enterprise to provide access to the various supporting services that are part of the Wallaroo instance. These include:

  • Simplified user authentication and management.
  • Centralized services for accessing the Wallaroo Dashboard, Wallaroo SDK and Authentication.
  • Collaboration features allowing teams to work together.
  • Managed security, auditing and traceability.

The following guide is for standard DNS services.

Once integrated, users can access the following services directly from a URL starting with the Wallaroo Domain - this is the domain name the DNS enetry is set to. For example, if the Wallaroo Domain is wallaroo.example.com, then the Wallaroo Ops instance is accessed from https://wallaroo.example.com/{service}.

Note that even when accessing specific Wallaroo services directly that the user must authenticate through Wallaroo or another authorized authentication service.

ServicePathDescription
Wallaroo Dashboardwallaroo domainProvides access to a user interface for updating workspaces, pipelines, and models. Also provides access to the integrated JupyterHub service.
APIv1/apiProvides access to the Wallaroo API. For example, wallaroo.example.com/v1/api.
AuthenticationauthAccess to authentication management including SSO configurations. For example: wallaroo.example.com/auth. For details on user management, see Wallaroo User Management. For details on user management services, see Wallaroo Authentication Configuration Guides.
Edge ObservabilityedgeOptional service for edge and multicloud deployments to report inference results back to the Wallaroo Ops instance. For example: wallaroo.example.com/edge.

Prerequisites

  • Install Wallaroo Enterprise into a qualified environment. For more details, see the Wallaroo Install Guides and the Wallaroo Enterprise Install Guides.
  • Generate or update the the SSL Certificates
  • Have access to internal corporate DNS configurations that can be updated. A subdomain for the Wallaroo instance will be created through this process.
  • Have the IP address for the Wallaroo instance.
  • Install kubectl into the Kubernetes cluster administrative node.

Wallaroo IP Address Retrieval Methods

The first step is to retrieve the IP address that is set to the DNS services.

Retrieve LoadBalancer IP with kubectl

For most organizations that install Wallaroo into a cloud based Kubernetes cluster such as Microsoft Azure, AWS, etc the external IP address is tied to Wallaroo LoadBalancer service. This can be retrieved with the kubectl command as follows:

  1. Retrieve the external IP address for your Wallaroo instance LoadBalancer. For example, this can be performed through the following kubectl command:

    kubectl get svc api-lb-ext -n wallaroo -o jsonpath='{.status.loadBalancer.ingress[0].ip}'
    34.83.140.98
    

    Verify that the IP address is the one that is used either for clusters in an internal or external mode.

    An alternative to this command to show all services is the following:

    kubectl get svc  -A
    

    Example Result:

    kubectl get svc  -A
    NAMESPACE     NAME                      TYPE           CLUSTER-IP     EXTERNAL-IP     PORT(S)                         AGE
    default       kubernetes                ClusterIP      10.25.64.1     <none>          443/TCP                         50d
    gmp-system    alertmanager              ClusterIP      None           <none>          9093/TCP                        50d
    gmp-system    gmp-operator              ClusterIP      10.25.70.18    <none>          8443/TCP,443/TCP                50d
    kube-system   default-http-backend      NodePort       10.25.67.38    <none>          80:31378/TCP                    50d
    kube-system   kube-dns                  ClusterIP      10.25.64.10    <none>          53/UDP,53/TCP                   50d
    kube-system   metrics-server            ClusterIP      10.25.67.8     <none>          443/TCP                         50d
    wallaroo      alertconfig               ClusterIP      10.25.67.141   <none>          9094/TCP                        2m30s
    wallaroo      alertmanager              ClusterIP      10.25.77.167   <none>          9093/TCP                        2m30s
    wallaroo      api-lb                    ClusterIP      10.25.79.188   <none>          8080/TCP                        2m31s
    wallaroo      api-lb-admin              ClusterIP      10.25.70.0     <none>          9901/TCP                        2m31s
    wallaroo      api-lb-ext                LoadBalancer   10.25.69.9     34.83.140.98    80:30715/TCP,443:32664/TCP      2m31s
    wallaroo      dashboard                 ClusterIP      10.25.72.14    <none>          4567/TCP                        2m31s
    wallaroo      wallsvc                   ClusterIP      10.25.75.74    <none>          3040/TCP                        2m30s
    

    Some cloud providers may provide multiple external IP addresses for the api-lb service. For some services such as Oracle, this is typically a 10 net address, with the second IP address being the actual external address. Verify the IP address before changing DNS settings.

Other Methods

For organizations that install Wallaroo other methods may find the kubectl get svc api-lb-ext command only returns the internal IP address.

Depending on the instance, there are different methods of acquiring that IP address. The links below reference difference sources.

Refer to your Wallaroo support representative if further assistance is needed.

DNS Entries

Create DNS the following entries based on the list above for the Wallaroo instance’s IP address. Select the cloud environment used for DNS services.

Edge Observability Enablement

For organizations that deploy Wallaroo pipelines on edge devices as Wallaroo Servers, see the DNS settings from the Edge Deployment Registry Guide.

Setup Users

User management is managed through the Wallaroo Dashboard, via the Platform Admin Dashboard page. See the Wallaroo User Management for full guides on setting up users, identity providers, and other user configuration options.

The following is an abbreviated guide on setting up new Wallaroo users.

The process includes the following steps:

Obtain the User Admin Credentials

Obtaining the admin User Credentials

The standard Wallaroo installation creates the user admin by default and assigns them a randomly generated password. The admin user credentials are obtained which may be obtained directly from Kubernetes with the following commands, assuming the Wallaroo instance namespace is wallaroo.

  • Retrieve Admin Username

    kubectl -n wallaroo \
    get secret keycloak-admin-secret \
    -o go-template='{{.data.KEYCLOAK_ADMIN_USER | base64decode }}'
    
  • Retrieve Admin Password

    kubectl -n wallaroo \
    get secret keycloak-admin-secret \
    -o go-template='{{.data.KEYCLOAK_ADMIN_PASSWORD | base64decode }}'
    

Create a New User with the Admin Role

Creating users is managed through the Platform Admin Dashboard. The following steps are used to create an initial user with the role admin.

  1. Access the Wallaroo Dashboard through the DNS name set up in the DNS Services Integration step. For example, if the DNS name of the Wallaroo Ops center is wallaroo.example.com, the Wallaroo Dashboard is available at https://wallaroo.example.com.
  2. Login with the username admin and the password retrieved in Obtaining the admin User Credentials.
  3. Select Create Users and enter the following:
    1. User Email: The email address for the user. This must be in the valid email address format.
    2. Assign Type: Select Admin.
    3. Password: Enter the user’s password. This user password be sent to the new user.
    4. Temporary or Permanent:
      1. Temporary: The user will be forced to change their login password upon their next login (Recommended).
      2. Permanent: The user will keep their password.
  4. Create any additional users as needed. When finished, select the Wallaroo icon drop down and select Logout.

At this point, users can log in to Wallaroo Dashboard with their provided identities. For guides on setting up Single Sign-On (SSO) and other features, see Wallaroo User Management for full guides on setting up users, identity providers, and other user configuration options.

Next Steps

Ready for more? Time to deploy models and start inferencing.

For access to these sample models and a demonstration on using LLMs with Wallaroo: