Question 1

What are Kubeflow Pipelines?

Accepted Answer

Kubeflow Pipelines is a platform designed to help you build and deploy container-based machine learning (ML) workflows that are portable and scalable. Each pipeline represents an ML workflow, and includes the specifications of all inputs needed to run the pipeline, as well the outputs of all components. Learn more about Kubernetes Architecture.

Question 2

Common Kubeflow Use Cases

Accepted Answer

Here are three common use cases for implementation of Kubeflow Pipelines. Deploying Models to Production Trained models are usually compiled into a single file that sits on a server host or laptop. Next, you copy the file to a machine hosting the application, and load the model to a server process accepting network requests for model inference. This process becomes complex when there are multiple applications requiring model inference output from a single model, especially when you need to deploy updates and initiate rollbacks. Kubeflow lets you run updates and rollbacks across multiple applications or servers. You can update your model in one place, and ensure all client applications quickly get the updates, once the update transaction is complete. Shared Multi Tenant ML Environment Machine learning environments and resources often need to be shared. To enable simple and effective sharing, you need a multi tenant machine learning environment. You can create one with Kubeflow Pipelines. You should aim to provide each collaborator an isolated environment. Kubernetes enables scheduling and managing containers, can help you isolate workflows and keep track of pending and running jobs for each collaborator. Running Jupyter Notebooks on GPUs ML algorithms need a lot of power in order to quickly run through linear algebra processes. Graphics processing units (GPUs) can meet this demand, but cannot usually be found on regular laptops and desktops. To gain access to GPUs, data scientists often leverage Jupyter Notebooks in combination with python code and dependency management with container platforms like Docker. However, this process often creates security issues because data is distributed across unauthorized platforms and services. Kubeflow Pipelines, on the other hand, enable data scientists to build their workflow into a container and execute it in an environment authorized by the security team. Learn more about Kubernetes Scheduling for AI.

Question 3

Automate Kubernetes Job Scheduling with Run:AI

Accepted Answer

Run:AI’s Scheduler is a simple plug-in to Kubernetes clusters and enables optimized orchestration of high-performance containerized workloads. It adds high-performance orchestration to your containerized AI workloads. The Run:AI platform includes:

High-performance for scale-up infrastructures—pool resources and enable large workloads that require considerable resources to coexist efficiently with small workloads requiring fewer resources.
Batch scheduling—workloads can start, pause, restart, end, and then shut down, all without any manual intervention. Plus, when the container terminates, the resources are released and can be allocated to other workloads for greater system efficiency.
Topology awareness—inter-resource and inter-node communication enable consistent high performance of containerized workloads.
Gang scheduling—containers can be launched together, start together, and end together for distributed workloads that need considerable resources.
Run:AI simplifies Kubernetes scheduling for AI and HPC workloads, helping researchers accelerate their productivity and the quality of their work.

Kubeflow Pipelines

The Basics and a Quick Tutorial

What are Kubeflow Pipelines?

Common Kubeflow Use Cases

Deploying Models to Production

Shared Multi Tenant ML Environment

Running Jupyter Notebooks on GPUs

Kubeflow Pipelines Architecture

Tutorial: Getting started with Kubeflow Pipelines

Run a Basic Pipeline

Run an ML Pipeline

Automate Kubernetes Job Scheduling with Run:AI