SC25 Poster: Mixed Compute Environments with OpenCHAMI

OpenCHAMI is featured in a research poster at SC25 in St. Louis, showcasing innovative work on managing mixed compute environments across HPC and Kubernetes workloads.

Mixed Compute Environments with OpenCHAMI

Authors: Sean Gibson, Richard Kim, Samuel Quan, Travis Cotton, Thomas Mackell (Los Alamos National Laboratory)

Supervisor: Travis Cotton (Los Alamos National Laboratory)

Abstract

There is a growing need for workloads that don’t follow a traditional HPC workflow. Many of these workloads are developed with Kubernetes as the workload manager rather than an HPC-focused one such as Slurm. Mixing different workloads presents a challenge for a few reasons: The demand for either type of resource may fluctuate, so static assignments of Kubernetes or Slurm as the WLM may result in idle resources; the desire for one WLM or another may increase, so extra resources will need to be assigned and moved.

To address this demand, we utilized OpenCHAMI, an open-source system management platform for deploying, managing, and scaling HPC clusters. With OpenCHAMI, we created “spread”: a command line tool that configures nodes’ workload environments across the cluster. We support fast node booting using kexec and a dynamic base of workload environments to swap between, including Slurm and Kubernetes.

Resources

Read more about OpenCHAMI here and try it yourself through the install guide.