Loading…
CNCF-hosted Co-located Events North America 2023 are taking place November 6. This event is happening in person at McCormick Place West in Chicago, Illinois.

The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon North America 2023, and have an All-Access pass in order to participate in the sessions.

Please note: This schedule is automatically displayed in Central Standard Time Zone. (UTC-6). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."


To view the full event schedule for a specific CNCF-hosted Co-located event, you can use the right-hand navigation bar to sort and filter.

The schedule is subject to change.


*Cloud Native Telco Day + CiliumCon will be available via live stream on our virtual platform,  all other co-located event recordings will be available 48-72 hours post-event on the CNCF YouTube channel.

Monday, November 6 • 10:25am - 10:50am
Batch Systems in Production with Kueue: Multi-Tenancy and Fungibility - Yuki Iwai, CyberAgent, Inc. & Aldo Culquicondor, Google

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Kueue is a could-native job scheduler with which you can build a multi-tenant batch system on a Kubernetes cluster. Kueue implements job queueing, deciding when jobs should wait and when they should start, based on quotas, priority and a hierarchy for sharing heterogeneous resources among teams. Kueue works on prem and in autoscaled environments in the cloud. In this talk, you will learn about Kueue’s architecture and extensibility to support a variety of workloads. You will also learn how Kueue is used in production in self-managed clusters, serving multiple machine-learning researchers, MLOps Engineers and data scientists. Kueue provides fair use while maximizing the utilization of accelerators and other resources, through its borrowing and preemption mechanisms. Kueue is used with frameworks like DeepSpeed, PyTorch, the kubernetes Job, RayJob, Jupyter, etc.

Speakers
avatar for Aldo Culquicondor

Aldo Culquicondor

Sr. Software Engineer, Google
Aldo is a Senior Software Engineer at Google. He works on Kubernetes and Google Kubernetes Engine, where he contributes to kube-scheduler, the Job API and other features to support batch, AI/ML and HPC workloads. He is currently a TL at SIG Scheduling and an Organizer of the WG Batch... Read More →
avatar for Yuki Iwai

Yuki Iwai

Software Engineer, CyberAgent, Inc.
Yuki is a Software Engineer at CyberAgent, Inc. He works on an internal platform for machine-learning applications and high-performance computing. He is currently a maintainer of some Kubeflow WG AutoML / Training sub-projects. He is also a WG Batch member and a Kubernetes' Kueue... Read More →



Monday November 6, 2023 10:25am - 10:50am CST
W194ab