Loading…
May 10-12, 2023
Vancouver, British Columbia, Canada + Virtual
View More Details & Registration
Note: The schedule is subject to change.

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for Open Source Summit North America 2023 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Pacific Daylight Time (UTC/GMT -8). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday, May 11 • 2:00pm - 2:40pm
5 Steps to Deploy Cloud Native Sustainable Foundation AI Models - Chen Wang, IBM & Huamin Chen, Red Hat

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
In recent years, huge foundation models have exhibited new impressive capabilities like suggesting writings, helping codings, and enlightening painters. While everyone is amazed by these incredible use cases, the tradeoff between these models' performance and energy cost on Container Platforms has yet to be studied. In this talk, we will show 5 steps on how to run high-performance and energy-efficient foundation models on Kubernetes: - containerize foundation models - deploy foundation model on Kubernetes - measure the energy consumption of serving the foundation model - reduce the energy consumption of the model via tuning the GPU frequencies - study the tradeoffs between the performance of the model inference requests and the energy cost of running the model.

Speakers
avatar for Huamin Chen

Huamin Chen

Sr. Principal Software Engineer, RedHat
Dr. Huamin Chen is a passionate developer at Red Hat' CTO office. He is one of the founding members of Kubernetes SIG Storage, member of Ceph, Knative, and Rook. He previously spoke at KubeCon, OpenStack Summits, and other technical conferences.
avatar for Chen Wang

Chen Wang

Research Staff Member, IBM Research
Chen Wang is a Research Staff Member at the IBM T.J. Watson Research Center. Her interests lie in Kubernetes, Container Cloud Resource Management, Cloud Native AI systems, and applying AI in Cloud system management. She is an open-source advocate, a Kubernetes contributor, and a KubeCon... Read More →



Thursday May 11, 2023 2:00pm - 2:40pm PDT
206 (Level 2)
  Open AI & Data Forum, Model