Gå frakoblet med Player FM -appen!
Deploy and fine-tune LLM models on Kubernetes using KAITO
Manage episode 433011321 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.
Check out our website at https://kubernetesbytes.com/
Cloud Native News:
- https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga
- https://github.blog/news-insights/product-news/introducing-github-models/
Show links:
- Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main
- https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2
- https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2
- Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator
- https://paulyu.dev/article/soaring-with-kaito/
- Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models
- Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:
- Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/
- Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/
Timestamps:
- 00:02:15 Cloud Native News
- 00:05:34 Interview with Sachi and Paul
- 00:42:08 Key takeaways
84 episoder
Manage episode 433011321 series 3332465
In this episode of the Kubernetes Bytes podcast, Bhavin sits down with Sachi Desai, Product Manager and Paul Yu, Sr. Cloud Advocate at Microsoft to talk about the open source KAITO project. KAITO is the Kubernetes AI Toolchain Operator that enables AKS users to deploy open source LLM models on their Kubernetes clusters. They discuss how KAITO helps with running AI-enabled applications alongside the LLM models, how it helps users bring their own LLM models and run them as containers, and how KAITO helps them fine-tune open source LLMs on their Kubernetes clusters.
Check out our website at https://kubernetesbytes.com/
Cloud Native News:
- https://azure.github.io/AKS/2024/07/30/azure-container-storage-ga
- https://github.blog/news-insights/product-news/introducing-github-models/
Show links:
- Azure/kaito: Kubernetes AI Toolchain Operator - https://github.com/Azure/kaito/tree/main
- https://www.youtube.com/watch?v=3cGmHDjR_3I&list=PLc3Ep462vVYtgN4rP1ThTJd2UlsBc2sou&index=2
- https://aka.ms/cloudnative/learnlive/intelligent-apps-on-aks/episode-2
- Jumpstart AI Workflows With Kubernetes AI Toolchain Operator - The New Stack - https://thenewstack.io/jumpstart-ai-workflows-with-kubernetes-ai-toolchain-operator
- https://paulyu.dev/article/soaring-with-kaito/
- Concepts - Fine-tuning language models for AI and machine learning workflows - Azure Kubernetes Service | Microsoft Learn - https://learn.microsoft.com/en-us/azure/aks/concepts-fine-tune-language-models
- Keep up to date on the most recent announcements by following some of the KAITO engineers on LinkedIn:
- Fei Guo - https://www.linkedin.com/in/fei-guo-a48319a/
- Ishaan Sehgal - https://www.linkedin.com/in/ishaan-sehgal/
Timestamps:
- 00:02:15 Cloud Native News
- 00:05:34 Interview with Sachi and Paul
- 00:42:08 Key takeaways
84 episoder
Alle episoder
×Velkommen til Player FM!
Player FM scanner netter for høykvalitets podcaster som du kan nyte nå. Det er den beste podcastappen og fungerer på Android, iPhone og internett. Registrer deg for å synkronisere abonnement på flere enheter.