Kubernetes for data scientists

Automated deployments - Kubernetes provides a simple way of deploying a docker image using just a few lines of yaml. This gets us a declarative way of managing what we deploy and how we serve it. It also makes it super easy to make a robust CI pipeline on top of this abstraction.

Service discovery - Services are natively allowed to call each other using static domain names. This makes it possible to not worry about the location of the other service and enables autoscaling.

Load Balancing - Load balancing of requests intended for a particular service to all its instances distributed across multiple nodes is also taken care of. You can also attach an external load balancer from the cloud provider for internet facing services. This enables dynamic autoscaling, graceful failover and zero-downtime rollouts

Self-healing - The built-in health checking mechanism keeps a watch over a running service and attempts to recover on failure by restarting the workload. This, combined with native load balancing support, is a huge win for the reliability aspect for any service.

Autoscaling - Autoscaling the workload size up or down on the basis of actual resource consumption. This translates into critical cost savings for workloads that face a variable load.

Developer Platform - Kubernetes has been built with the intention of serving as a platform for collaboration across multiple teams. This is enabled by native support for user and role management and multi-tenancy. These features become absolutely critical beyond a certain scale.

Extensible - With a system of custom resources and controllers, kubernetes acts as a platform for other tooling that massively enhance the feature set. This effectively unlocks a whole host of use cases that are addressed by other tools. Ex - ArgoCD for GitOps, Istio for Service mesh, Kubeflow for ML pipeline orchestration etc

Open Source - Kubernetes being open source means you get a huge range of options from running on bare metal servers to using one of the managed options from any major cloud provider. Aligning close to kubernetes means you get an interface that provides enough portability for the workload to actually be moved around according to business needs.

Kubernetes for data scientists

Introduction

What is kubernetes?

Data scientist workflow overview

Feature store

Model Development

Model training

Model management

Model Serving

Model monitoring

Conclusion

Subscribe to our newsletter

SSH Server Containers For Development on Kubernetes

Prompting, RAG or Fine-tuning - the right choice?

Large Language Models for Commercial Use

Adding OAuth2 to Jupyter Notebooks on Kubernetes

Blazingly fast way to build, track and deploy your models!

Product

Resources

Company

Goodreads

Kubernetes for data scientists

Introduction

What is kubernetes?

Data scientist workflow overview

Feature store

Model Development

Model training

Model management

Model Serving

Model monitoring

Conclusion

Subscribe to our Newsletter

Subscribe to our newsletter

Discover More

SSH Server Containers For Development on Kubernetes

Prompting, RAG or Fine-tuning - the right choice?

Large Language Models for Commercial Use

Adding OAuth2 to Jupyter Notebooks on Kubernetes

Related Blogs

Blazingly fast way to build, track and deploy your models!

Product

Resources

Company

Goodreads

Subscribe to our newsletter