How to deploy big models

Category: Kubernetes, Docker


How to deploy big models
I recently deployed a 500MB Pytorch model. It was surprisingly hard! In this post, I document the pitfalls and tradeoffs I made. Running on CPU was nearly as fast as GPU for non-batch processing, so I recommend starting with that if you can.
Read More

Get the latest DevOps jobs, events and curated articles straight to your inbox, once a week