Job Description
The successful candidate will design, build and deploy AI-ready infrastructure on OpenShift, delivering robust and scalable platforms capable of supporting AI/ML workloads.
Key Responsibilities:
* Design and deploy AI-ready OpenShift clusters to support AI/ML workloads at scale.
* Built, operate and maintain complex distributed systems for deploying machine learning models in production environments using Red Hat's Kubernetes-based container platform.
* Deployed, managed and run various machine learning pipelines across diverse data sources with varying formats through effective use of GPUs within the confines of a highly efficient hybrid cloud environment that scales dynamically according to user demand while maintaining optimal availability metrics throughout operations lifecycle.