**About the Role:**
We are seeking an experienced Machine Learning Operations Engineer to join our team. The ideal candidate will have a solid background in Cloud Engineering, focusing on platform engineering. You will develop machine learning platforms and ensure their continuous deployment and operation through effective infrastructure and tooling.
**Responsibilities:
• Lead the design, build, and maintenance of scalable, reliable, and efficient machine learning platforms, ensuring robust performance and operational excellence.
• Collaborate with infrastructure and DevOps teams to integrate machine learning models into the broader platform for continuous operation and scaling.
• Develop processes for model monitoring and governance, ensuring successful ML model operationalization and compliance with industry standards.
• Extend existing machine learning libraries and frameworks to enhance performance and integration within the platform.
• Define objectives for the machine learning platform, own the technical roadmap, and ensure alignment with platform engineering goals.
• Establish and uphold standards for platform engineering and operational excellence, striving for best-in-class ML platforms.
• Design and implement architectural best practices for efficient and scalable deployment of ML solutions.
• Implement model training, validation, and deployment pipelines using CI/CD tools.