Job Title: Machine Learning Operations Engineer - Cloud Platform Specialist
Job Description:
Our client is seeking an experienced Machine Learning Operations Engineer to join their team.
The Role
Design, build, and maintain scalable, reliable, and efficient Machine Learning platforms, ensuring robust performance and operational excellence.
Main Responsibilities:
* Lead the design, build, and maintenance of scalable, reliable, and efficient Machine Learning platforms.
* Collaborate with infrastructure and DevOps teams to integrate Machine Learning models into the broader platform for continuous operation and scaling.
* Improve architecture, scalability, stability, and performance of the Machine Learning platform, focusing on AWS cloud engineering solutions and platform services.
* Develop processes for model monitoring and governance, ensuring successful ML model operationalization and compliance with industry standards.
Requirements:
* Hands-on experience in Cloud Engineering, especially with AWS services such as EC2, S3, Lambda, and SageMaker.
* Proficiency in platform engineering practices and frameworks for MLOps.
* Development skills in Python or relevant languages.
* Experience with DevOps tools such as Git, Jenkins, GitHub Actions, or similar.
* Experience with data pipeline tools like Apache Airflow, AWS Glue, or similar.
Benefits:
* A competitive salary package including bonus and pension.
* Extensive training resources.
* Company discounts.
* On-site parking.