Overview
Our Clients Cloud Operations team is a group of talented engineers passionate about building highly reliable, scalable and secure solutions in public/private cloud environments. We are looking to hire a highly motivated Cloud Operations engineer with strong working experience in production operation, as well as cloud infrastructure design and implementation. Together, we will design, develop and implement the best public / private / local cloud solutions for our customers. You will also be expected to participate in continuous cloud service operation, troubleshoot, and resolve complex issues in production.
Responsibilities
Manage and maintain the clients cloud infrastructure in AWS, GCP & Azure
Provide technical leadership in cloud infrastructure design and implementation
Ensure secure and reliable communication across different regions and cloud service providers
Deploy and configure middleware services, such as SQL, NoSQL databases, and messaging queue systems
Evaluate, recommend, and implement CloudOps / DevOps technology and solutions
Participate in continuous cloud service operations with the US and remote teams
Troubleshoot and follow up on production infrastructure / application related issues
Driving root cause analysis and resolution
Communicate with Dev/QA as well as external carriers to resolve and prevent issues
Design and implement deployment automation platform for Kubernetes based microservices
Improve service availability and scalability through tuning, automation, tools, and process
Analyze service performance, identify bottleneck and provide actionable improvement plans
Requirements
BS level technical degree required; Computer Science or Engineering background preferred
8+ years of experience in a CloudOps / DevOps role
Hands on experience with AWS or any public cloud (Azure, GCP etc.)
Knowledge of Linux, security and networking fundamentals
Working knowledge of container-based architecture and deployment (Docker, Kubernetes)
Working knowledge of deployment automation development (Terraform, Helm, ArgoCD)
Experience in diagnosing and resolving complex application problems
Working knowledge of Elasticsearch, PostgreSQL, Redis, Ignite, Flink, Kafka, and RabbitMQ
Experience with monitoring tools (Nagios, Grafana, Prometheus)
Experience with cloud security and compliance implementation is a plus
Strong follow-through and initiative to stay with issues until they are resolved
Comfortable working within a distributed team located in multiple time zones
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Other
Industries
IT Services and IT Consulting
#J-18808-Ljbffr