Operational Readiness Architect Lead application health, performance, and capacity planning.
Collaborate with development teams for launch reviews and monitoring strategies.
Drive zero-downtime deployment frameworks.
Site Reliability Engineering (SRE)Ensure scalability and resilience of applications.
Conduct blameless post-mortems and optimize incident response.
Automate alerts and establish SLOs with development teams.
Dev Ops & Automation Enhance CI/CD pipelines and operational gating.
Promote automation to reduce manual toil.
Apply best practices in Chef Infra, Jenkins, and scripting.
ITSM Practices Analyze platform ITSM activities and provide feedback to improve resiliency.
Skills & Qualifications BS in Computer Science or related technical field(miust)Experience with Linux(must), Chef, Jenkins.
Java(good to have)Strong scripting and automation mindset.(good to have)Familiarity with distributed systems and incident management.(must)Effective communicator with a problem-solving approach.(must)Knowledge of enterprise monitoring tools splunk(good to have)