SRE
This is a systems reliability engineering position focused on ensuring the availability, scalability, and performance of cloud-based systems.
Key Responsibilities:
* Design, implement, and maintain monitoring tools to detect and respond to system issues in real-time.
* Develop and deploy containerized applications using Docker and Kubernetes.
* Collaborate with cross-functional teams to identify and resolve infrastructure and application issues.
* Implement automation scripts to improve efficiency and reduce downtime.
* Analyze system logs and metrics to troubleshoot issues and optimize system performance.
Requirements:
* At least 3 years of experience in a SRE role or similar field.