Social network you want to login/join with:
T-Pro is a cutting-edge provider of medical speech recognition and workflow solutions, helping healthcare organisations streamline documentation, reduce clinician burnout, and improve patient care. We leverage AI-driven solutions to enhance efficiency and accuracy in clinical documentation.
Reporting to the Software Engineering Manager / DevOps Manager, the primary duties and
responsibilities of the DevOps Engineer will be:
* Managing Kubernetes clusters in EKS, implementing and optimising scaling tools like Karpenter and KEDA.
* Ensuring efficient and cost-effective usage of AWS resources.
* Enhancing and maintaining CI/CD workflows to support automated deployments.
* Managing GitOps workflows with ArgoCD across all environments.
* Centralising monitoring solutions using tools like Grafana and Prometheus.
* Creating and maintaining dashboards for real-time insights and setting up alerting mechanisms.
* Managing AWS security tools such as WAF, GuardDuty, and Security Hub.
* Ensuring compliance with security best practices and automating security audits.
* Coordinating application deployments across multiple environments.
* Streamlining integrations with third-party tools and platforms.
* Monitoring and optimising AWS costs to ensure budget efficiency.
* Providing cost breakdowns and supporting Accounts and leadership teams with financial insights.
* Implementing redundancy strategies and disaster recovery plans.
* Participating in calls to resolve critical issues and minimise downtime.
* Driving automation initiatives to reduce manual operational overhead.
* Innovating new solutions for scalability, reliability, and performance optimisation.
Key Skills and Competencies
* Expertise in managing Kubernetes clusters, particularly in EKS, with tools like Karpenter and KEDA.
* In-depth knowledge of cloud architecture, scaling, and AWS services.
* Experience in setting up and optimising CI/CD pipelines using tools like GitHub Actions.
* Knowledge of GitOps practices with tools like ArgoCD for automated application deployment. Proficient in tools like Grafana, Prometheus, and CloudWatch for monitoring and visualization.
* Ability to set up alerting and dashboards to ensure real-time insights and issue detection.
* Hands-on experience with infrastructure-as-code (IaC) tools like Terraform, AWS CloudFormation, or CDK.
* Skilled in automating workflows to enhance efficiency and reduce manual intervention.
* Familiarity with AWS security tools like WAF, GuardDuty, and Security Hub.
* Knowledge of best practices for securing cloud environments and ensuring compliance.
* Ability to troubleshoot and resolve infrastructure issues efficiently, including scaling, deployment, and performance problems.
* Expertise in identifying and mitigating AWS cost inefficiencies.
* Excellent communication skills to work effectively with developers, accounts teams, and leadership.
* Capable of enabling self-service workflows for developers, such as automated deployments and monitoring setups.
* Eagerness to learn and implement emerging technologies to improve infrastructure and processes.
* Proactive in identifying opportunities for optimisation and scalability.
Qualifications:
* Bachelor's Degree in Computer Science, Information Systems, or a related field.
* 3-5 Years of Experience in DevOps engineering, cloud infrastructure management, and Kubernetes operations.
* Proficiency in AWS Services, and networking tools.
* Experience with Kubernetes, including EKS, scaling tools like Karpenter, and workload automation with KEDA.
* Hands-on Knowledge of CI/CD Pipelines, GitOps workflows, and deployment tools like ArgoCD and GitHub Actions.
* Strong Skills in Monitoring and Observability, with experience using Grafana, Prometheus, CloudWatch, or similar tools.
* Familiarity with IaC Tools, such as Terraform, AWS CloudFormation, or CDK.
* Security Experience, including implementing WAF rules, managing GuardDuty alerts, and ensuring compliance best practices.
* Strong Problem-Solving Abilities, with experience handling scaling, performance optimisation, and cost management.
* Excellent Communication Skills for collaboration with cross-functional teams and enabling developer self-service workflows.
* AWS Certifications (Preferred), such as AWS Certified Solutions Architect - Associate, DevOps Engineer, or SysOps Administrator.
Reporting Structure:
The DevOps Engineer will report directly to the Software Engineering Manager / DevOps Manager, working closely with both the Development and Integration teams. The role will also serve as a key liaison with the Accounts team to ensure accurate and efficient data management, reporting, and troubleshooting.
Onboarding Process
For the onboarding process, the following could be outlined:
Initial Phase (First 30 Days):
* Familiarisation with T-Pro's AWS cloud environment, Kubernetes clusters, and DevOps
* workflows.
* Shadowing the existing DevOps Engineer to understand current infrastructure,
* pipelines, and pain points.
* Reviewing existing CI/CD pipelines, monitoring tools, and cost management practices.
* Beginning low-risk improvements, such as optimising monitoring dashboards or
* automating small operational tasks.
Mid-Phase (Next 30-60 Days):
* Taking ownership of routine tasks, including pipeline maintenance, scaling
* configurations, and basic troubleshooting.
* Implementing and enhancing monitoring solutions (e.g., Grafana, Prometheus) for all
* Collaborating with the development team to streamline workflows and address
* Starting to refine and document infrastructure-as-code (IaC) practices to improve
* consistency and scalability.
Final Phase (Day 60-90):
* Assuming full responsibility for managing Kubernetes clusters, CI/CD pipelines, and cloud infrastructure optimisations.
* Developing and executing a long-term strategy for cost optimisation, scalability, and security.
* Proactively delivering measurable improvements, such as enhanced deployment speeds, reduced downtime, or streamlined resource usage.
* Establishing automation for recurring tasks to free up capacity for strategic initiatives.
#J-18808-Ljbffr