We are seeking an experienced DevOps Team Lead to guide our DevOps team in managing cloud infrastructure, ensuring system performance at scale, and driving reliability across our environments. This role will oversee critical areas such as AWS cloud infrastructure, CI/CD pipelines, security, cost optimization, and system scalability. You will collaborate with cross-functional teams to deliver high-quality, scalable, and secure solutions while optimizing for performance and cost.
Main Responsibilities:
Lead, mentor, and grow a high-performing DevOps team. Foster a collaborative, agile, and DevOps-centric culture focused on continuous improvement and enablement.
Design, build, and maintain scalable, secure, and highly available cloud environments. Leverage AWS services and Kubernetes (k8s) extensively for optimal performance.
Monitor, troubleshoot, and optimize system performance at scale. Implement best practices for high availability, disaster recovery, and performance tuning.
Automate provisioning, deployment, and management of AWS environments using IaC tools like Terraform and CD tools like Atlantis.
Develop and enhance CI/CD pipelines using tools like Azure DevOps, GitHub Actions and ArgoCD to ensure efficient, automated, and reliable software delivery.
Implement and maintain security best practices for cloud environments, including encryption, IAM policies, and compliance with industry standards such as SOC2.
Monitor cloud costs and implement strategies for optimization, ensuring efficient use of resources while minimizing waste.
Monitor system performance, troubleshoot issues, and implement solutions to ensure high availability and reliability.
Stay informed about the latest industry trends and best practices in DevOps.
Requirements: Five years of experience in a DevOps or Cloud Engineering role, with at least 2 years in a leadership position.
Proven experience with AWS cloud services (EKS, EC2, S3, SQS, RDS, ElastiCache, Lambda and Route53) and architecture.
Proficiency in application lifecycle and system monitoring / alerting tools such as Datadog.
Proven experience in cost management and optimization within cloud environments.
Strong knowledge of security best practices for cloud infrastructure.
Strong knowledge of CI/CD pipelines and hands-on experience with tools like Azure DevOps and GitHub Actions.
Strong skills in containerization and orchestration tools (e.g., Docker, Kubernetes) and related management tools like Helm.
Preferred experience with GitOps methodologies and tools like ArgoCD.
Proficiency in IaC principles and tools like Terraform, Pulumi, Atlantis and Crossplane.
Advanced scripting and automation skills (Python, bash).
Excellent leadership and communication skills.
Proven problem-solving skills and the ability to work both independently and collaboratively.
A passion for continuous learning and innovation.
Willingness to participate in on-call duty.
This position is open to all candidates.