Senior Cloud Operations Engineer
Caseware
Bogotá, DC, CO
Remote
Compensation: Competitive salary and comprehensive benefits package.
Job Description
About the Role
We are looking for a highly skilled and experienced Cloud Operations Engineer to join our dynamic Cloud Operations team. In this role, you will play a critical part in maintaining, optimizing, and evolving our cloud applications and infrastructure. You’ll work closely with cross-functional teams to ensure our cloud environments are secure, reliable, scalable, and at the forefront of cloud technology.
What you will do:
- Design, deploy, and maintain secure, scalable, and high-performing cloud infrastructure using AWS services such as Organizations, Control Tower, Landing Zone, and IAM Identity Center, while ensuring compliance with organizational policies.
- Lead cloud migration initiatives, including detailed planning, execution, and post-migration support for application and infrastructure transitions from on-premises or other cloud environments to AWS.
- Implement and manage Infrastructure as Code (IaC) using tools such as AWS CDK (preferred), CloudFormation, and Terraform to ensure consistent, automated, and repeatable cloud deployments.
- Provision and manage cloud resources including compute (EC2), storage (S3), databases (RDS/Aurora, DocumentDB), filesystems (FSx), container orchestration (EKS), and machine images (AMI Factory) to support the requirements of applications and services.
- Define and enforce configuration management standards, ensuring consistency, reliability, and compliance across all cloud environments.
- Develop, maintain, and optimize automation scripts and tools to streamline operational processes, reduce manual intervention, and enhance system reliability.
- Establish and uphold observability best practices, including monitoring, logging, and alerting, to ensure ongoing system health and optimal performance.
- Respond promptly to incidents, troubleshoot complex technical issues, and work collaboratively with development and IT teams to minimize downtime and maintain high service quality.
- Design, implement, and maintain robust backup and disaster recovery strategies to meet Recovery Time Objectives (RTO) hours and Recovery Point Objectives (RPO), utilizing tools such as AWS Backup and Vaults.
- Ensure cloud security and compliance by applying best practices, conducting regular audits, and enforcing policies to protect sensitive data and maintain the integrity of cloud infrastructure.
- Monitor cloud resource usage, analyze cost trends, and implement cost-optimization strategies to maximize cloud investment.
- Design and manage secure and efficient networking solutions using Transit Gateway, IPAM, and VPC to support seamless connectivity and data flow across environments.
- Collaborate with cross-functional teams to align cloud strategies with business goals and provide technical guidance on cloud best practices.
- Participate in a 24/7 on-call rotation to support critical infrastructure, respond to operational incidents, and ensure high availability.
- Document infrastructure designs, configurations, and operational procedures comprehensively to support knowledge sharing and facilitate team collaboration.
- Stay current with emerging cloud technologies and proactively recommend improvements or new approaches to enhance cloud operations and platform capabilities.
- Leverage prior experience in an Enterprise SaaS environment, placing a strong emphasis on quality, high availability, and attention to detail in every aspect of cloud operations.
What you will bring:
- 5+ years of experience in cloud operations, with a strong focus on AWS.
- Extensive experience and understanding of AWS Organizations, AWS Backup, AMI Factory, NAT Gateways, WAF, CloudTrail, Lambda, S3, RDS, EKS, ECR, and EC2.
- Hands-on experience implementing zero-trust security models and industry best practices.
- Excellent problem-solving and analytical abilities, with a proactive approach to troubleshooting.
- Strong communication and collaboration skills, with experience working in cross-functional teams.
- Experience with automation tools such as CloudFormation and CDK.
- Knowledge of containerization and orchestration technologies, including Docker and Kubernetes.
- Nice to have: Skilled in managing north-south traffic (HAProxy, Istio Gateway, API Gateways), experience with GitOps using Flux, knowledge of core concepts and scaling patterns (HPA, KEDA).
Added on 11/17/2025