About the role
Senior-level cloud infrastructure engineer responsible for managing and supporting multi-cloud environments (AWS, Azure, OpenShift) with focus on high availability, disaster recovery, security, and 24×7 production support for a major Asian bank.
BankingOnsite
Key Responsibilities
- Manage and support AWS production environments, ensuring high availability, reliability, and strict compliance
- Perform incident management, RCA, and problem resolution for production issues
- Monitor AWS resources using tools such as CloudWatch, CloudTrail, and third-party monitoring platforms
- Operate and support AWS core services: EC2, EBS, ELB/ALB, Auto Scaling VPC, Subnets, Security Groups, NACLs, IAM S3, RDS (basic operational support)
- Implement backup, restore, and disaster recovery (DR) strategies
- Provide basic support for Azure components: Azure VMs, VNets, NSGs, Storage, Load Balancers
- Monitor Azure services using Azure Monitor and Alerts
- Support hybrid cloud or multi-cloud integration and connectivity scenarios
- Manage and support OpenShift clusters in production environments
- Perform cluster health checks, upgrades, lifecycle management, and troubleshooting
Requirements
- Hands-on experience in managing multi-cloud environments, specifically AWS, Azure, and OpenShift infrastructure
- Strong expertise in administration and troubleshooting across distributed platforms (Unix/Linux/Windows)
- Solid understanding of high availability, performance tuning, resilience engineering, and security implementation
- Experience supporting service recovery, including disaster recovery (DR) testing, environment hardening, and root cause analysis (RCA) of production incidents
- Willingness to participate in a 24×7 on-call rotation
- Solid understanding of container technologies (Docker/OCI) and Kubernetes fundamentals