Responsibilities
Operate and maintain AWS-native services in production, including Lambda, ECS, EKS, FSx, Redshift, Glue, Neptune, SES, GuardDuty, WAF, Shield Advanced, and Security Hub.
Ensure uptime, availability, and secure operations.
Monitor infrastructure, manage alerts, and support production incidents.
Design and maintain infrastructure deployment pipelines using Terraform, CloudFormation, and Ansible.
Troubleshoot environment drift and pipeline failures.
Promote automation in cloud operations.
Manage patching across RHEL (v8→v9) and Windows Server (2016→2025) using AWS Patch Manager, WSUS, and YUM / DNF.
Schedule, automate, and track patches; coordinate approvals and ensure compliance.
Identify and remediate end-of-life components such as OS versions and Lambda runtimes.
Integrate tools like NGINX into the observability stack.
Work with SRE teams to enhance infrastructure monitoring.
Maintain infrastructure runbooks, patch / change logs, post-mortem reports, and audit documentation.
Qualifications
Bachelor's degree in computer science, information systems, or a related field.
Minimum six years' experience in DevOps / SRE roles, with at least four years in a public sector or regulated cloud environment.
Application Information
Interested candidates, please send your CV to Regret to inform that only shortlisted candidates will be notified.
Seniority Level
Mid-Senior level
Employment Type
Full-time
Job Function
Information Technology
Industries
Information Services
#J-18808-Ljbffr
Cloud Engineer • Singapore, Singapore, Singapore