Job Duties
- Manage Multi-Cloud Operations – Operate, maintain, and troubleshoot cloud-native services across AWS, Azure, and GCP, ensuring uptime, performance, and scalability in production environments.
- Lead Infrastructure as Code (IaC) Practices – Maintain and enhance IaC pipelines using Terraform, Ansible, or ARM templates, resolving drift and deployment issues while promoting automation and GitOps practices.
- Oversee OS Lifecycle & Patch Management – Lead Windows and Linux patching operations using AWS Patch Manager, Azure Update Management, WSUS, SCCM, and YUM / DNF, ensuring compliance and audit readiness.
- Support Application Deployment & Troubleshooting – Deploy, monitor, and troubleshoot applications on Windows and Linux servers; optimize OS-level performance and collaborate with development teams on infrastructure-related issues.
- Enforce Security & Compliance – Implement CIS hardening, remediate vulnerabilities using tools such as Trend Micro Vision One, Qualys, and Tenable, and ensure adherence to government security and audit standards.
- Drive Containerization & DevSecOps Integration – Support containerized environments (Docker, Kubernetes, ECS, EKS, AKS, GKE) and CI / CD pipelines, aligning with SHIP-HATS and other government DevSecOps frameworks.
- Maintain ITIL & Service Management Processes – Manage incidents, problems, and changes through ITSM tools (ServiceNow, Jira), coordinate CAB reviews, and ensure SLAs / OLAs are consistently met.
- Implement Monitoring & Observability Tools – Integrate monitoring and log analysis solutions using CloudWatch, Azure Monitor, and GCP Cloud Logging to enhance infrastructure visibility and reliability.
- Lead Documentation & Knowledge Management – Develop and maintain detailed runbooks, SOPs, architecture diagrams, and CMDB entries to ensure operational consistency and audit readiness.
- Provide Technical Leadership & Mentorship – Guide Level 2 and junior engineers through technical escalations, training sessions, and best practice adoption, fostering a culture of operational excellence.
Job Requirements
Education : Bachelor's degree in Computer Science, Information Systems, or a related field.Experience : Minimum 5 years in cloud engineering, with at least 3 years in AWS / Azure / GCP environments and 2 years in regulated or public-sector settings.Cloud Expertise : Proven experience managing production workloads across AWS, Azure, and GCP, including core services such as EC2, EKS, AKS, and GKE.IaC & Automation : Hands-on proficiency in Terraform, Ansible, or ARM templates; scripting skills using PowerShell, Bash, or Python.OS Management : Strong understanding of Windows Server administration and patching, with working knowledge of Linux (RHEL).Security Knowledge : Experience with CIS Benchmarks, IAM best practices, vulnerability remediation, and SSL certificate management.DevSecOps & Containers : Familiarity with CI / CD pipelines, container orchestration tools, and Singapore Government's SHIP-HATS or IM8 frameworks.ITIL & ITSM : Strong understanding of ITIL processes and experience using ITSM platforms such as ServiceNow or Jira.Leadership Skills : Proven ability to mentor, lead technical teams, and handle escalations in complex multi-cloud environments.Certifications (Preferred) : AWS Solutions Architect / SysOps Administrator, Azure Administrator / Architect Expert, RHCE or LPIC, and ITIL v4 Foundation.To Apply, please kindly email your updated resume to
Regret to inform that only shortlisted candidates will be notified.
CEI : R
EA License : 14C7275