Roles & Responsibilities
The Platform Operations Engineer will be responsible for maintaining and supporting critical on-premises platforms and infrastructure. This role ensures the reliability, security, and efficiency of enterprise IT environments while enabling infrastructure modernisation initiatives. The engineer will work closely with cross-functional teams and stakeholders to uphold operational excellence and deliver stable, scalable, and secure infrastructure solutions.
Key Responsibilities
- Maintain and support enterprise infrastructure platforms including compute, storage, virtualisation, and supporting systems across development, staging, and production environments.
- Implement and enforce platform standards, leveraging automation and modern operational practices to improve efficiency, consistency, and reliability.
- Support infrastructure modernisation initiatives and new solution implementations in line with enterprise architecture standards.
- Manage virtualisation platforms (e.g., VMware, Hyper-V), covering capacity monitoring, performance optimisation, and lifecycle management.
- Implement and maintain monitoring and observability solutions (e.g., Prometheus, Grafana, ELK stack) across platform components.
- Execute patching and upgrade strategies using automation, ensuring platform security and stability with minimal disruption.
- Provide L2 / L3 technical support for platform-related incidents, including troubleshooting, root cause analysis, and resolution.
- Support containerisation efforts and maintain container orchestration platforms where applicable.
- Apply Infrastructure as Code (IaC) practices to automate provisioning, configuration, and lifecycle management of infrastructure.
- Manage backup, disaster recovery (DR), and high-availability solutions for mission-critical systems.
- Implement security best practices including access management, security hardening, and compliance monitoring.
- Collaborate with application and security teams to ensure performance, scalability, and compliance requirements are met.
- Create and maintain platform documentation, including runbooks, SOPs, and technical guides.
- Mentor and support team members in platform operations and modern infrastructure practices.
Requirements
Technical Requirements
Strong experience with enterprise virtualisation platforms (VMware vSphere, Hyper-V).Hands-on expertise with enterprise storage solutions (SAN, NAS) and backup platforms.Proficiency in Linux and Windows Server administration.Experience with infrastructure automation tools (e.g., Ansible, Puppet, Chef).Knowledge of containerisation technologies (Docker, Kubernetes).Familiarity with observability and monitoring platforms.Practical experience with Infrastructure as Code practices.Strong understanding of networking concepts and technologies.Proficiency in scripting (Python, PowerShell, Bash).Experience with high-availability and disaster recovery solutionsQualifications
Bachelor’s degree in Computer Science, Information Technology, or a related field.Proven experience in infrastructure operations and engineering roles.Strong knowledge of enterprise infrastructure components and interdependencies.Experience in supporting infrastructure modernisation and transformation projects.Excellent problem-solving and analytical skills.Strong documentation skills and attention to detail.Effective communication skills with both technical and non-technical stakeholders.Desired Certifications
VMware Certified Professional (VCP).Microsoft Certified : Windows Server.Red Hat Certified Engineer (RHCE).ITIL v4 Foundation.Tell employers what skills you have
Puppet
Scalability
Kubernetes
Analytical Skills
VMware
Root Cause Analysis
Scripting
Reliability
Networking
Python
Docker
ITIL
Ansible
Orchestration
Linux
Technical Support