Roles & Responsibilities
Hi,
Immediate Hiring
Job Description
The Platform Operations Engineer will be working with government agencies to maintain and support their critical on-premises platforms and infrastructure. This role focuses on ensuring platform reliability, implementing modern operational practices, and supporting infrastructure modernisation initiatives while maintaining robust support for existing agency systems. You will work closely with various stakeholders to ensure the stability and efficiency of essential government IT infrastructure.
Key Responsibilities :
- Maintain critical infrastructure platforms including compute, storage, virtualisation, and supporting systems across development, staging and production environments.
- Follow and implement platform standards, executing infrastructure automation and modern operational practices to improve efficiency and reliability.
- Support platform enhancement initiatives and implementation of new infrastructure solutions, ensuring alignment with enterprise architecture standards.
- Manage virtualisation platforms (e.g., VMware, Hyper-V), including capacity monitoring, performance optimisation, and lifecycle management.
- Implement and maintain robust monitoring and observability solutions for all platform components using modern tooling (e.g., Prometheus, Grafana, ELK stack).
- Execute platform patching strategies, leveraging automation to maintain security and stability while minimising service disruption.
- Provide L2 / L3 technical support for platform-related incidents, conducting problem determination and resolution.
- Support containerisation initiatives and maintain container orchestration platforms for traditional workloads where applicable.
- Implement Infrastructure as Code (IaC) practices to automate platform provisioning and configuration management.
- Maintain backup, DR, and high-availability solutions for critical platform components.
- Follow security controls implementation, including access management, security hardening, and compliance monitoring.
- Collaborate with application teams to support platform stability, performance, and scalability requirements.
- Create and maintain platform documentation, runbooks, and standard operating procedures.
- Support team members on platform operations and modern infrastructure practices.
Regards
Kshama
91 9833964181kshama.raj@blueocean.systems
Tell employers what skills you have
Scalability
PostgreSQL
VMware
Reliability
Staging
Distributed Systems
Enterprise Architecture
Configuration Management
Python
Hardening
Virtualisation
Orchestration
API
Linux
Technical Support