Talent.com
This job offer is not available in your country.
Site Reliability Engineer (Linux Kernel, Kubernetes, Cloud, Automation, Networking). - Islandwide, SG

Site Reliability Engineer (Linux Kernel, Kubernetes, Cloud, Automation, Networking). - Islandwide, SG

EXASOFT CONSULTING PTE. LTD.Islandwide, SG
11 days ago
Job description

Roles & Responsibilities

Responsibilities

  • Develop and oversee performance-critical infrastructure for financial markets, ensuring maximum throughput, high resiliency, and minimal operational risk.
  • Leverage deep Linux kernel expertise to fine-tune scheduling policies, interrupt routing, and NUMA resource allocation, ensuring predictable performance at scale.
  • Build and maintain high-availability containerized environments using Kubernetes, Docker, and advanced orchestration tools with a strong focus on scalability and security.
  • Lead automation initiatives with Ansible, Bash, and Python, eliminating manual intervention and improving system efficiency.
  • Manage hybrid cloud infrastructure (AWS, Azure, GCP) with strict performance SLAs, security compliance, and cost-optimized deployments.
  • Oversee infrastructure monitoring and observability using ELK Stack, Grafana, Site24x7, Splunk, and other enterprise-grade tools, ensuring proactive incident detection and resolution.
  • Administer and troubleshoot enterprise storage and networking stacks like RAID, NFS, SAN / NAS, TCP / IP networking,VMware / vCenter, BigIP load balancers.
  • Collaborate with development, DevOps, and security teams to design fault-tolerant systems and enforce infrastructure governance policies.
  • Execute predictive capacity modeling, OS hardening and patch compliance, coupled with benchmark-driven performance optimization for trading and real-time compute platforms.
  • Provide expert-level outage resolution, coordinating cross-functional teams to deliver sustainable remediation and operational resilience.

Requirements

  • 10+ years of progressive experience in system administration, performance engineering, and reliability operations across enterprise and financial domains.
  • Advanced proficiency in Linux internals with specialization in kernel performance tuning, NUMA-aware optimizations, and real-time workload handling.
  • Proven hands-on experience with Kubernetes, Docker, and Ansible for large-scale automation and orchestration.
  • Strong scripting / programming in Bash, Python, and experience with perf / eBPF for system analysis.
  • Demonstrated expertise in cloud operations across AWS, Azure, and GCP.
  • Strong background in networking protocols (TCP / IP, FIX) and high-performance trading environments.
  • Familiarity with storage systems (SAN, NAS, RAID) and database tuning (MySQL optimization).
  • Experience implementing observability and monitoring solutions like ELK, Grafana, Splunk, Corvil.
  • Tell employers what skills you have

    Remediation

    Scalability

    Kubernetes

    Modeling

    MySQL

    Throughput

    Routing

    Tuning

    System Administration

    Hardening

    Performance Tuning

    Operational Risk

    Docker

    Ansible

    Orchestration

    Create a job alert for this search

    Site Reliability Engineer • Islandwide, SG

    Related jobs
    • Promoted
    Cloud Site Reliability Engineer (AWS) - Islandwide, SG

    Cloud Site Reliability Engineer (AWS) - Islandwide, SG

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer (AWS).An excellent opportunity has just arisen for a Cloud Site Reliability Engineer (AWS) to join a global technology leader supporting secure, mission-critical clo...Show moreLast updated: 3 days ago
    • Promoted
    Cloud Site Reliability Engineer (AWS)

    Cloud Site Reliability Engineer (AWS)

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer (AWS).An excellent opportunity has just arisen for a Cloud Site Reliability Engineer (AWS) to join a global technology leader supporting secure, mission-critical clo...Show moreLast updated: 3 days ago
    • Promoted
    Cloud Site Reliability Engineer

    Cloud Site Reliability Engineer

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer.An excellent Cloud Site Reliability Engineer (AWS) opportunity has just arisen in a global brand supporting mission-critical government systems.Ensure reliable, secu...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer - WECHAT INTERNATIONAL PTE. LTD.

    Senior Site Reliability Engineer - WECHAT INTERNATIONAL PTE. LTD.

    WECHAT INTERNATIONAL PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations. .Responsible for capacity management and planning, r...Show moreLast updated: 21 hours ago
    • Promoted
    Site Reliability Engineer (SRE) - Islandwide, SG

    Site Reliability Engineer (SRE) - Islandwide, SG

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Site Reliability Engineer (SRE).An excellent Site Reliability Engineer (SRE) opportunity is available in a cutting-edge, fast-growing cloud environment. Deliver reliable, secure, and scalable cloud ...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Site Reliability Engineer - PERSOLKELLY SINGAPORE PTE. LTD.

    Cloud Site Reliability Engineer - PERSOLKELLY SINGAPORE PTE. LTD.

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer.An excellent Cloud Site Reliability Engineer (AWS) opportunity has just arisen in a global brand supporting mission-critical government systems.Ensure reliable, secu...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CAREER INTERNATIONAL - FOS PTE. LTD.D11 Novena, Thomson, Watten Estate, SG
    Ensure the stability, reliability, and efficient operation of the Company's global business, maintaining high availability of services at all times. Responsible for core operational tasks such as re...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer (MCS) - D05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG

    Site Reliability Engineer (MCS) - D05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG

    THALES DIS (SINGAPORE) PTE. LTD.D05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG
    You will work in a Devops team managing ODC products in GCP Cloud, following the SRE approach.You will develop and maintain IAC code and automation tools. You will be responsible to provide technica...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer (MCS) - THALES DIS (SINGAPORE) PTE. LTD.

    Site Reliability Engineer (MCS) - THALES DIS (SINGAPORE) PTE. LTD.

    THALES DIS (SINGAPORE) PTE. LTD.D05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG
    You will work in a Devops team managing ODC products in GCP Cloud, following the SRE approach.You will develop and maintain IAC code and automation tools. You will be responsible to provide technica...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer (MCS)

    Site Reliability Engineer (MCS)

    THALES DIS (SINGAPORE) PTE. LTD.D05 Clementi New Town, Hong Leong Garden, Pasir Panjang, SG
    You will work in a Devops team managing ODC products in GCP Cloud, following the SRE approach.You will develop and maintain IAC code and automation tools. You will be responsible to provide technica...Show moreLast updated: 30+ days ago
    • Promoted
    DevSecOps - TOTAL EBIZ SOLUTIONS PTE. LTD.

    DevSecOps - TOTAL EBIZ SOLUTIONS PTE. LTD.

    TOTAL EBIZ SOLUTIONS PTE. LTD.D14 Geylang, Eunos, SG
    Job Title : Site Reliability Engineer (SRE).Bachelor's degree or Diploma in Computer Science, Engineering, or a related field (or equivalent experience). Proven experience as a Site Reliability Engin...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer - D11 Novena, Thomson, Watten Estate, SG

    Site Reliability Engineer - D11 Novena, Thomson, Watten Estate, SG

    CAREER INTERNATIONAL - FOS PTE. LTD.D11 Novena, Thomson, Watten Estate, SG
    Ensure the stability, reliability, and efficient operation of the Company's global business, maintaining high availability of services at all times. Responsible for core operational tasks such as re...Show moreLast updated: 7 days ago
    • Promoted
    Cloud Site Reliability Engineer - Islandwide, SG

    Cloud Site Reliability Engineer - Islandwide, SG

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer.An excellent Cloud Site Reliability Engineer (AWS) opportunity has just arisen in a global brand supporting mission-critical government systems.Ensure reliable, secu...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Site Reliability Engineer (SRE).An excellent Site Reliability Engineer (SRE) opportunity is available in a cutting-edge, fast-growing cloud environment. Deliver reliable, secure, and scalable cloud ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer (SRE) - PERSOLKELLY SINGAPORE PTE. LTD.

    Site Reliability Engineer (SRE) - PERSOLKELLY SINGAPORE PTE. LTD.

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Site Reliability Engineer (SRE).An excellent Site Reliability Engineer (SRE) opportunity is available in a cutting-edge, fast-growing cloud environment. Deliver reliable, secure, and scalable cloud ...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Site Reliability Engineer - D01 Cecil, Marina, People’s Park, Raffles Place, SG

    Senior Site Reliability Engineer - D01 Cecil, Marina, People’s Park, Raffles Place, SG

    WECHAT INTERNATIONAL PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations. .Responsible for capacity management and planning, r...Show moreLast updated: 21 hours ago
    • Promoted
    Senior Engineer - Reliability - GLOBALFOUNDRIES SINGAPORE PTE. LTD.

    Senior Engineer - Reliability - GLOBALFOUNDRIES SINGAPORE PTE. LTD.

    GLOBALFOUNDRIES SINGAPORE PTE. LTD.D25 Kranji, Woodgrove, Woodlands, SG
    GlobalFoundries is a leading full-service semiconductor foundry providing a unique combination of design, development, and fabrication services to some of the world’s most inspired technology compa...Show moreLast updated: 1 day ago
    • Promoted
    Cloud Site Reliability Engineer (AWS) - PERSOLKELLY SINGAPORE PTE. LTD.

    Cloud Site Reliability Engineer (AWS) - PERSOLKELLY SINGAPORE PTE. LTD.

    PERSOLKELLY SINGAPORE PTE. LTD.Islandwide, SG
    Cloud Site Reliability Engineer (AWS).An excellent opportunity has just arisen for a Cloud Site Reliability Engineer (AWS) to join a global technology leader supporting secure, mission-critical clo...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WECHAT INTERNATIONAL PTE. LTD.D01 Cecil, Marina, People’s Park, Raffles Place, SG
    Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations. .Responsible for capacity management and planning, r...Show moreLast updated: 21 hours ago
    • Promoted
    Site Reliability Engineer - CAREER INTERNATIONAL - FOS PTE. LTD.

    Site Reliability Engineer - CAREER INTERNATIONAL - FOS PTE. LTD.

    CAREER INTERNATIONAL - FOS PTE. LTD.D11 Novena, Thomson, Watten Estate, SG
    Ensure the stability, reliability, and efficient operation of the Company's global business, maintaining high availability of services at all times. Responsible for core operational tasks such as re...Show moreLast updated: 7 days ago