Talent.com
This job offer is not available in your country.
Network Operation Engineer

Network Operation Engineer

HORIZON GLOBAL SERVICES PTE. LTD.Singapore
12 days ago
Job description

Need Singaporean cat 1, Cat 2 A

Key Responsibilities :

  • Infrastructure Management :

Administer and manage HPC infrastructure with 700+ compute nodes and 50+ AWS cloud instances.

Ensure smooth operation and integration of HPC systems, storage subsystems, and networking components.

  • Linux Systems Administration :
  • Perform administration on Red Hat and CentOS servers.

    Handle patching, compiling, securing, and troubleshooting in a heterogeneous environment.

  • Monitoring & Automation :
  • Implement and maintain system monitoring, configuration management, and automation using tools like Puppet, Splunk, BigFix, Ganglia, and Nagios.

  • Job Scheduling :
  • Manage job scheduling environments with PBS or equivalent workload schedulers.

  • Technical Support :
  • Provide advanced troubleshooting for researchers and developers in HPC environments.

  • Performance Optimization :
  • Contribute to system performance, reliability, and scalability enhancements.

  • Change Management :
  • Coordinate and implement changes across development, testing, and production environments.

  • Collaboration :
  • Work closely with internal IT teams and research staff to meet infrastructure demands.

  • Disaster Recovery & Documentation :
  • Participate in disaster recovery planning and maintain system documentation.

    Required Skills & Tools :

    Operating Systems :

  • Red Hat
  • CentOS
  • HPC Tools & Technologies :

  • Xcat
  • PBS Scheduler
  • Infiniband
  • Lustre
  • Scripting & Automation :

  • Bash
  • Python
  • Compilers :
  • Intel
  • CUDA
  • Cloud Technologies :

  • AWS (Certified Solutions Architect - Associate preferred)
  • Monitoring & Configuration Tools :

  • Puppet
  • Splunk
  • BigFix
  • Ganglia
  • Nagios
  • Cluster Management :
  • Red Hat PCS
  • Parallel File Systems
  • Soft Skills :

  • Strong communication skills.
  • Analytical thinking.
  • Advanced troubleshooting capabilities.
  • Create a job alert for this search

    Network Engineer • Singapore