Talent.com
This job offer is not available in your country.
HPC AI Infrastructure Hardware Manager

HPC AI Infrastructure Hardware Manager

3160 KLA-Tencor (Singapore)Singapore, Singapore
18 days ago
Job description

Description

Preferred Qualifications

The ideal candidate will have a strong understanding of HPC infrastructure, Experience in deriving Hardware Specs based requirements, and proficiency in product lifecycle management. They will engage with teams to understand their requirements, drive development for our HPC platforms, and collaborate with other teams for integration. The candidate should also have expertise in Hardware System Design, Linux Systems Administration, container orchestration, networking, security, diagnostics tooling and performance tuning. Experience integrating, testing, and optimizing the integration of HPC with storage and data platforms is also essential.

Principal Responsibilities :

Drive team growth and development, providing mentorship and support to team members.

Ensure the successful execution of projects, meeting deadlines and delivering high-quality results.

Work with various OEMs to understand their Product offerings and Roadmaps to create optimal HPC Solution Offerings.

Collaborate with other sub-system teams on developing HPC Cluster Roadmaps that meet Product Requirements.

Collaborate within a customer-focused teams to design, develop, test, and deploy Embedded HPC infrastructure in alignment with business needs.

Foster strong relationships with Product and Program Management, Software engineering, Mfg and Service teams to ensure the HPC Platforms effectively meet their requirements.

Qualifications / Skills :

3+ years’ experience in managing, and mentoring teams.

Knowledge of Linux Hardware Ecosystem centered around CPU, GPU and PCIE Architecture.

Deep understanding of Linux Operating systems, Networking with practical experience in tuning HPC workloads.

Experience with configuration management and automation tools, such as Chef, Ansible, Salt, Packer

Experience with building monitoring and alerting on logs and metrics with excellent troubleshooting and analytical skills.

Experience with and a strong understanding of containers (docker / singularity). Container orchestration with Kubernetes a Plus.

Maintain a grounded approach, making decisions based on data and strategic goals rather than emotions and clearly articulate the decisions.

International traveling couple times a year will be required.

Minimum Qualifications

Engineering degree (Preferably CS, CE)

Experience working with HPC Technologies.

Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees. KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched for legitimate job postings. KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers. If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.

Create a job alert for this search

Manager Infrastructure • Singapore, Singapore

Related jobs
Project Lead, Hardware Cybersecurity AI Solutions

Project Lead, Hardware Cybersecurity AI Solutions

X-PHYsingapore, North East, SG
Quick Apply
We are seeking a highly motivated and experienced Project Lead to spearhead the development of cutting-edge hardware cybersecurity solutions that leverage the power of Artificial Intelligence.As th...Show moreLast updated: 30+ days ago
Staff Platform Engineer - High Performance Computing Infrastructure Platform Management

Staff Platform Engineer - High Performance Computing Infrastructure Platform Management

Centre for Strategic Infocomm TechnologiesSingapore, Singapore
We are seeking an experienced HPC Staff Engineer to join our team, responsible for managing and optimizing our HPC infrastructure platform. The successful candidate will have a deep understanding of...Show moreLast updated: 30+ days ago
  • Promoted
Senior HPC Deployment Project Manager

Senior HPC Deployment Project Manager

Hewlett Packard Enterprise Development LPSingapore, Pedra Branca, Singapore
Senior HPC Deployment Project Manager.This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office. Hewlett Packard Enterprise is the global edge-to-c...Show moreLast updated: 1 day ago
  • Promoted
Solutions Architect, Cloud Services

Solutions Architect, Cloud Services

Borr DrillingSingapore, Pedra Branca, Singapore
Join us at NVIDIA as a Cloud Services Solutions Architect and help our customers and partners adopt end-to-end AI solutions in the cloud! As a Solutions Architect, you’ll design, configure and depl...Show moreLast updated: 24 days ago
Manager HPC Resource Management (PDB), NSCC

Manager HPC Resource Management (PDB), NSCC

A •STAR RESEARCH ENTITIESSingapore
As Manager for HPC Resource Management (PDB), you will be a part of a high performing, dynamic and important team of people that work cross-functionally across strategy development and industry eng...Show moreLast updated: 14 days ago
  • Promoted
Solutions Architect, Cloud Services

Solutions Architect, Cloud Services

NVIDIA CorporationSingapore, Pedra Branca, Singapore
Solutions Architect, Cloud Services page is loaded.Solutions Architect, Cloud Services.Apply locations Singapore, Singapore-Suntec Tower time type Full time posted on Posted Yesterday job requisiti...Show moreLast updated: 25 days ago
Solutions Engineer - AI Infrastructure.

Solutions Engineer - AI Infrastructure.

CiscoSingapore, Singapore
We are seeking a Solutions Engineer - Artificial Intelligence (AI) to join our dynamic sales team.As an SE (AI), you will drive the adoption of our AI solutions across various industries.You will i...Show moreLast updated: 18 days ago
Principal Cloud Architect – HPC / GPU & AI Platform Solutions

Principal Cloud Architect – HPC / GPU & AI Platform Solutions

OracleSingapore
Architect and deploy large-scale GPU / HPC infrastructure on OCI using tools like Terraform, Ansible, Slurm and Kubernetes. Build automated solutions for cluster provisioning, software deployment, and...Show moreLast updated: 6 days ago
  • Promoted
Engineering Manager - Hardware

Engineering Manager - Hardware

RapsodoSingapore, Pedra Branca, Singapore
Be among the first 25 applicants.Are you a problem solver at heart with a passion for building high-performance embedded systems? Do you thrive leading talented engineering teams to develop innovat...Show moreLast updated: 9 days ago
Business Development Manager APAC (infrastructure / hardware

Business Development Manager APAC (infrastructure / hardware

HudsonSingapore
Industry : Infrastructure & Hardware Solutions.Reports to : VP Business Development based in US.Are you a strategic business development leader with a strong track record in growing infrastructure an...Show moreLast updated: 18 days ago
Quantum Integration System Engineer (Hardware), NSCC

Quantum Integration System Engineer (Hardware), NSCC

A •STARSingapur
We are looking for a talented Quantum Integration Systems Engineer to lead the integration of quantum technologies into classical HPC systems. The role involves bridging the gap between quantum hard...Show moreLast updated: 13 days ago
  • Promoted
Lead Engineer, AI Infrastructure

Lead Engineer, AI Infrastructure

HTX (Home Team Science & Technology Agency)Singapore, Pedra Branca, Singapore
HTX is the world’s first Science and Technology agency for Public Safety and Security.Home Team, our shared mission is to amplify, augment and accelerate the Home Team’s advantage in securing Singa...Show moreLast updated: 2 days ago
Software Engineer, AI Infrastructure

Software Engineer, AI Infrastructure

ByteDanceSingapore
ResponsibilitiesTeam IntroductionOur team is dedicated to building a highly available and scalable general-purpose Serverless platform that embodies the philosophy of Function-as-a-Service (FaaS).B...Show moreLast updated: 18 days ago
  • Promoted
Solutions Architect, Cloud Services

Solutions Architect, Cloud Services

NVIDIASingapore, Pedra Branca, Singapore
Solutions Architect, Cloud Services.Solutions Architect, Cloud Services.Join us at NVIDIA as a Cloud Services Solutions Architect and help our customers and partners adopt end-to-end AI solutions i...Show moreLast updated: 6 days ago
AI Infrastructure Platform Engineer

AI Infrastructure Platform Engineer

NTTSingapore, South East, Singapore
Join a company that is pushing the boundaries of what is possible.We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society.Our wo...Show moreLast updated: 18 days ago
  • Promoted
Change And Release Manager

Change And Release Manager

RiDiK (a Subsidiary of CLPS. Nasdaq : CLPS)Singapore, Pedra Branca, Singapore
Get AI-powered advice on this job and more exclusive features.Direct message the job poster from RiDiK (a Subsidiary of CLPS. APAC Senior Talent Acquisition Specialist @Shell Infotech RiDiK | Hiring...Show moreLast updated: 25 days ago
  • Promoted
Engineering Manager - Hardware

Engineering Manager - Hardware

Rapsodo Inc.Singapore, Pedra Branca, Singapore
Be among the first 25 applicants.Get AI-powered advice on this job and more exclusive features.Are you a problem solver at heart with a passion for building high-performance embedded systems? Do yo...Show moreLast updated: 21 days ago
MTS Test Engineer

MTS Test Engineer

Advanced Micro Devices, IncSingapore, Singapore
WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that ...Show moreLast updated: 18 days ago