Back to jobs
Quest Global

HPC Hardware/Network Engineer– Linux Systems

Milpitas, CAPosted 4 weeks ago
Temporary

Job Description

Job Requirements

Quest Global delivers world-class end-to-end engineering solutions by leveraging our deep industry knowledge and digital expertise. By bringing together technologies and industries, alongside the contributions of diverse individuals and their areas of expertise, we are able to solve problems better, faster. This multi-dimensional approach enables us to solve the most critical and large-scale challenges across the aerospace & defense, automotive, energy, hi-tech, healthcare, medical devices, rail and semiconductor industries.

 

We are looking for humble geniuses, who believe that engineering has the potential to make the impossible possible; innovators, who are not only inspired by technology and innovation, but also perpetually driven to design, develop, and test as a trusted partner for Fortune 500 customers. As a team of remarkably diverse engineers, we recognize that what we are really engineering is a brighter future for us all. If you want to contribute to meaningful work and be part of an organization that truly believes when you win, we all win, and when you fail, we all learn, then we’re eager to hear from you.

 

The achievers and courageous challenge-crushers we seek, have the following characteristics and skills:


We’re looking for a hands-on HPC Engineer to assist with the setup, maintenance, and operation of our high-performance computing cluster.


This role is ideal for someone with practical experience in Linux systems, containers, and basic scripting, who enjoys working in a fast-paced technical environment.


Key Responsibilities:


· Develop and maintain customized SUSE Linux OS images aligned with Client hardware and software requirements.

· Use configuration management tools such as Salt or Ansible to automate and streamline system configuration.

· Implement a test-driven development approach to ensure reliability and maintainability of system configurations and scripts.

· Create and maintain comprehensive documentation for all developed processes, configurations, and tools.

· Develop diagnostic scripts to integrate with existing diagnostic suites, improving system troubleshooting capabilities for both configuration and hardware-related issues.

· Collaborate closely with multi-functional teams including hardware engineering, software development, and system integration to ensure seamless deployment and support of Linux-based systems.

· Participate in regular team meetings, design reviews, and code walkthroughs to share progress, gather feedback, and align on project goals.

· Assist with Rack stacking, Cabling and maintain the AI data center and lab.

· Perform routine maintenance and troubleshooting on Linux servers.

· Use Bash and Python scripts to automate basic tasks and system checks.

· Setup a Monitoring stack and alerts for all system issues

· Work closely with engineers and developers to ensure smooth operation of infrastructure.

 



Work Experience

· Strong experience in computer hardware design, particularly in compute cluster or server environments.

· Experience in networking design, including InfiniBand, Ethernet switches, with expertise in port mapping and configuration.

· Familiarity with modern memory technologies (e.g., DDR4/DDR5, DIMM, LPDDR, HBM).

· Proven experience with Linux operating system customization and image creation.

· Proficiency in SaltStack, Ansible, or similar configuration management tools is a plus.

· Strong scripting skills (e.g., Bash, Python) for automation and diagnostics.

· Familiarity with test-driven development practices and tools.

· Excellent documentation skills with attention to detail.

· Ability to work independently and collaboratively in a fast-paced environment.

· Exposure to Git, Jenkins, or similar tools is a plus.


Preferred Attributes:


· Strong problem-solving and analytical skills.

· Effective communication and collaboration abilities.

· Self-motivated with a proactive approach to identifying and resolving issues.

· Experience in hardware troubleshooting and integration with diagnostic tools.

· Comfortable working in a team-oriented environment with shared responsibilities and goals.



See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

10001+ employees
Singapore, SG
Website
HPC Hardware/Network Engineer– Linux Systems at Quest Global | Renata