Senior Hpc Cluster Systems Jobs

33 jobs (page 1)

Categories

All Categories

Engineering (10)

Software/IT (7)

Senior HPC Cluster Engineer…

NVIDIA (Santa Clara, CA)

…a lasting impact on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA ... doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. +… more

NVIDIA (12/10/25)
- Save Job - Related Jobs - Block Source
Senior AI and ML HPC Cluster…

NVIDIA (Santa Clara, CA)

…doing: + Provide leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... Centos/RHEL and/or Ubuntu Linux distributions + Solid understanding of cluster configuration managements tools such as Ansible, Puppet, Salt...IPoIB and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC … more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior AI- HPC Cluster…

NVIDIA (Santa Clara, CA)

…doing: + Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + ... ). + Experience analyzing and tuning performance for a variety of AI/ HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks,… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
Senior HPC Systems Engineer

Massachusetts Institute of Technology (Cambridge, MA)

Senior HPC Systems Engineer + Job Number: 25342 + Functional Area: Information Technology + Department: MA Green High Performance Computing Ctr + School ... Email a Friend Save Save Apply Now Posting Description SENIOR HPC SYSTEMS ENGINEER, The...Requirements REQUIRED: Solid track record in HPC systems administration/engineering including hands-on experience with Linux-based cluster… more

Massachusetts Institute of Technology (12/04/25)
- Save Job - Related Jobs - Block Source
Senior HPC Engineer

Texas A&M University System (College Station, TX)

Job Title Senior HPC Engineer Agency Texas A&M...expertise and consultation for the design and deployment of HPC systems . Get in on the ground ... sensitive requiring US Citizenship. Opportunities to Contribute * Manage large-scale HPC cluster operations, including OS upgrades, firmware patching, and… more

Texas A&M University System (01/07/26)
- Save Job - Related Jobs - Block Source
Senior HPC Administrator (High…

Mount Sinai Health System (New York, NY)

…research clinical data warehouse team and a research data services team. The ** Senior HPC Administrator, High Performance Computational and Data Ecosystem** , is ... a strong focus on customer service for researchers. The HPC Senior Administrator is an expert troubleshooter...and operations (preferably in a Redhat/CentOS Linux administration, Batch HPC cluster environment) + Must be an… more

Mount Sinai Health System (11/15/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, Cluster…

NVIDIA (Santa Clara, CA)

…to stand out from the crowd: + Experience leading large-scale AI Factory or HPC cluster bring-ups or builds + Hands-on experience with NVIDIA networking products ... world's fastest supercomputers. We are seeing a highly motivated Senior Solutions Architect to join the Cluster ...and troubleshooting + Proven expertise in designing large-scale distributed systems , AI clusters, or HPC infrastructure +… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior GPU and HPC Infrastructure…

NVIDIA (Santa Clara, CA)

…level knowledge of Linux system administration and management. + Understanding of cluster management systems (Kubernetes, SLURM) + Understanding of performance, ... with RDMA networking. + Advanced hands-on experience and deep understanding of cluster management systems (Kubernetes, SLURM.) Hands-on experience in Machine… more

NVIDIA (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Systems Engineer…

NVIDIA (Santa Clara, CA)

Join the NVIDIA Deep Learning Frameworks Infrastructure team as a Senior Systems Engineer focusing on High-Performance AI & Networking Applications, committed to ... or AI clusters in production environments. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workload. + Experience with… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Systems Administrator…

Mount Sinai Health System (New York, NY)

…and implements backup policies. + Assist in the management and maintenance of HPC cluster and data center work, including troubleshooting for resolving system ... Sinai. The Administrator is the principal technology expert for Windows and Linux systems , and help support high-performance computing ( HPC ) environment in the… more

Mount Sinai Health System (12/22/25)
- Save Job - Related Jobs - Block Source
Research Systems Administrator

Boston College (Chestnut Hill, MA)

…departments and institutes. This position primarily provides Linux server administration for the HPC cluster . Additionally this role sets up and manages physical ... hardware (particularly GPU), job scheduling, and trends that improve performance of HPC systems . Helps promote technologies for teaching and research, and… more

Boston College (12/30/25)
- Save Job - Related Jobs - Block Source
Senior Systems Integration Engineer

Insight Global (Louisville, CO)

…space economy. In this position, you will support a handful of programs that use HPC 's cluster to do their runs. You will troubleshoot schedulers, Monte Carlo ... the commercial space industry is looking for a Sr Systems Integration Engineer to join their growing corporate IT...experience within Linux 5+ years of experience working with HPC (deploying and maintaining on prem HPC … more

Insight Global (01/09/26)
- Save Job - Related Jobs - Block Source
Senior Network Development Engineer

Oracle (Annapolis, MD)

…Description** The AI2NE Org strives to be global leaders in the RDMA cluster networking domain and enable seamless, accelerated High-Performance Compute ( HPC ), ... of state-of-the-art RDMA clusters tailored specifically for AI, ML, HPC workloads. We strive to be the go-to experts...We strive to be the go-to experts in RDMA cluster architecture, leveraging our deep understanding of the unique… more

Oracle (12/13/25)
- Save Job - Related Jobs - Block Source
Senior Solutions Architect, NVIDIA Cloud…

NVIDIA (Santa Clara, CA)

…GPUs, ETH/IB networking components, storage, etc.) within extensive AI and HPC cluster settings. + Practical knowledge of NVIDIA systems technology such as ... with partners and customers. + Experience crafting and deploying large-scale cluster environments. + Practical expertise in data center design, development and… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Linux Systems Administrator

SAIC (Princeton, NJ)

**Description** SAIC is recruiting for a ** Senior Linux Systems Administrator** to join our program as a general technologist to support our NOAA GFDL program in ... cluster . + Experience managing software scalable network file systems and centralized storage appliances. + Experience supporting enterprise logon with… more

SAIC (01/08/26)
- Save Job - Related Jobs - Block Source
Senior Software Architect - Data Center…

NVIDIA (Santa Clara, CA)

…team architects and develops the end to end software and firmware stack for these systems . We are looking for a Senior Software Architect who has deep expertise ... , particularly at the SW/HW interface. + Understanding of HPC or Deep learning workloads and use of accelerated...out from the crowd: + Knowledge of cloud and cluster level deployment and management systems . +… more

NVIDIA (12/25/25)
- Save Job - Related Jobs - Block Source
Senior Systems Software Security…

NVIDIA (CO)

…server architecture. In-depth understanding of the different deployment models for GPUs (eg, HPC , AI cluster , single- or multi-GPU servers). + Experience in Data ... in system software security to join the Data Center Systems Software team. In this role, you will focus...role, you will focus on securing NVIDIA's Data Center Systems . NVIDIA is leading the way in groundbreaking developments… more

NVIDIA (01/10/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer, AI Resiliency

NVIDIA (Santa Clara, CA)

…crucial in driving down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable at all times. What You'll Be Doing: + Develop AI ... We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA,...+ Hands-On Coding & Optimization: Contribute to large-scale distributed systems with high-quality, production-level C++ and Python code. Enhance… more

NVIDIA (10/15/25)
- Save Job - Related Jobs - Block Source
Senior Manager, CSP Engagements - System…

NVIDIA (Santa Clara, CA)

…and telemetry frameworks. + Familiarity with GPU computing (CUDA), large-scale AI/ HPC workloads, NVLink, Grace, and cluster -level deployment/management. + ... NVIDIA is seeking a Senior Manager to lead our System Software SWAT...on data center platforms such as GB200/GB300 and next‑generation systems . This elite, cross‑functional group is the rapid‑response hub… more

NVIDIA (12/28/25)
- Save Job - Related Jobs - Block Source
Senior Research Engineer, Foundation Model…

NVIDIA (Santa Clara, CA)

NVIDIA is searching for a senior or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the ... What you will be doing: + Design and maintain large-scale distributed training systems to support multi-modal foundation models for robotics. + Optimize GPU and … more

NVIDIA (12/05/25)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search