AI HPC Systems Performance Jobs in San Jose, CA

110 jobs (page 1)

Categories

All Categories

Engineering (51)

Software/IT (17)

Staff Product Manager, HBM

Micron Technology, Inc. (San Jose, CA)

…position in the Artificial Intelligence ( AI ), Machine Learning (ML) and High Performance Computing ( HPC ) business segments. You will be working on innovative ... you will be charged with defining and accomplishing the strategy for a High Performance Memory product portfolio that will further fortify Micron's leadership… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
Principal Product Manager, HBM

Micron Technology, Inc. (San Jose, CA)

…in growing the Artificial Intelligence ( AI ), Machine Learning (ML) and High- Performance Computing ( HPC ) business segments. You will be working on innovative ... Work (SOWs), business term sheets, and other customer-facing documents for high- performance memory products. Represent the Product Management team in Product… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
System Software Engineer

Broadcom (San Jose, CA)

…you apply. Job Description: Ethernet NIC product portfolio is designed for high performance computing and networking applications including AI and ML. This is ... of the next generation of Ethernet NIC solutions for AI /ML and High performance computing applications. We...networking is an added advantage. Experience analyzing and tuning performance for a variety of HPC workloads.… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
Cloud Performance Engineer

Intel (Santa Clara, CA)

…ensuring tailored innovation for diverse needs across general-purpose compute, web services, HPC , and AI -accelerated systems . Our charter encompasses ... you will be a member of the Data Center Performance Engineering team. This team is responsible for ...Business acumen on key market segments - cloud, enterprise, HPC , Edge Knowledge of key benchmarks in datacenter Ability… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
Member of Technical Staff, AI - Multimodal

Microsoft Corporation (Mountain View, CA)

…the boundaries of scale, performance , and deployment, creating frontier AI systems that power transformative experiences across Microsoft. The Multimodal ... data libraries (Pandas, NumPy, etc.) OR equivalent experience. Experience with large-scale AI systems - design and deployment of distributed architectures,… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
Member of Technical Staff, Hardware Health

Microsoft Corporation (Mountain View, CA)

…high- performance GPUs, ultra-low-latency NVLink/NVSwitch networks, and innovative liquid-cooling systems . Our team is seeking a Member of Technical Staff, ... Hardware Health, to ensure these systems deliver sustained reliability, performance , and availability...predictive health models, failure detection frameworks, and autonomous remediation systems that keep our AI clusters operating… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
Software Engineer, SystemML - Scaling…

Meta (Menlo Park, CA)

…following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more

job goal (12/12/25)
- Save Job - Related Jobs - Block Source
AI / HPC Systems…

Meta (Menlo Park, CA)

…fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...workloads that expects a loss-less fabric interconnect. To improve performance of these systems we constantly look… more

Meta (11/18/25)
- Save Job - Related Jobs - Block Source
Senior AI and ML HPC Cluster…

NVIDIA (Santa Clara, CA)

…with AI / HPC workflows that use MPI + Experience analyzing and tuning performance for a variety of AI / HPC workloads. + Passion for continual learning ... GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek a...storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning… more

NVIDIA (10/19/25)
- Save Job - Related Jobs - Block Source
Senior AI - HPC Cluster Engineer…

NVIDIA (Santa Clara, CA)

…analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... and implement GPU compute clusters for deep learning and high- performance computing. What you'll be doing: + Provide leadership...storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
Senior Engineer - AI and HPC…

NVIDIA (Santa Clara, CA)

…, time-series databases, and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC environments. + Experience ... teams to optimize observability for model training, inference workloads, and HPC performance . + Leverage machine learning and statistical techniques… more

NVIDIA (10/22/25)
- Save Job - Related Jobs - Block Source
AI / HPC System Performance…

Meta (Menlo Park, CA)

…and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look… more

Meta (11/06/25)
- Save Job - Related Jobs - Block Source
Senior HPC and AI Networking…

NVIDIA (Santa Clara, CA)

…fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools… more

NVIDIA (12/03/25)
- Save Job - Related Jobs - Block Source
AI / HPC Network Engineering Manager

Meta (Menlo Park, CA)

…These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more

Meta (10/16/25)
- Save Job - Related Jobs - Block Source
Senior Software Architect, AI…

NVIDIA (Santa Clara, CA)

…group at NVIDIA has openings for software architects in the field of AI and high- performance networking and system software. We research, develop, and ... and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new… more

NVIDIA (10/30/25)
- Save Job - Related Jobs - Block Source
Senior Solution Architect, HPC…

NVIDIA (Santa Clara, CA)

…Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... of large-scale AI clusters, focusing on performance at scale,...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more

NVIDIA (09/17/25)
- Save Job - Related Jobs - Block Source
Performance Engineering Intern, Deep…

NVIDIA (Santa Clara, CA)

…looking for a Performance Engineer Intern focused on Deep Learning (DL) & High- Performance Computing ( HPC ) applications to join our diverse team. Our team is ... What you'll be doing: + Plan and execute GPU performance benchmarking across a wide range of HPC...quality, and want to be at the forefront of AI & HPC , we would love for… more

NVIDIA (12/12/25)
- Save Job - Related Jobs - Block Source
Sr. Worldwide Specialist Solutions Architect,…

Amazon (Santa Clara, CA)

…computing and its potential to overcome some of the biggest challenges in High Performance Computing ( HPC )? Do you have a unique combination of deep technical ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more

Amazon (12/11/25)
- Save Job - Related Jobs - Block Source
Senior Manager, Google Cloud, HPC

Google (Sunnyvale, CA)

…cycles, building tools, architecting and developing software for scalable distributed systems , including data platform, AI /ML, and infrastructure. + Experience ... products, and different customer segments/use cases of the emerging AI compute tech stack. **About the job** The Google...of our customers and helping shape the future of HPC . As the Senior Manager in High Performance… more

Google (12/03/25)
- Save Job - Related Jobs - Block Source
Senior HPC Cluster Engineer - EDA

NVIDIA (Santa Clara, CA)

…analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, and ... deploy, and operate GPU Compute Clusters for EDA and high- performance computing workloads used across multiple teams and projects.... systems such as Lustre and GPFS for AI / HPC workload. + Familiarity with metrics collection… more

NVIDIA (12/10/25)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search