- NVIDIA (Santa Clara, CA)
- …cluster and network telemetry. + Work on software that manages NVLINK topography across GPU clusters. + Build automated test infrastructure that we use to ... NVIDIA is hiring engineers to scale up its AI Infrastructure . We expect you to have a strong programming background, knowledge of datacenter hardware, operations,… more
- NVIDIA (Westford, MA)
- …You will help develop an innovative hybrid HPC + Quantum computing infrastructure featuring a substantial NVIDIA GB200 NVL72 GPU cluster (572 GPUs) that ... Boston area, you'll be the linchpin in transforming next-generation HPC + Quantum infrastructure into robust, scalable...primary systems administrator and reliability owner for the GB200 GPU infrastructure . What We Need to See:… more
- NVIDIA (Santa Clara, CA)
- …imagination and intelligence. Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...years of experience designing and operating large scale compute infrastructure + Experience with AI/ HPC advanced job… more
- NVIDIA (Santa Clara, CA)
- …scalable telemetry pipelines, AI-driven insights, and intelligent monitoring across NVIDIA's world-class GPU infrastructure . What You Will Be Doing: + Design and ... heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to...operators with actionable insights. + Collaborate with AI platform, GPU , and cloud infrastructure teams to optimize… more
- NVIDIA (Westford, MA)
- …Quantum Engineer, you'll play a pivotal role in integrating quantum systems with HPC infrastructure and application workflows. Your work will be instrumental in ... Rust for API integration and workflow automation. + Strong understanding of HPC systems, Slurm orchestration, and GPU -accelerated computing environments. +… more
- NVIDIA (Santa Clara, CA)
- …+ Minimum of 6 years of experience crafting and operating large scale compute infrastructure . + Experience with AI/ HPC job schedulers and orchestrators, such as ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...ahead of new technologies and effective approaches in the HPC and AI/ML infrastructure fields. Ways to… more
- NVIDIA (Santa Clara, CA)
- …developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU , our invention, serves as the visual cortex of modern computers ... from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking team at NVIDIA. We...libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer… more
- NVIDIA (Santa Clara, CA)
- …generative machine learning models in digital biology and beyond + Collaborate with multiple HPC , AI infrastructure , and research teams + Develop tools to assist ... upon which every new AI-powered application is built. We are seeking a Sr. HPC Performance engineer to join our team of scientists and engineers passionate about… more
- Lilly (Indianapolis, IN)
- …capabilities. + **Be Your Best** - You will learn about new technologies, AI/ML based HPC , large scale GPU clustering, Infrastructure as Code, and Enterprise ... Advanced Intelligence and Data science teams through implementing advancements across our AI/ HPC infrastructure tooling and operational excellence You will work… more
- Amazon (Herndon, VA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC ...following programming languages: C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI/ML frameworks.… more
- Amazon (Arlington, VA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... multi-user environment. - High level understanding of the underlying infrastructure platform and resources to run HPC ...following programming languages: C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI/ML frameworks.… more
- Johns Hopkins University (Baltimore, MD)
- …the direction of senior engineers. The position collaborates closely with senior HPC staff to deliver stable, efficient, and well-documented systems that ... and maintenance of Johns Hopkins University's high-performance computing and AI infrastructure . This role ensures the reliability, availability, and security of… more
- NVIDIA (Santa Clara, CA)
- …of GPU /accelerated compute architectures and their contributions to AI, HPC , and distributed storage systems is necessary. + Experience with storage, security, ... make a lasting impact on the world! As a Senior Product Manager - Data Center at NVIDIA, you'll...AI model training/inference and secure workflows. + Experience defining GPU , servers, data center infrastructure , or disaggregated… more
- NVIDIA (Santa Clara, CA)
- …are seeking an expert Solutions Architect to assist customers in building AI/ML and HPC software solutions at scale. As a member of our Solutions Architecture team, ... of contact for customers engaged in the development of intricate AI infrastructure , while also offering support in understanding performance aspects related to tasks… more
- Sandia National Laboratories (Albuquerque, NM)
- …secure enclaves for CUI and RD applications + Coordinating public?private partnerships on HPC and AI infrastructure deployments + Working in cross-lab federated ... missions. The Platform is based on three pillars: Models, Infrastructure , and Data. You will join the Infrastructure...and implement the hybrid compute fabric + Integrate exascale HPC systems with elastic cloud resources and specialized AI… more
- Deloitte (Baltimore, MD)
- …This role is ideal for a hands-on technologist with deep expertise in HPC systems, GPU -accelerated infrastructure , and large-scale AI deployments-combined ... interconnect fabrics (InfiniBand NDR/800G, RoCEv2, 100-400 GbE) for large-scale GPU and HPC clusters. + Deploy and...and utilities in Python, Golang, and Bash. + Integrate HPC infrastructure with ML frameworks, container runtimes,… more
- NVIDIA (Santa Clara, CA)
- …as C++ for efficient system development. + Strong experience with large-scale GPU clusters, HPC environments, and job scheduling/orchestration tools (eg, SLURM, ... NVIDIA is searching for a senior or principal engineer who specializes in building... or principal engineer who specializes in building cutting-edge infrastructure for large-scale foundation model training in the Generalist… more
- NVIDIA (Santa Clara, CA)
- …of ground breaking projects. What You'll Be Doing: + Design, implement an on-prem HPC infrastructure supplemented with cloud computing to support the growing IT ... artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC ...HPC and AI solution technologies from CPU's and GPU 's to high speed interconnects and supporting software +… more
- NVIDIA (Santa Clara, CA)
- …out from the crowd: + Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp ... in Artificial Intelligence, High Performance Computing and Visualization. The GPU , our invention, serves as the visual cortex of...runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner… more
- NVIDIA (Santa Clara, CA)
- …improved workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist to architect, develop ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...We are looking for an outstanding architect for a Senior System Engineer role for system bringup and datacenter… more