- NVIDIA (Santa Clara, CA)
- …Understanding of fast, distributed storage systems like Lustre and GPFS for AI / HPC workloads + Familiarity with deep learning frameworks like PyTorch and ... join us today! As a member of the GPU AI / HPC Infrastructure team, you will provide leadership...leadership and strategic guidance on the management of large-scale HPC systems including the deployment of compute,… more
- NVIDIA (Santa Clara, CA)
- …+ Experience analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, ... fast, distributed storage systems like Lustre and GPFS for AI / HPC workload. Experience working with deep learning frameworks including PyTorch, MegatronLM… more
- NVIDIA (Santa Clara, CA)
- …workloads. Observability is at the heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to design and build the ... and artificial intelligence. Our technology powers everything from generative AI to autonomous systems , and we continue...and large-scale monitoring systems . + Familiarity with AI /ML pipelines, GPU-based workloads , and HPC … more
- Lilly (Indianapolis, IN)
- …the engineering and operations of advanced Linux platforms supporting AI and HPC workloads, managing Nvidia DGX systems using Mission Control, Base Command ... the world. Come help us unlock the power of HPC and AI based POGPU and Accelerated...engineering and development of Advanced Linux computing capabilities for AI /ML. Additionally, you would advise with our senior… more
- NVIDIA (Santa Clara, CA)
- …and usable. + Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new ... and new network hardware features. + Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM),… more
- NVIDIA (Westford, MA)
- …team and see how you can make a lasting impact on the world. We are seeking a Senior HPC & Quantum Systems Engineer to help architect, deploy, and operate a ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...computation, quantum execution, and data movement across tightly coupled systems . HPC Systems & Operations… more
- NVIDIA (Santa Clara, CA)
- …to hear from you! NVIDIA is seeking a Senior High Performance Computing ( HPC ) and AI Networking Performance Research and Analysis Engineer to join our ... In this exciting role, you will profile and analyze AI workloads on large GPUs and CPUs scale clusters...and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools and methodologies… more
- NVIDIA (Santa Clara, CA)
- …Be Doing: + Primary responsibilities will include building and enabling robust AI / HPC infrastructure for customers + Support operational and reliability aspects ... be part of the team that brings Artificial Intelligence ( AI ) emerging technology to the field? We are looking...in working with customers + Expertise with parallel file systems (eg Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects… more
- Massachusetts Institute of Technology (Cambridge, MA)
- Senior HPC Systems Engineer +...deploying, maintaining, and optimizing HPC clusters, storage systems , and networking for AI /ML workloads. Join a ... Email a Friend Save Save Apply Now Posting Description SENIOR HPC SYSTEMS ENGINEER, The...and container orchestration tools like Docker and Kubernetes; and experience in cloud-based HPC or AI /ML workloads.… more
- Texas A&M University System (College Station, TX)
- Job Title Senior HPC Engineer Agency Texas A&M...expertise and consultation for the design and deployment of HPC systems . Get in on the ground ... firmware patching, and performance tuning.* Oversee networking, security, and infrastructure for HPC systems .* Lead the development of specialized HPC … more
- NVIDIA (Santa Clara, CA)
- …+ Experience analyzing and tuning performance for a variety of AI / HPC workloads. Excellent problem-solving to analyze complex systems , identify bottlenecks, ... leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute,... systems such as Lustre and GPFS for AI / HPC workload. + Familiarity with metrics collection… more
- NVIDIA (Santa Clara, CA)
- …high-performance environments. + Published work, patents, or advanced certifications in networking or HPC systems . NVIDIA is widely considered to be one of the ... team, you'll design and shape the architectures that connect the world's most powerful AI clusters. As an HPC Networking Product Architect at NVIDIA, you'll be… more
- NVIDIA (Santa Clara, CA)
- …like NCCL, NVSHMEM, and UCX that are crucial for scaling Deep Learning and HPC . We're seeking a Senior Software Architect to help co-design next-gen data ... to grow with the increasing scale of next generation systems . This is an outstanding opportunity to advance the...+ Design and implement new communication technologies to accelerate AI and HPC workloads. + Explore innovative… more
- NVIDIA (Santa Clara, CA)
- …at NVIDIA. We deliver libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of ... our communication libraries. The DL and HPC applications of today have a huge compute demand...understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals) +… more
- NVIDIA (Santa Clara, CA)
- …team today! We are looking for an outstanding hands-on architect/engineer for a Senior HPC architect role to support deployment and bringup of large-scale ... parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is...develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist… more
- Amazon (Arlington, VA)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... HPC technologies in a multi-user environment. - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications - Advanced degree in Engineering,… more
- Amazon (Austin, TX)
- …large analytical problems as massive scale? Amazon Web Services (AWS) is seeking a Senior Worldwide Specialist Solutions Architect focused on HPC to work with ... C++, Python, CUDA, Bash - Deep GPU knowledge in HPC and/or AI /ML frameworks. Preferred Qualifications -...life sciences or related discipline. - Working knowledge of HPC schedulers and distributed/parallel file systems , underlying… more
- General Dynamics Information Technology (Vicksburg, MS)
- … HPC capabilities and complete technical projects. + Occasionally present to senior leadership on HPC best practices, new and interesting cloud technologies, ... other relevant advances (such as AI &ML), and ways you approached and solved challenging...time and managing customer expectations. Familiarity with commonly used HPC services (ie high performance file systems ,… more
- Federal Reserve Bank (Kansas City, MO)
- …and accelerator technologies (CUDA, OpenACC). + Experience supporting machine learning and AI workloads on HPC systems . **Additional Information** How ... Operations + Design, deploy, configure, and administer medium scale HPC clusters and associated storage systems . +...Relocation Assistance: Yes Salary + $110,300 - $155,700 / Senior Level + $125,200 - $176,700 / Advanced Level… more
- University of Rochester (Rochester, NY)
- …Energetics seeks group leader for the High Performance Computing ( HPC ) group. The HPC group provides production HPC systems services to the Laboratory. ... HPC software tools and applications support + HPC systems and environmental monitoring + ...unit. + Experience collaborating with more than one of senior management, internal customers, stakeholders, and peer organizations to… more