- NVIDIA (Santa Clara, CA)
- …a lasting impact on the world. We are looking for an outstanding engineer for a Senior HPC Systems Engineer role for at scale AI system performance ... develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems... HPC , OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large… more
- Stanford University (Stanford, CA)
- Senior HPC Cloud Engineer **Doerr School of Sustainability, Stanford, California, United States** Information Technology Services Post Date Sep 28, 2024 ... help our school expand and scale our research computing ( HPC ) resource portfolio in the cloud to meet the...including distributed filesystems and an emphasis on object storage systems and data lifecycle management. + **Infrastructure as Code… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing needs… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- NVIDIA (Santa Clara, CA)
- …Familiarity with InfiniBand with IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with ... join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... What you'll be doing: + Design highly available and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as the… more
- Amazon (Cupertino, CA)
- … HPC network fabric or machine learning accelerator cluster systems . Also applicable is experience high-frequency trading networking, high-speed wireless ... Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing … more
- NVIDIA (Santa Clara, CA)
- …the choice, join our diverse team today! We are looking for an outstanding hands-on architect/ engineer for a Senior HPC architect role to support deployment ... develop new, leading differentiated solutions. You will interact with HPC , OS, GPU compute, and systems specialist...are growing fast. If you're a creative and autonomous engineer with real passion for technology, we want to… more
- NVIDIA (Santa Clara, CA)
- …storage, and complex network topologies + Extensive experience with benchmarking systems and analyzing performance bottlenecks in large-scale AI/ HPC ... to create reference designs for the world's most powerful AI clusters. As an AI/ HPC Product Architect at NVIDIA, you'll be the linchpin in transforming ideas into… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...line. The Nitro Team is looking for engineers with systems knowledge and experience in area such as Linux… more
- NVIDIA (Santa Clara, CA)
- …Familiarity with InfiniBand with IBOP and RDMA + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads + Familiarity with ... Now, GPU deep learning is driving modern AI forward. Join our GPU AI/ HPC Infrastructure team and lead the design of groundbreaking GPU compute clusters for… more
- NVIDIA (Santa Clara, CA)
- …environments, and system software to make current and future high-end computer systems more performant, scalable, and usable. As an NVIDIAN, you'll be immersed ... proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new… more
- NVIDIA (Santa Clara, CA)
- …human imagination and intelligence. Join us today! As a member of the GPU/ HPC Infrastructure team, you will provide leadership in the design and implementation of ... to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads… more
- NVIDIA (Santa Clara, CA)
- …artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... implementing, and optimizing on-prem High-Performance Computing ( HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …to solve routine support issues, all while delivering world-class customer support. Role: Senior Cloud Solutions Engineer Location: Atlanta, GA Must Haves: + 3+ ... You will use your skills and knowledge of Linux, HPC on public cloud infrastructure, infrastructure-as-code, and shell and...preferably in EDA, or semiconductor industry, or for a systems design company + Certified AWS Solutions Architect or… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- …Experience with Cloud Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI/ HPC workloads. + Familiarity ... diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of… more
- NVIDIA (Santa Clara, CA)
- …implement, test, deploy, release, and support large scale distributed systems infrastructure with monitoring, logging, visualization, and alerting capabilities with ... profiling tools for real world ML/DL applications running on HPC GPU clusters for failure and efficiency analysis to...with distributed system software architecture + Basic understanding of HPC GPU cluster, slurm + Basic understanding of Machine… more
- NVIDIA (Santa Clara, CA)
- …impact on the world. We are looking for a Senior Linux Software Engineer to join the NVIDIA Applied Systems Engineering group. The work environment is ... network, storage security, and management software elements in support of HPC environments. + Driving a complete engineering process, including refining… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is searching for a highly motivated, creative engineer with experience in data center system security to join the Server Platform Software team. In this role, ... you will focus on securing NVIDIA's Data Center Systems . NVIDIA is leading the way in groundbreaking developments...the crowd: + Prior hands-on experience with GPU computing, HPC , or large clusters. Prior experience in developing Data… more