- Stanford University (Stanford, CA)
- Senior HPC Cloud Engineer **Doerr School of Sustainability, Stanford, California, United States** Information Technology Services Post Date Sep 28, 2024 ... input and requirements of the SDSS research community members. + Manage HPC Cloud budget, cost and capacity management including planning resource capacity to… more
- NVIDIA (Santa Clara, CA)
- …and implementation of distributed storage services. + Design, implement an on-prem AI/ HPC infrastructure supplemented with cloud computing to support the growing ... join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the...for large scale, high performance workloads, evolving our private/public cloud strategy, capacity modelling, and growth planning across our… more
- NVIDIA (Santa Clara, CA)
- …Make the choice to join us today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... resource utilization in a heterogeneous compute environment, evolving our private/public cloud strategy, capacity modeling, and growth planning across our global… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... are even higher at huge scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of the art in this space. Are… more
- NVIDIA (Santa Clara, CA)
- …one else can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC ... and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as...and management using automation + Support a globally distributed, multi- cloud hybrid environment - AWS, GCP and On-prem +… more
- NVIDIA (Santa Clara, CA)
- …and see how you can make a lasting impact on the world. We are looking for an outstanding engineer for a Senior HPC Systems Engineer role for at scale AI ... workflows and develop new, leading differentiated solutions. You will interact with HPC , OS, CPU and GPU compute, and systems specialist to architect, develop… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing ... services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set… more
- Amazon (Sunnyvale, CA)
- …we're building an environment that celebrates knowledge sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. ... will help each team member develop into a better-rounded engineer and enable them to take on more complex...the planet's largest, fastest growing and most feature-rich compute cloud . Nitro is AWS's ground-up design for virtualization at… more
- NVIDIA (Santa Clara, CA)
- …to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering ... intelligence to autonomous cars. We are now looking for a highly motivated HPC Operations Manager to join this multifaceted and innovative infrastructure team to… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …to solve routine support issues, all while delivering world-class customer support. Role: Senior Cloud Solutions Engineer Location: Atlanta, GA Must Haves: ... Cloud portfolio. You will use your skills and knowledge of Linux, HPC on public cloud infrastructure, infrastructure-as-code, and shell and configuration… more
- NVIDIA (Santa Clara, CA)
- …global scale PaaS and SaaS offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization which ... also make feature development faster and safer. As a Senior Omniverse Cloud SRE, you will architect...working with partners across multiple teams + Background with HPC or Model Training Operations or related experience. Ways… more
- NVIDIA (Santa Clara, CA)
- …artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial role in designing, ... HPC ) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and...What you'll be doing: + Design, implement an on-prem HPC infrastructure supplemented with cloud computing to… more
- NVIDIA (Santa Clara, CA)
- …human imagination and intelligence. Join us today! As a member of the GPU/ HPC Infrastructure team, you will provide leadership in the design and implementation of ... to identify architectural changes and/or completely new approaches for improving HPC schedulers for serving many simultaneous and large multi-node GPU workloads… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.) + Guide our… more
- NVIDIA (Santa Clara, CA)
- …the choice, join our diverse team today! As a member of the GPU AI/ HPC Infrastructure team, you will provide leadership in the design and implementation of ground ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...You will also be maintaining and building deep learning AI- HPC GPU clusters at scale and supporting our researchers… more
- Microsoft Corporation (Mountain View, CA)
- …cost is of paramount importance. To achieve this goal, we are looking for a ** Senior Mechanical Engineer .** The Senior Mechanical Engineer will ... materials across product families and product generations. As a senior mechanical engineer , you will perform structural...analysis for complex hardware systems, with a focus on cloud or HPC infrastructure + Proficiency in… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a dedicated and motivated senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (NeMo, ... Core) team. NVIDIA NeMo is an open-source, scalable and cloud -native framework built for researchers and developers working on... technologies, eg: SLURM, Lustre, k8s + Experience with HPC hardware systems such as compute clusters and … more
- Amazon (Cupertino, CA)
- …of these accelerator SoCs for use by internal partner teams. We're looking for a Senior SoC Modeling Engineer to join the team and deliver new functional models, ... including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience… more
- NVIDIA (Santa Clara, CA)
- …from the crowd: + Have built , deployed and operated AI platforms on HPC clusters. Have built, deployed and operated cloud native system including distributed ... We are seeking a Sr System Software Engineer to help us build out our scientific...build out our scientific computing platform on Nvidia DGX Cloud . We are building a cloud based… more
- NVIDIA (Santa Clara, CA)
- …you want to help bring AI responsibly to the world? We are seeking a Senior Systems Software Engineer who is technical, creative and driven to provide enterprise ... the world. You will build and deploy high-performance, scalable cloud native solutions for AI deployment in all clouds...inference to new levels using GPUs and distributed systems. Senior engineers are expected to mentor members of the… more