- Amazon (Cupertino, CA)
- …a better-rounded professional. Basic Qualifications - 5+ years of non-internship professional software development experience - 5+ years of programming with at ... Description We are seeking an experienced engineer to work on distributed AI/ML systems. This...and existing systems experience - 5+ years of full software development life cycle, including coding standards,… more
- Amazon (Sunnyvale, CA)
- …in Python and Lua. Basic Qualifications - 3+ years of non-internship professional software development experience - 2+ years of non-internship design or ... software programming language Preferred Qualifications - 3+ years of full software development life cycle, including coding standards, code reviews, source… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on ... development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce...workload needs.We are hiring in multiple locations. **Required Skills:** Software Engineer , Systems ML - HPC… more
- NVIDIA (Santa Clara, CA)
- …can solve. Make the choice to join us today. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure. ... and manage this infrastructure. Ideal candidate is strong in software development , designing and creating reliable distributed...and scalable systems to meet the demands of our HPC clusters + Evaluate new and innovative technologies as… more
- NVIDIA (Santa Clara, CA)
- …UCX for Deep Learning and HPC . We are looking for a motivated Performance engineer to influence the roadmap of our communication libraries. The DL and HPC ... scales! This is an outstanding opportunity for someone with HPC and performance background to advance the state of...space. Are you ready for to contribute to the development of innovative technologies and help realize NVIDIA's vision?… more
- NVIDIA (Santa Clara, CA)
- …, infrastructure technologies, and GPU technology. + Prior experience in infrastructure software , production application software development , software ... NVIDIA's Hardware Infrastructure organization is seeking a Senior Observability Engineer to help architect and implement our distributed observability systems for AI… more
- NVIDIA (Santa Clara, CA)
- …of experience in designing, developing, testing, maintenance, and performance optimization of HPC software using C++. + Strong fundamentals in kernel generation ... NVIDIA Math Libraries team is looking for an expert engineer to join our development efforts in...design for linear algebra. + Leadership skills in driving software development projects. + Strong collaboration, communication,… more
- NVIDIA (Santa Clara, CA)
- …Pure Storage, Lustre, ZFS, Isilon) + Infiniband (operations, debugging, performance tuning) + Software development , especially in a devops context + Knowledge of ... to support their future chip design needs, understand their workflow characteristics, and engineer an efficient HPC environment. Work with IT and engineering… more
- NVIDIA (Santa Clara, CA)
- …issues + Learn to use the latest applications in the fields of Deep Learning and HPC + Assist with the development of tools and processes that improve our ... We are now looking for a Performance Engineer Intern focused on Deep Learning (DL) &...Intern focused on Deep Learning (DL) & High-Performance Computing ( HPC ) applications to join our diverse team. NVIDIA builds… more
- NVIDIA (Santa Clara, CA)
- …in providing excellent, comprehensive support to our customers! The Technical Support Engineer in this role will significantly impact and contribute to the overall ... support to our customers for our Linux-based cluster management software product, ensuring customers get the help they require...require to support their clusters. + Collaborate with the development team to get the right information and to… more
- Amazon (Cupertino, CA)
- …the success of peer teams? We want to talk to you! We seek a Sr. Software Development Engineer for the Machine Learning (ML) Infrastructure team to build ... a better-rounded professional. Basic Qualifications - 5+ years of non-internship professional software development experience - 5+ years of leading design or… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies group, supporting the development of Meta's ... in some of the world's largest scale clusters. **Required Skills:** Software Engineer , Accelerator Solutions & Technologies Responsibilities: 1. Contribute… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies group, supporting the development of Meta's ... in some of the world's largest scale clusters. **Required Skills:** Software Engineer , Accelerator Systems & Technologies Responsibilities: 1. Understand… more
- NVIDIA (Santa Clara, CA)
- …We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking for a motivated Partner Enablement Engineer ... guide our key partners and customers with NCCL. Most DL/ HPC applications run on large clusters with high-speed networking...stack. Are you ready for to contribute to the development of innovative technologies and help realize NVIDIA's vision?… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced software engineer to join our Accelerator Solutions & Technologies group, supporting the development of Meta's ... in some of the world's largest scale clusters. **Required Skills:** Software Engineer , Accelerator Solutions & Technologies Responsibilities: 1. Contribute… more
- Amazon (Cupertino, CA)
- …AI accelerators in order to solve our customers toughest problems. As a Runtime Software Development Engineer you will have experience with high-performance ... and training. The Neuron team is hiring senior Runtime Software Development Engineers with a background in...Linux drivers, HPC technologies including: libfabric, MPI, and delivering products to… more
- Amazon (Sunnyvale, CA)
- …C/C++ or Rust with supporting script and tests in Python and Lua. Senior Software Development Engineers work closely with EC2 Principal Engineers and other ... High performance computing workloads. The Nitro High Memory and HPC team owns the purpose built platform development...10017 Basic Qualifications - 5+ years of non-internship professional software development experience - 5+ years of… more
- NVIDIA (Santa Clara, CA)
- …are pushing the boundaries of what's possible in AI. We are currently seeking a Senior Software Engineer to lead the development of AI software ... We are now looking for a Senior Software Engineer for AI Resiliency. At...and performance tuning large-scale AI workloads in cloud and HPC environments, ensuring seamless operation of AI training and… more
- Google (Sunnyvale, CA)
- …Bachelor's degree or equivalent practical experience. + 8 years of experience in software development , and with data structures/algorithms. + 5 years of ... goes on and is growing every day. As a software engineer , you will work on a...fully managed file services for Enterprise, High Performance Computing ( HPC ), and AI/ML customers, supporting workloads such as SAP,… more
- Meta (Menlo Park, CA)
- …on the space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , AI Networking (PhD) Responsibilities: 1. Enabling reliable ... you will be a member of the AI Networking Software team and part of the bigger DC networking...Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and… more