- Sandia National Laboratories (Livermore, CA)
- …and risk models Deep knowledge of performance issues related to training large AI models on HPC systems Background in digital-twin or real-time ... Architect and implement the hybrid compute fabric Integrate exascale HPC systems with elastic cloud resources and specialized AI accelerator clusters… more
- Sandia National Laboratories (Livermore, CA)
- …acquisition systems with robotics and instrument integration Develop data/ AI benchmarksdatasets, tools, and metricsfor pipeline performance , model ... a multi-disciplinary, mission-focused team delivering foundational data capabilities for transformative AI systems in national security, energy, and critical… more
- Sandia National Laboratories (Livermore, CA)
- …or digital twins Distributed training frameworks: MPI, Horovod, Ray Hyperparameter tuning HPC systems Implementing secure AI workflows in classified ... and proficiency with: Developing and deploying large language models, multimodal AI systems , or advanced reinforcement-learning agents Model optimization… more
- Cisco Systems, Inc. (San Jose, CA)
- …teams while collaborating on ASIC Design and Verification for reliable, high- performance products. Drive innovation in System /Board Design, leveraging excellent ... data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Supply… more
- PsiQuantum (Palo Alto, CA)
- …and implementation of system software for HPC , robotics, AI , quantum computing, semiconductor fabrication, or control systems . Proven track record ... of quantum mechanics to solve problems that even the most advanced supercomputers or AI systems will never reach. Their impact will span energy, pharmaceuticals,… more
- Cisco Systems, Inc. (San Jose, CA)
- …data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will affect billions globally. Supply ... change the product development from the lowest levels of circuit design to large system design and see your contribution all the way through to high volume… more
- Cisco Systems, Inc. (San Jose, CA)
- …data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Your ... from the lowest levels of circuit design to large system design and see your contribution all the way...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more
- Cisco Systems, Inc. (San Jose, CA)
- …mobile/wireless, video, VoIP, collaboration, web, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. ... from the lowest levels of circuit design to large system design and see your contribution all the way...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more
- Cisco Systems, Inc. (San Jose, CA)
- …mobile/wireless, video, VoIP, collaboration, web, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. ... from the lowest levels of circuit design to large system design and see your contribution all the way...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more
- PsiQuantum (Palo Alto, CA)
- …of quantum mechanics to solve problems that even the most advanced supercomputers or AI systems will never reach. Their impact will span energy, pharmaceuticals, ... analysis across services. Mentor engineers in platform design, Python performance optimization, and large-scale system composition. Experience/Qualifications… more
- PsiQuantum (Palo Alto, CA)
- …of quantum mechanics to solve problems that even the most advanced supercomputers or AI systems will never reach. Their impact will span energy, pharmaceuticals, ... access. Security & Compliance: Implemented Auth0/Okta, KMS encryption, or FedRAMP/IRAP-aligned systems for public sector clients. HPC /Advanced Compute: Deployed… more
- Micron Technology, Inc. (San Jose, CA)
- …position in the Artificial Intelligence ( AI ), Machine Learning (ML) and High Performance Computing ( HPC ) business segments. You will be working on innovative ... you will be charged with defining and accomplishing the strategy for a High Performance Memory product portfolio that will further fortify Micron's leadership… more
- Micron Technology, Inc. (San Jose, CA)
- …in growing the Artificial Intelligence ( AI ), Machine Learning (ML) and High- Performance Computing ( HPC ) business segments. You will be working on innovative ... of Work (SOWs), business term sheets, and other customer-facing documents for high- performance memory products. + Represent the Product Management team in Product… more
- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...training systems . 2. Responsible for the overall performance of the communication system , including … more
- Deloitte (San Francisco, CA)
- …building secure networks and modern data centers , to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience with ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- Meta (Menlo Park, CA)
- …These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more
- Pfizer (South San Francisco, CA)
- …and requires a hands-on approach to designing and delivering robust High Performance Computing ( HPC ) solutions supporting computational workloads across the ... role you will design, implement, operate, and own robust and dependable infrastructure for HPC and ML/ AI workloads in a cloud environment (AWS/GCP). + Lead… more
- IBM (San Jose, CA)
- …in system design * Experience with GPU Systems * Familiarity with HPC system performance evaluation. * Familiarity with system architectures * ... technical areas in the context of hybrid cloud, AI systems , networking, security, high-speed networked-storage, accelerators, and HPC principles. The… more
- Meta (Menlo Park, CA)
- …on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance , new model architectures and ... RCCL, UCC and MPI. 7. Guide Meta's AI HW requirements and design focusing on performance at System and Silicon levels. Co-design and optimize our AI HW… more