- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: ... a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look...teamwork and close collaboration 3. Responsible for the overall performance of the communication system , including … more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...training systems . 2. Responsible for the overall performance of the communication system , including … more
- Deloitte (San Francisco, CA)
- …building secure networks and modern data centers , to enabling the adoption of AI or high- performance computing ( HPC ), you'll gain firsthand experience with ... organizations through Data Center and infrastructure transformation journeys, such as adopting AI , deploying high- performance computing ( HPC ) or edge… more
- Meta (Menlo Park, CA)
- …These workloads expect a loss-less fabric interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: 1. Manage engineers… more
- Pfizer (South San Francisco, CA)
- …and requires a hands-on approach to designing and delivering robust High Performance Computing ( HPC ) solutions supporting computational workloads across the ... role you will design, implement, operate, and own robust and dependable infrastructure for HPC and ML/ AI workloads in a cloud environment (AWS/GCP). + Lead… more
- Meta (Menlo Park, CA)
- …on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance , new model architectures and ... RCCL, UCC and MPI. 7. Guide Meta's AI HW requirements and design focusing on performance at System and Silicon levels. Co-design and optimize our AI HW… more
- Deloitte (San Francisco, CA)
- …Engineer, Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack The wage range for this role takes into ... drug discovery, optimizing population health and clinical trials, autonomous systems and edge AI , and renewable energy....on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns… more
- Deloitte (San Francisco, CA)
- …Engineer, Solutions Architect) + 2+ years of experience with GPU computing (CUDA, OpenCL) and HPC system software stack The wage range for this role takes into ... in the cloud or on prem + Adopt best engineering practices in automation, HPC and AI /GenAI infrastructure and design patterns + Define and lead technology… more
- Meta (Menlo Park, CA)
- … AI product introductions and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps . They will be responsible ... deliver on shared goals 10. The ideal candidate will have experience in AI / HPC product development and operations, demonstrated experience in the Network… more
- quadric.io, Inc (Burlingame, CA)
- …wide variety of edge and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems . ... implementations using Quadric's SDK. This senior technical role demands expertise in system architecture, AI frameworks, algorithm optimization, and the ability… more
- Meta (Menlo Park, CA)
- …following machine learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, ... large-scale GPU training and inference fleet through an observable, reliable and high- performance distributed AI /GPU communication stack. Currently, one of the… more
- Meta (Menlo Park, CA)
- …for network devices, transport stacks, and AI workloads 2. Debug complex system -level issues and lead performance tuning exercises to optimize software stack ... networks, powering our global data centers and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network… more
- Meta (Menlo Park, CA)
- …for network devices, transport stacks, and AI workloads 2. Debug complex system -level issues and lead performance tuning exercises to optimize software stack ... FPGAs, sensors, fan control, power etc), Board Support Package (BSP), Operating Systems , Kernel, Bootloader, Power Management, Real-Time Operating System (RTOS),… more
- Meta (Menlo Park, CA)
- …for network devices, transport stacks, and AI workloads 2. Debug complex system -level issues and lead performance tuning exercises to optimize software stack ... FPGAs, sensors, fan control, power etc), Board Support Package (BSP), Operating Systems , Kernel, Bootloader, Power Management, Real-Time Operating System (RTOS),… more
- Meta (Menlo Park, CA)
- …levels 9. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: 10. GPU/ASIC-based kernel development and ... systems for our fleet 4. Technical management 5. Experience in systems architecture, performance , workload-analysis and large scale distributed systems … more
- Meta (Menlo Park, CA)
- …10. Experience in leading teams working on high performance computing ( HPC ) and AI /ML systems , including: GPU/ASIC-based kernel development and ... ROCm), distributed systems for large scale training and serving, and systems architecture and performance 11. Accelerator (GPU/ASIC) kernel development and… more
- Cisco (San Francisco, CA)
- …teams while collaborating on ASIC Design and Verification for reliable, high- performance products. + Drive innovation in System /Board Design, leveraging ... data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. Supply… more
- Genentech (South San Francisco, CA)
- …in vitro liver models reproduce in vivo toxicant responses. Using AI -based modeling, the intern will determine whether cell-specific chromatin accessibility changes ... patterns. These insights will define the translational utility of in vitro systems and inform the design of more predictive, human-relevant liver toxicity models.… more