- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more
- Meta (Menlo Park, CA)
- …5. Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques **Minimum ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC System Performance Engineer Responsibilities: 1. Lead… more
- Meta (Menlo Park, CA)
- …and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI / HPC Systems Performance Engineer Responsibilities: 1. Active member of ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like… more
- Meta (Menlo Park, CA)
- …operate in a multi-organization landscape. **Required Skills:** Technical Program Manager, AI Network Infra Responsibilities: 1. Lead technical program ... and AI operations initiatives supporting Meta's growing AI / HPC infrastructure for our Family of Apps...matrix organization covering a range of areas (Data Center, Network , Hardware Systems, Infrastructure Engineering , Software … more
- Meta (Menlo Park, CA)
- …many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network and storage. The team invests significantly ... develop and help productionize high performance software & hardware technologies for AI at datacenter scale. We achieve this via concurrent design and optimization… more
- Deloitte (San Francisco, CA)
- …Maintain up-to-date knowledge of advancements in AI , cloud computing, network technologies, infrastructure automation, and trends in HPC infrastructure, ... modern data centers , to enabling the adoption of AI or high-performance computing ( HPC ), you'll gain...path that can evolve toward ML engineer ing , AI architect ure , cloud adoption, network … more
- Cisco (Milpitas, CA)
- …an agile team engaged in the design, development and execution of tests to qualify network performance for AI /ML capability. You will be a part of our solutions ... per week **Meet the Team** The Cisco Distributed System Engineering (DSE) group is at the forefront of developing...the next generation infrastructure to meet the needs of AI /ML workloads and continuously increasing internet users and application.… more
- Deloitte (San Jose, CA)
- …operating model across Infrastructure Managed Services (IMS), Cloud Managed Services, AI / HPC environments, and End-User Support Services- all within Deloitte's ... (IMS), Hybrid Managed Services (HMS), Cloud Managed Services, and supporting AI / HPC environments and End-User Services-ensuring availability, performance, and… more
- Meta (Menlo Park, CA)
- **Summary:** In this role, you will be a member of the Network . AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
- Meta (Menlo Park, CA)
- …solutions to our challenges, and ship them into production. As part of our network engineering teams, you'll have the opportunity to work on cutting-edge ... and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ...6. Design, develop, and deploy services to manage datacenter network switches and forwarding functions 7. Enhance HPC… more
- Meta (Menlo Park, CA)
- …solutions to our challenges, and ship them into production. As part of our network engineering teams, you'll have the opportunity to work on cutting-edge ... and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ...6. Design, develop, and deploy services to manage datacenter network switches and forwarding functions 7. Enhance HPC… more
- Meta (Menlo Park, CA)
- …solutions to our challenges, and ship them into production. As part of our network engineering teams, you'll have the opportunity to work on cutting-edge ... and supporting cutting-edge technologies like AI , Generative AI , Recommendation engines, and Metaverse. Our network ...6. Design, develop, and deploy services to manage datacenter network switches and forwarding functions 7. Enhance HPC… more
- Cisco (Milpitas, CA)
- …works + Passionate about open-source development and contribution As a Tech Lead for AI /ML Test Engineering , you'll lead an agile team engaged in the design, ... development and execution of tests to qualify network performance for AI .ML capability. In this...with Quality of Service (QoS) policies to ensure optimal network performance + Exposure to RDMA, HPC … more
- Cisco (San Jose, CA)
- …may contact you directly if a relevant position opens. **Meet the Team** ** Engineering :** Open-minded, driven, diverse and deeply creative people at Cisco design the ... mobile/wireless, video, VoIP, collaboration, web, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally.… more
- Cisco (San Francisco, CA)
- …contact you directly if a relevant position opens. **Meet the Team ** Engineering : Open-minded, driven, diverse and deeply creative people at Cisco design the ... web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact...product specifications. **Your Impact ** Join our Creative Hardware Engineering team and make a tangible impact across the… more
- Broadcom (San Jose, CA)
- …L2/L3 protocols especially RoCE( RDMA over Converged Ethernet ) protocol & use cases in AI /ML, HPC cluster is a plus + Having Knowledge of deep learning models ... PCI-E-based designs, and hands-on experience in Python programming. Good understanding of AI /ML clusters, Deep learning models, and GPU Micro benchmarks is a plus.… more
- Cisco (San Jose, CA)
- …may contact you directly if a relevant position opens. **Meet the Team** Engineering : Open-minded, driven, diverse and deeply creative people at Cisco design the ... data, collaboration, web, Internet of Things, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will affect billions globally. Supply… more
- Cisco (San Jose, CA)
- …mobile/wireless, video, VoIP, collaboration, web, routing, switching, IPv6, data center, HPC , Telepresence and many more. Your work will impact billions globally. ... the way through to high volume manufacturing. Creative Hardware Engineering positions are available in: + ASIC Design and...data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly… more