- Meta (Austin, TX)
- …objectives Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques ... host networking, communications lib and scheduling infrastructure. Required Skills: AI / HPC System Performance Engineer Responsibilities: Lead multi-disciplinary… more
- Oracle (Indianapolis, IN)
- … AI Infrastructure Innovation team is pioneering the creation of next-generation AI / HPC networking for GPU superclusters at massive scale. Our mission is ... system design, and implementation for high-performance RDMA solutions across OCI's AI / HPC platforms, including frontend and backend fabrics. Innovate on… more
- Microsoft Corporation (Redmond, WA)
- …design and develop the systems that support large-scale AI and HPC workloads. This role involves working on network infrastructure, automation workflows, ... that underpin large-scale AI and high-performance computing ( HPC ) environments. Network Deployment: Support the deployment...Microsoft and the country where you work. #azurecorejobs Cloud Network Engineering IC2 - The typical base… more
- Oracle (Nashville, TN)
- …at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... automation, and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging technologies like… more
- Oracle (Hartford, CT)
- …problems in distributed systems, infrastructure, and highly available services. Collaborate with Network Engineering , Network Automation, Network ... and design of state-of-the-art RDMA clusters tailored specifically for AI , ML, HPC workloads. We strive to... devices. Qualifications: BS or MS in Computer Science, Network /Electrical Engineering , or equivalent experience. Minimum 7+… more
- Oracle (Little Rock, AR)
- …Qualifications Bachelor's degree in CS or related engineering field with 2+ years of Network Engineering experience or master's with 1+ years of Network ... and design of state-of-the-art RDMA clusters tailored specifically for AI , ML, HPC workloads. We strive to...(OCI). Primarily focused on the development and support of network fabric and systems through a combination of a… more
- Intel (Phoenix, AZ)
- …tailored innovation for diverse needs across general-purpose compute, web services, HPC , and AI -accelerated systems. Our charter encompasses defining business ... of data center technologies from the cloud to the network and to the edge. The Data Center Group...is seeking an exceptional Vice President of Silicon Design Engineering with a strong business development focus to lead… more
- Microsoft Corporation (Redmond, WA)
- … Network Architect to join our team! Responsibilities Own end-to-end network architecture for AI training/inference clusters: topology, routing, transport, ... experience. 10+ years designing and operating large-scale L2/L3 Ethernet fabrics for HPC / AI or hyperscale services. 5+ years of experience with Ethernet,… more
- Sandia National Laboratories (Albuquerque, NM)
- About Sandia Sandia National Laboratories is the nation's premier science and engineering lab for national security and technology innovation, with teams of ... by job classification. What Your Job Will Be Like Sandia's artificial intelligence ( AI ) team is building the US Department of Energy's (DOE) next-generation AI… more
- Oracle (Seattle, WA)
- …at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI /ML/ HPC workloads. This is your chance to be part of the ... automation, and diagnostic services. These are essential for running distributed AI /ML/ HPC workloads across thousands of GPUs, leveraging technologies like… more
- Deloitte (Hartford, CT)
- …operating model across Infrastructure Managed Services (IMS), Cloud Managed Services, AI / HPC environments, and End-User Support Services- all within Deloitte's ... (IMS), Hybrid Managed Services (HMS), Cloud Managed Services, and supporting AI / HPC environments and End-User Services-ensuring availability, performance, and… more
- Intel (Santa Clara, CA)
- …industry by delivering cutting-edge silicon process and packaging technology leadership for the AI era. With a focus on scalability, AI advancement, and shaping ... computing, server, mobile, networking, and automotive systems for the AI era. About the Role Intel Foundry Services is...sales plans, manage goals and budgets for CSP and HPC accounts. Build strategic customer relationships and identify new… more
- Meta (Menlo Park, CA)
- Summary: In this role, you will be a member of the Network . AI Software team and part of the bigger DC networking organization. The team develops and owns the ... Communications Library), which enables multi-GPU and multi-node data communication through HPC -style collectives. NCCL has been integrated into PyTorch and is on… more
- Intel (Santa Clara, CA)
- …a strong network and partner with various teams - product, engineering , marketing, sales Behavior skills we are looking for Excellent analytical skills, ability ... Silicon Valley. No one else is this obsessed with engineering a brighter future. Every day, we create world...innovation for diverse needs across general-purpose compute, web services, HPC , and AI -accelerated systems. Our charter encompasses… more
- Microsoft Corporation (Mountain View, CA)
- …predictive maintenance. Contributions to large-scale infrastructure operations, supercomputing centers, or AI hardware design. Software Engineering IC5 - The ... Overview Microsoft AI operates one of the world's most advanced...We work closely with research, hardware, datacenter, and platform engineering teams to develop predictive health models, failure detection… more
- Robert Half (Raymond, OH)
- …to maintain and optimize the Linux environment supporting Computer-Aided Engineering (CAE) development across virtual and physical machines, both on-premises ... patches to systems. Maintain and develop automation for application installation, network storage management, and user management. Annual Tasks Collaborate with CAE… more
- Curtiss-Wright Corporation (Davidson, NC)
- …Your Challenge Develop architectures for high performance software, advanced analytics, and AI /ML solution to solve industry wide and customer specific challenges in ... of business capture and top-line growth. Application areas include autonomy, AI /ML enhancements, sensor processing, and complex command & control applications.… more
- Broadcom (San Jose, CA)
- …L2/L3 protocols especially RoCE( RDMA over Converged Ethernet ) protocol & use cases in AI /ML, HPC cluster is a plus Having Knowledge of deep learning models - ... PCI-E-based designs, and hands-on experience in Python programming. Good understanding of AI /ML clusters, Deep learning models, and GPU Micro benchmarks is a plus.… more
- Eaton Corporation (Beachwood, OH)
- …industry presence to accelerate Eaton's position as a global leader in sustainable, HPC and AI next generation data center infrastructure. Key Responsibilities: ... partner engagements when needed. You will leverage your extensive industry network to build collaborations across hyperscalers, neoclouds, colocation and cloud… more
- Meta (Menlo Park, CA)
- …fabric and host networking, communications lib and scheduling infrastructure. **Required Skills:** AI / HPC Network Engineering Manager Responsibilities: ... daily basis. We need to build and evolve our network infrastructure that connects myriads of training accelerators like...responsible for design, model, develop, test, deploy and operate AI / HPC Networks at scale 2. Provide continual… more