- Google (Sunnyvale, CA)
- Software Engineer , Cluster Networking _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Mid** Experience driving progress, solving problems, and ... for software that manages changes to Google's cluster networks worldwide. Networking enables the growing...team builds systems to make this possible. As a Software Engineer , You will be working with… more
- NVIDIA (Santa Clara, CA)
- …diverse environments. Ways to stand out from the crowd: + Proficiency with cluster networking including InfiniBand and Spectrum-X. + Experience with NVIDIA ... them operational in production? We are seeking a dedicated Cluster Deployment Operations Engineer to support product...with large language models (LLMs) as part of a software development or content creation workflow - we rely… more
- NVIDIA (Santa Clara, CA)
- …cluster availability and performance. + Manage the rollout and rollback of cluster software and firmware updates, ensuring smooth transitions and minimal ... via NVLink and InfiniBand + Implement modern DevOps tools to automate software updates, perform maintenance tasks, and monitor cluster availability, ensuring… more
- NVIDIA (Santa Clara, CA)
- …lasting impact on the world. We are seeking a highly skilled and experienced HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for EDA and ... the management of large-scale HPC systems including the deployment of compute, networking , and storage. + Develop and improve our ecosystem around GPU-accelerated… more
- NVIDIA (Santa Clara, CA)
- …the management of large-scale HPC systems including the deployment of compute, networking , and storage. + Develop and improve our ecosystem around GPU-accelerated ... occur. + Build innovative tooling to accelerate researchers' velocity, troubleshooting, and software performance at scale. What we need to see: + Bachelor's degree… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** We are seeking a highly skilled Principal Engineer (Ceph/Scale-Out Storage) with 10years+ of deep technical experience in distributed storage ... environments. The ideal candidate will have strong expertise across Linux, networking , storage internals, and distributed systems, with the ability to diagnose… more
- Oracle (Santa Clara, CA)
- …in making critical technical decisions and setting engineering vision. + Knowledge of cluster networking and cloud networking . + Solid understanding of ... and more reliable. If you're passionate about building high-performance software , value clean design at scale, and enjoy working...Oracle Cloud, AWS, and Azure. + Hands-on experience in ** cluster networking ** and **cloud networking **… more
- NVIDIA (Santa Clara, CA)
- …involves developing tools for AI researchers and SW/HW teams running AI workload in GPU cluster . As a member of the software development team, we will work with ... of GPU cluster job scheduling (Slurm or Kubernetes), storage and networking + Experience with NVIDIA GPUs, CUDA Programming and NCCL + Motivated self-starter… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , Borglet Offloads _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +2 more; +1 more **Advanced** Experience owning ... goes on and is growing every day. As a software engineer , you will work on a...a strong, systems-heavy team that works closely with Borg cluster management software , the kernel, with Hardware… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , Emerging On-prem AI Infrastructure _corporate_fare_ Google _place_ Kirkland, WA, USA; Sunnyvale, CA, USA **Advanced** Experience owning ... goes on and is growing every day. As a software engineer , you will work on a...clusters using the latest technologies for AI acceleration and cluster interconnects and networking . The ML, Systems,… more
- Broadcom (Palo Alto, CA)
- …have a Candidate Account, please Sign-In before you apply.** **Job Description:** Senior Kubernetes Software Engineer - Palo Alto, CA VMware by Broadcom is the ... vSphere Supervisor. The Kubernetes Distribution team is looking for a Principal Kubernetes Engineer with deep expertise in Open Source Software (OSS) and… more
- Palo Alto Networks (Santa Clara, CA)
- …on-premise remote networks and mobile users. We are seeking an experienced Software Engineer to design, develop and deliver next-generation technologies within ... code and build great products. Engineers who bring new ideas in all facets of software development. We are looking for leaders who take ownership of their areas of… more
- pony.ai (Fremont, CA)
- …public at NASDAQ in November 2024. Responsibilities As a (Senior) Kubernetes Engineer , you will: + Design, operate, and optimize Kubernetes clusters across hybrid ... CRDs, APIs) to automate and productize internal use cases. + Own cluster lifecycle management including upgrades, patching, configuration, and governance. + Define… more
- LinkedIn (Mountain View, CA)
- …select days, as determined by the business needs of the team. As a Sr. Staff Software Engineer of the Compute Infrastructure team at LinkedIn, you will play a ... talented developers. Basic Qualifications -5+ years of industry experience in software design, development, and algorithm related solutions -5+ years of experience… more
- Google (Sunnyvale, CA)
- Senior Staff Software Engineer , SRE, ML Fleet Systems _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +2 more; +1 more **Advanced** ... Understanding of resource management systems (eg, Borg, Kubernetes, Flex), cluster management, and scheduling algorithms. + Familiarity with Machine...needed to learn and grow. As a Senior Staff Software Engineer in the ML Fleet Systems… more
- Google (Sunnyvale, CA)
- Staff Software Engineer , Cloud Compute, Kubernetes _corporate_fare_ Google _place_ Sunnyvale, CA, USA **Advanced** Experience owning outcomes and decision ... UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google Cloud's needs… more
- NVIDIA (Santa Clara, CA)
- …and PCIe devices + Experience working with hardware clusters, distributed system, networking , GPU interconnects (PCie, NVlink), node and cluster interconnect ... which every new AI-powered application is built. We are seeking a senior engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs).… more
- NVIDIA (Santa Clara, CA)
- …for a highly motived, technical architect to champion work across NVIDIA's Software , Architecture, Networking and Systems engineering teams in defining the ... full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking , NVIDIA Grace CPUs, and a fully optimized NVIDIA...CPUs, and a fully optimized NVIDIA AI and HPC software stack. NVIDIA NVLink Fusion will enable industry-leading AI… more
- LinkedIn (Sunnyvale, CA)
- …days, as determined by the business needs of the team. As a Principal Staff Software Engineer of the Compute Infrastructure team at LinkedIn, you will play a ... talented developers. Basic Qualifications: + 10+ years of industry experience in software design, development, and algorithm related solutions + 10+ years of… more
- NVIDIA (Santa Clara, CA)
- …robotics platform. This is not a typical DevOps job. You'll help engineer the cloud-native backend that drives simulation, synthetic data generation, multi-stage ... GPUs. We're looking for a pragmatic Kubernetes-native backend and infrastructure engineer who excels in solving complex orchestration problems in distributed AI/ML… more