- NVIDIA (Santa Clara, CA)
- We are looking for a Senior Software Engineer based in Santa Clara, CA, to join the Shoreline team with experience in building highly scalable and robust ... enterprise software to join us. We are building and improving...platform that will automate diagnosis and repair of a cluster of GPUs or CPUs across public clouds, private… more
- NVIDIA (Santa Clara, CA)
- … team with high standards! This software engineering role involves developing tools for GPU Cluster users and admins. As a member of the software ... work with users from different departments like Architecture teams, Software teams. Our work brings the users intuitive, rich...+ Build debugging tools for common encountered problems in GPU cluster + Work with our users… more
- NVIDIA (Santa Clara, CA)
- …hardware integration and bare-metal provisioning related functionality in our Linux-based cluster management software environment. NVIDIA's Bright Cluster ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...Work on adding features to our Ansible collections for Cluster Installation and Management. + Assist our support team… more
- NVIDIA (Santa Clara, CA)
- …architecture. Expertise with NVIDIA systems is a huge bonus. + Any experience with cluster software automation tools such as Ansible, Jenkins NVIDIA is leading ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and... accelerated applications, such as Deep Learning. As a Senior NVLink System Software Bringup Engineer within… more
- NVIDIA (Santa Clara, CA)
- …of experience + Knowledge of HPC and AI solution technologies from CPU's and GPU 's to high speed interconnects and supporting software + Experience with job ... We are looking for an outstanding architect for a senior HPC, be a key player to the most...of Kubernetes, container related microservice technologies + Experience with GPU -focused hardware/ software (DGX, Cuda) + Background with… more
- NVIDIA (Santa Clara, CA)
- …to be used for a variety of AI workloads. This includes working on custom software related to managing fleets of GPU nodes. + Implementing monitoring and health ... NVIDIA is hiring experienced software engineers with bare metal hardware experience to...You will be harnessing multiple data streams, ranging from GPU hardware diagnostics to cluster and network… more
- NVIDIA (Santa Clara, CA)
- We are looking for a highly experienced AI Senior Software Test development engineer in NVIDIA's Deep Learning SWQA team. The position is in NVIDIA Deep Learning ... to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech… more
- NVIDIA (Santa Clara, CA)
- …and alerting. Additional responsibilities include: + Design and implement state-of-the-art GPU compute clusters. + Optimize cluster operations for maximum ... complex systems in the world. + Implement remediations across software and hardware stack according to plan, while keeping...environments with operational experience of at least 5K GPUs cluster . + Deep understanding of GPU computing… more
- NVIDIA (Santa Clara, CA)
- … software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has ... We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data...customers. What you'll be doing: + You will lead software activities for NVIDIA's deep learning server platforms, from… more
- NVIDIA (Santa Clara, CA)
- … developers for the development of cloud related functionality in our Linux-based cluster software environment. NVIDIA's Base Command Manager is used to power ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...take pride in producing extremely clean code. + Our cluster management software is based on Linux.… more
- SHINE Systems & Technologies (Bethesda, MD)
- …3DS Max, Maya, Photoshop, or similar modeling tools. + For the senior level: Familiarity with cluster computing, signal processing, Kubernetes, numerical ... and script in C++ and OpenGL, utilizing the Unreal graphics engine and various software packages. This position supports a US Navy program whose mission is to… more
- Lenovo (Morrisville, NC)
- …Our Team** We are looking for a **Sr N** **etwork Architect** for the GPU cluster solutions we develop for the Cloud Service Provider market. **Location:** ... Senior Network Architect **General Information** Req # WD00074557...in Computer Science or Computer Engineering + Experience with GPU rack-scale and cluster networking in production… more
- NVIDIA (Santa Clara, CA)
- …and models + Familiarity with InfiniBand with IBOIP and RDMA + Background with Software Defined Networking and AI/HPC cluster networking + Familiarity with deep ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...environment [AWS, Azure or GCP] + Experience with AI/HPC cluster job schedulers such as SLURM, LSF + In… more
- NVIDIA (Santa Clara, CA)
- …Escalation Solution Architect with experience in validation and debugging of large-scale GPU clusters focused on performance. As part of the Solution Architecture ... organization, we work with the most sophisticated computing hardware and software , driving the latest deep learning and machine learning breakthroughs with NVIDIA's… more
- University of Pennsylvania (Philadelphia, PA)
- …health and wellness programs and resources, and much more. Posted Job Title Senior HPC Systems Administrator Job Profile Title Systems Administrator Senior Job ... school-wide shared condo system that forms our biggest HPC cluster . Resources across our systems include high-density CPU nodes,.... Resources across our systems include high-density CPU nodes, GPU nodes, I nfiniband networking and PB of data… more
- NVIDIA (Durham, NC)
- …their adoption of end-to-end Agentic AI solutions, using NVIDIA's compute, networking, and software stacks. Don't think this is a high-level slideshow job - we are ... and cloud based. + 4 plus years of proven experience with cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.… more
- SpaceX (Hawthorne, CA)
- …+ Knowledge of solidification physics COMPENSATION & BENEFITS: Pay Range: Software Engineer/ Senior : $160,000.00 - $220,000.00/per year Your actual level ... Sr. Software Engineer (Computational Geometry) at SpaceX Hawthorne, CA...+ Demonstrated proficiency in Python + Experience with compute cluster management tools such as SLURM + Demonstrated ability… more
- NVIDIA (Santa Clara, CA)
- …+ Background in computer science, machine learning, deep learning, open-source software , infrastructure technologies, and GPU technology. + Prior experience ... NVIDIA's Hardware Infrastructure organization is seeking a Senior or Princip al Data and Observability Architect....teams to define a vision and roadmap for AI/HPC cluster observability. + Architect and lead teams to develop,… more
- University of Massachusetts Amherst (Amherst, MA)
- …Fellow - Research Software & Research Computing Facilitation Apply now ... (https://secure.dc4.pageuppeople.com/apply/822/gateway/default.aspx?c=apply&lJobID=526435&lJobSourceTypeID=801&sLanguage=en-us) Job no: 526435 Work type: Senior /Research Fellow (Amherst Only) Location: UMass Amherst Department:Computer Science… more
- NVIDIA (Santa Clara, CA)
- …and Deep Learning Software Stack + Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix + Experience with ... the next wave of NVIDIA's highest performing deep learning software stacks. Your role spans multiple products such as...GPU computing systems + Track record of identifying useful… more