• Together AI (San Francisco, CA)
    …including fine-tuning & inference LLMs & multi-modality genAI models, model training & GPU cluster deployment / management. You will identify opportunities to ... overall customer experience. By aligning and influencing the Executive Team and senior leadership on Customer Success and Go-To-Market strategies, you will play a… more
    JobGet (10/01/24)
    - Save Job - Related Jobs - Block Source
  • Senior GPU Cluster

    NVIDIA (Santa Clara, CA)
    …working with distributed system software architecture + Basic understanding of HPC GPU cluster , slurm + Basic understanding of Machine learning concepts and ... experience for customer as well as engineers supporting the cluster . Much of our software development focuses...running and instrumenting distributed LLM training on a multi gpu HPC cluster + Knowledge of LLM… more
    NVIDIA (08/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior High Performance Computing…

    NVIDIA (Santa Clara, CA)
    …for a deeply technical HPC cluster administrator to lead a diverse cluster of GPU -accelerated systems and provide architectural mentorship to product teams ... team, you will provide leadership in the design and implementation of groundbreaking GPU compute cluster that runs demanding deep learning, high performance… more
    NVIDIA (09/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior Infrastructure Software

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Infrastructure Software Engineer! NVIDIA's Deep Learning Architecture and Libraries group is seeking excellent Software ... stack for our next generation test and development cluster , the core infrastructure that provides a foundation for...a member of our team, your work will enable GPU architects and deep learning software engineers… more
    NVIDIA (09/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Test Development…

    NVIDIA (Santa Clara, CA)
    We are looking for a highly experienced AI Senior Software Test development engineer in NVIDIA's Deep Learning SWQA team. The position is in NVIDIA Deep Learning ... to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech… more
    NVIDIA (09/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Test Development…

    NVIDIA (Santa Clara, CA)
    …to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech ... We are looking for a Software Test development engineer in NVIDIA's Deep Learning...improve test automation. + Experience in validating Data Center GPU based infrastructure (multi-GPUS, multi-nodes, cluster ). +… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Architect - Data…

    NVIDIA (Santa Clara, CA)
    software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has ... We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data...customers. What you'll be doing: + You will lead software activities for NVIDIA's deep learning server platforms, from… more
    NVIDIA (07/16/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer - Internal…

    NVIDIA (Santa Clara, CA)
    …+ Finding and fixing problems before they occur + Building automation for AI-HPC GPU Cluster bring up and scaled up operation + Improving Operational Excellence ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...including workflows that uses MPI + Working knowledge of cluster configuration management tools such as BCM, Ansible, Puppet,… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior DevOps Engineer - DGX Cloud

    NVIDIA (Santa Clara, CA)
    …be used for a variety of AI workloads. This includes working on custom software related to GPU asset provisioning, configuration, and lifecycle management across ... deployments and toil elimination. We view DevOps as a software engineering discipline and expect significant contributions to our...You will be harnessing multiple data streams, ranging from GPU hardware diagnostics to cluster and network… more
    NVIDIA (08/29/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software QA Test Development…

    NVIDIA (Santa Clara, CA)
    NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our ... Computing Company', and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Cloud Services Software

    NVIDIA (Santa Clara, CA)
    …seeking a distributed software engineer to join our team! As a Senior engineer, you'll be instrumental in developing and optimizing AI infrastructure services to ... resiliency for DGX Cloud. Your expertise in cloud services software architecture that drives the full resilience stack that...that allows the framework to be integrated with the cluster scheduler visibly to the users + Strong understanding… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI-HPC Storage Engineer

    NVIDIA (Santa Clara, CA)
    …and models + Familiarity with InfiniBand with IBOP and RDMA + Background with Software Defined Networking and AI/HPC cluster networking + Familiarity with deep ... reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC...[ AWS, Azure or GCP] + Experience with AI/HPC cluster job schedulers such as SLURM, LSF + In… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Systems Development…

    Dell Technologies (Austin, TX)
    ** Senior Principal Systems Development Engineer** Our customers' system requirements are usually highly complex. Bringing together hardware and software systems ... best work of your career and make a profound social impact as a Senior Principal Systems Development Engineer on our Systems Development Engineering Team in Austin,… more
    Dell Technologies (09/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Solutions Architect, NPN

    NVIDIA (Santa Clara, CA)
    …end-to-end Machine Learning and Deep Learning solutions, using NVIDIA's compute, networking, and software stacks. Don't think this is a high-level slideshow job - we ... on-premises and cloud based. + 12+ years of proven experience with cluster management and related tools, including Docker Containers, Slurm, Kubernetes, and Ansible.… more
    NVIDIA (09/18/24)
    - Save Job - Related Jobs - Block Source
  • Linux Systems Administrator- Senior

    IMRI (Vicksburg, MS)
    …systems, including Linux-based application and license servers, virtual machines, and GP/ GPU cluster -based systems. The selected candidate will collaborate with ... Linux Systems Administrator- Senior Apply Now! Back to search Location: Vicksburg,...maintaining system documentation, tuning system performance, installing system wide software and allocate mass storage space. + Developing and… more
    IMRI (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Systems Analyst (TS Clearance)

    All In Solutions, LLC (Patterson, OH)
    Senior Systems Analyst (TS Clearance) - (207) Share this job as a link in your status update to LinkedIn. Job Title Senior Systems Analyst (TS Clearance) ... Travel Job Description All In-Solutions is seeking an experienced Senior Systems Analyst to join our team supporting the...using consistent and repeatable processes, + Install and configure software on HPC systems, + Collaborate with users to… more
    All In Solutions, LLC (08/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Systems Engineer

    NVIDIA (Santa Clara, CA)
    …container runtimes, drivers+containers, and containerization of various high performance computing cluster software elements within a variety of environments. + ... next era of computing. An era in which our GPU acts as the brains of computers, robots, and...impact on the world. We are looking for a Senior Linux Software Engineer to join the… more
    NVIDIA (08/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior System/High Performance Computing…

    Integration Innovation, Inc. (i3) (Vicksburg, MS)
    Overview i3 is seeking a Senior System/High Performance Computing Administrator to join our team in Vicksburg, MS in support of the Engineering Research and ... operations for a High Performance Computer (HPC) including different nodes (compute, GPU , etc), storage, and network management. + Extensive knowledge and experience… more
    Integration Innovation, Inc. (i3) (09/13/24)
    - Save Job - Related Jobs - Block Source
  • Senior Linux Administration

    SOL Engineering, LLC (Vicksburg, MS)
    …consisting of Linux-based application and license servers, virtual machines, and GP/ GPU cluster -based systems. Coordination with network administrators is ... accounts, maintaining system documentation, tuning system performance, installing system-wide software , and allocating mass storage space. + Developing and… more
    SOL Engineering, LLC (08/28/24)
    - Save Job - Related Jobs - Block Source
  • Solutions Architect - InfiniBand and HPC

    NVIDIA (CA)
    …knowledge (bonus credit for BCM (Base Command Manager).) + Experience with GPU (Graphics Processing Unit) focused hardware/ software . + Experience with MPI ... NVIDIA is looking for a Senior HPC Engineer to join its Professional Services...support and deployment services; solving problems for hardware and software products. + Knowledge and experience with Linux system… more
    NVIDIA (08/20/24)
    - Save Job - Related Jobs - Block Source