• Senior GenAI Algorithms Engineer…

    NVIDIA (Santa Clara, CA)
    …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI ... vLLM, SGLang). You may also dive deeper into GPU-level optimization, including custom kernel development with CUDA and Triton. This role offers a unique opportunity… more
    NVIDIA (01/10/26)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, ML Kernel

    Amazon (Cupertino, CA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... The Acceleration Kernel Library team is at the forefront of maximizing...AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators.… more
    Amazon (12/04/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Development Engineer…

    Amazon (Seattle, WA)
    …Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. ... AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators....such large models across the stack from system level optimizations through to Pytorch or JAX is a must… more
    Amazon (12/26/25)
    - Save Job - Related Jobs - Block Source
  • Senior Deep Learning Software Engineer, LLM…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing ... GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and… more
    NVIDIA (11/25/25)
    - Save Job - Related Jobs - Block Source