• AI Inference Engineer

    quadric.io, Inc (Burlingame, CA)
    …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Lead AI Engineer (FM Hosting, LLM…

    Capital One (San Francisco, CA)
    Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more
    Capital One (11/04/25)
    - Save Job - Related Jobs - Block Source
  • Lead Engineer , Inference Platform

    MongoDB (Palo Alto, CA)
    We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... retrieval, and AI -native features across MongoDB Atlas. This role is part...Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with… more
    MongoDB (12/27/25)
    - Save Job - Related Jobs - Block Source
  • AI Kernel Engineer

    quadric.io, Inc (Burlingame, CA)
    …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number ... of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel… more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Senior/Principal Software Engineer

    Genentech (South San Francisco, CA)
    …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack engineering, you will be… more
    Genentech (12/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Forward Deployed Engineer , Data…

    Google (San Francisco, CA)
    Senior Forward Deployed Engineer , Data and AI _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +3 more; +2 more **Mid** Experience driving ... 5 years of experience in a customer-facing technical role (eg, Forward Deployed Engineer , Solutions Architect, Sales Engineer ) leading technical engagements. + 5… more
    Google (12/11/25)
    - Save Job - Related Jobs - Block Source
  • AI /ML Engineer

    Avispa Technology (Stanford, CA)
    AI /ML Engineer 1464457 * Hourly pay: $60/hr * Worksite: Leading university (Stanford, CA 94305 - Hybrid, Must be onsite 2-3 days on campus) * W2 Employment, ... Possible extension or conversion A leading university seeks an AI /ML Engineer . The successful candidate will focus...to design, automate, and maintain data pipelines for model inference , evaluation, and monitoring. * Experience with both GenAI… more
    Avispa Technology (12/19/25)
    - Save Job - Related Jobs - Block Source
  • Principal/Senior Principal Machine Learning…

    Genentech (South San Francisco, CA)
    …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely with folks that span the… more
    Genentech (12/06/25)
    - Save Job - Related Jobs - Block Source
  • Senior Principal Software Engineer

    Oracle (Redwood City, CA)
    … at the Principal Engineer level. We are at the forefront of AI innovation, exploring the next generation of AI accelerators and hardware solutions. As ... a Senior Principal software engineer , part of our growing team, you will be...will be involved in evaluation, prototyping, and optimizing cutting-edge AI hardware, AI accelerators, including custom-designed … more
    Oracle (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Development Engineer , FAR…

    Amazon (San Francisco, CA)
    Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers like Pieter ... run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
    Amazon (12/02/25)
    - Save Job - Related Jobs - Block Source
  • Senior AI /ML Tooling Engineer

    General Motors (San Francisco, CA)
    **Job Description** **Senior AI /ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... training, and inference of ML models. You will develop and enhance...toolchain and stack, to leverage the latest advancements in AI + Influence model architecture decisions and strategy within… more
    General Motors (11/04/25)
    - Save Job - Related Jobs - Block Source
  • AI Applications Engineer

    quadric.io, Inc (Burlingame, CA)
    …both NN graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands-on ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...users in the field. The AI Application Engineer will [1] integrate Quadric… more
    quadric.io, Inc (11/25/25)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer , Frontier…

    Amazon (San Francisco, CA)
    Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production ... scale. As a Software Development Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
    Amazon (12/02/25)
    - Save Job - Related Jobs - Block Source
  • Research Engineer , Language - Generative…

    Meta (Menlo Park, CA)
    inference ; and/or multilingual and multimodal modeling. **Required Skills:** Research Engineer , Language - Generative AI Responsibilities: 1. Design methods, ... **Summary:** Meta is seeking a Research Engineer to join our Large Language Model (LLM)...for strong engineers who have a background in generative AI and NLP, with experience in areas like language… more
    Meta (12/20/25)
    - Save Job - Related Jobs - Block Source
  • AI /HPC System Performance Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially to support ever increasing use cases of AI . This results in a dramatic ... host networking, communications lib and scheduling infrastructure. **Required Skills:** AI /HPC System Performance Engineer Responsibilities: 1. Lead… more
    Meta (12/20/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Tech-leading the ... this role, you will be a member of the AI Networking Software team and part of the bigger...and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed… more
    Meta (12/20/25)
    - Save Job - Related Jobs - Block Source
  • Senior Machine Learning Engineer , Model…

    Amazon (San Francisco, CA)
    Description The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. ... with customers across industries to fine-tune and deploy customized generative AI applications at scale. Additionally, we work closely with foundational model… more
    Amazon (12/12/25)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Machine Learning…

    DoorDash (San Francisco, CA)
    …systems that empower efficient machine learning at scale, especially in the Generative AI area. This is a remote opportunity, with Pacific Time working hours ... data and ML pipelines-from retrieval (eg, RAG) to batch inference -that adapt quickly to new technologies. + Develop an...fast iteration and deployment of products powered by Generative AI . + Improve the reliability, scalability, and monitoring of… more
    DoorDash (10/28/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , Systems ML - Frameworks…

    Meta (Menlo Park, CA)
    …authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta.You will be working on ... help in driving next generation hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer...core compilers to support new state of the art inference and training AI hardware accelerators and… more
    Meta (12/20/25)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , ML Infrastructure,…

    Snap Inc. (San Francisco, CA)
    …forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems, and driving innovations that make ... more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc! What you'll...models for content ranking and recommendations + Develop high-performance inference systems to ensure fast and efficient AI more
    Snap Inc. (12/18/25)
    - Save Job - Related Jobs - Block Source