- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world ... of AI /LLM models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform; [2] optimize the… more
- Capital One (San Francisco, CA)
- Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700… more
- quadric.io, Inc (Burlingame, CA)
- …GPNPU executes both NN graph code and conventional C++ DSP and control code. Role: The AI Kernel Engineer in Quadric plays the key role to enable a large number ... of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel… more
- Genentech (South San Francisco, CA)
- …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a software engineer in AI Enablement with a focus on full stack engineering, you will be… more
- Google (San Francisco, CA)
- Senior Forward Deployed Engineer , Data and AI _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA; +3 more; +2 more **Mid** Experience driving ... 5 years of experience in a customer-facing technical role (eg, Forward Deployed Engineer , Solutions Architect, Sales Engineer ) leading technical engagements. + 5… more
- Genentech (South San Francisco, CA)
- …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... evolving to meet the scientific needs. **The Opportunity:** As a machine learning engineer in AI Enablement, you will be working closely with folks that span the… more
- Amazon (San Francisco, CA)
- Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll work alongside world-renowned AI pioneers like Pieter ... run at production scale. As a Senior Machine Learning Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
- General Motors (San Francisco, CA)
- **Job Description** **Senior AI /ML Tooling Engineer ** Role: We are looking for an ML tooling engineer to build tools to analyze and optimize distillation, ... training, and inference of ML models. You will develop and enhance...toolchain and stack, to leverage the latest advancements in AI + Influence model architecture decisions and strategy within… more
- quadric.io, Inc (Burlingame, CA)
- …both NN graph code and conventional C++ DSP and control code. Role: The AI Applications Engineer is the key bridge between development engineering and hands-on ... and hardware is targeted to run neural network (NN) inference workloads in a wide variety of edge and...users in the field. The AI Application Engineer will [1] integrate Quadric… more
- Amazon (San Francisco, CA)
- Description Join the next revolution in robotics at Amazon's Frontier AI & Robotics team, where you'll contribute to breakthrough foundation models run at production ... scale. As a Software Development Engineer embedded in our science team, you'll be instrumental...your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale. In this role, you'll… more
- Amazon (San Francisco, CA)
- Description The Generative AI Innovation Center at AWS empowers customers to harness state of the art AI technologies for transformative business opportunities. ... with customers across industries to fine-tune and deploy customized generative AI applications at scale. Additionally, we work closely with foundational model… more
- DoorDash (San Francisco, CA)
- …systems that empower efficient machine learning at scale, especially in the Generative AI area. This is a remote opportunity, with Pacific Time working hours ... data and ML pipelines-from retrieval (eg, RAG) to batch inference -that adapt quickly to new technologies. + Develop an...fast iteration and deployment of products powered by Generative AI . + Improve the reliability, scalability, and monitoring of… more
- Snap Inc. (San Francisco, CA)
- …forefront. You'll play a critical role in scaling our ML Infrastructure, optimizing AI training and inference systems, and driving innovations that make ... more efficient and impactful. We're looking for a Software Engineer , ML Infrastructure to join Snap Inc! What you'll...models for content ranking and recommendations + Develop high-performance inference systems to ensure fast and efficient AI… more
- General Motors (San Francisco, CA)
- **Job Description** **About the Team** **:** We are building a state-of-the-art AI training platform in close collaboration with model teams to deliver Autonomous ... milestones, maximizing model flop utilization and iteration speed, and enhancing inference systems. We are augmenting training, data processing, evaluation and the… more
- Oracle (San Francisco, CA)
- …serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to agent-based AI systems or tool-based inference workflows. - Knowledge of ... future of cloud computing-designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of… more
- DataRobot (San Francisco, CA)
- …makes sense for their business - today and in the future. As a Principal Software Engineer for Generative AI at DataRobot, you will be the technical anchor for ... **Job Description:** DataRobot delivers AI that maximizes impact and minimizes business risk. Our platform and applications integrate into core business processes so… more
- DataRobot (San Francisco, CA)
- …GCP, Azure) and/or hybrid environments. **Nice to Have:** + Experience with AI /ML model deployment, serving, inference , and production integration. + Experience ... **Job Description:** DataRobot delivers AI that maximizes impact and minimizes business risk....today and in the future. As a Staff Software Engineer focused on Application Scalability & Performance, you will… more
- Oracle (San Francisco, CA)
- …building the core cloud services that power modern enterprise applications. The Generative AI Service team focuses on building and scaling systems that enable Large ... Language Models (LLMs) and agent-based AI applications to run reliably and efficiently in production...area within OCI. Role Summary As a Senior Software Engineer (IC3), you will be responsible for developing and… more
- ServiceNow, Inc. (San Francisco, CA)
- …group of builders, thinkers, and problem-solvers dedicated to delivering scalable, AI -powered software products that elevate how organizations work. We value clean ... architecture, intuitive user experiences, and a culture of continuous improvement. Every engineer here plays a key role in shaping the quality and reliability of our… more
- Amazon (San Francisco, CA)
- …human interfaces in the digital world. We are seeking a Machine Learning Engineer who combines strong ML expertise with software engineering excellence to scale and ... our leading computer-use agent. We are seeking a strong engineer who has a passion for scaling ML models...frameworks, improving engineering practices, and accelerating the velocity of AI development. You will be hired as a Member… more