• OpenAI (San Francisco, CA)
    …tighter coordination with product and research. About the Role We're looking for a software engineer to help us serve OpenAI's multimodal models at scale. You'll ... About the Team OpenAI's Inference team powers the deployment of our most...of what AI can do. We're expanding into multimodal inference , building the infrastructure needed to serve models that… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    Senior Software Engineer , Model Inference San Francisco Bay Area, California, United States Software and Services Join Apple Maps to help build the best ... take end-to-end ownership, and deliver measurable results at global scale. Description As a Software Engineer on the Apple Maps team, you will lead the design… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • San Francisco Compute Co. (San Francisco, CA)
    …GPU offtake. About the Role As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase ... This isn't SaaS anymore - application layer companies sign multi -year contracts for computer and inference , but...fit for you if You enjoy the craftsmanship of software You're a thoughtful high-agency engineer Have… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Deep Learning Software Engineer , Inference page is loaded## Senior Deep Learning Software Engineer , Inferencelocations: US, CA, Santa Clara: ... on: Posted Yesterdayjob requisition id: JR2002670NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Virtue AI (San Francisco, CA)
    …we're looking for passionate builders to join our core team. What You'll Do As an Inference Engineer , you will own how models are served in production. Your job ... versioning, and backward compatibility Build routing and load‑balancing logic for inference traffic Multi ‑model routing Fallback and degradation strategies vLLM… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …and contribute to our innovative projects. Position Overview We are looking for a Software Engineer to work at the forefront of deploying our cutting-edge AI ... scenarios (autoregressive models, denoising models, hierarchical models, state machines, multi -agent systems, cloud-based inference ). Adapt optimization solutions… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • General Motors (Sunnyvale, CA)
    …This job is eligible for relocation assistance. About the Team The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure ... of state-of-the‑art (SOTA) machine learning models for experimental and bulk inference , with a focus on performance, availability, concurrency, and scalability.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Pulse (San Francisco, CA)
    …models. Own profiling, batching, and autoscaling across single-tenant and multi -tenant environments. Responsibilities Build inference services with smart ... and growing quickly. What makes our tech special is our multi -stage architecture: Layout understanding with specialized component detection models Low-latency OCR… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …with event-driven or serverless architectures. Exposure to hybrid cloud or multi -cluster environments. Contributions to open-source ML or inference systems ... and B200s, ready to go for experimentation, full-scale model training, or inference . Our client operates high-performance GPU clusters powering some of the most… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Menlo Ventures (San Francisco, CA)
    …leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that ... to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Sanas (Palo Alto, CA)
    Staff Software Engineer : Microservice Infrastructure & Real-Time ML Inference Sanas.ai is pioneering the future of human communication. Founded by a team of ... growth. The company's valuation has a clear trajectory toward multi -billion-dollar market capitalization as it continues to expand into...communication. About the role We're looking for a Staff Software Engineer (Backend) to design and build… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • MongoDB (Palo Alto, CA)
    …build next-generation, AI-powered applications. About the Role We're looking for a Lead Engineer , Inference Platform to join our team building the inference ... transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes...Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Google Inc. (Sunnyvale, CA)
    Senior Software Engineer , Machine Learning, Kernal Apply Note: By applying to this position you will have an opportunity to share your preferred working location ... like PyTorch or JAX. 3 years of experience in software development for machine learning model inference ...goes on and is growing every day. As a software engineer , you will work on a… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Etched.ai, Inc. (San Jose, CA)
    …high-performance networking solutions for large-scale inference workloads. As a Pod Software Engineer , you will focus on developing and qualifying ... software that drives communication amongst Sohu inference nodes in multi -rack inference clusters. You will collaborate closely with kernel, platform, and… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    …a variety of benefits, including business-only pricing and selection, a multi -seller marketplace, single- or multi -user business accounts, approval workflow, ... rapid pace. We are looking for a Machine Learning Engineer (MLE) to join the team to drive key...building data pipelines to produce inputs for training and inference in both online and offline contexts; 2) Training… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Principal Software Engineer - Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer - Large-Scale LLM Memory and ... Posted Todayjob requisition id: JR2010271NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across … more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Vinci4d (Palo Alto, CA)
    …cloud integrations to accelerate development. We're looking for a C++-heavy engineer who writes production application code-owning features end-to-end and making ... both highly performant and broadly useful. You will implement parsers, simulation/ inference pipelines, and distributed execution engines that scale across many CPUs… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Voxel (San Francisco, CA)
    …by industry leading VC's. Voxel is looking for a Staff Machine-Learning Infrastructure Engineer to drive the next wave of our computer-vision platform for workplace ... stay within cost constraints. Build and operate training infrastructure - create multi -GPU / multi -node training frameworks (Ray, Spark, Kubernetes), optimize… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • GEICO (Palo Alto, CA)
    GEICO . For more information, please .Staff Software Engineer - AI/ML Infra page is loaded## Staff Software Engineer - AI/ML Infralocations: Chevy Chase, ... Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure...and maintain feature stores for ML model training and inference pipelines* Build and optimize LLM inference more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (Cupertino, CA)
    AIML - Sr Software Engineer , Machine Learning Platform Technologies Cupertino, California, United States Machine Learning and AI Are you an open-source ... Kubernetes, and you're driven to build infrastructure for ML training and inference -including optimizing for performance, cost, and automation-this role is for you.… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source