• Acceler8 Talent (San Francisco, CA)
    A dynamic technology company in San Francisco is seeking a Machine Learning Engineer ( Inference ) to enhance AI inference efficiency. This mid-senior level ... role focuses on building and optimizing inference stacks, requiring experience with tools such as vLLM and TensorRT. The compensation includes a competitive salary,… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Together AI (San Francisco, CA)
    About the Role Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated ... high-load and high-performance requirements. If you are passionate about designing ML systems that operate at scale and eager to create impactful solutions,… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Serve Robotics (San Francisco, CA)
    Sr. Software Engineer , ML Edge Inference Engineer Join to apply for the Sr. Software Engineer , ML Edge Inference Engineer role at Serve ... seeking a highly skilled Sr. Software Engineer , ML Edge Inference Engineer to...as NVIDIA Jetson platforms. You will work closely with ML researchers, embedded systems engineers, and robotics… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Aldea Inc (San Francisco, CA)
    …AI capabilities. The role involves building and optimizing infrastructure for training and inference systems at scale. Ideal candidates will have experience with ... deep learning frameworks, large model training, and production-grade systems . The company offers comprehensive benefits including equity participation and flexible… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Together AI (San Francisco, CA)
    A leading AI research company in San Francisco is seeking a skilled software engineer to focus on optimizing AI systems and ensuring robust performance. ... Candidates should have extensive experience in building scalable distributed systems and proficiency in modern programming languages. The role also involves new… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Apple Inc. (San Francisco, CA)
    …equivalent experience). 5+ years in software engineering focused on ML inference , GPU acceleration, and large-scale systems . Expertise in deploying and ... Senior Software Engineer , Model Inference San Francisco Bay...Continuously evaluate emerging research and industry trends in LLM inference , distributed systems , and ML more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • MongoDB (Palo Alto, CA)
    …for developer-first experiences. As a Senior Engineer , you'll focus on building core systems and services that power model inference at scale. You'll own key ... We're looking for a Senior Engineer to help build the next-generation inference...developers Nice to Have Experience integrating infrastructure with production ML workloads Understanding of hybrid retrieval, prompt-driven systems more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Jobright.ai (San Francisco, CA)
    Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for the Site ... Reliability Engineer - Inference role at Jobright.ai Get...models and building a high-throughput, low-latency API for distributed systems . Responsibilities: * Work on our Inference more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat (Boston, MA)
    Machine Learning Engineer , vLLM Inference - Tool Calling and Structured Output Join to apply for the Machine Learning Engineer , vLLM Inference - Tool ... LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings...optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM, you will be at the… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat, Inc. (Boston, MA)
    …the power of open-source LLMs and vLLM to every enterprise. The Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI ... optimize, and scale LLM deployments. As a Machine Learning Engineer focused on vLLM, you will be at the...you. Join us in shaping the future of AI Inference ! What You Will Do Write robust Python and… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Red Hat (Boston, MA)
    Principal Machine Learning Engineer , AI Inference page is loaded## Principal Machine Learning Engineer , AI Inferenceremote type: Hybridlocations: ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...optimize, and scale LLM deployments.As a Principal Machine Learning Engineer focused on vLLM, you will be at the… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Parafin (San Francisco, CA)
    …deploy to batch or real-time targets with minimal boilerplate. Build our real-time ML inference platform. Stand up and scale low-latency model serving. Expand ... batch ML inference . Improve scheduling, parallelism, cost controls,...platforms, shadow/canary deployments, and automated rollback. Experience with low‑latency inference systems . What We Offer Salary Range:… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • SoFi (San Francisco, CA)
    Join to apply for the Senior Software Engineer , ML Platform role at SoFi 1 day ago Be among the first 25 applicants Join to apply for the Senior Software ... Engineer , ML Platform role at SoFi Get...platforms (AWS preferred) and containerization (Docker, Kubernetes). Familiarity with ML workflows, including model training, batch/online inference ,… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • GEICO (Palo Alto, CA)
    …Experience with Staff Engineer or Tech Lead roles in ML /AI organizations* Background in distributed systems and high-performance computing* Open-source ... ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale...and inference pipelines* Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM,… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Scale AI, Inc. (San Francisco, CA)
    Machine Learning Research Engineer , Enterprise ML Systems Ready to Apply? Join the team shaping the future of AI at Scale. AI is becoming vitally important ... that serve all of our enterprise clients. As an ML Sys Research Engineer , you'll work on...You will: Build, profile and optimize our training and inference framework. Post‑train state of the art models, developed… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Genies (San Francisco, CA)
    …the visual and interactive layer for the LLM-powered internet. Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based ... APIs/services (eg, FastAPI, Flask, gRPC). Hands-on experience deploying and operating ML models at scale, including: GPU-based inference services concurrency… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Waymo (Seattle, WA)
    … through major evolutions while supporting high production demands. Experienced at modern ML data, training, and inference systems for large models. ... Perception engineers and build a deep understanding of the ML development workflows and systems , their requirements...extraction for model training, large model training pipelines, model inference systems , etc. Work closely with all… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Proximity Works (San Francisco, CA)
    ML systems optimized for: Low latency High throughput Cost‑efficient inference Implement ML pipeline best practices (versioning, monitoring, A/B testing, ... About the Role This role is for a hands‑on ML Engineer who can design, train, and...vector databases TensorFlow / PyTorch Proven experience building production‑grade ML systems at scale. Familiarity with LLMs,… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Roche Holdings Inc. (Santa Clara, CA)
    …today and for generations to come. Join Roche, where every voice matters. Principal DevOps Engineer - ML /AI Algorithms As Principal DevOps Engineer - ML ... & Development team contributing to digital health products touching Imaging, ML /AI, and computational science. The Opportunity As Principal DevOps Engineer more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source
  • Amazon (San Francisco, CA)
    Firmware Engineer , Annapurna Labs, ML Acceleration - Performance Instrumentation & Developer Tools AWS Utility Computing (UC) provides product Annapurna Labs ... pipelines and scripting (Python, shell) for algorithm validation Understanding of ML training/ inference workloads and their performance characteristics Takes… more
    job goal (01/14/26)
    - Save Job - Related Jobs - Block Source