- MongoDB (Palo Alto, CA)
- …for developer-first experiences. As a Senior Engineer, you'll focus on building core systems and services that power model inference at scale. You'll own key ... Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic...Voyage. ai to bring novel ideas into scalable systems Tackle challenging problems in inference , observability,… more
- Amazon (Boston, MA)
- …for designing, developing, and operating high-performance machine learning inference systems that power real-time computer vision and AI applications across ... Amazon Worldwide Grocery Tech is seeking an exceptional Software Development Manager to lead our Edge ML Inference Systems team. This team is responsible… more
- Adobe Inc. (San Jose, CA)
- …7 years of experience in machine learning and expertise in GPU-intensive generative AI systems . Responsibilities include designing inference pipelines and ... A leading software company in California is seeking a Senior Machine Learning Engineer for their Generative AI Services team. The ideal candidate will have over… more
- Databricks (San Francisco, CA)
- A leading data and AI company is seeking a Senior Software Engineer, Model Serving to design and implement core systems that ensure scalability and ... excellence. You will drive architectural decisions for high-throughput, low-latency inference and collaborate with cross-functional teams. The ideal candidate has… more
- Jobright.ai (San Francisco, CA)
- Join to apply for the Site Reliability Engineer - Inference role at Jobright. ai 2 days ago Be among the first 25 applicants Join to apply for the Site ... Reliability Engineer - Inference role at Jobright. ai Get AI...models and building a high-throughput, low-latency API for distributed systems . Responsibilities: * Work on our Inference … more
- Acceler8 Talent (San Francisco, CA)
- …to join a Stanford spin out scale up building a foundational infrastructure layer for AI inference . The team were founded on the back of a successful exit, ... $350,000.00/yr Direct message the job poster from Acceler8 Talent Compilers, Kernels, Performance. Systems Software & HPC Recruiter We are seeking an Inference … more
- Acceler8 Talent (San Francisco, CA)
- …in San Francisco is seeking a Machine Learning Engineer ( Inference ) to enhance AI inference efficiency. This mid- senior level role focuses on building ... and optimizing inference stacks, requiring experience with tools such as vLLM and TensorRT. The compensation includes a competitive salary, meaningful equity percentage, and potential sign-on bonus, making this an excellent opportunity for skilled… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …of resilient fault-tolerant queues, model catalogs, and scheduling mechanisms. Build and optimize systems for high-volume AI inference , handling millions of ... cloud infrastructure. About This Role: The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior...in shaping the architecture and scalability of our next-generation AI inference platform. You will lead the… more
- Capital One (San Francisco, CA)
- Senior Lead AI Engineer ( AI ...At Capital One, we are creating responsible and reliable AI systems , changing banking for good. For years, ... techniques to improve the performance-scalability, cost, latency, throughput-of large‑scale production AI systems . Contribute to the technical vision and the… more
- GEICO (Palo Alto, CA)
- …ML model training and inference pipelines* Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions* ... AI ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our...Experience with Staff Engineer or Tech Lead roles in ML/ AI organizations* Background in distributed systems and… more
- Capital One (San Francisco, CA)
- Senior AI Engineer ( AI Foundations,...At Capital One, we are creating responsible and reliable AI systems , changing banking for good. For years, ... scalability, cost, latency, throughput - of large scale production AI systems . Contribute to the technical vision...AI and ML algorithms or technologies (eg LLM inference , similarity search and vector databases, guardrails, memory) using… more
- Capital One (San Francisco, CA)
- Senior Lead AI Engineer ( AI ...At Capital One, we are creating responsible and reliable AI systems , changing banking for good. For years, ... - scalability, cost, latency, throughput - of large‑scale production AI systems . Contribute to the technical vision...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Capital One (San Francisco, CA)
- Overview: At Capital One, we are creating responsible and reliable AI systems , changing banking for good. For years, Capital One has been an industry leader in ... cost, latency, throughput, and throughput of large scale production AI systems . Contribute to the technical vision...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more
- Proximity Works (Los Angeles, CA)
- …Proxonauts from San Francisco, Dubai, India, and beyond. High autonomy, direct accountability, and the opportunity to ship AI systems at scale. #J-18808-Ljbffr ... Demonstrable improvements in grounding accuracy and hallucination reduction across deployed systems . Consistent delivery of sub-100ms inference latency for… more
- GMI Cloud (Mountain View, CA)
- Senior Solution Architect - AI / GPU Cloud We are seeking a Senior Solution Architect to design GPU‑cloud and AI infrastructure solutions, lead PoCs and ... footprint in Asia, delivering a full spectrum of services from GPU compute to AI model inference API solutions. As an NVIDIA Reference Platform Cloud Partner,… more
- Blue Origin LLC (Seattle, WA)
- Senior Principal Software Engineer, Factory Automation AI page is loaded## Senior Principal Software Engineer, Factory Automation AIlocations: Seattle, WA: ... understanding of software development as well as automated manufacturing processes and systems , including industrial automation hardware and software. AI will… more
- Oracle (Redwood City, CA)
- Senior Principal Software Engineer - AI ...and optimizing cutting‑edge AI hardware, including custom‑designed AI chips and systems and software to ... for the Senior Principal Software Engineer - AI Infrastructure Innovation role at Oracle 5 days ago...with industry‑standard hardware (eg, NVIDIA GPUs) for training and inference workloads. Develop tools and processes for evaluating the… more
- Oracle (Redwood City, CA)
- Senior Principal Software Engineer - AI GPU...AI hardware, AI accelerators, including custom‑designed AI chips and systems and software to drive ... AI innovation, exploring the next generation of AI accelerators and hardware solutions. As a Senior...with industry‑standard hardware (eg, NVIDIA GPUs) for training and inference workloads. Develop tools and processes for evaluating the… more
- Qualified (San Francisco, CA)
- …Pay Range $170,000 - $230,000 per year Overview We are seeking a seasoned, product‑focused Senior AI Backend Engineer to develop and scale the core backend ... Senior Software Engineer, Backend - AI ...with cross‑functional teams. Why Join Qualified Foundational Impact: Shape AI integration and backend systems from day… more
- Genentech (San Francisco, CA)
- …scale and optimise workflows. We also work on scaling up model training and inference , evaluating the quality of AI /ML models and output, and building impactful ... DevOps, and everyone in between. You'll build, own, and constantly improve scalable AI /ML based systems that unlock the potential of our diverse scientific… more