- NVIDIA (CA)
- …distributed model management systems, including Rust-based runtime components, for large-scale AI inference workloads. + Implement inference scheduling ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...+ Collaborate with infrastructure engineers and researchers to develop scalable APIs, services, and end-to-end inference workflows.… more
- Capital One (San Francisco, CA)
- Lead AI Engineer (FM Hosting, LLM Inference ) **Overview** At Capital One, we are creating responsible and reliable AI systems, changing banking for good. ... upon number of hours to be regularly worked. Cambridge, MA: $193,400 - $220,700 for Lead AI Engineer McLean, VA: $193,400 - $220,700 for Lead AI Engineer… more
- Argonne National Laboratory (Lemont, IL)
- … working in the space of enabling AI for science, specifically targeting scalable inference leveraging HPC systems and AI accelerators. The successful ... In this position, the candidate can expect to explore and engineer solutions for AI inference integrated within scientific workflows, via programmatic access… more
- Amazon (Cupertino, CA)
- …optimization and system reliability across the Neuron ecosystem * Design and implement scalable solutions for both offline and online inference workloads * Lead ... stack About the team The Neuron Serving team is at the forefront of scalable and resilient AI infrastructure at AWS. We focus on developing model-agnostic … more
- Bank of America (Addison, TX)
- Senior Engineer - AI Inference Addison, Texas;Plano, Texas; Newark, Delaware; Charlotte, North Carolina; Kennesaw, Georgia **To proceed with your application, ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/Senior- Engineer - AI - Inference \_25029879) **Job Description:** At Bank… more
- MongoDB (Palo Alto, CA)
- We're looking for a Lead Engineer , Inference Platform to join our team building the inference platform for embedding models that power semantic search, ... retrieval, and AI -native features across MongoDB Atlas. This role is part...Atlas and optimized for developer experience. As a Lead Engineer , Inference Platform, you'll be hands-on with… more
- Red Hat (Boston, MA)
- …will collaborate with our team to tackle the most pressing challenges in scalable inference systems and Kubernetes-native deployments. Your work with distributed ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise... infrastructure leveraging Kubernetes APIs, operators, and the Gateway Inference Extension API for scalable LLM deployments.… more
- MongoDB (Palo Alto, CA)
- …with ML researchers from Voyage. ai to bring novel ideas into scalable systems + Tackle challenging problems in inference , observability, and distributed ... **About the Role** We're looking for a Senior Engineer to help build the next-generation inference...supporting semantic search and hybrid retrieval + Collaborate with AI engineers and researchers to productionize inference … more
- NVIDIA (Santa Clara, CA)
- …with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, ... What you'll be doing: + Design and build modular, scalable model optimization software platforms that deliver exceptional user...NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to… more
- NVIDIA (CA)
- …to stand out from the crowd: + Prior work experience improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. ... We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference...role, you will develop open source software to serve inference of trained AI models running on… more
- NVIDIA (Santa Clara, CA)
- We're forming a team of innovators to roll out and enhance AI inference solutions at scale, demonstrating NVIDIA's GPU technology and Kubernetes. As a Solutions ... Solutions Architecture with a proven track record of moving AI inference from POC to production on...AWS Lambda, NVCF) with NVIDIA GPUs. + NVIDIA Certified AI Engineer or similar. + Active contributions… more
- Guidehouse (Huntsville, AL)
- …solutions that leverage large language models (LLMs), retrieval systems, and secure, scalable inference pipelines to enable data-driven decision-making. You will ... Active Top Secret (TS) Guidehouse is seeking a Lead AI /ML Engineer to join our Technology /...the FBI adjudication platform. Drives design and implementation of inference pipelines, RAG workflows, retrieval systems, prompt architectures, and… more
- Amazon (Seattle, WA)
- …and Llama, and collaborate directly with silicon architects to influence the future of AI hardware. Our systems handle millions of inference calls daily, while ... accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll be...deep technical experts transform complex ML challenges into elegant, scalable solutions that define how AI workloads… more
- Oracle (San Juan, PR)
- **Job Description** The Senior Principal AI /ML Software Engineer is responsible for evaluating, integrating, and optimizing cutting-edge technologies for AI ... AI infrastructure offerings, spearheads the design and implementation of scalable orchestration for AI /ML workloads-incorporating the latest research in… more
- General Motors (Sunnyvale, CA)
- …such as Embodied AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of ML engineers, and drive ... the adoption of cutting-edge ML techniques. Our AV ML infrastructure includes: ** AI Validation & Inference ** : Ensures robust model performance by running… more
- Oracle (Frankfort, KY)
- …serving frameworks like vLLM, DeepSpeed, or FasterTransformer. - Exposure to agent-based AI systems or tool-based inference workflows. - Knowledge of ... future of cloud computing-designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of… more
- Mastercard (O'Fallon, MO)
- …mining, cleaning, processing, and validation. -Contribute to the development of scalable AI infrastructure and reusable components for enterprise-wide use. ... businesses and governments realize their greatest potential._ **Title and Summary** Lead, AI Data Engineer Mastercard Security Services is responsible for… more
- General Motors (Austin, TX)
- **Job Description** **Staff AI /ML Engineer , AV ML Infra** We're General Motors (GM), a company driving the future of mobility with advanced self-driving and ... AI , Simulation, Data Science, and more. We enable scalable and efficient ML experimentation, enhance the productivity of...of cutting-edge ML techniques. Our ML infrastructure includes: + ** AI Validation & Inference :** Ensures robust model… more
- SAIC (VA)
- **Description** We are seeking a **hands-on AI & LLM Engineer ** to join the AWS AI /GenAI Solutions Team within the IRS Advanced Analytics Program (AAP). This ... Step Functions) in production workflows. + Proven ability to build and deploy ** inference endpoints and APIs** for AI /ML workloads. + Familiarity with **CI/CD… more
- Palo Alto Networks (Santa Clara, CA)
- …is to create an environment where we all win with precision. **Your Career** As a Principal AI Engineer for the Enterprise AI Platform, you will be a pivotal ... assistant, agent, app, supporting both traditional and Generative AI model development, deployment, and real-time inference ...Research: Actively learn and incorporate advances in Agents, Generative AI , LLMs, and scalable AI … more