- MongoDB (Palo Alto, CA)
- …We're looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, ... in MongoDB Atlas. You'll join the broader Search and AI Platform organization and collaborate with ML...Do** + Design and build components of a multi-tenant inference platform integrated directly with MongoDB Atlas,… more
- Bank of America (Addison, TX)
- Senior Engineer- AI Inference Addison, Texas;Plano,.... We are building the next generation of Gen AI platform , empowering new AI initiatives ... must be at least 18 years of age.** Acknowledge (https://ghr.wd1.myworkdayjobs.com/Lateral-US/job/Addison/ Senior -Engineer- AI - Inference \_25029879) **Job Description:** At Bank… more
- Red Hat (Raleigh, NC)
- …this role, your primary responsibility will be to build and release the Red Hat AI Inference runtimes, continuously improve the processes and tooling used by the ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise...techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
- quadric.io, Inc (Burlingame, CA)
- …unique platforms. The AI Inference Engineer at Quadric will [1] port AI models to Quadric platform ; [2] optimize the model deployment for efficient ... and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the...; [3] profile and benchmark the model performance. This senior technical role demands deep knowledge of AI… more
- Amazon (Seattle, WA)
- …accelerators that power the latest AI models As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build ... inference technology building blocks and libraries to enable AI developers to optimize model for inference ...You will work with the executive leadership and other senior management and technical leaders to define product directions… more
- NVIDIA (Santa Clara, CA)
- …will influence NVIDIA's entire technical marketing strategy to showcase our leadership position in AI inference . Want to join a fun, creative company that is at ... companies! What You'll Be Doing: + Help drive NVIDIA's inference platform technical go-to-market efforts + Work...cache implementations) and identify intersection points between the latest AI models and NVIDIA's platform to maximize… more
- Amazon (Seattle, WA)
- … Inference Technology building blocks team, you will guide your expert AI engineers to build fundamental inference technology building blocks and libraries ... enable AI developers to optimize model for inference on Trainium and Inferentia devices. We're currently focusing...day in the life You will work with your senior management and technical leaders to define the building… more
- NVIDIA (Santa Clara, CA)
- …performance to strengthen NVIDIA platform integration and expand market adoption across the AI inference ecosystem. What we need to see: + Master's, PhD, or ... to neural architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer… more
- NVIDIA (Durham, NC)
- …Architecture team does groundbreaking hardware-software co-design work that focuses on accelerating AI Inference workloads. In this role, you will write ... We are now looking for a Senior Deep Learning Inference Performance Architect!...architectures to extend the state of the art in AI Inference performance and efficiency + Model,… more
- Red Hat (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to ... Summary** At Red Hat we believe the future of AI is open and we are on a mission...model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM… more
- NVIDIA (CA)
- …to stand out from the crowd: + Prior work experience improving performance of AI inference systems. + Background with deep learning algorithms and frameworks. ... We are now looking for a Senior System Software Engineer to work on Dynamo...role, you will develop open source software to serve inference of trained AI models running on… more
- Red Hat (Boston, MA)
- …bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to ... high performance computing will directly impact the development of our cutting-edge software platform , helping to shape the future of AI deployment and… more
- Amazon (Cupertino, CA)
- …as Llama and GPT-OSS to run really fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI /ML engineers to ... open-source and customer LLMs, both dense and MoE, for inference on Neuron and Trainium and Inferentia accelerators. You...day in the life You will work with your senior management and technical leaders to define the model… more
- Coinbase (Charlotte, NC)
- …in the job description. For select roles, Coinbase is also piloting an AI interview intelligence platform to transcribe and summarize interview notes, allowing ... us, every day, as we build the emerging onchain platform - and with it, the future global financial...(ie. Job Duties)* * Develop and* deploy robust causal inference models* to quantify the holistic business impact of… more
- Capital One (New York, NY)
- Senior AI Engineer (Gen AI Platform Services, Agentic Systems) **Overview:** At Capital One, we are creating responsible and reliable AI systems, ... AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,...be regularly worked. Cambridge, MA: $158,600 - $181,000 for Senior AI Engineer McLean, VA: $158,600 -… more
- PennyMac (Westlake Village, CA)
- …achieve aspirations of homeownership through the complete mortgage journey. A Typical Day The Senior AI Platform Engineer will: + Design, implement, and ... ensuring they can be securely and efficiently consumed by platform services. + Enable AI /ML workflows by...AI workloads, including monitoring and resource scaling for inference services. + Implement and enforce security best practices… more
- NVIDIA (Santa Clara, CA)
- We are looking for a Developer Relations Manager - AI Platform SW, passionate about developing modern Artificial Intelligence, and Generative AI applications ... Focus will be on accelerating GenAI model training and inference , which is making a major impact across research...throughout the developer applications. At NVIDIA, we are enabling AI platform software solutions for accelerated compute… more
- Veralto (MA)
- Position Summary We're seeking a Senior Data & AI Platform Architect to lead the design and implementation of TraceGains' next-generation data and MLOps ... 80% of their platform needs * Successfully hire and onboard 2-3 senior data engineers and AI /ML engineers Required Qualifications Experience & Background *… more
- LinkedIn (Mountain View, CA)
- …Attention. PyTorch Lightning and more. Feature Engineering: this team shapes the future of AI with the state-of-the-art Feature Platform , which empowers AI ... models together. The team is responsible for scaling LinkedIn's AI model training, feature engineering and serving with hundreds...infra on top of native cloud, enable GPU based inference for a large variety of use cases, cuda… more
- Capital One (San Jose, CA)
- Senior Manager, AI Engineering (People Leader) (GenAI Platform Services) **Overview** : At Capital One, we are creating responsible and reliable AI ... us to be at the forefront of enterprises leveraging AI . From informing customers about unusual charges to answering...AI and ML algorithms or technologies (eg LLM Inference , Similarity Search and VectorDBs, Guardrails, Memory) using Python,… more