- Amazon (Seattle, WA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team ... to provide seamless integration and bring peak performance at scale for customers and developers. This role is responsible...models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side… more
- Google (Sunnyvale, CA)
- …deploying multimodal signals and autoraters, including tools for prompting, large- scale inference , visualization, and evaluation. + Address unique and ... Staff Software Engineer , AI Data, Multimodal _corporate_fare_ Google...cross-business projects. + Experience with innovation of technology at scale and passion for development and the use of… more
- NVIDIA (Santa Clara, CA)
- …open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI models like LLMs, VLMs, multimodal ... as large language models (LLM) and diffusion models for maximal inference efficiency using techniques ranging from quantization, speculative decoding, sparsity,… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... open-source frameworks, which are at the forefront of efficient large- scale model serving and inference . You will...tuning of DL models in various domains like LLM, Multimodal and Generative AI. + Scale performance… more
- Amazon (Cupertino, CA)
- …and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale . The Neuron Serving team develops infrastructure ... to pushing the boundaries of what's possible in large- scale ML serving. Recent shares: https://github.com/aws-neuron/upstreaming-to-vllm/releases/tag/2.25.0 https://awsdocs-neuron.readthedocs-hosted.com/en/latest/libraries/nxd- inference… more
- NVIDIA (Santa Clara, CA)
- NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will help design, build, and ... and vLLM, which are at the forefront of efficient large- scale model serving and inference . You will...tuning of DL models in various domains like LLM, Multimodal and Generative AI. + Scale performance… more
- NVIDIA (Santa Clara, CA)
- …data processing and model post-training. + Deep understanding of distributed systems for large- scale model inference and serving. Your base salary will be ... We are now looking for a DL Algorithms Engineer ! We are seeking a highly skilled Deep...and deploying deep learning models for efficient and fast inference across diverse GPU platforms, particularly for physical AI… more
- NVIDIA (Santa Clara, CA)
- …developers to build, optimize, and deploy GPU-accelerated pipelines that process multimodal sensor data in real time. Originally pioneered for healthcare imaging, ... earth observation, and beyond. Wherever high-throughput sensor processing, AI inference , and visualization meet, Holoscan provides the foundation. The field… more
- Google (Mountain View, CA)
- Snapshot We are seeking a highly motivated and innovative Software Engineer to join us in rapidly exploring and implementing the state-of-the-art Gemini models ... to validate new ideas, utilizing the latest advancements in multimodal large language models (vision, audio, text) to build...are the highest priority. The Role As a Software Engineer at Google DeepMind, you will be at the… more
- Amazon (San Francisco, CA)
- …Rocky Duan, and Peter Chen to make breakthrough foundation models run at production scale . As a Senior Machine Learning Engineer embedded in our science team, ... your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale . In this...spans the full spectrum of robotics intelligence - from multimodal perception using images, videos, and sensor data, to… more
- Microsoft Corporation (Redmond, WA)
- …part of AI Platform team in CoreAI Organization is responsible for large- scale , highly reliable and efficient GPU management infrastructure and the inference ... as M365 CoPilot, Github CoPilot, Microsoft CoPilot, AI Foundry's Inference and Fine-Tuning offering of OAI and OSS models,...OSS models, and many more. As a Senior Software Engineer on the training infrastructure team, you will work… more
- Amazon (San Francisco, CA)
- …Robotics team, where you'll contribute to breakthrough foundation models run at production scale . As a Software Development Engineer embedded in our science ... your expertise in CUDA and TensorRT to achieve unprecedented inference efficiency at Amazon scale . In this...spans the full spectrum of robotics intelligence - from multimodal perception using images, videos, and sensor data, to… more
- NVIDIA (Santa Clara, CA)
- …engineering team is pushing the boundaries of what's possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic ... crafting or contributing to benchmarks or evaluation datasets, especially for multimodal or agentic systems. + Familiarity with evaluating models via prompting,… more
- Microsoft Corporation (Redmond, WA)
- …tuning and reinforcement learning from feedback + Reasoning: Enabling LLMs to scale inference time compute via reinforcement learning + Action Models: ... close links to top academic institutions around the world. This Principal Research Engineer position is a unique opportunity to contribute towards tackling some of… more
- Meta (Bellevue, WA)
- **Summary:** Meta Superintelligence Labs is seeking a Staff Research Engineer to provide technical leadership for the team working on the next generation of AI ... Assistants powered by frontier- scale foundation models. This is a high-impact Individual Contributor...model development with real end-user value.As a Staff Research Engineer , you will define the scientific "North Star" behind… more
- JPMorgan Chase (Plano, TX)
- …adding business value and enhance user experiences. As a Senior Lead Software Engineer at JPMorgan Chase within the Consumer & Community Banking - Deposits Platform ... Designs and implements generative AI models including large language models, multimodal systems, and specialized domain-specific models. + Builds scalable, robust AI… more
- Insight Global (Whitpain, PA)
- …Software Engineer to lead the design and delivery of enterprise- scale Generative AI solutions that power next-generation healthcare experiences. This role goes ... Job Description Our Healthcare Insurance Client is looking to hire a Principal Software Engineer - Generative AI. This position is remote work from home and will… more
- Norbert Health (Brooklyn, NY)
- …that matter. The position We are looking for our lead deep learning engineer to spearhead the development of our groundbreaking sensing technology. What you will ... facial landmark detection, object tracking, pose estimation) for real-time inference on the edge + Optimize models for embedded...best practices and help reduce technical debt as we scale + Contribute to the architecture and implementation of… more
- Microsoft Corporation (Mountain View, CA)
- **Overview** As a Principal Software Engineer on the Azure Artificial Intelligence Core team at Microsoft, you will design, build, and maintain AI systems that power ... We enable secure, scalable, and high-performance AI experiences across multimodal verticals, including real-time audio interaction, image generation, video… more
- ManTech (Ashburn, VA)
- …environments (eg, cloud, edge devices). Develop and optimize models for real-time inference and efficiently handle large- scale data processing pipelines. + ... **MANTECH** seeks a motivated, career and customer-oriented **Senior Computer Vision Engineer ** to join our AI innovation team in **Ashburn, VA.** This is currently… more