- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …Acceleration Kernel Library team is at the forefront of maximizing performance for AWS's custom ML accelerators. Working at the hardware-software boundary, ... our engineers craft high- performance kernels for ML functions, ensuring every...Neuron architecture and programming models * Analyze and optimize kernel -level performance across multiple generations of Neuron… more
- Amazon (Cupertino, CA)
- …stack for Trainium and Inferentia, the AWS Machine Learning chips, delivering best-in-class ML performance in the cloud. You will lead NKI requirements working ... to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom ...Inferentia, the AWS Machine Learning chips. Inferentia delivers best-in-class ML inference performance at the lowest cost… more
- Amazon (Cupertino, CA)
- …work backward from Trainium customers and drive the developer experience for running high- performance ML workloads at scale on AWS Trainium, from getting started ... as well as how Neuron Runtime System interacts with ML frameworks to ensure scale and high performance...nodes, and cost-optimized inference powered by optimized kernels. For performance optimization, Neuron provides the Neuron Kernel … more
- Google (Sunnyvale, CA)
- Senior Software Engineer, Machine Learning, Kernel _corporate_fare_ Google _place_ Sunnyvale, CA, USA; Kirkland, WA, USA **Mid** Experience driving progress, ... + Experience in Kernel development for TPU. + Experience in low-level ML model optimization and willingness to learn new architectures and tools. + Experience in… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines ... deep technical expertise in embedded firmware, Linux kernel development, and middleware development, with customer-facing responsibilities to enable cloud service… more
- Palo Alto Networks (Santa Clara, CA)
- …community. Beyond individual contribution, you will lead complex technical projects, mentor senior engineers, and set the standard for performance , scalability, ... contributions in these areas are a significant plus. + Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton… more
- Amazon (Cupertino, CA)
- …with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance . The Inference Enablement and Acceleration team ... a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from… more
- quadric.io, Inc (Burlingame, CA)
- …and endpoint devices, ranging from battery operated smart-sensor systems to high- performance automotive or autonomous vehicle systems. Unlike other NPUs or neural ... Domain Knowledge: Demonstrated ability to drive complex technical projects in the AI/ ML and embedded processing domain. Highly Desired Skills and Experience (Pluses)… more
- NVIDIA (Santa Clara, CA)
- …learning compilers (eg Apache TVM, MLIR) + Strong experience in GPU kernel development and performance optimizations (especially using CUDA C/C++, cuTile, ... We are now looking for a Senior Deep Learning Software Engineer, FlashInfer. NVIDIA has...the team, you'll develop libraries, code generators, and GPU kernel technologies for NVIDIA's hardware architecture. This means designing… more
- NVIDIA (Santa Clara, CA)
- …solutions. Your problem-solving prowess will be crucial for AON's success with ML workloads. + Architect and Build High- Performance Platforms: Transform user ... our Developer Tools Always-On Profiling (AON) team as a Senior Software Architect, where you'll be pivotal in designing,...CUDA APIs, runtime, streams, kernels, and GPU architecture. + ML Ecosystem & Performance Analysis: Familiarity with… more
- NVIDIA (Santa Clara, CA)
- NVIDIA networking designs and manufactures high- performance networking equipment that enable the most powerful super computers in the largest data centers in the ... or RoCE (RDMA over Converged Ethernet) we make powerful ML /AI platforms possible. We believe in our people and...in large clusters even more performant. As a networking Sr . Solutions Architect at NVIDIA you will have agency… more
- NVIDIA (Santa Clara, CA)
- …for customer use cases, ensuring optimal end-to-end workflows and balanced accuracy- performance trade-offs. + Conduct deep GPU kernel -level profiling to ... and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative generative AI… more
- NVIDIA (Santa Clara, CA)
- …for an experienced engineer to triage customers' hardware platform issues and AI/ ML workloads in huge datacenters of rack-scale platforms, solve customer problems, ... solid programming skills, and experience with multi-GPU platforms. Expertise analyzing performance of distributed GPU-accelerated workloads is a plus. What you'll be… more
- NVIDIA (Santa Clara, CA)
- …of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high- performance computing. + Strong programming ... distributed systems, deep learning theories. + Knowledgeable and passionate about performance engineering in ML frameworks (eg, PyTorch) and inference… more
- NVIDIA (Santa Clara, CA)
- NVIDIA's AI Developer Tools organization is seeking a Senior Research Engineer to join our Quality team, where we're building the definitive benchmarks and ... teams developing CUDA-focused AI tools to provide evaluation insights, identify performance gaps, and integrate novel capabilities (eg, RAG, profiling, web research)… more
- Deloitte (San Jose, CA)
- …critical to businesses. Your contributions can help clients improve financial performance , accelerate new digital ventures, and fuel growth through innovation. AI ... and business applications - delivering production-grade reliability, scalability, and performance . + Engineer core solution components directly, including data… more
- NVIDIA (Santa Clara, CA)
- …and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning Algorithms Engineer to improve innovative LLMs, VLMs, ... of use, compute and memory efficiency, and achieving the best accuracy- performance tradeoffs through software-hardware co-design. Your work will span multiple layers… more
- Microsoft Corporation (Mountain View, CA)
- …compiler engineering, programming language design, algorithmic innovation, AI, and high- performance computing. Our culture is highly collaborative and we regularly ... AI. + Collaborate broadly across multiple disciplines from hardware architects to ML developers. + Identify requirements, plan and design solutions, estimate effort,… more
- Broadcom (San Jose, CA)
- …designed for high performance computing and networking applications including AI and ML . This is driven by the growing need for high server bandwidth, highest ... of the next generation of Ethernet NIC solutions for AI/ ML and High performance computing applications. We...8+ years of experience in Linux Systems programming, Linux kernel , Linux Network Drivers, Linux Kernel Networking,… more