- Meta (Burlingame, CA)
- …in machine learning and software engineering. **Required Skills:** Software Engineer , Systems ML - Edge Inference Responsibilities: 1. Contribute to ... EMG Wristbands, and other upcoming products). This role is focused on efficient ML inference via use of edge hardware accelerators, including NPUs and DSPs. The… more
- Amazon (Seattle, WA)
- …and the Trn1 and Inf2 servers that use them. This role is for a software engineer in the Neuron Inference ( ML Apps) team for AWS Neuron. This role is ... Stable Diffusion, Vision Transformers and many more. The Neuron Inference team works side by side with compiler engineers...running on the AWS Trainium and Inferentia silicon. Strong software development using C++/Python and ML knowledge… more
- Amazon (Seattle, WA)
- …the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... and runtime engineers to create, build and tune distributed inference solutions with Trn1. Experience optimizing inference ...Inferentia silicon and the TRn1 , Inf1 servers. Strong software development using C++/Python and ML knowledge… more
- NVIDIA (Santa Clara, CA)
- …upon which every new AI-powered application is built. We are seeking a Principal Engineer for NVIDIA Inference Microservices (NIMs). The right person for this ... You will apply your deep technical expertise in AI Inference and containerized software to design an...traceability. What you'll be doing: + You are the engineer 's engineer demonstrating good engineering practice and… more
- Bloomberg (New York, NY)
- …hybrid cloud environment. We are founding members of the KServe project to standardize ML Inference within the Kubernetes ecosystem. As part of that, we ... first-class support for a variety of workloads such as ML training job and inference services, Spark,...Kubeflow/KServe, MLFlow, Sagemaker. + Experience working with GPU compute software and hardware. + Ability to identify and perform… more
- NVIDIA (Santa Clara, CA)
- As a System Software Engineer (LLM Inference & Performance Optimization) you will be at the heart of our AI advancements. Our team is dedicated to pushing ... on GPU. + Experience with low-latency optimizations and real-time system constraints for ML inference . The base salary range is 180,000 USD - 339,250 USD.… more
- Amazon (New York, NY)
- … inference engines. You will leverage advanced hardware, innovative software architecture, and distributed computing techniques to enable breakthrough research ... technologies, including Amazon's most expansive multimodal Large Language Models. Our inference engines power these initiatives. Key job responsibilities You will… more
- NVIDIA (Santa Clara, CA)
- …alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer to develop and scale up our automated inference and ... on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM, TRT Model...design. + Strong proficiency in Python, PyTorch, and related ML tools (eg HuggingFace). + Strong algorithms and programming… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for Deep Learning Inference ! Would you like to make a big impact in Deep Learning by helping build a ... apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team. What you'll be...working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks. NVIDIA is widely considered to be one… more
- NVIDIA (CA)
- We are now looking for a Senior System Software Engineer to work on Triton Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated ... doing: In this role, you will develop open source software to serve inference of trained AI...design. + Experience with high scale distributed systems and ML systems Ways to stand out from the crowd:… more
- JPMorgan Chase (Mclean, VA)
- …Chase's research teams, ensuring alignment with business goals. As a Senior Lead Software - Machine Learning Engineer at JPMorgan Chase within the Corporate ... Generative AI. **Job responsibilities** + Architects and implements distributed ML infrastructure, including inference , training, scheduling, orchestration, and… more
- Meta (Columbus, OH)
- …generation hardware software codesign for AI domain specific problems. **Required Skills:** Software Engineer , Systems ML - Frameworks / Compilers / ... be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of...a highly flexible platform to train & serve new DL/ ML model architectures, combined with auto-tuned high performance for… more
- Google (Sunnyvale, CA)
- …UI design and mobile; the list goes on and is growing every day. As a software engineer , you will work on a specific project critical to Google Cloud's needs ... provides ML infrastructure customers with large-scale, cloud-based access to Google's ML supercomputers to run training and inference workloads using PyTorch… more
- Amazon (Cupertino, CA)
- …customer AWS Trainium and Inferentia silicon and the TRn1 , Inf1 servers. Strong software development using C++/Python and ML knowledge are both critical to this ... Description AWS Neuron is the complete software stack for the AWS Inferentia and Trainium...You will develop and extend support for the leading ML frameworks, delivering an outstanding user experience for PyTorch… more
- Amazon (Seattle, WA)
- …and Inf1 servers that use them. This role is for a senior software engineer in the Machine Learning Applications ( ML Apps) team for AWS Neuron. This role ... will help lead the efforts building distributed training and inference support into Pytorch, Tensorflow using XLA and the...Inferentia silicon and the TRn1 , Inf1 servers. Strong software development and ML knowledge are both… more
- Meta (Menlo Park, CA)
- …do deep technical work. Our work is open-source, cutting-edge, and industry-leading. **Required Skills:** Software Engineer , Systems ML - PyTorch Compiler / ... GPU performance and writing high-performance CUDA kernels. 14. Research and software engineer experience demonstrated via fellowships, patents, internships, or… more
- Walmart (Bentonville, AR)
- …do It's an exciting time to join our Walmart journey, we are seeking a Staff Software Engineer . At Walmart, technology empowers us and people to lead. You will ... role requires close collaboration with Computer Vision data scientists, software engineers, ML Ops, and DevOps teams....and DevOps teams. Together, you will design and construct inference pipelines that leverage a combination of AI and… more
- Amazon (Cupertino, CA)
- Description We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by ... and TPUs. This role is on the forefront of AI/ ML , we spend a good deal of the day...customers that uses AWS for large models training and inference will be using your software , and… more
- CACI International (Annapolis Junction, MD)
- AI/ ML Senior Software Engineer Job Category: Engineering Time Type: Full time Minimum Clearance Required to Start: TS/SCI with Polygraph Employee Type: ... experience in developing Artificial Intelligence (AI) and Machine Learning ( ML ) algorithms. This is an opportunity to work with...technical degree + 7+ years of experience as a software engineer + Understanding of principles in… more
- Amazon (Seattle, WA)
- …Spark MMLib, Tensorflow, developing machine learning models - Experience with ML model training/ inference optimization Preferred Qualifications - 3+ years ... while optimizing Basic Qualifications - 3+ years of non-internship professional software development experience - 2+ years of non-internship design or architecture… more