- NVIDIA (Santa Clara, CA)
- NVIDIA is looking for a Principal Engineer to join our Distributed Machine Learning team focused on GPU accelerated Apache Spark. Data scientists often apply machine ... (eg, XGBoost, RAPIDS cuML, PyTorch, and TensorFlow) have been extended for distributed training in a GPU-accelerated computer cluster. NVIDIA plans to work with… more
- Meta (Menlo Park, CA)
- **Summary:** The vision for the PyTorch Distributed team is to make PyTorch the industry leading machine learning framework for Efficient and Large Scale ... Distributed Training and Inference.This vision includes:- Enabling scaling to...competitors as the preferred framework across the industry for distributed training- Enable cutting edge research for the largest… more
- NVIDIA (NY)
- NVIDIA is looking to hire a Senior Distributed Acceleration Engineer to work on RAPIDS, a suite of open-source software libraries that accelerates end-to-end data ... and optimizing how RAPIDS can leverage multiple GPUs for distributed execution. The team's charter is to explore, develop,...is a great chance to take advantage of your distributed systems knowledge, CUDA C++, and Python programming skills.… more
- Electric Power Research Institute (Knoxville, TN)
- **Job Title:** Information & Communication Technology for Distributed Energy Resources (DER) Integration Technical Leader II **Location:** Knoxville, TN **Job ... Technical Leader II in the area of Information and Communication Technology for Distributed Energy Resources (DER) Integration. As the penetration of DER on the grid… more
- eightfold.ai (Santa Clara, CA)
- …What you will do + Design, develop, maintain, and support highly distributed systems in Eightfold's core infrastructure. + Build out large-scale software platforms ... standards. + Diagnose and solve problems that can arise in a complex distributed environment. What you bring + Proven Track Record: Demonstrable experience in… more
- Motion Recruitment Partners (St. Louis, MO)
- Software Developer (Mainframe & Distributed Systems) St. Louis, MO **Hybrid** Contract $69.5/hr - $78.31/hr Outstanding long-term contract opportunity! A well-known ... people succeed financially, apply today. **Title: Software Developer (Mainframe & Distributed Systems)** **Location:** Charlotte, NC or St. Louis, MO **Contract… more
- Amazon (Seattle, WA)
- Description Does working on a cutting edge database excite you? The Distributed SQL team at AWS is building revolutionary transactional database technology, ... shaping the future of databases at Amazon and beyond! Distributed data management is at the heart of AWS...scalable performance. We are building and operating large scale, distributed , fault tolerant data and transaction management solutions using… more
- Netflix (UT)
- …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... and infer business metadata across all datasets at Netflix. We own the Distributed Search and Lineage Infrastructure to efficiently discover data and track data… more
- Amazon (Cupertino, CA)
- …as well as stable diffusion, Vision Transformers and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending… more
- Amazon (Seattle, WA)
- …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
- Amazon (Seattle, WA)
- …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
- Amazon (Seattle, WA)
- …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
- Netflix (UT)
- …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
- Netflix (Los Gatos, CA)
- …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
- Bosch (Sunnyvale, CA)
- …frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units ... CoRL. **Job Description** + Conduct advanced research and engineering in reliable distributed systems, especially using AI approaches, to enable Cloud and Edge… more
- Netflix (UT)
- …and pervasive destination for global internet entertainment. We are looking for distributed systems engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
- Amazon (Seattle, WA)
- …Does working on a cutting edge serverless database excite you? The Distributed SQL team at AWS is building revolutionary transactional database technology, ... shaping the future of databases at Amazon and beyond! Distributed data management is at the heart of AWS...scalable performance. We are building and operating large scale, distributed , fault tolerant data and transaction management solutions using… more
- Netflix (Los Gatos, CA)
- …and pervasive destination for global internet entertainment. We are looking for distributed systems engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
- MyFlorida (Tallahassee, FL)
- DISTRIBUTED COMPUTER SYSTEMS SPECIALIST - 37020831 Date: Nov 26, 2024 The State Personnel System is an E-Verify employer. For more information click on our E-Verify ... . Requisition No: 832803 Agency: Environmental Protection Working Title: DISTRIBUTED COMPUTER SYSTEMS SPECIALIST - 37020831 Pay Plan: Career Service… more
- NVIDIA (Santa Clara, CA)
- …level experienced software professionals to help build the next generation of distributed backends for premier Deep Learning frameworks like PyTorch, JAX and ... of runtime systems that underlay the foundation of all distributed GPU computing at NVIDIA What We Need To...and C++ programming + Strong background with parallel and distributed programming, preferably on GPUs + Hands-on development skills… more