• Senior Distributed Storage Engineer

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Senior Software Engineer to develop a distributed storage service for AI/ML. The goal is to craft a reliable, scalable, and efficient ... for an engineer who has a deep understanding of distributed systems, outstanding design skills, and a track record...and a track record in building and delivering large-scale distributed services. What you will be doing: + Leading… more
    NVIDIA (12/20/24)
    - Save Job - Related Jobs - Block Source
  • Principal Engineer, Distributed Machine…

    NVIDIA (Santa Clara, CA)
    NVIDIA is looking for a Principal Engineer to join our Distributed Machine Learning team focused on GPU accelerated Apache Spark. Data scientists often apply machine ... (eg, XGBoost, RAPIDS cuML, PyTorch, and TensorFlow) have been extended for distributed training in a GPU-accelerated computer cluster. NVIDIA plans to work with… more
    NVIDIA (11/27/24)
    - Save Job - Related Jobs - Block Source
  • Research Engineer, Pytorch Distributed

    Meta (Menlo Park, CA)
    **Summary:** The vision for the PyTorch Distributed team is to make PyTorch the industry leading machine learning framework for Efficient and Large Scale ... Distributed Training and Inference.This vision includes:- Enabling scaling to...competitors as the preferred framework across the industry for distributed training- Enable cutting edge research for the largest… more
    Meta (10/24/24)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer (GCP,…

    CVS Health (Hartford, CT)
    …advanced analytics capabilities (predictive models, LLMs, vector search) into complex distributed systems built within the Google Cloud Platform. You will work ... products, leveraging your expertise in cloud technologies, software architecture, and distributed system design. You will join a high-caliber engineering team… more
    CVS Health (01/05/25)
    - Save Job - Related Jobs - Block Source
  • Postdoctoral Appointee - Distributed

    Sandia National Laboratories (Albuquerque, NM)
    …by job classification. What Your Job Will Be Like: The Renewable and Distributed Systems Integration Department has a postdoctoral research opening in the area of ... Distributed Systems Integration at our Albuquerque, New Mexico facility....called on to: + Conduct research on renewable and distributed energy systems and their integration with electrical power… more
    Sandia National Laboratories (12/20/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineering Manager, Distributed

    NVIDIA (Santa Clara, CA)
    …an experienced software engineering manager to lead the development of NVIDIA's distributed runtime stack for large-scale distributed computing that attempts to ... autonomous vehicles and countless others. Our team develops foundational distributed computing software that extremely simplifies development of such applications!… more
    NVIDIA (12/11/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineer (L4) - Data…

    Netflix (UT)
    …and pervasive destination for global internet entertainment. We are looking for distributed systems engineers to help evolve and innovate our infrastructure. We are ... and infer business metadata across all datasets at Netflix. We own the Distributed Search and Lineage Infrastructure to efficiently discover data and track data… more
    Netflix (12/14/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineer (L5) - Data…

    Netflix (UT)
    …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... and infer business metadata across all datasets at Netflix. We own the Distributed Search and Lineage Infrastructure to efficiently discover data and track data… more
    Netflix (12/06/24)
    - Save Job - Related Jobs - Block Source
  • Information & Communication Technology…

    Electric Power Research Institute (Knoxville, TN)
    **Job Title:** Information & Communication Technology for Distributed Energy Resources (DER) Integration Technical Leader II **Location:** Knoxville, TN **Job ... Technical Leader II in the area of Information and Communication Technology for Distributed Energy Resources (DER) Integration. As the penetration of DER on the grid… more
    Electric Power Research Institute (11/23/24)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer - Distributed

    eightfold.ai (Santa Clara, CA)
    …What you will do + Design, develop, maintain, and support highly distributed systems in Eightfold's core infrastructure. + Build out large-scale software platforms ... standards. + Diagnose and solve problems that can arise in a complex distributed environment. What you bring + Proven Track Record: Demonstrable experience in… more
    eightfold.ai (11/23/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer, AWS…

    Amazon (Seattle, WA)
    Description Does working on a cutting edge database excite you? The Distributed SQL team at AWS is building revolutionary transactional database technology, ... shaping the future of databases at Amazon and beyond! Distributed data management is at the heart of AWS...scalable performance. We are building and operating large scale, distributed , fault tolerant data and transaction management solutions using… more
    Amazon (10/10/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineer, L5

    Netflix (UT)
    …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
    Netflix (12/29/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI/ML, AWS Neuron…

    Amazon (Seattle, WA)
    …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
    Amazon (12/13/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer- AI/ML, AWS Neuron…

    Amazon (Cupertino, CA)
    …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
    Amazon (12/11/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineering - Intern

    Bosch (Sunnyvale, CA)
    …frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units ... RSS, NeurIPS and CoRL. **Job Description** + Conduct advanced engineering in distributed systems and middleware frameworks to address practical challenges found in… more
    Bosch (12/07/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineer, Gen AI L4

    Netflix (Los Gatos, CA)
    …and pervasive destination for global internet entertainment. We are looking for distributed systems engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
    Netflix (12/06/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer - AI/ML, AWS Neuron…

    Amazon (Seattle, WA)
    …well as Stable Diffusion, Vision Transformers (ViT) and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create, build and tune distributed training solutions with Trainium instances. Experience with training these large models using Python is a… more
    Amazon (12/01/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer- AI/ML, AWS Neuron…

    Amazon (Cupertino, CA)
    …as well as stable diffusion, Vision Transformers and many more. The ML Distributed Training team works side by side with chip architects, compiler engineers and ... runtime engineers to create , build and tune distributed training solutions with Trn1. Experience training these large...using Python is a must. FSDP, Deepspeed and other distributed training libraries are central to this and extending… more
    Amazon (11/28/24)
    - Save Job - Related Jobs - Block Source
  • Distributed Systems Engineer, Gen AI L5

    Netflix (Los Gatos, CA)
    …and pervasive destination for global internet entertainment. We are looking for Distributed Systems Engineers to help evolve and innovate our infrastructure. We are ... Architecting and building a robust, scalable, and highly available distributed infrastructure. + Leading cross-functional initiatives and collaborating with… more
    Netflix (11/05/24)
    - Save Job - Related Jobs - Block Source
  • Research Scientist for Distributed

    Bosch (Sunnyvale, CA)
    …frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units ... CoRL. **Job Description** + Conduct advanced research and engineering in reliable distributed systems, especially using AI approaches, to enable Cloud and Edge… more
    Bosch (10/16/24)
    - Save Job - Related Jobs - Block Source