• Production Systems Engineer

    Meta (Menlo Park, CA)
    …to track the health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: ... **Summary:** Meta is seeking a Production Systems Engineer to...3. Develop test framework for large-scale test automation inside fleet during product development and after mass production more
    Meta (05/31/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …to track the health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: ... **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are… more
    Meta (06/01/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: 1. ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...Support new AI platform introduction in to Meta fleet by driving scale up (eg NVlink, XGMI) and… more
    Meta (07/04/24)
    - Save Job - Related Jobs - Block Source
  • Production Engineer

    Meta (Menlo Park, CA)
    …experts. Relevant industry experience is important (Software Engineer , Site Reliability Engineer (SRE), Systems Engineer ,, DevOps Engineer , Network ... **Summary:** Production Engineers at Meta are hybrid software/ systems...we are always learning.This position is full-time. **Required Skills:** Production Engineer Responsibilities: 1. Own back-end services… more
    Meta (06/13/24)
    - Save Job - Related Jobs - Block Source
  • Network Production Engineer

    Meta (Menlo Park, CA)
    …click "Apply to Job" online on this web page. **Required Skills:** Network Production Engineer Responsibilities: 1. Build experience with Backbone, Data Center, ... center and backbone network. 3. Develop optimized network monitoring systems . 4. Design, and deploy new network architectures. 5....9. Test new network products and pilot them in production DC networks before a mass rollout. 10. Develop… more
    Meta (07/06/24)
    - Save Job - Related Jobs - Block Source
  • SiteOps Global Systems Engineer

    Meta (Fremont, CA)
    **Summary:** Meta is seeking a forward thinking, experienced Data Center Systems Engineer to join the Data Center Site Operations team. Our data centers, and the ... diagnostic tooling which enables quality and efficient delivery of production servers. They should be able to perform deep... systems is required. **Required Skills:** SiteOps Global Systems Engineer Responsibilities: 1. Identify and root… more
    Meta (05/24/24)
    - Save Job - Related Jobs - Block Source
  • Software Development Engineer - SDE II,…

    Amazon (Los Gatos, CA)
    Description eero is looking for an experienced software development engineer on the Connectivity - Systems software team. Responsibilities will include improving ... the management plane, implementing new customer features, improving our automation and fleet observability. This engineer will focus on L3+ in the network… more
    Amazon (06/26/24)
    - Save Job - Related Jobs - Block Source
  • Hardware Development Engineer , Storage…

    Amazon (Cupertino, CA)
    …servers solutions for the AWS environment * You will support launching our servers into production and operating our fleet of servers About the team Why AWS ... with an interdisciplinary team to execute product designs from concept to production including design, development, validation, and the deployment of servers at… more
    Amazon (06/12/24)
    - Save Job - Related Jobs - Block Source
  • Staff Site Reliability Engineer

    General Motors (Palo Alto, CA)
    …cloud software - to deliver seamless solutions to our customers, including fleet management, energy optimization, transportation logistics, safety systems , and ... journey to pioneer next-generation software solutions tailored for commercial fleet owners and their drivers, ranging from small and...SRE and Observability platform to monitor health of our production system and provide a holistic view of the… more
    General Motors (04/20/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - Scaling…

    Meta (Menlo Park, CA)
    …training. In other words, nearly every distributed GPU-based ML workload in Meta Production goes through the SW stack the team owns.At the high level, the ... and innovations to leverage our large-scale GPU training and inference fleet through an observable, reliable and high-performance distributed AI/GPU communication… more
    Meta (04/12/24)
    - Save Job - Related Jobs - Block Source
  • Senior Compliance Engineer

    FreeWire Technologies (Newark, CA)
    …charging when and where they need it. We build battery-based energy storage systems that provide ultrafast charging for EVs and clean, affordable, resilient power ... and customers from Fortune 100 companies including leading corporate, utility, retail, and fleet operators. To meet our ambitious goals, we are growing rapidly and… more
    FreeWire Technologies (07/06/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Software Engineer - Compute Infra SRE

    LinkedIn (Mountain View, CA)
    …relevant work experience. . Experience in building and running distributed large-scale systems in production (24x7). . Experience with Kubernetes controller ... valued. Suggested Skills: . API Development . Kubernetes Infrastructure . Distributed Large-Scale Systems in Production You will Benefit from our Culture: We… more
    LinkedIn (06/16/24)
    - Save Job - Related Jobs - Block Source
  • Distinguished Engineer - Infra Automation

    ServiceNow, Inc. (Santa Clara, CA)
    …**The Role** Design and develop the services and tools to manage our massive fleet of SaaS instances on our infrastructure and scale it to meet our business ... with automating their lifecycle. + Experience building and operating massive distributed systems for mission critical functions. + "Hands on the keyboard" coding in… more
    ServiceNow, Inc. (06/01/24)
    - Save Job - Related Jobs - Block Source
  • Research Scientist, Systems ML - SW/HW…

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a software engineer to join our AI & Systems Co-Design team to drive the definition of our next-generation compute and storage ... path-finding, roadmap definition and co-design activities to deliver new capabilities and efficient systems for our fleet . They will also work with external… more
    Meta (07/03/24)
    - Save Job - Related Jobs - Block Source
  • SDE III, DBS Redshift

    Amazon (East Palo Alto, CA)
    …metric gathering technologies that help us understand the current state of the production fleet . You will build out automation of critical operational functions ... Redshift. We are building the next generation data warehouse systems , creating an impact on its hundreds of thousands...analyze all your data seamlessly. As a Software Development Engineer with Amazon Redshift, you will have the opportunity… more
    Amazon (06/30/24)
    - Save Job - Related Jobs - Block Source