• Production Systems Engineer

    Meta (Menlo Park, CA)
    production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems (NetZero) Responsibilities: 1. ... **Summary:** Meta is seeking a Production Systems Engineer to...full system technologies, full system lifecycle 16. Experience supporting AI /HPC systems and/or related components at scale.… more
    Meta (07/25/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...life cycle for computer system products 17. Experience supporting AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • AI /HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (08/01/24)
    - Save Job - Related Jobs - Block Source
  • Engineer , Silicon Systems AI

    Micron Technology, Inc. (San Jose, CA)
    …or related field and 24 months of experience in the job offered or in an Engineer , Silicon Systems AI - related occupation. Position requires 24 months of ... System Design. 2. C/C++/Python in a Linux development environment. 3. AI model deployment in production environment. 4. AI algorithms to modify silicon… more
    Micron Technology, Inc. (09/23/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …availability of the platform, as well as applying SRE principles to improve production systems and optimize service SLOs. Additionally, collaboration with our ... Joining NVIDIA's AI Efficiency Team means contributing to the infrastructure...foster innovation. We are seeking a Senior Site Reliability Engineer (SRE) to join our team. You'll be instrumental… more
    NVIDIA (09/24/24)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Machine Learning Engineer

    LinkedIn (Sunnyvale, CA)
    …Machine Learning and Artificial Intelligence Preferred QualificationsExperience in bringing large scale AI systems to production .PhD in Computer Science, ... within FAIT and across the company to realize these AI innovations. As a Principal Staff Engineer ...define the bar for quality and efficiency of software systems while balancing business impact, operational impact and cost… more
    LinkedIn (07/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior High-Performance AI Training…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior High-Performance AI Training Engineer : NVIDIA is seeking senior engineers who are obsessed with performance analysis and ... help us squeeze every last clock cycle out of AI training, the workload driving the design and construction...and construction of the largest and most powerful compute systems in the world. This role offers the opportunity… more
    NVIDIA (09/11/24)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer - AI

    eightfold.ai (Santa Clara, CA)
    …for Machine Learning & AI + Implement best practices for building AI -enabled products + Develop AI -based systems for Natural Language Processing ... is NOT a remote position ) About Eightfold Eightfold AI is the industry leader in AI -powered...frameworks (scikit-learn, tensorflow, torch, etc.) + Experience with implementing production machine learning systems and working with… more
    eightfold.ai (08/14/24)
    - Save Job - Related Jobs - Block Source
  • Senior Platform Software Engineer

    NVIDIA (Santa Clara, CA)
    …can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the ... productivity required for strong scaling for HPC and generative AI workload.Scale out is inherent to design of this...We are looking for a strong technical platform software engineer focused on PCIe firmware, you will own PCIe… more
    NVIDIA (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Senior Software Engineer , Cybersecurity…

    NVIDIA (Santa Clara, CA)
    …which enable cybersecurity ecosystems in their creation of detection, alerting, and observability systems . We are seeking a Senior Software Engineer to help ... by great technology-and outstanding people! Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU… more
    NVIDIA (09/20/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , AI Innovation…

    Google (Mountain View, CA)
    AI /ML models, ensuring their robustness, scalability, and successful integration into production systems . + Work closely with team based in other countries, ... on and is growing every day. As a software engineer , you will work on a specific project critical...play a pivotal role in shaping how we bridge AI expertise from research to the product team. This… more
    Google (09/28/24)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer - Employee…

    NVIDIA (Santa Clara, CA)
    Engineer to lead our efforts in designing, developing, and deploying generative AI systems and productivity solutions. As a key leader in our technology ... 12+ years of proven experience in software engineering experience on large-scale production systems + Proven expertise of performance, reliability in… more
    NVIDIA (09/19/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
    Meta (09/04/24)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer , Security…

    Cisco (San Jose, CA)
    …window is expected to close on October 18, 2024. Who We Are The Cisco Security AI team delivers AI products and platform for all Cisco Secure products and ... customers secure by simplifying security with zero compromise using AI and Machine Learning. Who You Are You are...Who You Are You are a passionate Machine Learning Engineer who is building their career through successfully building,… more
    Cisco (10/03/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
    Meta (07/19/24)
    - Save Job - Related Jobs - Block Source
  • AI Cloud k8s Infrastructure Engineer

    NVIDIA (Santa Clara, CA)
    We are looking for a highly motivated AI cloud k8s infrastructure engineer to join our team in the fastest growing organization at NVIDIA. There is an excellent ... new ways to improve availability of our Cloud Native AI Platform by automating critical processes on the multiple...at problem solving and complexity analysis of the distributed systems + Working experience with the observability tooling -… more
    NVIDIA (09/25/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI /ML Ops Engineer

    Tarana Wireless (Milpitas, CA)
    …and help us to consistently deliver high-quality, high-reliability, cost-effective machine learning and AI systems as part of various products. You will guide ... preferred + 5-12 years of experience building large scale ML/ AI models and systems Knowledge, Skills and...markets, using either licensed or unlicensed spectrum. G1 started production in mid 2021 and has now been installed… more
    Tarana Wireless (09/05/24)
    - Save Job - Related Jobs - Block Source
  • Staff Engineer - AI /Machine…

    LinkedIn (Sunnyvale, CA)
    …roles, our engineers work on projects from ideation to implementation. As a Staff Engineer at LinkedIn, you will be responsible for the development and training of ... of samples for statistical modeling, data mining, recommendation solutions * Write production quality code and influence the next generation of LinkedIn's system *… more
    LinkedIn (09/29/24)
    - Save Job - Related Jobs - Block Source
  • Sr. Hardware Dev Engineer (AWS Generative…

    Amazon (Cupertino, CA)
    Description Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to ... delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come...the current customer experience as well as developing improved systems for future designs. You will work directly with… more
    Amazon (07/13/24)
    - Save Job - Related Jobs - Block Source