• Production Systems Engineer

    Meta (Menlo Park, CA)
    …platforms, all the way to mass production and deployment. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production...Inference Accelerator (MTIA) program as a part of the AI /ML initiatives supporting large scale AI Training… more
    Meta (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Scientific Computing Engineer (Machine…

    Cadence Design Systems, Inc. (San Jose, CA)
    …who want to make an impact on the world of technology. Cadence Design Systems is a world leader in providing computational software for all aspects of intelligent ... R&D team working on the emerging boundary of scientific computing and machine learning/ AI , with specific emphasis in the analog electronic circuit analysis area. The… more
    Cadence Design Systems, Inc. (10/23/24)
    - Save Job - Related Jobs - Block Source
  • Systems Software Engineer

    NVIDIA (Santa Clara, CA)
    …boundaries of what's possible in AI and cloud computing. As a versatile System Software Engineer - AI and Cloud, you will be part of a team of dedicated ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...solid understanding of API design principles for building scalable, production -ready inference systems . Ways to stand out… more
    NVIDIA (12/04/24)
    - Save Job - Related Jobs - Block Source
  • AI /HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Lead ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...interconnect with minimal latency. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (10/27/24)
    - Save Job - Related Jobs - Block Source
  • AI /HPC Systems Performance…

    Meta (Menlo Park, CA)
    …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
    Meta (10/31/24)
    - Save Job - Related Jobs - Block Source
  • Partner Engineer , Generative AI

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking a Partner Engineer to join Meta's AI Partner Engineering team, a highly technical team that works with strategic partners, machine ... evangelize Meta's AI design patterns and best practices. **Required Skills:** Partner Engineer , Generative AI Responsibilities: 1. Apply relevant AI and… more
    Meta (11/06/24)
    - Save Job - Related Jobs - Block Source
  • Senior AI Engineer

    Capital One (San Jose, CA)
    …to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems . + Contribute to the technical vision and ... (61069), United States of America, SAN JOSE, California Senior AI Engineer At Capital One, we are...At Capital One, we are creating responsible and reliable AI systems , changing banking for good. For… more
    Capital One (12/04/24)
    - Save Job - Related Jobs - Block Source
  • Principal, Software Engineer - Gen…

    Walmart (Sunnyvale, CA)
    …**RAG frameworks** to lead the design, development, and deployment of advanced AI systems . This role involves architecting scalable solutions, integrating ... redefine customer experiences. We are seeking a **Principal, Software Engineer ** with deep expertise in **Generative AI **.... 2. **Architecture & Scalability:** + Architect scalable, distributed AI systems with a focus on performance,… more
    Walmart (12/21/24)
    - Save Job - Related Jobs - Block Source
  • Senior Site Reliability Engineer

    NVIDIA (Santa Clara, CA)
    …and blameless postmortems + Be part of an on call rotation to support production systems + Write and review code, develop documentation and capacity plans, ... automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of...Deployment, BCM, Terraform. + Understanding of fast, distributed storage systems like Lustre and GPFS for AI /HPC… more
    NVIDIA (12/25/24)
    - Save Job - Related Jobs - Block Source
  • Principal Staff Machine Learning Engineer

    LinkedIn (Sunnyvale, CA)
    …Machine Learning and Artificial Intelligence Preferred QualificationsExperience in bringing large scale AI systems to production .PhD in Computer Science, ... within FAIT and across the company to realize these AI innovations. As a Principal Staff Engineer ...define the bar for quality and efficiency of software systems while balancing business impact, operational impact and cost… more
    LinkedIn (10/20/24)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , Futures…

    Intuit (Mountain View, CA)
    …AutoML modeling, etc + Skilled in evaluating and monitoring the performance of AI technology in production and making necessary adjustments to ensure optimal ... the evolving needs of our customers. As a Staff Engineer within the Intuit Futures organization, you will play...expertise to ensure efficient and effective outcomes. + Launch AI integrations in production and evaluate their… more
    Intuit (11/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior High-Performance AI Training…

    NVIDIA (Santa Clara, CA)
    We are now looking for a Senior High-Performance AI Training Engineer : NVIDIA is seeking senior engineers who are obsessed with performance analysis and ... help us squeeze every last clock cycle out of AI training, the workload driving the design and construction...and construction of the largest and most powerful compute systems in the world. This role offers the opportunity… more
    NVIDIA (12/11/24)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer - AI

    eightfold.ai (Santa Clara, CA)
    …for Machine Learning & AI + Implement best practices for building AI -enabled products + Develop AI -based systems for Natural Language Processing ... is NOT a remote position ) About Eightfold Eightfold AI is the industry leader in AI -powered...frameworks (scikit-learn, tensorflow, torch, etc.) + Experience with implementing production machine learning systems and working with… more
    eightfold.ai (11/13/24)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer , AI

    Google (Mountain View, CA)
    …and operability, and follow production principles in maintaining customer-facing production systems . + Influence, mentor, and coach a distributed team ... Model (LLM) or Machine Learning (ML) Infrastructure, and working with Generative AI /ML technologies or similar. Preferred qualifications: + Master's degree or PhD in… more
    Google (12/13/24)
    - Save Job - Related Jobs - Block Source
  • Production Systems Engineer

    Meta (Menlo Park, CA)
    **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
    Meta (10/24/24)
    - Save Job - Related Jobs - Block Source
  • Senior Lead Software Engineer - AI

    JPMorgan Chase (Palo Alto, CA)
    …and applications. For this role you will build and deploy cutting edge Generative AI RAG and Agentic systems to augment processes for the software development ... deliver top-notch technology products. As a Senior Lead Software Engineer at JPMorgan Chase within the Engineer 's...on distributed compute platforms + Deploy and operate Generative AI benchmarking systems to advance the firm's… more
    JPMorgan Chase (12/23/24)
    - Save Job - Related Jobs - Block Source
  • Staff Software Engineer - FullStack Web…

    eightfold.ai (Santa Clara, CA)
    About Eightfold Eightfold is rapidly growing and revolutionizing talent management with AI . We are seeking exceptional engineers to join our team and help us build ... the next generation of our AI -powered talent platform. About the Job As the most...About the Job As the most senior-level Staff Software Engineer on the team, you will own major roadmap… more
    eightfold.ai (12/10/24)
    - Save Job - Related Jobs - Block Source
  • Principal Software Engineer - Enterprise…

    NVIDIA (Santa Clara, CA)
    …generative AI platform and products + Develop platform and systems enabling unified experience across applications and driving insights for end-to-end user ... people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An...shaping the architecture, development, and scaling of our software systems . This role will give an opportunity to collaborate… more
    NVIDIA (12/19/24)
    - Save Job - Related Jobs - Block Source
  • Machine Learning Engineer , Security…

    Cisco (San Jose, CA)
    …window is expected to close on January 3rd, 2025. Who We Are The Cisco Security AI team delivers AI products and platform for all Cisco Secure products and ... customers secure by simplifying security with zero compromise using AI and Machine Learning. Who You Are You are...Who You Are You are a passionate Machine Learning Engineer who is building their career through successfully building,… more
    Cisco (10/30/24)
    - Save Job - Related Jobs - Block Source
  • Software Engineer , SystemML - AI

    Meta (Menlo Park, CA)
    …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
    Meta (12/20/24)
    - Save Job - Related Jobs - Block Source