- Meta (Menlo Park, CA)
- … production issue triage, rolling out new features in FW/Driver. **Required Skills:** Production Systems Engineer , AI Systems Responsibilities: ... **Summary:** Meta is seeking a Systems Engineer to join our Release to Production (RTP) team working on AI /ML initiatives supporting large scale AI … more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems (NetZero) Responsibilities: 1. ... **Summary:** Meta is seeking a Production Systems Engineer to...full system technologies, full system lifecycle 16. Experience supporting AI /HPC systems and/or related components at scale.… more
- Meta (Menlo Park, CA)
- …health and lifecycle of servers in production . **Required Skills:** Production Systems Engineer , Fleet AI Systems Responsibilities: 1. Interface ... **Summary:** Meta is seeking a Production Systems Engineer to...life cycle for computer system products 17. Experience supporting AI /HPC systems and/or related components at scale… more
- Meta (Menlo Park, CA)
- …fabric and host networking, comms lib and scheduling infrastructure. **Required Skills:** AI /HPC Systems Performance Engineer Responsibilities: 1. Active ... **Summary:** Meta's AI Training and Inference Infrastructure is growing exponentially...a loss-less fabric interconnect. To improve performance of these systems we constantly look for opportunities across stack: network… more
- Micron Technology, Inc. (San Jose, CA)
- …or related field and 24 months of experience in the job offered or in an Engineer , Silicon Systems AI - related occupation. Position requires 24 months of ... System Design. 2. C/C++/Python in a Linux development environment. 3. AI model deployment in production environment. 4. AI algorithms to modify silicon… more
- NVIDIA (Santa Clara, CA)
- …availability of the platform, as well as applying SRE principles to improve production systems and optimize service SLOs. Additionally, collaboration with our ... Joining NVIDIA's AI Efficiency Team means contributing to the infrastructure...foster innovation. We are seeking a Senior Site Reliability Engineer (SRE) to join our team. You'll be instrumental… more
- LinkedIn (Sunnyvale, CA)
- …Machine Learning and Artificial Intelligence Preferred QualificationsExperience in bringing large scale AI systems to production .PhD in Computer Science, ... within FAIT and across the company to realize these AI innovations. As a Principal Staff Engineer ...define the bar for quality and efficiency of software systems while balancing business impact, operational impact and cost… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior High-Performance AI Training Engineer : NVIDIA is seeking senior engineers who are obsessed with performance analysis and ... help us squeeze every last clock cycle out of AI training, the workload driving the design and construction...and construction of the largest and most powerful compute systems in the world. This role offers the opportunity… more
- eightfold.ai (Santa Clara, CA)
- …for Machine Learning & AI + Implement best practices for building AI -enabled products + Develop AI -based systems for Natural Language Processing ... is NOT a remote position ) About Eightfold Eightfold AI is the industry leader in AI -powered...frameworks (scikit-learn, tensorflow, torch, etc.) + Experience with implementing production machine learning systems and working with… more
- NVIDIA (Santa Clara, CA)
- …can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the ... productivity required for strong scaling for HPC and generative AI workload.Scale out is inherent to design of this...We are looking for a strong technical platform software engineer focused on PCIe firmware, you will own PCIe… more
- NVIDIA (Santa Clara, CA)
- …which enable cybersecurity ecosystems in their creation of detection, alerting, and observability systems . We are seeking a Senior Software Engineer to help ... by great technology-and outstanding people! Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU… more
- Google (Mountain View, CA)
- … AI /ML models, ensuring their robustness, scalability, and successful integration into production systems . + Work closely with team based in other countries, ... on and is growing every day. As a software engineer , you will work on a specific project critical...play a pivotal role in shaping how we bridge AI expertise from research to the product team. This… more
- NVIDIA (Santa Clara, CA)
- … Engineer to lead our efforts in designing, developing, and deploying generative AI systems and productivity solutions. As a key leader in our technology ... 12+ years of proven experience in software engineering experience on large-scale production systems + Proven expertise of performance, reliability in… more
- Meta (Menlo Park, CA)
- …space of GenAI/LLM scaling reliability and performance. **Required Skills:** Software Engineer , SystemML - AI Networking Responsibilities: 1. Enabling reliable ... learning/deep learning domains: Distributed ML Training, GPU architecture, ML systems , AI infrastructure, high performance computing, performance optimizations,… more
- Cisco (San Jose, CA)
- …window is expected to close on October 18, 2024. Who We Are The Cisco Security AI team delivers AI products and platform for all Cisco Secure products and ... customers secure by simplifying security with zero compromise using AI and Machine Learning. Who You Are You are...Who You Are You are a passionate Machine Learning Engineer who is building their career through successfully building,… more
- Amazon (Sunnyvale, CA)
- …- Experience handling and organizing video assets. - - Ideally familiar with Amazon systems and deployment methods for production . Amazon is committed to a ... access to high-quality creatives (audio, images, videos, description) by building AI -driven solutions for advertisers. To accomplish this, we are investing in… more
- Meta (Menlo Park, CA)
- **Summary:** Meta is seeking an experienced Production Systems Engineer to join our Release to Production (RTP) team. Our servers and data centers are ... cycle of servers in production . **Required Skills:** Production Systems Engineer , Sustaining Responsibilities:...hardware at scale 9. Experience in deploying and productionizing AI /HPC systems and/or related components at scale… more
- NVIDIA (Santa Clara, CA)
- We are looking for a highly motivated AI cloud k8s infrastructure engineer to join our team in the fastest growing organization at NVIDIA. There is an excellent ... new ways to improve availability of our Cloud Native AI Platform by automating critical processes on the multiple...at problem solving and complexity analysis of the distributed systems + Working experience with the observability tooling -… more
- Tarana Wireless (Milpitas, CA)
- …and help us to consistently deliver high-quality, high-reliability, cost-effective machine learning and AI systems as part of various products. You will guide ... preferred + 5-12 years of experience building large scale ML/ AI models and systems Knowledge, Skills and...markets, using either licensed or unlicensed spectrum. G1 started production in mid 2021 and has now been installed… more
- Google (Sunnyvale, CA)
- …Experience in architecting and developing software or infrastructure for scalable, distributed systems . + Experience in data and information management as it relates ... that drive differentiation, customer acquisition, and business acceleration. As a Customer Engineer , you will partner with technical Sales teams as a subject matter… more