- NVIDIA (Santa Clara, CA)
- …for these meaningful changes in automotive industry, We are now looking for a Senior Reliability Engineer . The position is an individual contributor role, ... auto industry, as the car is becoming a connected system , and graphics and computing power is redefining the...engineer must have skills in mathematical analysis and reliability modellings to improve product quality and reliability… more
- NVIDIA (Santa Clara, CA)
- …our global scale PaaS and SaaS offering! We are seeking a highly motivated Senior Site Reliability Engineer to join our Omniverse Infrastructure organization ... system design, complexity analysis, software design in Unix/Linux systems , performance, and application issues + 8+ years' of...for us. Are you a creative and autonomous Site Reliability Engineer , who loves challenges? Do you… more
- NVIDIA (Santa Clara, CA)
- …us accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a ... procedures and practices, perform technology evaluations, related to distributed file systems . + Collaborate across teams to better understand developers' workflows… more
- NVIDIA (Santa Clara, CA)
- …efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering ... Center Servers. What you'll be doing: + Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack,… more
- NVIDIA (Santa Clara, CA)
- …drive foundational improvements and automation to improve researchers productivity. As a Site Reliability Engineer , you are responsible for the big picture of ... of deep learning workflows. You will design, implement and support operational and reliability aspects of large scale distributed systems with focus on… more
- NVIDIA (Santa Clara, CA)
- …once they are live by measuring and monitoring availability, latency and overall system health. + Scale systems sustainably through mechanisms like automation, ... time enabling developers to make changes to the existing system through careful preparation and planning while keeping an... systems by pushing for changes that improve reliability and velocity + Practice sustainable incident response and… more
- NVIDIA (Santa Clara, CA)
- …intelligence. We are seeking a highly skilled and experienced Staff Software Engineer to lead the design, deployment, and management of our large-scale GPU ... - like GB200 - and cloud technologies to improve system performance. What we need to see: + Minimum...Python, Go, Ruby). + Strong proficiency with Linux operating systems and TCP/IP fundamentals. + Proficient in modern CI/CD… more
- NVIDIA (Santa Clara, CA)
- …see how you can make a lasting impact on the world. As a Sr Staff Engineer , you will drive the design and development of scalable database platform systems in ... operating our existing infrastructure to the highest level of reliability and security. You will work side by side...technology. + Source code management: GitLab, GitHub + Strong systems engineering background in Linux or Windows. + Proficiency… more
- Google (Sunnyvale, CA)
- …Preferred qualifications: + Master's degree in Computer Science or Engineering. Site Reliability Engineering (SRE) combines software and systems engineering to ... that Google Cloud's services-both our internally critical and our externally-visible systems -have reliability , uptime appropriate to customer's needs and a… more
- Abbott (Pleasanton, CA)
- …for diversity, working mothers, female executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer ** As senior member of Site ... serve people in more than 160 countries. **Staff Site Reliability Engineer ** **Working at Abbott** At Abbott,...delivery, and implementation of highly complex and critical software systems . Expertise in the value and principles of SRE… more
- General Motors (Mountain View, CA)
- …the scale, availability, and operations for our applications. As a staff engineer , you'll be creating patterns for reliability , implementing reliability ... a week, at minimum **The Role:** This is a senior technical leadership role, and we are looking for...and observability tools (eg Prometheus, Grafana, Datadog) to ensure system health and reliability + Lead automation… more
- NVIDIA (Santa Clara, CA)
- …module and etc. + Work with design and operations teams to create board and system reliability test plans for various products such as GPU, Server, Automotive, ... of technological advancement. What you will be doing: + Reliability (DfR) qualification and testing for various packaging including...testing data; + Perform FEA to assist PCBA and system mechanical structural design; + Analyze and track failure… more
- Palo Alto Networks (Santa Clara, CA)
- …insights into our systems ' performance and health. **Your Impact** As a Senior Staff SRE with the Cortex Observability team, you will: + Cloud Expertise - ... including the design, implementation, and continuous enhancement of our comprehensive observability systems . To meet the opportunities that such a role provides, you… more
- Palo Alto Networks (Santa Clara, CA)
- …Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Senior Staff DevOps Engineer for the CDL/SLS team, you will be part of a ... of critical business and production issues **Your Experience** + 4+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering + 3+ years… more
- NVIDIA (Santa Clara, CA)
- …of performance, security and reliability in complex distributed systems . Familiarity with system level architecture, data synchronization, fault ... have a strong programming background, a deep understanding of distributed systems , familiarity with software testing and deployment, and excellent communication and… more
- SLAC National Accelerator Laboratory (Menlo Park, CA)
- Senior Electrical Engineer - Electrical Systems Development Job ID 6172 Location SLAC - Menlo Park, CA Full-Time Regular **SLAC Job Postings** **Position ... Electrical Power Department (EPD) is seeking an experienced Electrical Engineer to join our team. Members of EPD are...design for SLAC electrical distribution, emergency and standby power systems . + Provide support to the Electrical System… more
- NVIDIA (Santa Clara, CA)
- …places to work in the world. We are now looking for a Senior System Power Validation & Applications Engineer in the Datacenter System Engineering Team. ... solutions of density, performance, transient response, manageability, scalability, manufacturability, reliability , security, protection, and cost. You will gain a… more
- Amazon (Cupertino, CA)
- …Linux/Unix environment experience - 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience - 5+ ... day in the life Lead the Hardware Engineering (HWEng) System Development (SysDE) effort to define and build software...EE, CE, or CS - 7+ years of SysDE ( Systems Development Engineer ) or equivalent experience -… more
- Silicon Valley Power (Santa Clara, CA)
- ** Senior Electric Utility Engineer ** Print (https://www.governmentjobs.com/careers/cityofsantaclaraca/jobs/newprint/4679157) ** Senior Electric Utility ... NVIDIA. **The Positions** SVP is seeking dynamic and innovative Senior Electric Utility Engineer candidates to fill...This will include running models for contingency analysis and reliability analysis of the transmission system . This… more
- RTX Corporation (San Jose, CA)
- …security and safeguard the global community. We're on the lookout for a dedicated Senior Principal Systems Engineer with specialization in Airborne ... protect the interests of the US and its allies. From high reliability telecommunication processing to airborne signals intelligence collection and signal processing… more