- NVIDIA (Santa Clara, CA)
- …the first people to make them operational in production? We are seeking a dedicated Cluster Deployment Operations Engineer to support product deployments ... team, acting as the link between engineering and the NVIS field team for cluster deployment and management solutions! We bridge the gap between product roadmaps… more
- TEKsystems (Santa Clara, CA)
- Sr. SQL Data Base Administrator/ Engineer 100% onsite in Santa Clara, CA Job Description We are looking for a world class DBA to join its Software Infrastructure and ... Operations team. The position will be part of a...in DBA Activities - Provision MySQL instances: standalone, replication, cluster , HA configurations - Should have good exposure in… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …world of technology. We are seeking a highly skilled and experienced AI Systems Engineer to join our team. This is a hands-on, senior individual contributor role ... that will be pivotal in leading the development, operations , and support of our entire AI infrastructure. You...services on both GCP and Azure. + Hands-on GPU Cluster Management: Take a leadership role in the configuration,… more
- NVIDIA (Santa Clara, CA)
- … operations , and networking, familiarity with software testing and deployment , familiarity with distributed systems, and excellent communication and planning ... management systems (Kubernetes, SLURM.) Hands-on experience in Machine Learning Operations . Hands-on experience with Bright Cluster Manager. + Hands-on… more
- NVIDIA (Santa Clara, CA)
- We are looking for Senior Software Development Engineer in Test to join our Confidential Computing team for NVIDIA's Enterprise SWQA team. Are you passionate about ... to have your skills on the team! As an engineer on this Confidential Computing QA team, you will...tools to significantly enhance our testing capabilities and streamlining operations for more efficient and accurate results. + Improve… more
- NVIDIA (Santa Clara, CA)
- …Artifactory, Jira) in hybrid on-premise and cloud environments. + Assist with cluster operations and system administration (managing: servers, team accounts, ... dedicated and motivated senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (Megatron-LM (https://github.com/NVIDIA/Megatron-LM) and NeMo… more
- NVIDIA (Santa Clara, CA)
- …experience. Ways to stand out from the crowd: + Knowledge of cloud and cluster level deployment and management systems. + Experience with GPU computing (CUDA), ... NVIDIA is seeking a Senior Firmware Engineer to join our CSP Engagements team, focusing...hardware and software, driving technical solutions from concept through deployment . What you will be doing: + Design and… more
- Oracle (Santa Clara, CA)
- **Job Description** Supports the design, deployment , and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - ... OCI). Primarily focused on development and support of AI cluster network fabric and GPU compute server systems through a combination of a deep level understanding of… more
- NVIDIA (Santa Clara, CA)
- … Operations team to manage crucial accomplishments in new product operations , from Development introduction to Production deployment . The product portfolio ... encompasses HGX GPU accelerators, DGX GPU/CPU products, and L11 cluster solutions. What you'll be doing: + Developing the...are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want… more