- Unknown (Houston, TX)
- …generation to deal close, and ensuring that clients have access to reliable , distributed power solutions for their mission-critical operations. Applicants for ... center ecosystem or large-scale energy infrastructure buyers. Technical fluency in distributed power systems , grid interconnection, or related engineering… more
- Bosch (Sunnyvale, CA)
- …journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL. **Job Description** As the Distributed Embodied AI Systems intern, you will perform research on ... that take advantage of technologies in the field of reliable distributed computing. We work with internal...future prediction for latency mitigation in distributed embodied AI systems . A… more
- Public Storage (Plano, TX)
- …recognized for its iconic orange doors and commitment to delivering simple, reliable solutions to millions of customers across the country. We are expanding ... Lead to join our team and spearhead the development of our next-generation, AI -powered applications. In this pivotal role, you will bridge the gap between complex… more
- Microsoft Corporation (Redmond, WA)
- …healthcare, economics, and the environment. Are you passionate about building the future of reliable , large-scale cloud and AI systems ? The ** Systems ... Interns to tackle cutting-edge challenges at the intersection of distributed systems , AI systems...letter. **Preferred Qualifications** + Experience of building scalable and reliable systems . + Demonstrated ability to develop… more
- Microsoft Corporation (Redmond, WA)
- **Overview** **Help shape the future of reliable AI systems ** . At Microsoft Research's AI and Systems Reliability Group (Redmond, WA), we push the ... the computing landscape. We are seeking **Senior Researcher - AI and Systems Reliability - Microsoft Research**...Systems Reliability - Microsoft Research** areas such as distributed systems and reliability, formal methods and… more
- NVIDIA (Santa Clara, CA)
- …design, or enterprise platform engineering. + Deep expertise in architecting large-scale distributed systems with a focus on reliability, performance, and ... record of publishing technical papers, architecture patterns, or thought leadership in AI systems . + Knowledge of observability tools, telemetry dashboards, and… more
- Cisco (San Jose, CA)
- …platforms, such as AWS, Azure, or Google Cloud. + Understanding of distributed systems concepts, including scalability, reliability, fault tolerance, and data ... Team** Our dedicated team members are building the future of Cisco's AI -driven platforms and data infrastructure, supporting innovation across the globe. You will… more
- Oracle (Nashville, TN)
- …Work closely with a collaborative and experienced global team. - Expand your knowledge in AI , cloud computing, and distributed systems . - Contribute to one ... tools to operationalize Large Language Models (LLMs) and agentic AI systems . Our goal is to empower...will contribute to the design and implementation of scalable, distributed systems that serve LLMs and support… more
- NVIDIA (Santa Clara, CA)
- …and inference more reliable , scalable, and efficient. If you're passionate about AI , distributed systems , and high-performance computing, we want to hear ... driving down cluster downtime towards zero, ensuring that our AI systems remain robust and reliable...detection. + Hands-On Coding & Optimization: Contribute to large-scale distributed systems with high-quality, production-level C++ and… more
- Oracle (Raleigh, NC)
- …learning, LLM applications, and agentic AI . Our team builds real-world AI systems and deploys scalable, production-ready solutions across Oracle's enterprise ... engineer to contribute to the design and deployment of advanced AI systems , including LLM-powered agents, Retrieval-Augmented Generation (RAG) pipelines,… more
- Oracle (Redwood City, CA)
- …data architectures (data mesh, lakehouse, etc.). + Expertise in **data modeling, distributed systems , and performance optimization.** + Proven ability to ... you ready to shape the future of intelligent data systems ? We're seeking an ** AI and Data...Collaborate with engineers, product teams, and researchers to build systems that are ** reliable , scalable, and production-ready.**… more
- Oracle (Frankfort, KY)
- …Work closely with a collaborative and experienced global team. - Expand your knowledge in AI , cloud computing, and distributed systems . - Contribute to one ... tools to operationalize Large Language Models (LLMs) and agentic AI systems . Our goal is to empower...will contribute to the design and implementation of scalable, distributed systems that serve LLMs and support… more
- Charles Schwab (San Francisco, CA)
- …+ Champion reliability, monitoring, observability, and operational best practices for AI systems and data pipelines. + Collaborate with cross-functional ... in the development process. You will ensure that the systems we build are robust, reliable , and...troubleshoot complex problems with ambiguous or incomplete data in distributed systems . + Curiosity about new technologies… more
- Amazon (Redmond, WA)
- …is for a Software Engineer who will design, implement, and operate globally distributed systems that enable Leo to achieve low single-digit-second query ... real-time analytics layer or lakehouse, and to support agentic AI capabilities on top. You'll build these systems...user experience in real time. We combine expertise in distributed systems , data lakehouse architectures, and applied… more
- Guidehouse (Huntsville, AL)
- …+ Experience implementing MLOps workflows, evaluation frameworks, drift detection, and responsible‑ AI safeguards. + Experience delivering ML systems in secure ... Active Top Secret (TS) Guidehouse is seeking a Lead AI /ML Engineer to join our Technology / AI... solutions that leverage large language models (LLMs), retrieval systems , and secure, scalable inference pipelines to enable data-driven… more
- Walmart (Sunnyvale, CA)
- …build dynamic, context-aware systems . 2. **Architecture ; Scalability:** + Architect scalable, distributed AI systems with a focus on performance, fault ... to lead the design, development, and deployment of advanced AI systems . This role involves architecting scalable...Walmart GTP, you will be building highly scalable and reliable APIs, services and applications which will drive the… more
- Amazon (Redmond, WA)
- …for a Data Engineering Manager who will design, implement, and operate globally distributed systems that enable Leo to achieve low single-digit-second query ... real-time analytics layer or lakehouse, and to support agentic AI capabilities on top. You'll build these systems...user experience in real time. We combine expertise in distributed systems , data lakehouse architectures, and applied… more
- Amazon (Redmond, WA)
- …is for a Data Engineer who will design, implement, and operate globally distributed systems that enable Leo to achieve low single-digit-second query responses ... real-time analytics layer or lakehouse, and to support agentic AI capabilities on top. You'll build these systems...user experience in real time. We combine expertise in distributed systems , data lakehouse architectures, and applied… more
- Charles Schwab (San Francisco, CA)
- …bring curiosity, creativity, and technical depth to help shape the next generation of reliable AI at Schwab. **What you have** **Required Qualifications** + 8+ ... you will play a key role in ensuring our AI solutions are reliable , scalable, and resilient-enabling...Experience implementing monitoring, alerting, and incident response for large-scale distributed systems . + Proven track record in… more
- GE Vernova (Niskayuna, NY)
- …neural network architectures (eg, CNNs, RNNs, Transformers). + Expertise in designing scalable, distributed architectures for AI systems . + Strong experience ... Azure, GCP) and containerization (Kubernetes, Docker). + Familiarity with large-scale distributed systems and database technologies. + Experience in creating… more