Research Engineer Audio Speech Jobs

53 jobs (page 1)

Categories

All Categories

Engineering (9)

Research Engineer - Audio…

Zyphra Technologies Inc. (Palo Alto, CA)

…intelligence company based in Palo Alto, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on ... Zyphra's Audio Team, building the next generation of open-source text-to- speech and audio models. You will be deeply involved in the entire model training… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Audio & Speech AI Research…

Zyphra Technologies Inc. (Palo Alto, CA)

A cutting-edge AI firm in Palo Alto is seeking a Research Engineer specializing in audio and speech models. You will contribute to the development of ... advanced open-source audio technologies. Responsibilities include data processing, model training, and performance optimization. Ideal candidates possess strong … more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Founding Audio AI Research…

David AI (New York, NY)

…stakeholders to productionalize the latest research insights. About this role As a Founding Audio AI Research Engineer , you'll define the research ... build comprehensive evaluation frameworks for audio AI capabilities across speech , emotion, conversation dynamics, and acoustic patterns. Research and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Member of Technical Staff - ML Research…

Liquid AI (San Francisco, CA)

…like DeepSpeed, FSDP, or Megatron-LM You've worked with multimodal data (eg audio , text, image, video) You've contributed to research papers, open-source ... If: You have experience with machine learning at scale You have worked with audio models and understand the effects of architecture choices on runtime, latency, and… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr Audio Digital Signal Processing…

Disney (San Francisco, CA)

Sr Audio Digital Signal Processing Engineer The Skywalker Sound Development Group is seeking an innovative and experienced Audio Digital Signal Processing ... Engineer to develop and implement advanced audio processing solutions that integrate seamlessly into modern media...work will focus on enabling high-quality, efficient, and reliable audio workflows for applications such as speech … more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Audio Systems Engineer

Sesame (San Francisco, CA)

… System Engineer to define, tune, evaluate, and ship high‑quality speech reproduction systems and spatialized audio capture solutions. Responsibilities Own ... transducer requirements to ensure product success. Design, analyze, and refine real‑time audio capture and render systems. Research and implement advanced… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Sr. Machine Learning Engineer , Siri…

Apple Inc. (Cupertino, CA)

Sr. Machine Learning Engineer , Siri Speech Cupertino, California, United States Machine Learning and AI The Speech Team within the Siri organization drives ... research anchored in Apple‑specific production needs, while improving speech interaction experiences for Apple's customers around the world. Description You… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Staff Speech Scientist

Trades Workforce Solutions (San Francisco, CA)

…language switching, and accurate pronunciation across languages and accents. As a Staff Research Engineer , you'll work hands‑on to expand their foundation models ... that balance speed with control. What you'll do Conduct research to advance core speech models and...systems What you'll bring 3+ years of experience in speech synthesis, audio generation, or generative modeling… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Speech Scientist - Real-Time…

Trades Workforce Solutions (San Francisco, CA)

A speech AI technology firm in San Francisco seeks a Staff Research Engineer to advance core speech models and develop new architectures. The ideal ... candidate has over 3 years of experience in speech synthesis and audio generation, with a solid background in model training. This role offers up to $250K base… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Product Engineer

David AI (San Francisco, CA)

About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to ... bring AI into the real world, and we believe audio is the gateway. Speech is versatile,...machine learning experts focused on building the world's first audio data research company. We move fast,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Product Engineer , Tech Lead

David AI (San Francisco, CA)

About David AI David AI is the first audio data research company. We bring an R&D approach to data-developing datasets with the same rigor AI labs bring to ... bring AI into the real world, and we believe audio is the gateway. Speech is versatile,...machine learning experts focused on building the world's first audio data research company. We move fast,… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Research Engineer , Synthetic Data

Cartesia (San Francisco, CA)

…even the best models can continuously process and reason over a year-long stream of audio , video and text-1B text tokens, 10B audio tokens and 1T video ... where you will solve critical data bottlenecks and directly accelerate our research progress. What you'll do Evaluate fidelity, diversity, and usefulness of… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Founding Applied ML Engineer

Recruiting From Scratch (San Francisco, CA)

…1 year of prior software development experience before transitioning into ML. Experience with speech or audio research , particularly in applying ML models to ... the best candidate for our clients. Applied Machine Learning Engineer - Audio AI Location: San Francisco,...and models for collecting and training on large-scale, high-fidelity speech and audio datasets across diverse languages… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Research Engineer , Multilingual…

Cartesia (San Francisco, CA)

…even the best models can continuously process and reason over a year-long stream of audio , video and text-1B text tokens, 10B audio tokens and 1T video ... build large-scale multilingual datasets for model training. Build evaluations of multilingual speech models, both via manual annotation and at scale with automated… more

job goal (01/12/26)
- Save Job - Related Jobs - Block Source
Senior Software Engineer , Voice Agent

Decagon AI, Inc. (San Francisco, CA)

…conversations across our omnichannels (phone, web, mobile, etc). We work across speech understanding, audio streaming, synthesis quality, and the voice‑specific ... to end. About the Role As a Senior Software Engineer focused on Voice Agent, you will design and..., networking, or real time pipelines An interest in speech , audio , or multimodal AI Even better… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Staff Software Engineer (Voice Agent)

Recruiting From Scratch (San Francisco, CA)

…the Voice Agent group-a technically challenging surface responsible for real-time speech understanding, audio streaming, synthesis quality, and voice-specific ... measure and iterate on voice performance. Partner closely with Research to integrate new speech models and...Experience with automatic speech recognition (ASR) or text-to- speech (TTS) systems. Familiarity with VAD, audio … more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal ML Engineer

Sanas (Palo Alto, CA)

…Deep experience with ML tooling for training and serving models, ideally in audio or speech domains (eg, PyTorch, ONNX, Hugging Face Transformers, torchaudio). ... deep industry experience, Sanas has developed the world's first real-time speech transformation platform capable of accent translation, noise elimination, speech… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Sr. AI Engineer

Mursion, Inc. (San Francisco, CA)

…evolving and fast-paced startup environment. About the role As a Senior AI Engineer for Real-Time Conversational Agents, you will play a crucial role in developing ... conversational agents. You'll work primarily with large language models (LLMs), automatic speech recognition (ASR), text-to- speech (TTS), and emerging speech… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Software Engineer , Inference - Multi Modal

OpenAI (San Francisco, CA)

…multimodal models. Optimize systems for high-throughput, low-latency delivery of image and audio inputs and outputs. Enable experimental research workflows to ... and scalable in production, and we partner closely with Research to bring the next generation of models into...the infrastructure needed to serve models that handle image, audio , and other non-text modalities. These workloads are inherently… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source
Principal Data Engineer

Sanas (Palo Alto, CA)

…deep industry experience, Sanas has developed the world's first real-time speech transformation platform capable of accent translation, noise elimination, speech ... algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track… more

job goal (01/13/26)
- Save Job - Related Jobs - Block Source

"Juju

Account Login

Sign Up

Forgot your password?

Advanced Search