<?xml version="1.0" encoding="UTF-8"?>
<source>
  <jobs>
    <job>
      <externalid>c2c97849-e31</externalid>
      <Title>Senior Machine Learning Engineer, Voice Experience</Title>
      <Description><![CDATA[<p>We are looking for a Senior Machine Learning Engineer, Voice Experience to help build the next generation of AI-powered voice systems for the contact center. In this role, you will work at the intersection of speech, language, and real-time production systems, improving how AI listens, understands, reasons, empathizes, and responds in live customer conversations.</p>
<p>You will develop and improve machine learning systems that power voice experiences end to end, including automatic speech recognition, turn detection, downstream language understanding, retrieval-augmented and agentic workflows, quality measurement, text to speech, and production optimization.</p>
<p>Responsibilities:</p>
<ul>
<li>Design, train, evaluate, and deploy machine learning systems that power real-time voice experiences, including ASR, speech understanding, turn detection, text to speech, speech to speech, classification, entity extraction, summarization, and structured insight generation.</li>
<li>Improve the quality of voice AI systems through error analysis, data curation, metric design, benchmarking, and iterative model improvement, with a strong focus on real-world performance.</li>
<li>Build evaluation frameworks for complex voice and agentic systems, measuring metrics such as accuracy, robustness, latency, faithfulness, naturalness, professionalism, task completion, and cost.</li>
<li>Diagnose and mitigate failure modes across the voice stack, including transcription errors, hallucinations, retrieval failures, tool misuse, prompt brittleness, context drift, and multi-step reasoning breakdowns.</li>
<li>Design and optimize low-latency ML workflows for live conversations, balancing model quality with system responsiveness, scalability, and reliability.</li>
<li>Partner with platform and backend engineers to productionize real-time inference, streaming pipelines, quality monitoring, and continuous model iteration.</li>
<li>Collaborate cross-functionally with product, design, frontend, and backend teams to integrate voice intelligence seamlessly into Cresta’s platform.</li>
<li>Establish best practices for offline evaluation, online experimentation, model validation, observability, and ongoing quality monitoring in production.</li>
<li>Mentor engineers, contribute to technical strategy, and help shape the roadmap for Cresta’s voice AI systems.</li>
</ul>
<p>Qualifications:</p>
<ul>
<li>Bachelor’s degree in Computer Science, Mathematics, Machine Learning, AI, or a related field; Master’s or Ph.D. preferred.</li>
<li>5+ years of experience building, evaluating, and deploying machine learning systems in production.</li>
<li>Strong background in one or more of the following: speech recognition, speech processing, NLP, generative AI, or conversational AI.</li>
<li>Deep experience with model evaluation, benchmarking, error analysis, and quality improvement for production ML systems.</li>
<li>Strong expertise with modern ML frameworks and tooling such as PyTorch, TensorFlow, and Hugging Face.</li>
<li>Solid understanding of transformer-based models, embeddings, retrieval systems, and large-scale training or inference workflows.</li>
<li>Experience designing and deploying real-time ML systems with strong requirements around latency, scalability, and reliability.</li>
<li>Experience building data pipelines and tooling for experimentation, measurement, and large-scale quality analysis.</li>
<li>Ability to work across research and engineering boundaries and translate promising ideas into production-grade systems.</li>
<li>Strong communication and technical leadership skills, with the ability to influence cross-functional decisions and raise the engineering bar.</li>
</ul>
<p>Nice to have:</p>
<ul>
<li>Hands-on experience with ASR quality metrics such as WER and task-level evaluation methodologies.</li>
<li>Experience with RAG systems, agentic workflows, multi-step reasoning systems, or LLM-as-a-judge evaluation methods.</li>
<li>Familiarity with streaming inference, real-time voice pipelines, or media systems.</li>
<li>Experience working closely with infrastructure or platform teams on production ML deployment, observability, and reliability.</li>
<li>Experience in contact center AI, conversational intelligence, or enterprise voice products.</li>
</ul>
<p style="margin-top:24px;font-size:13px;color:#666;">XML job scraping automation by <a href="https://yubhub.co">YubHub</a></p>]]></Description>
      <Jobtype>full-time</Jobtype>
      <Experiencelevel>senior</Experiencelevel>
      <Workarrangement>remote</Workarrangement>
      <Salaryrange>$205,000–$270,000</Salaryrange>
      <Skills>speech recognition, speech processing, NLP, generative AI, conversational AI, PyTorch, TensorFlow, Hugging Face, transformer-based models, embeddings, retrieval systems, large-scale training, inference workflows, real-time ML systems, latency, scalability, reliability, data pipelines, tooling, experimentation, measurement, quality analysis</Skills>
      <Category>Engineering</Category>
      <Industry>Technology</Industry>
      <Employername>Cresta</Employername>
      <Employerlogo>https://logos.yubhub.co/cresta.ai.png</Employerlogo>
      <Employerdescription>Cresta is a technology company that specializes in contact center AI and conversational intelligence.</Employerdescription>
      <Employerwebsite>https://www.cresta.ai/</Employerwebsite>
      <Compensationcurrency></Compensationcurrency>
      <Compensationmin></Compensationmin>
      <Compensationmax></Compensationmax>
      <Applyto>https://job-boards.greenhouse.io/cresta/jobs/5199747008?utm_source=yubhub.co&amp;utm_medium=jobs_feed&amp;utm_campaign=apply</Applyto>
      <Location>United States (Remote)</Location>
      <Country></Country>
      <Postedate>2026-04-24</Postedate>
    </job>
  </jobs>
</source>