Senior Machine Learning Engineer (Speech Synthesis)
Join Telnyx as a Senior ML Engineer focused on next‑generation speech synthesis systems. This greenfield opportunity lets you build end‑to‑end multilingual text‑to‑speech (TTS) platforms from training pipelines to low‑latency inference services, enabling voice AI agents that scale globally.
Impact You’ll Drive
As a founding member of the speech synthesis team, you’ll define the stack, architecture, and best practices for training and deploying state‑of‑the‑art multilingual TTS models. Your work will shape how millions experience real‑time conversational AI.
What You’ll Work On
* Own the stack from day one: Design and implement the ML training and inference pipelines for multilingual speech synthesis.
* Low‑latency TTS: Engineer systems optimized for real‑time, streaming speech generation with sub‑100 ms response targets.
* Train cutting‑edge models: Build and fine‑tune multilingual TTS systems using modern architectures, including LLM‑based, diffusion, and flow‑matching approaches.
* Massive‑scale data processing: Develop pipelines for ingesting, aligning, and normalizing text, audio, and phonetic data across dozens of languages.
* Experimentation at scale: Run distributed training across multi‑node GPU clusters, tracking results and iterating quickly.
* Cross‑functional collaboration: Work with infrastructure and voice platform teams to deploy models that scale globally.
* Research meets production: Evaluate emerging techniques (LLM‑guided synthesis, zero/few‑shot voice cloning, full‑duplex modeling) and bring them to life in production‑grade systems.
What You’ll Work With
* Infrastructure: Docker, Kubernetes, Ray, Kubeflow, MLflow, Weights & Biases
* Data Systems: Kafka, Redis, PostgreSQL, Parquet
* You’ll define it: You’ll help select and implement the stack that supports distributed training, data processing, and inference for global deployment.
What We’re Looking For
* 6+ years of experience in machine learning or speech systems engineering.
* Hands‑on expertise with neural TTS, speech synthesis, or adjacent areas (ASR, voice cloning, multilingual modeling).
* Obsessed over hard problems such as building multilingual TTS from noisy data, teaching LLMs to speak, designing self‑supervised audio encoders, or making diffusion models run in real time.
* Experience with LLM‑based approaches to speech synthesis or prosody control.
* Proven track record leading small teams and defining technical direction or team executables.
* Production mindset: build systems that run fast, stay stable, and are easy to maintain.
Why Telnyx
You’ll be joining a company where voice, infrastructure, and AI converge. Telnyx is building the foundation for real‑time, intelligent global communications, and your work on multilingual TTS will be at the core of that vision.
Location: Dublin, County Dublin, Ireland
#J-18808-Ljbffr