Audio Data Engineer Speech Cleaning & Pipeline Automation (TTS) Job at Hippocratic Ai, Palo Alto, CA

TmRneTlGTkJSelI5VTM2Z0RnTUNRQ0xNdkE9PQ==
  • Hippocratic Ai
  • Palo Alto, CA

Job Description

About Us:

Hippocratic AI has developed a safety-focused Large Language Model (LLM) for healthcare. The company believes that a safe LLM can dramatically improve healthcare accessibility and health outcomes in the world by bringing deep healthcare expertise to every human. No other technology has the potential to have this level of global impact on health. 

About the Role

Hippocratic AI is seeking a skilled Audio Data Engineer to help us scale and improve our speech datasets for use in Text-to-Speech (TTS) and speech synthesis systems. In this role, you will clean and enhance real-world audio data, build automation pipelines for processing, and ensure our voice models are trained on the highest quality inputs. This work will directly shape the clarity and expressiveness of the voices used in healthcare AI applications.

Responsibilities

  • Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines.

  • Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries.

  • Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion.

  • Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows.

  • Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets.

  • Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists.

Required Qualifications

  • Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX.

  • Proficiency in Python and audio-related scripting for automation and batch processing.

  • Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts.

  • Experience designing or operating scalable, automated workflows for handling audio at volume.

  • Meticulous attention to detail in audio quality control and error spotting.

Nice to Have

  • Experience working on TTS model pipelines (e.g., Tacotron, VITS, FastSpeech) or speech synthesis datasets.

  • Background in audio engineering, phonetics, or signal processing.

  • Familiarity with real-time or low-latency audio processing constraints.

  • Experience with cloud platforms and tools for automation (e.g., AWS, Airflow, or containerized audio workflows).

Why Join Our Team:

  • Innovative Mission: We are developing a safe, healthcare-focused large language model (LLM) designed to revolutionize health outcomes on a global scale.

  • Visionary Leadership: Hippocratic AI was co-founded by CEO Munjal Shah, alongside a group of physicians, hospital administrators, healthcare professionals, and artificial intelligence researchers from leading institutions, including El Camino Health, Johns Hopkins, Stanford, Microsoft, Google, and NVIDIA.

  • Strategic Investors: We have raised a total of $278 million in funding, backed by top investors such as Andreessen Horowitz, General Catalyst, Kleiner Perkins, NVIDIA’s NVentures, Premji Invest, SV Angel, and six health systems.

  • World-Class Team: Our team is composed of leading experts in healthcare and artificial intelligence, ensuring our technology is safe, effective, and capable of delivering meaningful improvements to healthcare delivery and outcomes.

For more information, visit .

Our team values in-person collaboration, with on-site presence expected five days a week in Palo Alto, CA.

Job Tags

Full time,

Similar Jobs

Physician's Practice Enhancement LLC

Emergency Medicine Physician Job at Physician's Practice Enhancement LLC

Emergency Medicine Physician at Physician's Practice Enhancement LLC summary: Physician role in a low-volume VA Emergency Department in Clarksburg, WV, covering ~25 patients/day with ~1,400 annual admissions. Shifts are primarily 12-hour day and night blocks using the...

Samaritan Health Services Clinician Recruitment

Orthopedic Surgeon - Trauma Job at Samaritan Health Services Clinician Recruitment

 ...Orthopedic Trauma Surgeon Opportunity in Corvallis, OR Samaritan Health Services is seeking an Orthopedic Trauma Surgeon to join our team in Corvallis, Oregon. Our Orthopedic Surgeons based at Good Samaritan Regional Medical Center, a level II trauma center, are... 

Kaav Inc.

IT Security Manager Job at Kaav Inc.

Onsite in Detroit for Hybrid model 3 days a week. **IT Security Manager - ** Exceptional understanding of Risk and Regulatory requirements...  ...in Quality Assurance/Quality Control, IT Risk Management, and Information Security - ability to evaluate and design effective controls;... 

Garment Decor

Promo, Screen Printing & Embroidery Sales - Work Remotely Job at Garment Decor

 ...Sales Representative Location: Remote Hours: Flexible What Were Looking For: A thorough understanding of promo, screen printing, and embroidery decoration methods A sales rep with their own book of business that is looking for improved service for... 

Circuit Court of Cook County, Illinois

OCJ - Certified Court Interpreter Job at Circuit Court of Cook County, Illinois

 ...CIRCUIT COURT OF COOK COUNTY, ILLINOIS OFFICE OF THE CHIEF JUDGE JOB DESCRIPTION JOB TITLE: CERTIFIED COURT INTERPRETER LANGUAGE: SPANISH, POLISH GRADE: 17 UNION: CHICAGO NEWS GUILD SALARY: UNION PAY SCHEDULE, FIRST STEP: $70,137 LOCATION: COOK COUNTY...