Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Data Engineer

ConnexAI
Manchester
2 days ago
Create job alert
Build the Future of Conversational AI with ConnexAI

As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating, analysing, and engineering the voice data that fuels our models, you’ll help shape products that transform how people and businesses communicate.

You’ll be part of the team that manages and scales our massive speech corpora, builds automated pipelines for cleaning and validation, and works closely with annotation and Machine Learning teams to keep our models at the cutting edge.

Core Responsibilities
  • Organise and maintain large-scale audio and text corpora, ensuring they are versioned correctly, catalogued, and easy to retrieve.
  • Build automated pipelines using Python, AWS, and Docker to clean, validate, and standardise speech data, detecting duplicates, transcription inconsistencies, or quality issues.
  • Develop and integrate APIs to streamline ingestion and processing of new datasets.
  • Analyse speech datasets to support ASR and TTS model development, performance evaluation, and linguistic research.
  • Implement and manage data version control tools to ensure dataset reproducibility and traceability.
  • Contribute to evaluation frameworks for ASR and TTS performance by analysing metrics such as Word Error Rate (WER), Speaker Similarity (SSim), and Mean Opinion Score (MOS) to generate data-driven insights.
  • Document data processes and tools, ensuring all datasets and analyses are well-documented, reproducible, and compliant with internal standards.
  • Collaborate closely with data scientists, ML engineers, and product teams to identify opportunities to improve data quality, balance, and diversity through targeted analysis and feedback loops.
Key Skills & Experience
  • Strong programming skills in Python for data processing, analysis, and automation.
  • Proficiency with SQL for developing and managing large-scale datasets.
  • Experience with AWS cloud services.
  • Hands‑on experience with Docker and containerised development environments.
  • Familiarity with data versioning tools (e.g., LakeFS, DVC) and dataset reproducibility principles.
  • Strong collaboration and communication skills.
  • Background in speech, audio, or NLP data processing is highly desirable.
Interview Process
  • 30‑minute video call with the team lead
  • Take‑home technical exercise
  • 90‑minute face‑to‑face interview
About ConnexAI

ConnexAI is an award‑winning Conversational AI platform designed by an elite engineering team. ConnexAI’s technology enables organisations to maximise profitability, increase revenue, and take productivity to new levels. ConnexAI provides cutting‑edge, enterprise‑grade AI applications, including AI Agent, AI Guru, AI Analytics, ASR, AI Voice, and AI Quality. We value growth both for our products and our people. As we scale, there will be clear opportunities to progress into senior data science, leadership, or principal research roles. Our high retention rate reflects our inclusive, supportive, and empowering environment.

Seniority level
  • Associate
Employment type
  • Full‑time
Job function
  • Information Technology and Research
Industries
  • Software Development, Data Infrastructure and Analytics, and Research Services


#J-18808-Ljbffr

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK machine learning hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise shipped ML/LLM features, robust evaluation, observability, safety/governance, cost control and measurable business impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for ML engineers, applied scientists, LLM application engineers, ML platform/MLOps engineers and AI product managers. Who this is for: ML engineers, applied ML/LLM engineers, LLM/retrieval engineers, ML platform/MLOps/SRE, data scientists transitioning to production ML, AI product managers & tech‑lead candidates targeting roles in the UK.

Why Machine Learning Careers in the UK Are Becoming More Multidisciplinary

Machine learning (ML) has moved from research labs into mainstream UK businesses. From healthcare diagnostics to fraud detection, autonomous vehicles to recommendation engines, ML underpins critical services and consumer experiences. But the skillset required of today’s machine learning professionals is no longer purely technical. Employers increasingly seek multidisciplinary expertise: not only coding, algorithms & statistics, but also knowledge of law, ethics, psychology, linguistics & design. This article explores why UK machine learning careers are becoming more multidisciplinary, how these fields intersect with ML roles, and what both job-seekers & employers need to understand to succeed in a rapidly changing landscape.

Machine Learning Team Structures Explained: Who Does What in a Modern Machine Learning Department

Machine learning is now central to many advanced data-driven products and services across the UK. Whether you work in finance, healthcare, retail, autonomous vehicles, recommendation systems, robotics, or consumer applications, there’s a need for dedicated machine learning teams that can deliver models into production, maintain them, keep them secure, efficient, fair, and aligned with business objectives. If you’re hiring for or applying to ML roles via MachineLearningJobs.co.uk, this article will help you understand what roles are typically present in a mature machine learning department, how they collaborate through project lifecycles, what skills and qualifications UK employers look for, what the career paths and salaries are, current trends and challenges, and how to build an effective ML team.