Data Engineer

ConnexAI
Manchester
4 months ago
Applications closed

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Build the Future of Conversational AI with ConnexAI

As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating, analysing, and engineering the voice data that fuels our models, you’ll help shape products that transform how people and businesses communicate.

You’ll be part of the team that manages and scales our massive speech corpora, builds automated pipelines for cleaning and validation, and works closely with annotation and Machine Learning teams to keep our models at the cutting edge.

Core Responsibilities
  • Organise and maintain large-scale audio and text corpora, ensuring they are versioned correctly, catalogued, and easy to retrieve.
  • Build automated pipelines using Python, AWS, and Docker to clean, validate, and standardise speech data, detecting duplicates, transcription inconsistencies, or quality issues.
  • Develop and integrate APIs to streamline ingestion and processing of new datasets.
  • Analyse speech datasets to support ASR and TTS model development, performance evaluation, and linguistic research.
  • Implement and manage data version control tools to ensure dataset reproducibility and traceability.
  • Contribute to evaluation frameworks for ASR and TTS performance by analysing metrics such as Word Error Rate (WER), Speaker Similarity (SSim), and Mean Opinion Score (MOS) to generate data-driven insights.
  • Document data processes and tools, ensuring all datasets and analyses are well-documented, reproducible, and compliant with internal standards.
  • Collaborate closely with data scientists, ML engineers, and product teams to identify opportunities to improve data quality, balance, and diversity through targeted analysis and feedback loops.
Key Skills & Experience
  • Strong programming skills in Python for data processing, analysis, and automation.
  • Proficiency with SQL for developing and managing large-scale datasets.
  • Experience with AWS cloud services.
  • Hands‑on experience with Docker and containerised development environments.
  • Familiarity with data versioning tools (e.g., LakeFS, DVC) and dataset reproducibility principles.
  • Strong collaboration and communication skills.
  • Background in speech, audio, or NLP data processing is highly desirable.
Interview Process
  • 30‑minute video call with the team lead
  • Take‑home technical exercise
  • 90‑minute face‑to‑face interview
About ConnexAI

ConnexAI is an award‑winning Conversational AI platform designed by an elite engineering team. ConnexAI’s technology enables organisations to maximise profitability, increase revenue, and take productivity to new levels. ConnexAI provides cutting‑edge, enterprise‑grade AI applications, including AI Agent, AI Guru, AI Analytics, ASR, AI Voice, and AI Quality. We value growth both for our products and our people. As we scale, there will be clear opportunities to progress into senior data science, leadership, or principal research roles. Our high retention rate reflects our inclusive, supportive, and empowering environment.

Seniority level
  • Associate
Employment type
  • Full‑time
Job function
  • Information Technology and Research
Industries
  • Software Development, Data Infrastructure and Analytics, and Research Services


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Machine Learning Tools Do You Need to Know to Get a Machine Learning Job?

Machine learning is one of the most exciting and rapidly growing areas of tech. But for job seekers it can also feel like a maze of tools, frameworks and platforms. One job advert wants TensorFlow and Keras. Another mentions PyTorch, scikit-learn and Spark. A third lists Mlflow, Docker, Kubernetes and more. With so many names out there, it’s easy to fall into the trap of thinking you must learn everything just to be competitive. Here’s the honest truth most machine learning hiring managers won’t say out loud: 👉 They don’t hire you because you know every tool. They hire you because you can solve real problems with the tools you know. Tools are important — no doubt — but context, judgement and outcomes matter far more. So how many machine learning tools do you actually need to know to get a job? For most job seekers, the real number is far smaller than you think — and more logically grouped. This guide breaks down exactly what employers expect, which tools are core, which are role-specific, and how to structure your learning for real career results.

What Hiring Managers Look for First in Machine Learning Job Applications (UK Guide)

Whether you’re applying for machine learning engineer, applied scientist, research scientist, ML Ops or data scientist roles, hiring managers scan applications quickly — often making decisions before they’ve read beyond the top third of your CV. In the competitive UK market, it’s not enough to list skills. You must send clear signals of relevance, delivery, impact, reasoning and readiness for production — and do it within the first few lines of your CV or portfolio. This guide walks you through exactly what hiring managers look for first in machine learning applications, how they evaluate CVs and portfolios, and what you can do to improve your chances of getting shortlisted at every stage — from your CV and LinkedIn profile to your cover letter and project portfolio.

MLOps Jobs in the UK: The Complete Career Guide for Machine Learning Professionals

Machine learning has moved from experimentation to production at scale. As a result, MLOps jobs have become some of the most in-demand and best-paid roles in the UK tech market. For job seekers with experience in machine learning, data science, software engineering or cloud infrastructure, MLOps represents a powerful career pivot or progression. This guide is designed to help you understand what MLOps roles involve, which skills employers are hiring for, how to transition into MLOps, salary expectations in the UK, and how to land your next role using specialist platforms like MachineLearningJobs.co.uk.