Principal Data Engineer (Azure, PySpark, Databricks)

PEXA
Leeds
1 month ago
Applications closed

Related Jobs

View all jobs

Principal Data Engineer

Principal Data Engineer (GCP)

Principal Data Engineer (MS Azure)

Principal Data Engineer (GCP)

Principal Data Engineer (GCP)

Principal GCP Data Engineer

Get AI-powered advice on this job and more exclusive features.

This range is provided by PEXA. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

Hi, we’re Smoove, part of the PEXA Group. Our vision is to simplify and revolutionise the home moving and ownership experience for everyone. We are on a mission to deliver products and services that remove the pain, frustration, uncertainty, friction and stress that the current process creates. We are a leading provider of tech in the property sector. Founded in 2003, our product focus has been our conveyancer two-sided marketplace, connecting consumers with a range of quality conveyancers to choose from at competitive prices via our easy-to-use tech platform. We are now building out our ecosystem so consumers can benefit from our services either via their Estate Agent or their Mortgage Broker, through smarter conveyancing platforms, making the home buying or selling process easier, quicker, safer and more transparent.

Role We are seeking an experienced Principal Data Engineer to define, lead, and scale the technical strategy of our data platform. This is a senior, hands-on leadership role at the intersection of architecture, governance, and engineering excellence, where you will shape how data is collected, processed, and delivered across the organisation. You will own the end-to-end quality, performance, and scalability of our data systems — from raw ingestion through to trusted datasets powering business-critical analytics and reporting. This includes setting standard and influencing the strategic roadmap for data infrastructure. Our stack is built on both AWS and Azure, using Databricks across data domains and you will lead the evolution of this ecosystem to meet future business needs. You’ll ensure that data is secure, compliant, discoverable, and business-ready, enabling analysts, data scientists, and stakeholders to make confident, data-driven decisions. This role is ideal for a highly technical leader who thrives at both the strategic and execution levels: someone equally comfortable defining architecture with executives, mentoring senior engineers, and optimising distributed pipelines at scale.

Role Responsibilities

  • Design and oversee scalable, performant, and secure architectures on Databricks and distributed systems.
  • Anticipate scaling challenges and ensure platforms are future-proof.
  • Lead the design and development of robust, high-performance data pipelines using PySpark and Databricks.
  • Define and ensure testing frameworks for data workflows.
  • Ensure end-to-end data quality from raw ingestion to curated, trusted datasets powering analytics.
  • Establish and enforce best practices for data governance, lineage, metadata, and security controls.
  • Ensure compliance with GDPR and other regulatory frameworks.
  • Act as a technical authority and mentor, guiding data engineers.
  • Influence cross-functional teams to align on data strategy, standards, and practices.
  • Partner with product, engineering, and business leaders to prioritise and deliver high-impact data initiatives.
  • Build a culture of data trust, ensuring downstream analytics and reporting are always accurate and consistent.
  • Evaluate and recommend emerging technologies where they add value to the ecosystem.

Skills & Experience Required

  • Broad experience as a Data Engineer including technical leadership
  • Broad cloud experience, ideally both Azure and AWS
  • Deep expertise in PySpark and distributed data processing at scale.
  • Extensive experience designing and optimising in Databricks.
  • Advanced SQL optimisation and schema design for analytical workloads.
  • Strong understanding of data security, privacy, and GDPR/PII compliance.
  • Experience implementing and leading data governance frameworks.
  • Proven experience leading the design and operation of a complex data platform.
  • Track record of mentoring engineers and raising technical standards.
  • Ability to influence senior stakeholders and align data initiatives with wider business goals.
  • Strategic mindset with a holistic view of data reliability, scalability, and business value.

Sound like you? We at Smoove are ready so if this role sounds like you, apply today.

To be conducted as part of post offer employment checks: The personal information we have collected from you will be shared with Cifas who will use it to prevent fraud, other unlawful or dishonest conduct, malpractice, and other seriously improper conduct. If any of these are detected, you could be refused certain services or employment. Your personal information will also be used to verify your identity. Further details of how your information will be used by us and Cifas, and your data protection rights, can be found at [Cifas].

GDPR Compliance: Digital Completion UK Limited (trading name “PEXA”), Optima Legal Services Limited (trading name "Optima Legal") and Smoove Limited are owned by DigCom UK Holdings Limited, a subsidiary of PEXA Group. When we process your applicant personal data for recruitment purposes, we do so as a controller. If as part of the recruitment process we share your personal data with another company within the PEXA Group, that company may process your personal data as either an independent controller or, in certain circumstances, a joint controller. By applying for this role, you consent to us processing your personal data in accordance with the UK GDPR and the Data Protection Act 2018, and further information can be found in our privacy notice.

Seniority level

Not Applicable

Employment type

Full-time

Job function

Information Technology

Industries: Information Services, Financial Services, and IT Services and IT Consulting

Referrals increase your chances of interviewing at PEXA by 2x


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

What Hiring Managers Look for First in Machine Learning Job Applications (UK Guide)

Whether you’re applying for machine learning engineer, applied scientist, research scientist, ML Ops or data scientist roles, hiring managers scan applications quickly — often making decisions before they’ve read beyond the top third of your CV. In the competitive UK market, it’s not enough to list skills. You must send clear signals of relevance, delivery, impact, reasoning and readiness for production — and do it within the first few lines of your CV or portfolio. This guide walks you through exactly what hiring managers look for first in machine learning applications, how they evaluate CVs and portfolios, and what you can do to improve your chances of getting shortlisted at every stage — from your CV and LinkedIn profile to your cover letter and project portfolio.

MLOps Jobs in the UK: The Complete Career Guide for Machine Learning Professionals

Machine learning has moved from experimentation to production at scale. As a result, MLOps jobs have become some of the most in-demand and best-paid roles in the UK tech market. For job seekers with experience in machine learning, data science, software engineering or cloud infrastructure, MLOps represents a powerful career pivot or progression. This guide is designed to help you understand what MLOps roles involve, which skills employers are hiring for, how to transition into MLOps, salary expectations in the UK, and how to land your next role using specialist platforms like MachineLearningJobs.co.uk.

The Skills Gap in Machine Learning Jobs: What Universities Aren’t Teaching

Machine learning has moved from academic research into the core of modern business. From recommendation engines and fraud detection to medical imaging, autonomous systems and language models, machine learning now underpins many of the UK’s most critical technologies. Universities have responded quickly. Machine learning modules are now standard in computer science degrees, specialist MSc programmes have proliferated, and online courses promise to fast-track careers in the field. And yet, despite this growth in education, UK employers consistently report the same problem: Many candidates with machine learning qualifications are not job-ready. Roles remain open for months. Interview processes filter out large numbers of applicants. Graduates with strong theoretical knowledge struggle when faced with practical tasks. The issue is not intelligence or effort. It is a persistent skills gap between university-level machine learning education and real-world machine learning jobs. This article explores that gap in depth: what universities teach well, what they routinely miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in machine learning.