Principal Data Engineer (Azure, PySpark, Databricks)

PEXA Group Limited
Leeds
3 weeks ago
Create job alert

Hi, we’re Smoove, part of the PEXA Group.


Our vision is to simplify and revolutionise the home moving and ownership experience for everyone. We are on a mission to deliver products and services that remove the pain, frustration, uncertainty, frictionand stress that the current process creates.


We are a leading provider of tech in the property sector – founded in 2003, our product focus has been our conveyancer two‑sided marketplace, connecting consumers with a range of quality conveyancers to choose from at competitive prices via our easy‑to‑use tech platform. We are now building out our ecosystem so consumers can benefit from our services either via their Estate Agent or their Mortgage Broker, through smarter conveyancing platforms, making the home buying or selling process easier, quicker, safer and more transparent.


Why join Smoove?

Great question! We pride ourselves on attracting, developing and retaining a diverse range of people in an equally diverse range of roles and specialisms – who together achieve outstanding results. Our transparent approach and open‑door policy make Smoove a great place to work and as our business expands, we are looking for ambitious, talented people to join us.


We are seeking an experienced Principal Data Engineer to define, lead, and scale the technical strategy of our data platform. This is a senior, hands‑on leadership role at the intersection of architecture, governance, and engineering excellence, where you will shape how data is collected, processed, and delivered across the organisation.


You will own the end‑to‑end quality, performance, and scalability of our data systems – from raw ingestion through to trusted datasets powering business‑critical analytics and reporting. This includes setting standards and influencing the strategic roadmap for data infrastructure.


Our stack is built on both AWS and Azure, using Databricks across data domains and you will lead the evolution of this ecosystem to meet future business needs.


You’ll ensure that data is secure, compliant, discoverable, and business‑ready, enabling analysts, data scientists, and stakeholders to make confident, data‑driven decisions.


This role is ideal for a highly technical leader who thrives at both the strategic and execution levels: someone equally comfortable defining architecture with executives, mentoring senior engineers, and optimising distributed pipelines at scale.


Role Responsibilities

  • Design and oversee scalable, performant, and secure architectures on Databricks and distributed systems.
  • Anticipate scaling challenges and ensure platforms are future‑proof.
  • Lead the design and development of robust, high‑performance data pipelines using PySpark and Databricks.
  • Define and ensure testing frameworks for data workflows.
  • Ensure end‑to‑end data quality from raw ingestion to curated, trusted datasets powering analytics.
  • Establish and enforce best practices for data governance, lineage, metadata, and security controls.
  • Ensure compliance with GDPR and other regulatory frameworks.
  • Act as a technical authority and mentor, guiding data engineers.
  • Influence cross‑functional teams to align on data strategy, standards, and practices.
  • Partner with product, engineering, and business leaders to prioritise and deliver high‑impact data initiatives.
  • Build a culture of data trust, ensuring downstream analytics and reporting are always accurate and consistent.
  • Evaluate and recommend emerging technologies where they add value to the ecosystem.

Skills & Experience Required

  • Broad experience as a Data Engineer including technical leadership.
  • Broad cloud experience, ideally both Azure and AWS.
  • Deep expertise in PySpark and distributed data processing at scale.
  • Extensive experience designing and optimising in Databricks.
  • Advanced SQL optimisation and schema design for analytical workloads.
  • Strong understanding of data security, privacy, and GDPR/PII compliance.
  • Experience implementing and leading data governance frameworks.
  • Proven experience leading the design and operation of a complex data platform.
  • Track record of mentoring engineers and raising technical standards.
  • Ability to influence senior stakeholders and align data initiatives with wider business goals.
  • Strategic mindset with a holistic view of data reliability, scalability, and business value.

£80,000 – £100,000 a year


Sound like you?

We at Smoove are ready so if this role sounds like you, apply today.


#J-18808-Ljbffr

Related Jobs

View all jobs

Principal Data Engineer

Principal Data Engineer...

Principal Data Engineer...

Principal Data Engineer

Principal Data Engineer

Principal Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write a Machine Learning Job Ad That Attracts the Right People

Machine learning now sits at the heart of many UK organisations, powering everything from recommendation engines and fraud detection to forecasting, automation and decision support. As adoption grows, so does demand for skilled machine learning professionals. Yet many employers struggle to attract the right candidates. Machine learning job adverts often generate high volumes of applications, but few applicants have the blend of modelling skill, engineering awareness and real-world experience the role actually requires. Meanwhile, strong machine learning engineers and scientists quietly avoid adverts that feel vague, inflated or confused. In most cases, the issue is not the talent market — it is the job advert itself. Machine learning professionals are analytical, technically rigorous and highly selective. A poorly written job ad signals unclear expectations and low ML maturity. A well-written one signals credibility, focus and a serious approach to applied machine learning. This guide explains how to write a machine learning job ad that attracts the right people, improves applicant quality and strengthens your employer brand.

Maths for Machine Learning Jobs: The Only Topics You Actually Need (& How to Learn Them)

Machine learning job adverts in the UK love vague phrases like “strong maths” or “solid fundamentals”. That can make the whole field feel gatekept especially if you are a career changer or a student who has not touched maths since A level. Here is the practical truth. For most roles on MachineLearningJobs.co.uk such as Machine Learning Engineer, Applied Scientist, Data Scientist, NLP Engineer, Computer Vision Engineer or MLOps Engineer with modelling responsibilities the maths you actually use is concentrated in four areas: Linear algebra essentials (vectors, matrices, projections, PCA intuition) Probability & statistics (uncertainty, metrics, sampling, base rates) Calculus essentials (derivatives, chain rule, gradients, backprop intuition) Basic optimisation (loss functions, gradient descent, regularisation, tuning) If you can do those four things well you can build models, debug training, evaluate properly, explain trade-offs & sound credible in interviews. This guide gives you a clear scope plus a six-week learning plan, portfolio projects & resources so you can learn with momentum rather than drowning in theory.

Neurodiversity in Machine Learning Careers: Turning Different Thinking into a Superpower

Machine learning is about more than just models & metrics. It’s about spotting patterns others miss, asking better questions, challenging assumptions & building systems that work reliably in the real world. That makes it a natural home for many neurodivergent people. If you live with ADHD, autism or dyslexia, you may have been told your brain is “too distracted”, “too literal” or “too disorganised” for a technical career. In reality, many of the traits that can make school or traditional offices hard are exactly the traits that make for excellent ML engineers, applied scientists & MLOps specialists. This guide is written for neurodivergent ML job seekers in the UK. We’ll explore: What neurodiversity means in a machine learning context How ADHD, autism & dyslexia strengths map to ML roles Practical workplace adjustments you can ask for under UK law How to talk about neurodivergence in applications & interviews By the end, you’ll have a clearer sense of where you might thrive in ML – & how to turn “different thinking” into a genuine career advantage.