Engineer the Quantum RevolutionYour expertise can help us shape the future of quantum computing at Oxford Ionics.

View Open Roles

Data Engineer

Sandtech
London
2 months ago
Applications closed

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Sand Technologies is a fast-growing enterprise AI company that solves real-world problems for large blue-chip companies and governments worldwide.

We’re pioneers of meaningful AI: our solutions go far beyond chatbots. We are using data and AI to solve the world’s biggest issues in telecommunications, sustainable water management, energy, healthcare, climate change, smart cities, and other areas that have a real impact on the world. For example, our AI systems help to manage the water supply for the entire city of London. We created the AI algorithms that enabled the 7th largest telecommunications company in the world to plan its network in 300 cities in record time. And we built a digital healthcare system that enables 30m people in a country to get world-class healthcare despite a shortage of doctors.

We’ve grown our revenues by over 500% in the last 12 months while winning prestigious scientific and industry awards for our cutting-edge technology. We’re underpinned by over 300 engineers and scientists working across Africa, Europe, the UK and the US.

ABOUT THE ROLE

Sand Technologies focuses on cutting-edge cloud-based data projects, leveraging tools such as Databricks, DBT, Docker, Python, SQL, and PySpark to name a few. We work across a variety of data architectures such as Data Mesh, lakehouse, data vault and data warehouses. Our data engineers create pipelines that support our data scientists and power our front-end applications. This means we do data-intensive work for both OLTP and OLAP use cases. Our environments are primarily cloud-native spanning AWS, Azure and GCP, but we also work on systems running self-hosted open source services exclusively. We strive towards a strong code-first, data as a product mindset at all times, where testing and reliability with a keen eye on performance is a non-negotiable.

JOB SUMMARY

A Data Engineer, has the primary role of designing, building, and maintaining scalable data pipelines and infrastructure to support data-intensive applications and analytics solutions. They closely collaborate with data scientists, analysts, and software engineers to ensure efficient data processing, storage, and retrieval for business insights and decision-making. From their expertise in data modelling, ETL (Extract, Transform, Load) processes, and big data technologies it becomes possible to develop robust and reliable data solutions.

RESPONSIBILITIES

  • Data Pipeline Development: Design, implement, and maintain scalable data pipelines for ingesting, processing, and transforming large volumes of data from various sources using tools such as databricks, python and pyspark.
  • Data Modeling: Design and optimize data models and schemas for efficient storage, retrieval, and analysis of structured and unstructured data.
  • ETL Processes: Develop and automate ETL workflows to extract data from diverse sources, transform it into usable formats, and load it into data warehouses, data lakes or lakehouses.
  • Big Data Technologies: Utilize big data technologies such as Spark, Kafka, and Flink for distributed data processing and analytics.
  • Cloud Platforms: Deploy and manage data solutions on cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), leveraging cloud-native services for data storage, processing, and analytics.
  • Data Quality and Governance: Implement data quality checks, validation processes, and data governance policies to ensure accuracy, consistency, and compliance with regulations.
  • Monitoring, Optimization and Troubleshooting: Monitor data pipelines and infrastructure performance, identify bottlenecks and optimize for scalability, reliability, and cost-efficiency. Troubleshoot and fix data-related issues.
  • DevOps: Build and maintain basic CI/CD pipelines, commit code to version control and deploy data solutions.
  • Collaboration: Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand requirements, define data architectures, and deliver data-driven solutions.
  • Documentation: Create and maintain technical documentation, including data architecture diagrams, ETL workflows, and system documentation, to facilitate understanding and maintainability of data solutions.
  • Best Practices: Continuously learn and apply best practices in data engineering and cloud computing.

QUALIFICATIONS

  • Proven experience as a Data Engineer, or in a similar role, with hands-on experience building and optimizing data pipelines and infrastructure.
  • Proven experience working with Big Data and tools used to process Big Data
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • Solid understanding of data engineering principles and practices.
  • Excellent communication and collaboration skills to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders.
  • Ability to adapt to new technologies, tools, and methodologies in a dynamic and fast-paced environment.
  • Ability to write clean, scalable, robust code using python or similar programming languages. Background in software engineering a plus.

DESIRABLE LANGUAGES/TOOLS

  • Proficiency in programming languages such as Python, Java, Scala, or SQL for data manipulation and scripting.
  • Strong understanding of data modelling concepts and techniques, including relational and dimensional modelling.
  • Experience in big data technologies and frameworks such as Databricks, Spark, Kafka, and Flink.
  • Experience in using modern data architectures, such as lakehouse.
  • Experience with CI/CD pipelines and version control systems like Git.
  • Knowledge of ETL tools and technologies such as Apache Airflow, Informatica, or Talend.
  • Knowledge of data governance and best practices in data management.
  • Familiarity with cloud platforms and services such as AWS, Azure, or GCP for deploying and managing data solutions.
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • SQL (for database management and querying)
  • Apache Spark (for distributed data processing)
  • Apache Spark Streaming, Kafka or similar (for real-time data streaming)
  • Experience using data tools in at least one cloud service - AWS, Azure or GCP (e.g. S3, EMR, Redshift, Glue, Azure Data Factory, Databricks, BigQuery, Dataflow, Dataproc

Would you like to join us as we work hard, have fun and make history?

Apply for this job

*

indicates a required field

First Name *

Last Name *

Preferred First Name

Email *

Phone

Resume/CV *

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

Enter manually

Accepted file types: pdf, doc, docx, txt, rtf

What interests and excites you about joining Sand? *

Where are you currently located? *

What are your gross annual salary expectations (in USD)? * Select...

When would you be able to join us? *

How did you hear about the role? * Select...

If you selected other, Sand Staff or Media, please specify *


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Seasonal Hiring Peaks for Machine Learning Jobs: The Best Months to Apply & Why

The UK's machine learning sector has evolved into one of Europe's most intellectually stimulating and financially rewarding technology markets, with roles spanning from junior ML engineers to principal machine learning scientists and heads of artificial intelligence research. With machine learning positions commanding salaries from £32,000 for graduate ML engineers to £160,000+ for senior principal scientists, understanding when organisations actively recruit can dramatically accelerate your career progression in this pioneering and rapidly evolving field. Unlike traditional software engineering roles, machine learning hiring follows distinct patterns influenced by AI research cycles, model development timelines, and algorithmic innovation schedules. The sector's unique combination of mathematical rigour, computational complexity, and real-world application requirements creates predictable hiring windows that strategic professionals can leverage to advance their careers in developing tomorrow's intelligent systems. This comprehensive guide explores the optimal timing for machine learning job applications in the UK, examining how enterprise AI strategies, academic research cycles, and deep learning initiatives influence recruitment patterns, and why strategic timing can determine whether you join a groundbreaking AI research team or miss the opportunity to develop the next generation of machine learning algorithms.

Pre-Employment Checks for Machine Learning Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in machine learning reflects the discipline's unique position at the intersection of artificial intelligence research, algorithmic decision-making, and transformative business automation. Machine learning professionals often have privileged access to proprietary datasets, cutting-edge algorithms, and strategic AI systems that form the foundation of organizational competitive advantage and automated decision-making capabilities. The machine learning industry operates within complex regulatory frameworks spanning AI governance directives, algorithmic accountability requirements, and emerging ML ethics regulations. Machine learning specialists must demonstrate not only technical competence in model development and deployment but also deep understanding of algorithmic fairness, AI safety principles, and the societal implications of automated decision-making at scale. Modern machine learning roles frequently involve developing systems that impact hiring decisions, financial services, healthcare diagnostics, and autonomous operations across multiple regulatory jurisdictions and ethical frameworks simultaneously. The combination of algorithmic influence, predictive capabilities, and automated decision-making authority makes thorough candidate verification essential for maintaining compliance, fairness, and public trust in AI-powered systems.

Why Now Is the Perfect Time to Launch Your Career in Machine Learning: The UK's Intelligence Revolution

The United Kingdom stands at the epicentre of a machine learning revolution that's fundamentally transforming how we solve problems, deliver services, and unlock insights from data at unprecedented scale. From the AI-powered diagnostic systems revolutionising healthcare in Manchester to the algorithmic trading platforms driving London's financial markets, Britain's embrace of intelligent systems has created an extraordinary demand for skilled machine learning professionals that dramatically exceeds the current talent supply. If you've been seeking a career at the forefront of technological innovation or looking to position yourself in one of the most impactful sectors of the digital economy, machine learning represents an exceptional opportunity. The convergence of abundant data availability, computational power accessibility, advanced algorithmic development, and enterprise AI adoption has created perfect conditions for machine learning career success.