Geospatial Data Engineer

TransitionZero
London
1 month ago
Create job alert

About Us

We are a climate analytics nonprofit established in 2021. We provide system modelling data, software and analysis to support energy transition planning and decision-making. We are grant-funded by the Quadrature Climate Foundation, Google.org, Sequoia Climate Foundation, Bloomberg Philanthropies, European Climate Foundation, among others. Our data, software and analysis is used by developers, financiers, planners and think tanks internationally.

At TransitionZero, we understand that diversity is an essential component of a successful team, whether that be diverse ways of thinking, personal or professional backgrounds and skills. We aim to be the most talented nonprofit in our vertical, spearheading innovative data solutions through our culture of inclusivity and adaptability. Now we are looking for an experienced Data Engineer. 

Our people are our greatest asset, and the diverse experience, skills and perspectives individuals bring to our organisation are the driving force of our success. We will therefore consider all qualified applicants in the recruitment process as we welcome all the unique qualities and experiences that make you, you.

About the Role

Heavy industry activity contributes around a quarter of global anthropogenic emissions, yet unlike the power sector, data remains sparse and inaccurate. TransitionZero is attempting to address this problem as a member of Climate TRACE, a coalition of non-profits founded by former U.S. Vice President Al Gore, which offers a global, independent inventory of greenhouse gas emissions. We use earth observation methods to estimate productivity and emissions from heavy industry facilities in regions where data is unavailable, untimely or unreliable. TransitionZero has been a core member of Climate TRACE since its inception in 2021, providing emissions estimates for the steel, cement, aluminium, pulp & paper, and chemicals subsectors. 

We are looking for a Data Engineer to develop our remote sensing tooling, as well as assist with an exciting new project exploring potential pathways to reduce steel industry emissions using an open-source energy system modelling package. You will design, develop, maintain, test and optimise scalable data pipelines to populate our Data Warehouse with model data. As you will be working closely with remote sensing data and pipelines, prior experience of geospatial technologies is highly beneficial, in addition to cloud-based architecture (our tools are deployed in GCP) and github version control. You will work with data across all of the listed sectors, with a particular focus on steel and cement.

The successful candidate will work in collaboration with members of the Data, Software and Analysis teams under the supervision of the grant lead. 

Responsibilities

The responsibilities of the successful candidate will include:

  • Process, clean, and integrate remote sensing data into our Data Warehouse

  • Design, develop, and maintain non-geospatial data pipelines (geospatial and non-geospatial e.g. heavy industry demand projections, production capacity, emissions factors, technology costs, raw material supply sources, commodity prices etc.) 

  • Work with GIS technologies to analyse and visualise geospatial data.

  • Deploy automated solutions on Google Cloud Platform

  • Ensure data quality while handling large-scale (geospatial) datasets by engineering a suite of validation checks and analysis tooling on our outputs

  • Optimize performance of spatial queries and GIS services for real-time and batch processing applications

  • Productionise geospatial visualisation tooling for use in external-focussed communications

  • Prepare monthly data releases

About You

You are a Python-based data engineer with a particular interest in geospatial data/tooling and a passion for the energy transition. You are interested in using your development skills to design and build an infrastructure to improve global heavy industry data. You are curious and open to learning domain knowledge about heavy industry subsectors and the wider energy system. You take a collaborative approach to your work, make time for others, and promote a no-blame culture when things break. You take pride in writing clean, well-tested code, and documenting and sharing your work with others. You are interested in continued learning and growth, are open to asking questions, and will bring your ideas to our company. 

We welcome people with a proven ability to juggle multiple projects at a time and to collaborate across teams, some of whom are based outside of the UK. 

While we are a rapidly growing tech startup, we are also a non-profit, so a high level of adaptability and willingness to approach challenges with creativity and curiosity is essential.

Skills & Experience

Essential:

  • Interest in geospatial data, processing packages and software

  • Degree-level qualifications in software engineering or STEM field (or equivalent professional experience), with up to 3 years of software development experience 

  • Experience with Python-based development including scripting, data retrieval, and data manipulation (Pandas, Numpy, etc.)

  • Experience with SQL (for interacting with our Data Warehouse in BigQuery)

  • Experience with containerisation (e.g. Docker) and pipeline orchestration (e.g. Airflow, GCP Batch/Jobs)

  • Experience with Github-based version control

  • Attention to detail for writing clean, well-documented, well-tested code

  • Collaborative team player with good communication skills (we work in Agile)

  • Proactivity and ability to self-organise and think outside the box

  • An interest in sustainability and a desire to work in a start-up environment

We’d love to see:

  • Experience using GIS or other geospatial data, processing packages and software

  • Domain-relevant knowledge, postgraduate education, and/or experience, e.g. energy systems engineering, earth observation, environmental science, meteorology, climate science, material science, mining, industry decarbonisation

  • Experience with CloudOps, serverless deployment, and continuous integration/deployment

  • Data modelling, data architecture, schema design

  • Experience using Google Cloud Platform (e.g. BigQuery, DataFlow, CloudBuild, CloudRun)

  • Experience with scientific computing (e.g. machine learning, numerical optimisation)

Salary & Benefits

  • Competitive salary based on experience

  • Enhanced pension scheme (5% employer contribution)

  • 25 days annual leave (excluding UK public holidays) and an additional day off on your birthday

  • 20 days annual allowance to work from anywhere in the world

  • Hybrid working and core working hours model

  • Allowance to set up your home office

  • Annual budget and dedicated leave time for relevant training courses

  • Enhanced gender-neutral parental leave (4 months full paid)

  • Private healthcare following successful completion of the probation period

  • Yearly team offsite

Our Commitment to Diversity, Equity, & Inclusion

"Studieshave shown that some people from marginalised or underrepresented groups are less likely to apply to a role unless they meet all of the hiring guidelines or qualifications.  Whoever you are, even if you don't meet all of the criteria, if you can demonstrate a variety of skills and experience relevant to this role, we encourage you to apply as you might just be the candidate we hire! At TransitionZero, we're looking for people who are genuinely passionate about what they do, and we welcome all people, regardless of their background.

If you would like to discuss any reasonable adjustments to the application or hiring process that may better facilitate your participation, please contact our People team  () for an informal chat. We will make every effort to respond to your request for assistance as soon as possible.

Application Process

We are only accepting direct applications for this role, we will review applications on a rolling basis and may close the role earlier than the anticipated close date (31/03/25) so we encourage you to send your application as soon as possible.

About us

We are a climate analytics nonprofit established in 2021. We provide system modelling data, software and analysis to support energy transition planning and decision-making. We are funded by the Quadrature Climate Foundation, Google.org, Sequoia Climate Foundation, Bloomberg Philanthropies, European Climate Foundation, among others. Our data, software and analysis is used by developers, financiers, planners and think tanks internationally.

Related Jobs

View all jobs

GIS Data Engineer

Principal Data Scientist and Senior Data Engineer

Software Developer for GCS

Python Engineer / Data Scientist

▷ (3 Days Left) Drone Pilot for AI Training and DataCollection

Data Scientist

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine‑Learning Jobs for Non‑Technical Professionals: Where Do You Fit In?

The Model Needs More Than Math When ChatGPT went viral and London start‑ups raised seed rounds around “foundation models,” many professionals asked, “Do I need to learn PyTorch to work in machine learning?” The answer is no. According to the Turing Institute’s UK ML Industry Survey 2024, 39 % of advertised ML roles focus on strategy, compliance, product or operations rather than writing code. As models move from proof‑of‑concept to production, demand surges for specialists who translate algorithms into business value, manage risk and drive adoption. This guide reveals the fastest‑growing non‑coding ML roles, the transferable skills you may already have, real transition stories and a 90‑day action plan—no gradient descent necessary.

Quantexa Machine‑Learning Jobs in 2025: Your Complete UK Guide to Joining the Decision‑Intelligence Revolution

Money‑laundering rings, sanctioned entities, synthetic identities—complex risks hide in plain sight inside data. Quantexa, a London‑born scale‑up now valued at US $2.2 bn (Series F, August 2024), solves that problem with contextual decision‑intelligence (DI): graph analytics, entity resolution and machine learning stitched into a single platform. Banks, insurers, telecoms and governments from HSBC to HMRC use Quantexa to spot fraud, combat financial crime and optimise customer engagement. With the launch of Quantexa AI Studio in February 2025—bringing generative AI co‑pilots and large‑scale Graph Neural Networks (GNNs) to the platform—the company is hiring at record pace. The Quantexa careers portal lists 450+ open roles worldwide, over 220 in the UK across data science, software engineering, ML Ops and client delivery. Whether you are a graduate data scientist fluent in Python, a Scala veteran who loves Spark or a solutions architect who can turn messy data into knowledge graphs, this guide explains how to land a Quantexa machine‑learning job in 2025.

Machine Learning vs. Deep Learning vs. MLOps Jobs: Which Path Should You Choose?

Machine Learning (ML) continues to transform how businesses operate, from personalised product recommendations to automated fraud detection. As ML adoption accelerates in nearly every industry—finance, healthcare, retail, automotive, and beyond—the demand for professionals with specialised ML skills is surging. Yet as you browse Machine Learning jobs on www.machinelearningjobs.co.uk, you may encounter multiple sub-disciplines, such as Deep Learning and MLOps. Each of these fields offers unique challenges, requires a distinct skill set, and can lead to a rewarding career path. So how do Machine Learning, Deep Learning, and MLOps differ? And which area best aligns with your talents and aspirations? This comprehensive guide will define each field, highlight overlaps and differences, discuss salary ranges and typical responsibilities, and explore real-world examples. By the end, you’ll have a clearer vision of which career track suits you—whether you prefer building foundational ML models, pushing the boundaries of neural network performance, or orchestrating robust ML pipelines at scale.