Engineer the Quantum RevolutionYour expertise can help us shape the future of quantum computing at Oxford Ionics.

View Open Roles

Principal Data Engineer

Atorus Research
London
1 month ago
Create job alert

Principal Data Engineer
full-time
remote from anywhere in the UK #LI-Remote

Description:
The Principal will be responsible for supporting complex or leading singular projects related to data engineering requirements and initiatives across Research and Development. The Principal will support data projects from across the business including Clinical, Pre-Clinical, Non-Clinical, Chemistry, RWD and Omics.

Essential Functions:
• Support the design, development and maintenance of data pipelines for processing Research and Development data from diverse sources (Clinical Trials, Medical Devices, Pre-Clinical, Omics, Real World Data) utilizing the AWS technology platform.
• Create and optimize ETL/ELT processes for structured and unstructured data using Python, R, SQL, AWS services and other tools.
• Build and maintain data repositories using AWS S3 and FSx technologies. Establish data warehousing solutions using Amazon Redshift.
• Build and maintain standard data models.
• Develop data quality frameworks, validation processes and KPIs to ensure accuracy and consistency of data pipelines.
• Implement data versioning and lineage tracking to support data traceability, regulatory compliance and audit requirements.
• Create and maintain documentation for data processes, architectures, and workflows.
• Implement modern software development best practices (e.g. Code Versioning, DevOps, CD/CI).
• Maintain compliance with data privacy regulations such as HIPAA, GDP
• May be required to develop, deliver or support data literacy training across R&D.

Required Knowledge, Skills and Abilities:
• Strong knowledge of data engineering tools such as Python, R and SQL for data processing.
• Strong proficiency with AWS services particularly S3, Redshift, FSx, Glue, Lambda.
• Strong proficiency with relational databases.
• Strong background in data modeling and database design.
• Familiarity with unstructured database technologies (e.g. NoSQL) and other database types (e.g. Graph).
• Familiarity with Containerization such as Docker and EKS/Kubernetes.
• Familiarity with one or more RnD research process and associated regulatory requirements.
• Exposure to healthcare data standards (CDISC, HL7, FHIR, SNOMED CT, OMOP, DICOM).
• Exposure to big data technologies and handling.
• Knowledge of machine learning operations (MLOps) and model deployment.
• Strong problem-solving and analytical abilities.
• Excellent communication and collaboration skills.
• Experience working in an Agile development environment.

Minimum Requirements:
• Bachelor's Degree in Computer Science, Statistics, Mathematics, Life Sciences, or other relevant scientific fields; Master's Degree preferred
• 3-5 years of experience in data engineering, with at least 1.5 years focusing on healthcare, research or clinical related data
#J-18808-Ljbffr

Related Jobs

View all jobs

Principal Data Engineer - Azure Databricks (Unity Catalog)

Principal Data Engineer - Azure Databricks (Unity Catalog) - Contract

Principal Data Engineer

Principal Data Engineer – Azure Databricks (Unity Catalog) - Contract

Principal Data Engineer - Azure Databricks (Unity Catalog) - Contract

Principal Data Engineer - Azure Databricks (Unity Catalog)

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Pre-Employment Checks for Machine Learning Jobs: DBS, References & Right-to-Work and more Explained

Pre-employment screening in machine learning reflects the discipline's unique position at the intersection of artificial intelligence research, algorithmic decision-making, and transformative business automation. Machine learning professionals often have privileged access to proprietary datasets, cutting-edge algorithms, and strategic AI systems that form the foundation of organizational competitive advantage and automated decision-making capabilities. The machine learning industry operates within complex regulatory frameworks spanning AI governance directives, algorithmic accountability requirements, and emerging ML ethics regulations. Machine learning specialists must demonstrate not only technical competence in model development and deployment but also deep understanding of algorithmic fairness, AI safety principles, and the societal implications of automated decision-making at scale. Modern machine learning roles frequently involve developing systems that impact hiring decisions, financial services, healthcare diagnostics, and autonomous operations across multiple regulatory jurisdictions and ethical frameworks simultaneously. The combination of algorithmic influence, predictive capabilities, and automated decision-making authority makes thorough candidate verification essential for maintaining compliance, fairness, and public trust in AI-powered systems.

Why Now Is the Perfect Time to Launch Your Career in Machine Learning: The UK's Intelligence Revolution

The United Kingdom stands at the epicentre of a machine learning revolution that's fundamentally transforming how we solve problems, deliver services, and unlock insights from data at unprecedented scale. From the AI-powered diagnostic systems revolutionising healthcare in Manchester to the algorithmic trading platforms driving London's financial markets, Britain's embrace of intelligent systems has created an extraordinary demand for skilled machine learning professionals that dramatically exceeds the current talent supply. If you've been seeking a career at the forefront of technological innovation or looking to position yourself in one of the most impactful sectors of the digital economy, machine learning represents an exceptional opportunity. The convergence of abundant data availability, computational power accessibility, advanced algorithmic development, and enterprise AI adoption has created perfect conditions for machine learning career success.

Automate Your Machine Learning Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

ML jobs are everywhere—product companies, labs, consultancies, fintech, healthtech, robotics—often hidden in ATS portals or duplicated across boards. The fastest way to stay on top of them isn’t more scrolling; it’s automation. With keyword-rich alerts, RSS feeds, and a reusable ChatGPT workflow, you can bring relevant roles to you, triage them in minutes, and tailor strong applications without burning your evenings. This is a copy-paste playbook for www.machinelearningjobs.co.uk readers. It’s UK-centric, practical, and designed to save you hours each week. What You’ll Have Working In 30 Minutes A role & keyword map spanning LLM/NLP, Vision, Core ML, Recommenders, MLOps/Platform, Research/Applied Science, and Edge/Inference optimisation. Shareable Boolean searches you can paste into Google & job boards to cut noise. Always-on alerts & RSS feeds delivering fresh roles to your inbox/reader. A ChatGPT “ML Job Scout” prompt that deduplicates, scores fit, and outputs tailored actions. A lightweight pipeline tracker so deadlines and follow-ups never slip.