Director, Data Architecture

GlaxoSmithKline
Cambridge
1 month ago
Applications closed

Related Jobs

View all jobs

Director of Data Science & AI – Global Manufacturing Transformation

Director of Artificial Intelligence - Manufacturing & Industrial

Principal Data Scientist

Lead Machine Learning Engineer, Associate Director, London

Senior Data Engineer

Director of Data Science & Analytics

The Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately, this helps us get ahead of disease in more predictive and powerful ways.

Onyx is a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:

  • Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”
  • Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent
  • Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time

The Onyx Data Architecture team sits within the Data Engineering team, which is responsible for the design, delivery, support, and maintenance of industrialized automated end-to-end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structured and unstructured data in line with Product requirements.

This role is responsible for building and leading a scrum team of world-class data architects focused on creating the data ecosystem of Onyx. They support the head of Data Engineering in building a strong culture of accountability and ownership in their team, as well as developing innovative data solutions to ensure the data is integrated into our platforms and downstream applications. They work in close partnership with the rest of data engineering teams to ensure we are following the best engineering standards and practices, and with our customers and product teams to ensure the use of appropriate schemas, vocabularies, and ontologies.

Key Responsibilities

  • Lead a team of data architects in delivering robust data and knowledge products that advance GSK R&D
  • Experience influencing, motivating and aligning others towards common technical decisions
  • Experience with modern data architecture approaches and data management capabilities such as metadata management, lineage, data governance, data discovery, and data quality
  • Define and implement data modelling and design principles, ensuring data structures are optimized for performance, scalability, and data integrity
  • Partner closely with other data engineering leads to conceptualize the design of new data flows aimed at maximizing reuse and aligning with an event-driven microservice enabled architecture
  • Partner with data engineering leads, stakeholders and product team to develop data standards and best practices to deliver seamless data flow and interoperability across systems and applications
  • Evaluate and select, deploy and use modern data architecture and modelling technologies that align with GSK’s data strategy & technology stack
  • Exemplar and strategic leader that can envision the future of data architecture in GSK

Why You?Basic Qualifications:

We are looking for professionals with these required skills to achieve our goals:

  • Bachelor’s degree in Data Engineering, Computer Science, Software Engineering or related field.
  • 8+ years of relevant work experience
  • 5+ years of people management experience
  • Cloud experience (e.g., AWS, Google Cloud, Azure, Kubernetes)
  • Experience in leading teams and fostering collaboration to deliver data architecture solutions
  • Knowledge of Data architecture principles, data modelling techniques and data integration

Preferred Qualifications:

  • Deep knowledge and use of at least one common programming language: e.g., Python, Scala, Java
  • Expertise in data modelling, database concepts and SQL
  • Experience with agile software development environments using tools like Jira and Confluence
  • Strong understanding of data architecture principles and practices such as data as a product, Data as a Service, Data Mesh, Data Fabric, Data Virtualization, etc.
  • Experience in data governance, data security and privacy relevant to the industry
  • Experience with common big data tools (e.g., Spark, Kafka, Storm, …) a plus

J-18808-Ljbffr

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Portfolio Projects That Get You Hired for Machine Learning Jobs (With Real GitHub Examples)

In today’s data-driven landscape, the field of machine learning (ML) is one of the most sought-after career paths. From startups to multinational enterprises, organisations are on the lookout for professionals who can develop and deploy ML models that drive impactful decisions. Whether you’re an aspiring data scientist, a seasoned researcher, or a machine learning engineer, one element can truly make your CV shine: a compelling portfolio. While your CV and cover letter detail your educational background and professional experiences, a portfolio reveals your practical know-how. The code you share, the projects you build, and your problem-solving process all help prospective employers ascertain if you’re the right fit for their team. But what kinds of portfolio projects stand out, and how can you showcase them effectively? This article provides the answers. We’ll look at: Why a machine learning portfolio is critical for impressing recruiters. How to select appropriate ML projects for your target roles. Inspirational GitHub examples that exemplify strong project structure and presentation. Tangible project ideas you can start immediately, from predictive modelling to computer vision. Best practices for showcasing your work on GitHub, personal websites, and beyond. Finally, we’ll share how you can leverage these projects to unlock opportunities—plus a handy link to upload your CV on Machine Learning Jobs when you’re ready to apply. Get ready to build a portfolio that underscores your skill set and positions you for the ML role you’ve been dreaming of!

Machine Learning Job Interview Warm‑Up: 30 Real Coding & System‑Design Questions

Machine learning is fuelling innovation across every industry, from healthcare to retail to financial services. As organisations look to harness large datasets and predictive algorithms to gain competitive advantages, the demand for skilled ML professionals continues to soar. Whether you’re aiming for a machine learning engineer role or a research scientist position, strong interview performance can open doors to dynamic projects and fulfilling careers. However, machine learning interviews differ from standard software engineering ones. Beyond coding proficiency, you’ll be tested on algorithms, mathematics, data manipulation, and applied problem-solving skills. Employers also expect you to discuss how to deploy models in production and maintain them effectively—touching on MLOps or advanced system design for scaling model inferences. In this guide, we’ve compiled 30 real coding & system‑design questions you might face in a machine learning job interview. From linear regression to distributed training strategies, these questions aim to test your depth of knowledge and practical know‑how. And if you’re ready to find your next ML opportunity in the UK, head to www.machinelearningjobs.co.uk—a prime location for the latest machine learning vacancies. Let’s dive in and gear up for success in your forthcoming interviews.

Negotiating Your Machine Learning Job Offer: Equity, Bonuses & Perks Explained

How to Secure a Compensation Package That Matches Your Technical Mastery and Strategic Influence in the UK’s ML Landscape Machine learning (ML) has rapidly shifted from an emerging discipline to a mission-critical function in modern enterprises. From optimising e-commerce recommendations to powering autonomous vehicles and driving innovation in healthcare, ML experts hold the keys to transformative outcomes. As a mid‑senior professional in this field, you’re not only crafting sophisticated algorithms; you’re often guiding strategic decisions about data pipelines, model deployment, and product direction. With such a powerful impact on business results, companies across the UK are going beyond standard salary structures to attract top ML talent. Negotiating a compensation package that truly reflects your value means looking beyond the numbers on your monthly payslip. In addition to a competitive base salary, you could be securing equity, performance-based bonuses, and perks that support your ongoing research, development, and growth. However, many mid‑senior ML professionals leave these additional benefits on the table—either because they’re unsure how to negotiate them or they simply underestimate their long-term worth. This guide explores every critical aspect of negotiating a machine learning job offer. Whether you’re joining an AI-focused start-up or a major tech player expanding its ML capabilities, understanding equity structures, bonus schemes, and strategic perks will help you lock in a package that matches your technical expertise and strategic influence. Let’s dive in.