Overview
This role presents an opportunity to engage deeply with MLOps, vector databases, and Retrieval-Augmented Generation (RAG) pipelines - skills that are in incredibly high demand. If you are passionate about shaping the future of AI and thrive on complex, high-impact challenges, we encourage you to apply.
Responsibilities
- Design and Build Scalable Data Pipelines: Architect, implement, and optimize robust, high-performance real-time and batch ETL pipelines to ingest, process, and transform massive datasets for LLMs and foundational AI models.
- Cloud-Native Innovation: Leverage your deep expertise across AWS, Azure, and/or GCP to build cloud-native data solutions, ensuring efficiency, scalability, and cost-effectiveness.
- Power Generative AI: Develop and manage specialized data flows for generative AI applications, including integrating with vector databases and constructing sophisticated RAG pipelines.
- Champion Data Governance & Ethical AI: Implement best practices for data quality, lineage, privacy, and security, ensuring our AI systems are developed and used responsibly and ethically.
- Tooling the Future: Get hands-on with cutting-edge technologies like Hugging Face, PyTorch, TensorFlow, Apache Spark, Apache Airflow, and other modern data and ML frameworks.
- Collaborate and Lead: Partner closely with ML Engineers, Data Scientists, and Researchers to understand their data needs, provide technical leadership, and translate complex requirements into actionable data strategies.
- Optimize and Operate: Monitor, troubleshoot, and continuously optimize data pipelines and infrastructure for peak performance and reliability in production environments.
What You'll Bring
We are seeking a seasoned professional who is excited by the unique challenges of AI data.
QualificationsWhat are we looking for?Must-Have Skills
- Extensive Data Engineering Experience: 3+ years designing, building, and maintaining large-scale data pipelines and data warehousing solutions.
- Cloud Platform Mastery: Expert-level proficiency with at least one major cloud provider (GCP preferred, AWS, or Azure), including their data, compute, and storage services.
- Programming Prowess: Strong programming skills in Python and SQL.
- Big Data Ecosystem Expertise: Hands-on experience with Apache Spark, Kafka, and data orchestration tools such as Apache Airflow or Prefect.
- ML Data Acumen: Solid understanding of data requirements for machine learning models, including feature engineering, data validation, and dataset versioning.
- Vector Database Experience: Practical experience with vector databases (e.g., Pinecone, Milvus, Chroma) for embedding storage and retrieval.
- Generative AI Familiarity: Understanding of data paradigms for LLMs, RAG architectures, and how data pipelines support fine-tuning or pre-training.
- MLOps Principles: Familiarity with MLOps best practices for deploying and managing ML models in production.
- Data Governance & Ethics: Experience implementing data governance frameworks, ensuring data quality, privacy, and compliance, with awareness of ethical AI considerations.
Bonus Points If You Have
- Direct experience with Hugging Face ecosystem, PyTorch, or TensorFlow for data preparation in an ML context.
- Experience with real-time data streaming architectures.
- Familiarity with containerization (Docker, Kubernetes).
- Master's or Ph.D. in Computer Science, Data Engineering, or a related quantitative field.
Additional Information
Starcom has fantastic benefits on offer to all of our employees. In addition to the classics, Pension, Life Assurance, Private Medical and Income Protection Plans, we also offer:
- WORK YOUR WORLD opportunity to work anywhere in the world, where there is a Publicis office, for up to 6 weeks a year.
- REFLECTION DAYS - Two additional days of paid leave to step away from your usual day-to-day work and create time to focus on your well-being and self-care.
- HELP@HAND BENEFITS 24/7 helpline to support you on a personal and professional level. Access to remote GPs, mental health support and CBT. Wellbeing content and lifestyle coaching.
- FAMILY FRIENDLY POLICIES We provide 26 weeks of full pay for the following family milestones: Maternity, Adoption, Surrogacy and Shared Parental Leave.
- FLEXIBLE WORKING, BANK HOLIDAY SWAP & BIRTHDAY DAY OFF You are entitled to an additional day off for your birthday, from your first day of employment.
- GREAT LOCAL DISCOUNTS This includes membership discounts with Soho Friends, local restaurants and retailers in Westfield White City and Television Centre.
Full details of our benefits will be shared when you join us.
Publicis Groupe operates a hybrid working pattern with full-time employees being office-based three days during the working week.
We are supportive of all candidates and are committed to providing a fair assessment process. If you have any circumstances (such as neurodiversity, physical or mental impairments or a medical condition) that may affect your assessment, please inform your Talent Acquisition Partner. We will discuss possible adjustments to ensure fairness. Rest assured, disclosing this information will not impact your treatment in our process.
Please make sure you check out the Publicis Career Page which showcases our Inclusive Benefits and our EAGs (Employee Action Groups).