ML Engineer (LLM)

OhChat
London
2 months ago
Applications closed

Related Jobs

View all jobs

AI Engineer - Jan 25

Machine Learning Engineer

Founding AI Engineer

ML/AI Software Engineer

Lead Machine Learning Engineer

Machine Learning Engineer

Machine Learning / LLM Engineer


About Oh:

Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.

With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational AI experiences at scale.


The Role:

Our platform integrates a variety of multimodal GenAI models. You willown the technical roadmapandfull lifecycleof ourlarge language models, most notably our flagshipLlama 3.1 70Band otheropen-source models.


Your responsibilities will include:

  • Fine-tuningwithcustom and synthetic datasets
  • Deploying on GPU platformsto ensurelow-latency, cost-efficient, and safereal-time interactions
  • Driving multimodal expansion—integratingtext, voice, and image capabilities
  • Embedding robust safety and compliance measures
  • Keeping on top of recent development in the field and auditing new models for a wide range of purposes (e.g. conversational AI, intent classification, AI agents life planner)

 

Key Responsibilities:

LLM Fine-Tuning & Optimization

  • Fine-tune and optimize models (Llama 3.1 70B, GPT-based, Mistral, etc.) usingdomain-specific and synthetic datasets
  • Enhanceaccuracy, reduce hallucinations, and improve alignment with user intent


Deployment & Infrastructure Management

  • Deployscalable, memory-efficient modelsonGPU-based platforms(Runpod, AWS, Kubernetes clusters)
  • Optimize GPU inference withTorch,CUDA, TensorRT, vLLM, and DeepSpeed


Multimodal & Cross-Model Integration

  • Integrateadditional open-source modelsto enableimage prompt generation, voice synthesis, and dynamic character personalization
  • Expand multimodal AI capabilities (e.g. improve LLava-based vision models)

Data Pipeline & Evaluation

  • Designrobust data pipelinesforcuration, cleaning, synthetic data generation, and versioning(DVC)
  • Implementevaluation metrics and continuous monitoringto ensure model quality


Real-Time Performance & System Optimization

  • Ensurelow-latency, real-time performanceusingmixed-precision training, quantization, pruning, and distillation techniques


Safety, Moderation & Compliance

  • Embedrobust safety, content moderation, and ethical AI frameworksto comply withGDPR and industry standards
  • Developcustom token filters and controlled response mechanisms


Monitoring, Diagnostics & Cost Management

  • Set up and maintainmonitoring tools(Prometheus, Grafana, TensorBoard, Weights & Biases, Sentry) forperformance tracking and cost optimization


 

Technical Skills & Requirements:

Experience:

  • 5+ yearsinmachine learning engineering, NLP, or AI researchwith deep expertise inTransformer-based LLMs


Programming & Frameworks:

  • Strong proficiency inPythonand Bash scripting
  • Hands-on experience withPyTorch,HuggingFacelibraries (Transformers, Diffusers, PEFT, Accelerate), and the common ML toolkit (e.g. SKLearn, Pandas, Numpy)
  • Familiarity withJAX/TensorFlowis a plus

LLM Specialization:

  • Proven expertise infine-tuning LLMsusing techniques likeLoRA, QLoRA, PEFT, RLHF, and prompt engineering


GPU & Inference Optimization:

  • Experience with common inference speed optimisation and model quantization techniques.


Deployment & Orchestration:

  • Skilled incontainerization (Docker) and orchestration (Kubernetes)for scalable ML deployments
  • Experience with major MLOps frameworks (MLFlow / KubeFlow) preferred


Data Handling:

  • Proficient indata wrangling and preprocessing(Pandas,Dask)
  • Experience managinglarge-scale datasetsusing AWS (S3,RedShift,EC2)
  • Knowledge of data QC and monitoring tools (DVC,Great Expectations)

Additional Knowledge:

  • Understanding ofretrieval-augmented generation (RAG) techniques
  • Familiarity withvector databases(FAISS, Pinecone, Weaviate)

 

Preferred Qualifications:

✅ Experience integrating and optimizingmultimodal models(text, voice, image, video)

✅ Background inAI-driven gaming, digital experiences, or adult content

✅ Familiarity withCI/CD pipelines(GitLab CI, Jenkins) for ML workflows

✅ Interest or experience incrypto, Web3, or NFT-based AI models

✅ Prior exposure toAI governance, safety, or ethical AI frameworks

 

What We Offer:

Competitive Compensation:

  • Attractivesalary, benefits, and equity participation


Remote & Flexible:

  • Remote-first work environmentwithflexible hours


Growth & Leadership:

  • Rapidcareer advancementand the opportunity toshape our AI strategy


Innovative Culture:

  • Join afast-pacedteam at the forefront ofadvanced, uncensored AI applications


 

If you’re passionate aboutpushing the boundaries of AI-driven experiencesand have a track record indeveloping, deploying, and optimizing cutting-edge LLMs, we want to hear from you!

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Contract vs Permanent Machine Learning Jobs: Which Pays Better in 2025?

Machine learning (ML) has swiftly become one of the most transformative forces in the UK technology landscape. From conversational AI and autonomous vehicles to fraud detection and personalised recommendations, ML algorithms are reshaping how organisations operate and how consumers experience products and services. In response, job opportunities in machine learning—including roles in data science, MLOps, natural language processing (NLP), computer vision, and more—have risen dramatically. Yet, as the demand for ML expertise booms, professionals face a pivotal choice about how they want to work. Some choose day‑rate contracting, leveraging short-term projects for potentially higher immediate pay. Others embrace fixed-term contract (FTC) roles for mid-range stability, or permanent positions for comprehensive benefits and a well-defined career path. In this article, we will explore these different employment models, highlighting the pros and cons of each, offering sample take‑home pay scenarios, and providing insights into which path might pay better in 2025. Whether you’re a new graduate with a machine learning degree or an experienced practitioner pivoting into an ML-heavy role, understanding these options is key to making informed career decisions.

Machine‑Learning Jobs for Non‑Technical Professionals: Where Do You Fit In?

The Model Needs More Than Math When ChatGPT went viral and London start‑ups raised seed rounds around “foundation models,” many professionals asked, “Do I need to learn PyTorch to work in machine learning?” The answer is no. According to the Turing Institute’s UK ML Industry Survey 2024, 39 % of advertised ML roles focus on strategy, compliance, product or operations rather than writing code. As models move from proof‑of‑concept to production, demand surges for specialists who translate algorithms into business value, manage risk and drive adoption. This guide reveals the fastest‑growing non‑coding ML roles, the transferable skills you may already have, real transition stories and a 90‑day action plan—no gradient descent necessary.

Quantexa Machine‑Learning Jobs in 2025: Your Complete UK Guide to Joining the Decision‑Intelligence Revolution

Money‑laundering rings, sanctioned entities, synthetic identities—complex risks hide in plain sight inside data. Quantexa, a London‑born scale‑up now valued at US $2.2 bn (Series F, August 2024), solves that problem with contextual decision‑intelligence (DI): graph analytics, entity resolution and machine learning stitched into a single platform. Banks, insurers, telecoms and governments from HSBC to HMRC use Quantexa to spot fraud, combat financial crime and optimise customer engagement. With the launch of Quantexa AI Studio in February 2025—bringing generative AI co‑pilots and large‑scale Graph Neural Networks (GNNs) to the platform—the company is hiring at record pace. The Quantexa careers portal lists 450+ open roles worldwide, over 220 in the UK across data science, software engineering, ML Ops and client delivery. Whether you are a graduate data scientist fluent in Python, a Scala veteran who loves Spark or a solutions architect who can turn messy data into knowledge graphs, this guide explains how to land a Quantexa machine‑learning job in 2025.