Jobs

ML Engineer (LLM)


Job details
  • OhChat
  • Bristol
  • 3 days ago

Machine Learning / LLM Engineer


About Oh:

Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.

With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational AI experiences at scale.


The Role:

Our platform integrates a variety of multimodal GenAI models. You willown the technical roadmapandfull lifecycleof ourlarge language models, most notably our flagshipLlama 3.1 70Band otheropen-source models.


Your responsibilities will include:

  • Fine-tuningwithcustom and synthetic datasets
  • Deploying on GPU platformsto ensurelow-latency, cost-efficient, and safereal-time interactions
  • Driving multimodal expansion—integratingtext, voice, and image capabilities
  • Embedding robust safety and compliance measures
  • Keeping on top of recent development in the field and auditing new models for a wide range of purposes (e.g. conversational AI, intent classification, AI agents life planner)

 

Key Responsibilities:

LLM Fine-Tuning & Optimization

  • Fine-tune and optimize models (Llama 3.1 70B, GPT-based, Mistral, etc.) usingdomain-specific and synthetic datasets
  • Enhanceaccuracy, reduce hallucinations, and improve alignment with user intent


Deployment & Infrastructure Management

  • Deployscalable, memory-efficient modelsonGPU-based platforms(Runpod, AWS, Kubernetes clusters)
  • Optimize GPU inference withTorch,CUDA, TensorRT, vLLM, and DeepSpeed


Multimodal & Cross-Model Integration

  • Integrateadditional open-source modelsto enableimage prompt generation, voice synthesis, and dynamic character personalization
  • Expand multimodal AI capabilities (e.g. improve LLava-based vision models)

Data Pipeline & Evaluation

  • Designrobust data pipelinesforcuration, cleaning, synthetic data generation, and versioning(DVC)
  • Implementevaluation metrics and continuous monitoringto ensure model quality


Real-Time Performance & System Optimization

  • Ensurelow-latency, real-time performanceusingmixed-precision training, quantization, pruning, and distillation techniques


Safety, Moderation & Compliance

  • Embedrobust safety, content moderation, and ethical AI frameworksto comply withGDPR and industry standards
  • Developcustom token filters and controlled response mechanisms


Monitoring, Diagnostics & Cost Management

  • Set up and maintainmonitoring tools(Prometheus, Grafana, TensorBoard, Weights & Biases, Sentry) forperformance tracking and cost optimization


 

Technical Skills & Requirements:

Experience:

  • 5+ yearsinmachine learning engineering, NLP, or AI researchwith deep expertise inTransformer-based LLMs


Programming & Frameworks:

  • Strong proficiency inPythonand Bash scripting
  • Hands-on experience withPyTorch,HuggingFacelibraries (Transformers, Diffusers, PEFT, Accelerate), and the common ML toolkit (e.g. SKLearn, Pandas, Numpy)
  • Familiarity withJAX/TensorFlowis a plus

LLM Specialization:

  • Proven expertise infine-tuning LLMsusing techniques likeLoRA, QLoRA, PEFT, RLHF, and prompt engineering


GPU & Inference Optimization:

  • Experience with common inference speed optimisation and model quantization techniques.


Deployment & Orchestration:

  • Skilled incontainerization (Docker) and orchestration (Kubernetes)for scalable ML deployments
  • Experience with major MLOps frameworks (MLFlow / KubeFlow) preferred


Data Handling:

  • Proficient indata wrangling and preprocessing(Pandas,Dask)
  • Experience managinglarge-scale datasetsusing AWS (S3,RedShift,EC2)
  • Knowledge of data QC and monitoring tools (DVC,Great Expectations)

Additional Knowledge:

  • Understanding ofretrieval-augmented generation (RAG) techniques
  • Familiarity withvector databases(FAISS, Pinecone, Weaviate)

 

Preferred Qualifications:

✅ Experience integrating and optimizingmultimodal models(text, voice, image, video)

✅ Background inAI-driven gaming, digital experiences, or adult content

✅ Familiarity withCI/CD pipelines(GitLab CI, Jenkins) for ML workflows

✅ Interest or experience incrypto, Web3, or NFT-based AI models

✅ Prior exposure toAI governance, safety, or ethical AI frameworks

 

What We Offer:

Competitive Compensation:

  • Attractivesalary, benefits, and equity participation


Remote & Flexible:

  • Remote-first work environmentwithflexible hours


Growth & Leadership:

  • Rapidcareer advancementand the opportunity toshape our AI strategy


Innovative Culture:

  • Join afast-pacedteam at the forefront ofadvanced, uncensored AI applications


 

If you’re passionate aboutpushing the boundaries of AI-driven experiencesand have a track record indeveloping, deploying, and optimizing cutting-edge LLMs, we want to hear from you!

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

ML Engineer (LLM)

Machine Learning / LLM EngineerAbout Oh:Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational...

OhChat Bristol

ML Engineer (LLM)

Machine Learning / LLM EngineerAbout Oh:Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational...

OhChat

ML Engineer (LLM)

Machine Learning / LLM EngineerAbout Oh:Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational...

OhChat London

ML Engineer

About Dalton:Dalton is on a mission to make the world’s drug design more efficient. We are building the AI ecosystem for drug design and solving real-world problems that transform the efficiency of the pharmaceutical industry. Our mission is to harness cutting-edge technology and turn it into impactful products for our...

Dalton London

Snr ML Engineer – Machine Learning, LLMs, MLOps, RAG, Prompt Engineering, UK Remote

Snr ML Engineer – Machine Learning, LLMs, MLOps, RAG, Prompt Engineering, UK RemoteMy client is revolutionizing the way businesses are leveraging AI with cutting edge Machine Learning technologies. Recently funded and looking for Snr ML Engineers to join the mission to innovate and make an impact.What You'll DoAs a Senior...

WMtech Sheffield

Snr ML Engineer – Machine Learning, LLMs, MLOps, RAG, Prompt Engineering, UK Remote

Snr ML Engineer – Machine Learning, LLMs, MLOps, RAG, Prompt Engineering, UK RemoteMy client is revolutionizing the way businesses are leveraging AI with cutting edge Machine Learning technologies. Recently funded and looking for Snr ML Engineers to join the mission to innovate and make an impact.What You'll DoAs a Senior...

WMtech Bolton