Machine Learning / LLM Engineer
About Oh:
Oh is pioneeringhyper-realistic, uncensored AI-driven content, building a full-spectrum ecosystem of multimodal AI products. Our platform powerslifelike digital twins and AI charactersacross text, voice, and images.
With a mission to become theOpenAI of the spicy content industry, we iterate fast, push boundaries, and deploycutting-edge, real-time conversational AI experiences at scale.
The Role:
Our platform integrates a variety of multimodal GenAI models. You willown the technical roadmapandfull lifecycleof ourlarge language models, most notably our flagshipLlama 3.1 70Band otheropen-source models.
Your responsibilities will include:
- Fine-tuningwithcustom and synthetic datasets
- Deploying on GPU platformsto ensurelow-latency, cost-efficient, and safereal-time interactions
- Driving multimodal expansion—integratingtext, voice, and image capabilities
- Embedding robust safety and compliance measures
- Keeping on top of recent development in the field and auditing new models for a wide range of purposes (e.g. conversational AI, intent classification, AI agents life planner)
Key Responsibilities:
LLM Fine-Tuning & Optimization
- Fine-tune and optimize models (Llama 3.1 70B, GPT-based, Mistral, etc.) usingdomain-specific and synthetic datasets
- Enhanceaccuracy, reduce hallucinations, and improve alignment with user intent
Deployment & Infrastructure Management
- Deployscalable, memory-efficient modelsonGPU-based platforms(Runpod, AWS, Kubernetes clusters)
- Optimize GPU inference withTorch,CUDA, TensorRT, vLLM, and DeepSpeed
Multimodal & Cross-Model Integration
- Integrateadditional open-source modelsto enableimage prompt generation, voice synthesis, and dynamic character personalization
- Expand multimodal AI capabilities (e.g. improve LLava-based vision models)
Data Pipeline & Evaluation
- Designrobust data pipelinesforcuration, cleaning, synthetic data generation, and versioning(DVC)
- Implementevaluation metrics and continuous monitoringto ensure model quality
Real-Time Performance & System Optimization
- Ensurelow-latency, real-time performanceusingmixed-precision training, quantization, pruning, and distillation techniques
Safety, Moderation & Compliance
- Embedrobust safety, content moderation, and ethical AI frameworksto comply withGDPR and industry standards
- Developcustom token filters and controlled response mechanisms
Monitoring, Diagnostics & Cost Management
- Set up and maintainmonitoring tools(Prometheus, Grafana, TensorBoard, Weights & Biases, Sentry) forperformance tracking and cost optimization
Technical Skills & Requirements:
Experience:
- 5+ yearsinmachine learning engineering, NLP, or AI researchwith deep expertise inTransformer-based LLMs
Programming & Frameworks:
- Strong proficiency inPythonand Bash scripting
- Hands-on experience withPyTorch,HuggingFacelibraries (Transformers, Diffusers, PEFT, Accelerate), and the common ML toolkit (e.g. SKLearn, Pandas, Numpy)
- Familiarity withJAX/TensorFlowis a plus
LLM Specialization:
- Proven expertise infine-tuning LLMsusing techniques likeLoRA, QLoRA, PEFT, RLHF, and prompt engineering
GPU & Inference Optimization:
- Experience with common inference speed optimisation and model quantization techniques.
Deployment & Orchestration:
- Skilled incontainerization (Docker) and orchestration (Kubernetes)for scalable ML deployments
- Experience with major MLOps frameworks (MLFlow / KubeFlow) preferred
Data Handling:
- Proficient indata wrangling and preprocessing(Pandas,Dask)
- Experience managinglarge-scale datasetsusing AWS (S3,RedShift,EC2)
- Knowledge of data QC and monitoring tools (DVC,Great Expectations)
Additional Knowledge:
- Understanding ofretrieval-augmented generation (RAG) techniques
- Familiarity withvector databases(FAISS, Pinecone, Weaviate)
Preferred Qualifications:
✅ Experience integrating and optimizingmultimodal models(text, voice, image, video)
✅ Background inAI-driven gaming, digital experiences, or adult content
✅ Familiarity withCI/CD pipelines(GitLab CI, Jenkins) for ML workflows
✅ Interest or experience incrypto, Web3, or NFT-based AI models
✅ Prior exposure toAI governance, safety, or ethical AI frameworks
What We Offer:
Competitive Compensation:
- Attractivesalary, benefits, and equity participation
Remote & Flexible:
- Remote-first work environmentwithflexible hours
Growth & Leadership:
- Rapidcareer advancementand the opportunity toshape our AI strategy
Innovative Culture:
- Join afast-pacedteam at the forefront ofadvanced, uncensored AI applications
If you’re passionate aboutpushing the boundaries of AI-driven experiencesand have a track record indeveloping, deploying, and optimizing cutting-edge LLMs, we want to hear from you!