Sr Research Engineer, Computer Vision

Autodesk
Bishop Auckland
4 days ago
Create job alert
Overview

Job Requisition ID 26WD96331

Senior Software Engineer, Computer Vision & Multimodal AI (Applied Research)

Location

Flexible / Hybrid / Remote (team-dependent)

About the Role

We are hiring a Senior Software Engineer focused on Computer Vision and Multimodal AI to build robust perception and understanding systems used across multiple teams and product areas. You will develop end-to-end pipelines that transform images and video into structured, reliable observations by combining modern vision models with multimodal reasoning and contextual signals (for example: domain metadata, documents, and sensor inputs).

This role blends applied research with strong software engineering: rapid iteration, rigorous evaluation, and production-minded implementation for cloud-scale batch processing and interactive workflows.


Responsibilities
  • Design, build, and improve multi-stage computer vision pipelines that may include segmentation, detection, tracking, and VLM-based analysis, producing structured outputs (entities, attributes, actions/events, confidence, provenance)
  • Build systems that handle real-world variability in visual inputs (for example: low resolution, poor lighting, motion blur, cluttered scenes, inconsistent capture devices)
  • Work with diverse media types such as photos, video, timelapse, 360 video, and RGB-D when available
  • Fuse visual evidence with contextual inputs such as metadata, documents, and sensor streams to improve recognition quality and reduce ambiguity
  • Evaluate and integrate state-of-the-art vision and vision-language foundation models, including open-vocabulary recognition, grounded perception, segmentation, and multimodal reasoning
  • Apply fine-tuning or adaptation approaches when needed; partner with ML teams on training, data strategy, and infrastructure best practices
  • Define measurable acceptance criteria and benchmarking for accuracy, robustness, latency/cost, and reliability across datasets and domains
  • Build scalable cloud workflows for batch processing and integrate outputs with APIs and downstream consumers
  • Improve operational performance and cost via batching, caching, model selection, and pipeline observability
  • Write maintainable code, contribute to design docs, code reviews, shared libraries, and cross-team technical decisions

Minimum Qualifications
  • Bachelor’s degree in Computer Science, Electrical Engineering, Robotics, or related field (or equivalent practical experience)
  • 4+ years of experience building computer vision systems using Python
  • Strong experience with deep learning for computer vision (detection, segmentation, and/or video understanding) using modern frameworks such as PyTorch
  • Experience taking ML prototypes into reliable pipelines, including evaluation, monitoring, and failure analysis
  • Experience building or integrating ML systems into cloud or backend workflows (batch processing and/or services)
  • Strong collaboration and communication skills; ability to work across teams and stakeholders

Preferred Qualifications
  • Experience with vision-language models (VLMs) and multimodal systems (for example: grounded vision, open-vocabulary recognition, retrieval-augmented multimodal reasoning)
  • Experience with multimodal fusion (combining imagery/video with metadata, documents, and sensor signals)
  • Experience with video pipelines (tracking, temporal aggregation, long-video processing)
  • Experience with real-world datasets, including data curation, labeling strategy, augmentation, and quality control under limited data constraints
  • Experience developing reusable platform components adopted across multiple teams

What Success Looks Like
  • Delivered an end-to-end system that ingests real-world image/video inputs and outputs a structured, queryable set of observations (objects plus activities/events), with clear accuracy and reliability metrics
  • Demonstrated robustness to common visual failure modes (lighting, occlusion, clutter, camera variation) and measurable improvements when contextual signals are available
  • Built a modular pipeline architecture (segmentation/detection/VLM reasoning components) that can be reused and extended across domains and teams
  • Maintained strong engineering quality: reproducible experiments, documented decisions, maintainable code, and dependable integrations

Keywords (for candidate matching)

Computer Vision, Deep Learning, PyTorch, Object Detection, Segmentation, Tracking, Video Understanding, Vision-Language Models (VLM), Multimodal AI, Open-Vocabulary, Grounding, Sensor Fusion, Data Curation, Model Evaluation, Benchmarking, Cloud ML Pipelines, Batch Processing, MLOps, Observability


About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!


Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.


Diversity & Belonging

We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging


Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).


#J-18808-Ljbffr

Related Jobs

View all jobs

Sr Research Engineer, Computer Vision

Sr. AI Data Engineer (UK Remote)

Sr. AI Data Engineer (UK Remote)

Sr. AI Data Engineer (UK Remote)

Sr. AI Data Engineer (UK Remote)

Sr. AI Data Engineer (UK Remote)

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Machine Learning Tools Do You Need to Know to Get a Machine Learning Job?

Machine learning is one of the most exciting and rapidly growing areas of tech. But for job seekers it can also feel like a maze of tools, frameworks and platforms. One job advert wants TensorFlow and Keras. Another mentions PyTorch, scikit-learn and Spark. A third lists Mlflow, Docker, Kubernetes and more. With so many names out there, it’s easy to fall into the trap of thinking you must learn everything just to be competitive. Here’s the honest truth most machine learning hiring managers won’t say out loud: 👉 They don’t hire you because you know every tool. They hire you because you can solve real problems with the tools you know. Tools are important — no doubt — but context, judgement and outcomes matter far more. So how many machine learning tools do you actually need to know to get a job? For most job seekers, the real number is far smaller than you think — and more logically grouped. This guide breaks down exactly what employers expect, which tools are core, which are role-specific, and how to structure your learning for real career results.

What Hiring Managers Look for First in Machine Learning Job Applications (UK Guide)

Whether you’re applying for machine learning engineer, applied scientist, research scientist, ML Ops or data scientist roles, hiring managers scan applications quickly — often making decisions before they’ve read beyond the top third of your CV. In the competitive UK market, it’s not enough to list skills. You must send clear signals of relevance, delivery, impact, reasoning and readiness for production — and do it within the first few lines of your CV or portfolio. This guide walks you through exactly what hiring managers look for first in machine learning applications, how they evaluate CVs and portfolios, and what you can do to improve your chances of getting shortlisted at every stage — from your CV and LinkedIn profile to your cover letter and project portfolio.

MLOps Jobs in the UK: The Complete Career Guide for Machine Learning Professionals

Machine learning has moved from experimentation to production at scale. As a result, MLOps jobs have become some of the most in-demand and best-paid roles in the UK tech market. For job seekers with experience in machine learning, data science, software engineering or cloud infrastructure, MLOps represents a powerful career pivot or progression. This guide is designed to help you understand what MLOps roles involve, which skills employers are hiring for, how to transition into MLOps, salary expectations in the UK, and how to land your next role using specialist platforms like MachineLearningJobs.co.uk.