Autodesk

Make anything.

Sr Research Engineer, Computer Vision

職種データサイエンス

経験シニア級

勤務地London; Toronto, United Kingdom

勤務オンサイト

雇用正社員

掲載1ヶ月前

応募する

必須スキル

Python

PyTorch

Job Requisition ID #

26WD96331

Senior Software Engineer, Computer Vision & Multimodal AI (Applied Research)Location

Flexible / Hybrid / Remote (team-dependent)

About the Role

We are hiring a Senior Software Engineer focused on Computer Vision and Multimodal AI to build robust perception and understanding systems used across multiple teams and product areas. You will develop end-to-end pipelines that transform images and video into structured, reliable observations by combining modern vision models with multimodal reasoning and contextual signals (for example: domain metadata, documents, and sensor inputs)

This role blends applied research with strong software engineering: rapid iteration, rigorous evaluation, and production-minded implementation for cloud-scale batch processing and interactive workflows

Key Responsibilities

Design, build, and improve multi-stage computer vision pipelines that may include segmentation, detection, tracking, and VLM-based analysis, producing structured outputs (entities, attributes, actions/events, confidence, provenance)
Build systems that handle real-world variability in visual inputs (for example: low resolution, poor lighting, motion blur, cluttered scenes, inconsistent capture devices)
Work with diverse media types such as photos, video, timelapse, 360 video, and RGB-D when available
Fuse visual evidence with contextual inputs such as metadata, documents, and sensor streams to improve recognition quality and reduce ambiguity
Evaluate and integrate state-of-the-art vision and vision-language foundation models, including open-vocabulary recognition, grounded perception, segmentation, and multimodal reasoning
Apply fine-tuning or adaptation approaches when needed; partner with ML teams on training, data strategy, and infrastructure best practices
Define measurable acceptance criteria and benchmarking for accuracy, robustness, latency/cost, and reliability across datasets and domains
Build scalable cloud workflows for batch processing and integrate outputs with APIs and downstream consumers
Improve operational performance and cost via batching, caching, model selection, and pipeline observability
Write maintainable code, contribute to design docs, code reviews, shared libraries, and cross-team technical decisions

Minimum Qualifications

Bachelor’s degree in Computer Science, Electrical Engineering, Robotics, or related field (or equivalent practical experience)
4+ years of experience building computer vision systems using Python
Strong experience with deep learning for computer vision (detection, segmentation, and/or video understanding) using modern frameworks such as Py Torch
Experience taking ML prototypes into reliable pipelines, including evaluation, monitoring, and failure analysis
Experience building or integrating ML systems into cloud or backend workflows (batch processing and/or services)
Strong collaboration and communication skills; ability to work across teams and stakeholders

Preferred Qualifications

Experience with vision-language models (VLMs) and multimodal systems (for example: grounded vision, open-vocabulary recognition, retrieval-augmented multimodal reasoning)
Experience with multimodal fusion (combining imagery/video with metadata, documents, and sensor signals)
Experience with video pipelines (tracking, temporal aggregation, long-video processing)
Experience with real-world datasets, including data curation, labelling strategy, augmentation, and quality control under limited data constraints
Experience developing reusable platform components adopted across multiple teams

What Success Looks Like

Delivered an end-to-end system that ingests real-world image/video inputs and outputs a structured, queryable set of observations (objects plus activities/events), with clear accuracy and reliability metrics
Demonstrated robustness to common visual failure modes (lighting, occlusion, clutter, camera variation) and measurable improvements when contextual signals are available
Built a modular pipeline architecture (segmentation/detection/VLM reasoning components) that can be reused and extended across domains and teams
Maintained strong engineering quality: reproducible experiments, documented decisions, maintainable code, and dependable integrations

Keywords (for candidate matching)

Computer Vision, Deep Learning, Py Torch, Object Detection, Segmentation, Tracking, Video Understanding, Vision-Language Models (VLM), Multimodal AI, Open-Vocabulary, Grounding, Sensor Fusion, Data Curation, Model Evaluation, Benchmarking, Cloud ML Pipelines, Batch Processing, MLOps, Observability

Learn More About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & BelongingWe take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

閲覧数

応募クリック

Mock Apply

スクラップ

類似の求人

Process Development Principal Scientist

Amgen · US - Massachusetts - Cambridge

Senior Principal Data Scientist

LexisNexis (RELX) · 2 Locations

Senior Data Scientist

LexisNexis (RELX) · London

Senior Data Scientist

Monzo · Cardiff, London or Remote (UK)

Senior Research Scientist, Cohere Labs

Cohere · London

Autodeskについて

Autodesk

Public

Autodesk, Inc. is an American multinational software corporation that provides software products and services for the architecture, engineering, construction, manufacturing, media, education, and entertainment industries.

10,001+

従業員数

San Francisco

本社所在地

$50B

企業価値

レビュー

10件のレビュー

3.9

10件のレビュー

ワークライフバランス

3.8

報酬

3.2

企業文化

4.1

キャリア

3.0

経営陣

3.5

75%

知人への推奨率

良い点

Flexible schedules and remote work options

Supportive and approachable management

Great team dynamics and friendly coworkers

改善点

Work-life balance challenges and stress

Communication issues

Limited advancement opportunities

給与レンジ

864件のデータ

Junior/L3

Mid/L4

Senior/L5

Director

Junior/L3 · Machine Learning Engineer

2件のレポート

$160,615

年収総額

基本給

$123,550

ストック

ボーナス

$130,000

$191,070

面接レビュー

レビュー1件

難易度

3.0

/ 5

期間

14-28週間

面接プロセス

Application Review

Recruiter Screen

Technical Phone Screen

Onsite/Virtual Interviews

Team Matching

Offer

よくある質問

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Past Experience