Senior GPU Networking Architect

NVIDIA

Switzerland

Last month

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Senior
Education: Masters
Posted: 30 Mar 2026 (Last month)

Benefits

Competitive salary Diverse and supportive environment Opportunity to work on cutting-edge technology

Save job

Create job alert

Job Type: Permanent
Work Pattern: Full-time
Work Location: On-site
Seniority: Senior
Education: Masters
Posted: 30 Mar 2026 (Last month)

Benefits

Competitive salary Diverse and supportive environment Opportunity to work on cutting-edge technology

Save job

Create job alert

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for a Senior GPU Networking Architect to join our networking software group, bringing strong GPU architecture and programming skills to build and improve GPU communication kernels. This role links GPU computing with networking by making sure communication primitives are carefully developed alongside GPU hardware capabilities. Join our team of engineers developing the software foundation for the largest AI systems globally.

What you will be doing:

Build, implement, and optimize GPU communication kernels that underpin collective and point-to-point operations in large-scale AI systems.
Leverage deep knowledge of GPU architecture—thread scheduling, memory hierarchy, execution pipelines—to improve kernel efficiency, minimize latency, and overlap computation with communication.
Develop GPU-resident communication primitives and device-side APIs that enable fine-grained, kernel-initiated data movement across nodes and accelerators.
Profile and tune GPU kernels end-to-end, identifying bottlenecks at the intersection of compute, memory, and network, and driving targeted optimizations.
Collaborate with network software, hardware, and AI framework teams to co-design communication strategies that align with GPU execution patterns and emerging model architectures.
Build proofs-of-concept, conduct experiments, and perform quantitative modeling to evaluate and validate new communication strategies before committing them to production.
Contribute to the evolution of programming models that expose GPU-aware networking capabilities to application developers.

What we need to see:

5+ years of hands-on CUDA programming, including writing and optimizing non-trivial GPU kernels.
M.Sc. or equivalent experience in computer science, computer engineering, or a closely related field.
Strong understanding of GPU architecture fundamentals: warp scheduling, shared memory, L2 cache, memory coalescing, occupancy tuning, and asynchronous execution.
Experience with systems-level C/C++ development in performance-critical environments.
Familiarity with GPU data movement mechanisms such as GPUDirect RDMA and GPU-initiated communication.
Ability to read and reason about GPU performance profiles (e.g., Nsight Compute, Nsight Systems) and translate observations into actionable optimizations.
Strong collaboration skills in a multi-national, interdisciplinary environment.

Ways to stand out from the crowd:

Experience developing or optimizing communication kernels in libraries such as NCCL, NVSHMEM, or similar GPU-aware communication frameworks.
Understanding of distributed deep learning parallelism techniques, including data parallelism, tensor parallelism, pipeline parallelism, expert parallelism, and mixture-of-experts parallelism, and the communication patterns they impose on GPU kernels.
Background in RDMA, InfiniBand, high-speed networking, and GPU system topology, including NVLink, NVSwitch, PCIe, and network fabrics, and their impact on communication kernel design.
Experience with overlap techniques such as kernel pipelining, persistent kernels, or cooperative groups to hide communication latency behind compute.
Proven experience evaluating and optimizing large-scale LLM training or inference workloads, including hands-on work with frameworks such as PyTorch, TensorRT-LLM, or vLLM, and familiarity with emerging serving architectures such as disaggregated serving.

At NVIDIA, you'll work alongside colleagues who demonstrate deep expertise and innovative thinking in the industry, pushing the boundaries of what's possible in AI and high-performance computing. If you're passionate about GPU architecture, low-level kernel optimization, and building the communication fabric for next-generation AI, we want to hear from you!

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 292,500 PLN - 507,000 PLN for Level 4, and 375,000 PLN - 650,000 PLN for Level 5.

Related Jobs

View all jobs

Spotlight

Machine Learning Engineer - National Security (Gloucestershire)

Mind Foundry Gloucester, Gloucestershire, United Kingdom

On-site Clearance Required

Senior GPU Networking Architect

NVIDIA Switzerland

On-site

Senior GPU Networking Architect

NVIDIA

Remote

Senior GPU Networking Architect

NVIDIA

Remote

Senior GPU Networking Architect

NVIDIA

Remote

Senior Embedded Architect Manager

NVIDIA Cambridge, United Kingdom

On-site

Senior Embedded Architect Manager

NVIDIA Bristol, United Kingdom

On-site

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

May 14, 2026

Jobs

ML Compiler Engineer and ML Runtime Engineer Jobs UK: Salaries, Skills and How to Break In (2026 Guide)

ML Compiler Engineer and ML Runtime Engineer jobs UK 2026: salaries, in-demand skills (MLIR, CUDA, LLVM, Triton), top employers and how to break in.

Apr 9, 2026

Products

Where to Advertise Machine Learning Jobs in the UK (2026 Guide)

Where to advertise machine learning jobs UK in 2026: the specialist boards and communities that reach ML, MLOps and deep learning engineering talent. The candidate pool is small, highly specialised and in demand across AI labs, financial services, healthcare, autonomous systems and consumer technology simultaneously. Machine learning engineers and researchers move between roles through professional networks, conference communities and specialist platforms — not general job boards where ML roles compete with unrelated software engineering positions for the same audience. This guide, published by MachineLearningJobs.co.uk, covers where to advertise machine learning roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

Apr 5, 2026

Jobs

Machine Learning Jobs UK 2026: What to Expect Over the Next 3 Years

Machine Learning Jobs UK 2026: roles, salaries and the MLOps, LLM and generative AI hiring trends shaping UK ML careers over the next three years. Machine learning has undergone a transformation that few technology disciplines can match. In the space of three years it has moved from a specialism sitting at the edges of most organisations' technology strategies to a capability that sits at the centre of them. The tools have changed, the expectations have shifted, and the range of industries treating machine learning as a core business function — rather than an experimental one — has expanded dramatically. For job seekers, this creates both opportunity and complexity in roughly equal measure. The machine learning jobs market of 2026 is significantly larger than it was three years ago, but it is also significantly more demanding. Employers have developed more sophisticated expectations, the technical bar for specialist roles has risen, and the landscape of tools, frameworks, and architectural patterns that practitioners are expected to know has broadened considerably. The candidates who will thrive over the next three years are those who understand where the discipline is heading — which specialisms are attracting the most investment, which technologies are reshaping what machine learning engineers and researchers are expected to build, and how the definition of a machine learning career is evolving beyond the model-building core toward a much wider range of roles across the full ML lifecycle. This article breaks down what the UK machine learning jobs market is likely to look like through to 2028 — covering the titles emerging right now, the technologies driving employer demand, the skills that will matter most, and how to position your career ahead of the curve.