ML Runtime Engineer (Mid-Level and Senior)
This role involves developing and integrating the ML runtime stack for Fractile's AI accelerators, focusing on high-performance inference systems. You'll work closely with hardware and software teams to co-design solutions, integrating with open-source frameworks like PyTorch, vLLM, and SGLang. The position emphasizes Rust-based runtime development and deep collaboration across disciplines to optimize AI hardware performance.
Bristol, United Kingdom