Lead/Staff AI Runtime

2 weeks ago


Sverige, Sweden FlexAI Full time 120,000 - 180,000 per year
Join FlexAI:

FlexAI is at the forefront of revolutionizing AI computing by reengineering infrastructure at the system level. Our groundbreaking architecture, combined with sophisticated software intelligence, abstraction, and an orchestration layer, allows developers to leverage a diverse array of compute, resulting in efficient, more reliable computing at a fraction of the cost. We are seeking a skilled and experienced Lead/Staff AI Runtime Engineer.

Founded by Brijesh Tripathi, who bring experience from Nvidia, Apple, Tesla, Intel, and Zoox, FlexAI is not just building a product – we're shaping the future of AI. Our teams are strategically distributed across Paris, Silicon Valley, and Bangalore, united by a shared mission: to deliver more compute with less complexity.

Position Overview:

At FlexAI, we're building a high-performance, cloud-agnostic AI compute platform designed for next-generation training and inference workloads. As Lead/Staff AI Runtime Engineer, you'll play a pivotal role in the design, development, and optimization of the core runtime infrastructure that powers distributed training and deployment of large AI models (LLMs and beyond).

This is a hands-on leadership role - perfect for a systems-minded software engineer who thrives at the intersection of AI workloads, runtimes, and performance-critical infrastructure. You'll own critical components of our PyTorch-based stack, lead technical direction, and collaborate across engineering, research, and product to push the boundaries of elastic, fault-tolerant, high-performance model execution.

What you'll do:

Lead Runtime Design & Development

  • Own the core runtime architecture supporting AI training and inference at scale.
  • Design resilient and elastic runtime features (e.g. dynamic node scaling, job recovery) within our custom PyTorch stack.
  • Optimize distributed training reliability, orchestration, and job-level fault tolerance.

Drive Performance at Scale

  • Profile and enhance low-level system performance across training and inference pipelines.
  • Improve packaging, deployment, and integration of customer models in production environments.
  • Ensure consistent throughput, latency, and reliability metrics across multi-node, multi-GPU setups.

Build Internal Tooling & Frameworks

  • Design and maintain libraries and services that support model lifecycle: training, checkpointing, fault recovery, packaging, and deployment.
  • Implement observability hooks, diagnostics, and resilience mechanisms for deep learning workloads.
  • Champion best practices in CI/CD, testing, and software quality across the AI Runtime stack.

Collaborate & Mentor

  • Work cross-functionally with Research, Infrastructure, and Product teams to align runtime development with customer and platform needs.
  • Guide technical discussions, mentor junior engineers, and help scale the AI Runtime team's capabilities.

What you'll need to be successful:

  • 8+ years of experience in systems/software engineering, with deep exposure to AI runtime, distributed systems, or compiler/runtime interaction.
  • Experience in delivering PaaS services.
  • Proven experience optimizing and scaling deep learning runtimes (e.g. PyTorch, TensorFlow, JAX) for large-scale training and/or inference.
  • Strong programming skills in Python and C++ (Go or Rust is a plus).
  • Familiarity with distributed training frameworks, low-level performance tuning, and resource orchestration.

  • Experience working with multi-GPU, multi-node, or cloud-native AI workloads.

  • Solid understanding of containerized workloads, job scheduling, and failure recovery in production environments.

Bonus Points:

  • Contributions to PyTorch internals or open-source DL infrastructure projects.
  • Familiarity with LLM training pipelines, checkpointing, or elastic training orchestration.
  • Experience with Kubernetes, Ray, TorchElastic, or custom AI job orchestrators.
  • Background in systems research, compilers, or runtime architecture for HPC or ML.
  • Start up previous experience

What we offer:

  • A competitive salary and benefits package, tailored to recognize your dedication and contributions.
  • The opportunity to collaborate with leading experts in AI and cloud computing, learning from the best and the brightest, fostering continuous growth.
  • An environment that values innovation, collaboration, and mutual respect.
  • Support for personal and professional development, empowering you with the tools and resources to elevate your skills and leave a lasting impact.
  • A pivotal role in the AI revolution, shaping the technologies that power the innovations of tomorrow.
Offices :

Our teams are strategically distributed across three continents - Europe, North America, and Asia—united by a shared mission: to deliver more compute with less complexity.

  • Paris - HQ
  • San Francisco (Bay Area) - US office
  • Bangalore - India office

Location: Based in Paris (Hybrid) or EU (full-remote) - with at least two trips to Paris per month to sync with the team.

Apply NOW

You've seen what this role entails. Now we want to hear from you Does this opportunity align with your aspirations? If you're even slightly curious, we encourage you to apply – it could be the start of something extraordinary

At FlexAI, we believe diverse teams are the most innovative teams. We're committed to creating an inclusive environment where everyone feels valued, and we proudly offer equal opportunities regardless of gender, sexual orientation, origin, disabilities, veteran status, or any other facets of your identity that make you uniquely you.



  • Sverige, Sweden Hope Global School Full time 900,000 - 1,200,000 per year

    Job DescriptionInfosys is seeking a visionary SAP Business AI Architect to lead enterprise-scale transformations in Sweden. The role involves designing and implementing end-to-end SAP solutions, integrating AI capabilities, and guiding clients through their digital transformation journeys. The architect will collaborate with cross-functional teams, mentor...


  • Sverige, Sweden The European Spallation Source Full time 65,000 - 95,000 per year

    The European Spallation Source (ESS) is one of the largest science and technology infrastructure projects being built today. Our goal is to become the brightest source of neutrons for science and technology and be operational by the end of this decade. A state-of-the-art suite of neutron scattering instruments will be made available to address a wide range...


  • Sverige, Sweden Palta (Simple Zing Lovi) Full time 1,200,000 - 1,800,000 per year

    Simple Life is the #1 AI-powered health coaching app for adults who want to lose weight and enjoy a healthier lifestyle—without the stress or extremes. Our mission is to empower people to feel their best every day. By challenging traditional, restrictive approaches, Simple offers a more sustainable method grounded in ease, personalization, and real-life...


  • Sverige, Sweden Telia Company Full time 105,000 - 240,000 per year

    We're looking for a Lead Mobile Core Network Engineer to join us at Telia.Do you share our passion for 5G and the future of mobile data communication? If yes, this might be the next step in your career.I'm Taimur Lodhi, Head of Packet Core Development Team. We are an experienced and dedicated telecommunications team responsible for the development of Mobile...


  • Sverige, Sweden +Moveco AB Full time 250,000 - 400,000 per year

    Moveco söker en LIA-praktikant inför våren 2026 som vill växa och utvecklas inom digital marknadsföring, sociala medier, AI-optimering och kommunikation. Är du affärsdriven, kreativ och vill göra skillnad i ett växande entreprenörsbolag utsedda till ett av Sveriges bästa solcellsföretag 2025? Då är det dig vi sökerOm MovecoPå Moveco arbetar...


  • Sverige, Sweden Fluke Health Solutions Full time 250,000 - 400,000 per year

    Job Description – Customer Service Administrator - STUDENT POOLDepartment/Region: Customer Service / Operations SwedenReports to: Customer Service ManagerPurpose of RoleThe Customer Service Administrator ensures smooth handling of customer service tasks, order processing, and documentation to support efficient operations and high-quality service...


  • Sverige, Sweden clickhouse Full time $100,000 - $200,000 per year

    About ClickHouseRecognized on the 2025 Forbes Cloud 100 list, ClickHouse is one of the most innovative and fast-growing private cloud companies. With over 2,000 customers and ARR that has more than quadrupled over the past year, ClickHouse leads the market in real-time analytics, data warehousing, observability, and AI workloads. ClickHouse's incredible...

  • Security Engineer

    1 day ago


    Sverige, Sweden Kognity Full time 120,000 - 180,000 per year

    Education changes lives. But tech hasn't lived up to its promise yet. At Kognity, we're here to change that.We're a 125-person EdTech scale-up powering learning in 120+ countries. Our intelligent platform combines rich pedagogy with smart AI to help students and teachers thrive, from international schools to US high schools.Why Kognity is the place to...

  • Marketing Designer

    2 weeks ago


    Sverige, Sweden Kognity Full time 80,000 - 120,000 per year

    Education changes lives. But tech hasn't lived up to its promise, yet. At Kognity, we're here to change that.We're a 125-person EdTech scale-up powering learning in 120+ countries. Our intelligent platform combines rich pedagogy with smart AI to help students and teachers thrive — from international schools to US high schools.Why Kognity is the place to...


  • Sverige, Sweden Microsoft Full time 120,000 - 180,000 per year

    Partner Solution Architect - AI Business SolutionsMultiple Locations, SwedenDate postedOct 14, 2025Job number1891322Work site3 days / week in-officeTravel0-25%Role typeIndividual ContributorProfessionCustomer SuccessDisciplineCloud Solution ArchitectureEmployment typeFull-TimeOverviewWith a vision to "Build and sell Microsoft AI, Cloud applications,...