FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control¶
Venue: iclr2026 (Withdraw) Authors: Yutong Ye, Yingbo Zhou, Xiao Du, Zhusen Liu, Hao Zhou, Li Sun, Xiang Lian, Mingsong Chen OpenReview: https://openreview.net/forum?id=0XiGLcOpLK
Relevance¶
LLM score: 1/3 — Mentions model pruning for resource-constrained deployment, but the main contribution is federated imitation learning for traffic signal control, not advances in energy-efficient training or hardware-aware training methods.
Keyword hits: pruning
TLDR¶
(none provided)
Abstract¶
Although Reinforcement Learning (RL)-based Traffic Signal Control (TSC) methods have been extensively studied, their practical applications still raise some serious issues such as high learning cost and poor generalizability. This is because the ``trial-and-error'' training style makes RL agents extremely dependent on the specific traffic environment, which also requires a long convergence time. To address these issues, we propose a novel Federated Imitation Learning (FIL)-based framework for multi-intersection TSC, named FitLight, which allows RL agents to plug-and-play for any traffic environment without additional pre-training cost. Unlike existing imitation learning approaches that rely on pre-training RL agents with demonstrations, FitLight allows real-time imitation learning and seamless transition to reinforcement learning. Due to our proposed knowledge-sharing mechanism and novel hybrid pressure-based agent design, RL agents can quickly find a best control policy with only a few episodes. Moreover, for resource-constrained TSC scenarios, FitLight supports model pruning and heterogeneous model aggregation, such that RL agents can work on a micro-controller with merely 16{\it KB} RAM and 32{\it KB} ROM. Extensive experiments demonstrate that, compared to state-of-the-art methods, FitLight not only provides a superior starting point but also converges to a better final solution on both real-world and synthetic datasets, even under extreme resource limitations.
Keywords¶
Autonomous System, Traffic Signal Control, Federated Imitation Learning