Fig. 1From: A dynamic approach to support outbreak management using reinforcement learning and semi-connected SEIQR modelsRL environment design and interactions with RL agent. a Transition of the SEIQR model. b Population flow governed by PWD via the transport hub, using Tokyo as an example. When inter-regional traveling occurs, the passenger will randomly appear at the edge of the destination’s transport hub and then keep moving. c Interactions between the Agent and the EnvironmentBack to article page