The theoretical Investigation demonstrates that EDIS reveals reduced suboptimality in comparison to exclusively utilizing online details or straight reusing offline data. EDIS is actually a plug-in method and will be coupled with current strategies in offline-to-online RL setting. By applying EDIS to off-the-shelf approaches Cal-QL and IQL, we noti