通过可配置的半POMDPS优化空容器重新定位和车队部署

论文标题

通过可配置的半POMDPS优化空容器重新定位和车队部署

Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

论文作者

Poiani, Riccardo, Stirbu, Ciprian, Metelli, Alberto Maria, Restelli, Marcello

论文摘要

随着全球经济和市场的持续增长，资源失衡已成为实际逻辑场景中的核心问题之一。在海洋运输中，这种贸易不平衡导致空容器重新定位（ECR）问题。一旦将货物从出口国交付到进口国，Laden将变成空容器，需要重新安置以满足出口国中新商品请求的需求。在这样的问题中，任何合作重新定位政策的绩效都可以严格取决于船只将遵循的路线（即车队部署）。从历史上看，提出了行动研究（OR）方法，以与船队一起共同优化重新定位政策。但是，容器的未来供应和需求的随机性以及环境中存在的黑盒和非线性约束，使这些方法不适合这些情况。在本文中，我们介绍了一个新颖的框架，可配置的半POMDP，以建模这种类型的问题。此外，我们提供了一种两阶段的学习算法“配置和征服”（CC），该算法首先通过找到最佳车队部署策略的近似来配置环境，然后通过在这种调整后的环境环境中学习ECR政策来“征服”它。我们在这个问题的大型和现实世界中验证了我们的方法。我们的实验强调，CC避免了或方法的陷阱，并且成功地优化了ECR政策和船队的船队，从而在世界贸易环境中取得了出色的表现。

With the continuous growth of the global economy and markets, resource imbalance has risen to be one of the central issues in real logistic scenarios. In marine transportation, this trade imbalance leads to Empty Container Repositioning (ECR) problems. Once the freight has been delivered from an exporting country to an importing one, the laden will turn into empty containers that need to be repositioned to satisfy new goods requests in exporting countries. In such problems, the performance that any cooperative repositioning policy can achieve strictly depends on the routes that vessels will follow (i.e., fleet deployment). Historically, Operation Research (OR) approaches were proposed to jointly optimize the repositioning policy along with the fleet of vessels. However, the stochasticity of future supply and demand of containers, together with black-box and non-linear constraints that are present within the environment, make these approaches unsuitable for these scenarios. In this paper, we introduce a novel framework, Configurable Semi-POMDPs, to model this type of problems. Furthermore, we provide a two-stage learning algorithm, "Configure & Conquer" (CC), that first configures the environment by finding an approximation of the optimal fleet deployment strategy, and then "conquers" it by learning an ECR policy in this tuned environmental setting. We validate our approach in large and real-world instances of the problem. Our experiments highlight that CC avoids the pitfalls of OR methods and that it is successful at optimizing both the ECR policy and the fleet of vessels, leading to superior performance in world trade environments.

下载PDF全文

下载文献需遵守相关版权规定

论文标题