论文标题

数据驱动的在线交互式招标策略的需求响应

Data-Driven Online Interactive Bidding Strategy for Demand Response

论文作者

Lee, Kuan-Cheng, Yang, Hong-Tzer, Tang, Wenjun

论文摘要

需求响应(DR)是未来网格中重要的能源之一,它提供了剃须峰值的服务,从而提高了可再生能源利用的效率,并在短时间内响应期和低成本。建立了各种DR的类别,例如自动化博士,激励DR,紧急DR和需求招标。但是,由于住宅和商业消费者公用事业模型的不了解的实际问题,有关电力市场的需求招标聚合商的研究正处于开始阶段。对于这个问题,竞标价格和竞标数量是两个必需的决策变量,同时考虑了由于市场和参与者的不确定性。在本文中,我们确定同时使用智能电表数据和功能的竞标和采购策略。开发了一种两种代理的深层确定性政策梯度方法,以通过学习历史招标经验来优化决策。在线学习进一步利用了每天获得的最新招标经验,以确保趋势追踪和自我适应。采用了两个环境模拟器来证明模型的鲁棒性。结果证明,当面对各种情况时,提出的模型可以通过OFF/在线学习招标规则并稳健地进行适当的出价来赚取最佳利润。

Demand response (DR), as one of the important energy resources in the future's grid, provides the services of peak shaving, enhancing the efficiency of renewable energy utilization with a short response period, and low cost. Various categories of DR are established, e.g. automated DR, incentive DR, emergency DR, and demand bidding. However, with the practical issue of the unawareness of residential and commercial consumers' utility models, the researches about demand bidding aggregator involved in the electricity market are just at the beginning stage. For this issue, the bidding price and bidding quantity are two required decision variables while considering the uncertainties due to the market and participants. In this paper, we determine the bidding and purchasing strategy simultaneously employing the smart meter data and functions. A two-agent deep deterministic policy gradient method is developed to optimize the decisions through learning historical bidding experiences. The online learning further utilizes the daily newest bidding experience attained to ensure trend tracing and self-adaptation. Two environment simulators are adopted for testifying the robustness of the model. The results prove that when facing diverse situations the proposed model can earn the optimal profit via off/online learning the bidding rules and robustly making the proper bid.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源