Abstract

In the location-related planning of a hydropower system, the consideration of future operations under uncertainties can make the decisions sustainable and robust. Then, it is of great importance to develop an effective approach that deals with the long-term stochasticity due to the long-lasting effects of the location selections. Thus, we propose a multistage stochastic programming model to optimize the planning decisions of cascade hydropower stations and the long-term stochastic operations in an integrated way. The first stage (i.e., the planning stage) in the model deals with the location and capacity decisions of the hydropower stations, while the subsequent stages implement the scheduling decisions under each stagewise stochastic scenario. To address the curse of dimensionality caused by the long-term stochastic operations, we further propose a novel dimensionality reduction approach based on dual equilibrium to transform the multistage model into a tractable two-stage stochastic program. The applicability of our approach is validated by a case study based on a basin of Yangtze River, China, and corresponding sensitivity analysis.

1. Introduction

Given a basin, the hydropower stations to-be-built usually could be multiple. Even only a hydropower station in the current timetable, it is still necessary to consider the impacts of other possible hydropower stations in the future because of their interdependence. Moreover, such a plan that involves major capital investments must be made for a longstanding future. Hence, subsequent operations of all the stations, which influence the effect of the planning decisions, should be considered. Such an integrated way can help to get the global optimum in disasters mitigation, irrigation water supply, and clean power production [1, 2]. However, in the long-term operations, natural or social uncertainties, such as water inflows and demands, are inevitable and influence the performance of the hydropower system significantly [3]. Thus, the problem faced by the decision-maker is long-term stochastic optimization. The “long-term” we mentioned here is far beyond the long-term scale considered in previous works, such as 3–5 years in Gjelsvik et al. [4] or 12-period used in Cheng et al. [5], Wang et al. [6], and Xu and Mei [7], because of the long-lasting effects of the location decisions. Hence, the challenge is how to handle such overlong stochastic nature.

In previous works, the decisions on locations and operations of a hydropower system have been extensively, but separately, studied. Specifically, when a decision-maker plans a hydropower station, advanced technologies, such as geographical information system (GIS), are often utilized to obtain geological, climatic, and topographical information in most recent studies. Based on this information, different mathematical methods are further utilized for spotting site by synthesizing natural, engineering, and social criteria. For example, Larentis et al. [8] propose a GIS-based DEM (digital elevation model) composed of the preidentification of promising sites and the multicriteria feasibility assessment of the final set, in which energy, technical, and environmental factors are considered together. Another GIS-based DEM is proposed by Kusre et al. [9] which use a hydrologic model to assess water resource utilization under different site candidates. Similar works can be referred to Serpoush et al. [10], Zaidi and Khan [11], etc. Besides the above assessment models, the optimization technique becomes another alternative in recent years. For example, Hosnar and Kovač-Kralj [12] identify the optimal installation locations by maximizing an ecoprofit objective at technological, economic, environmental, and social constraints. To identify the appropriate hydropower dam location, Loannidou and O’Hanley [13] develop a mixed-integer linear programming model to optimize the hydropower potential with the consideration of river connectivity. In these studies, the future power generation and hydrologic dynamics are described by the empirical formulas in the cumulative form. Thus, time-varying operations under uncertainties are ignored in the setting of location-related planning.

The optimization of the multistage operational activities, such as water storage, supply, and power generation, is an important issue which has drawn lots of studies. Most of them focus on a given hydropower system involving a single hydropower plant (e.g., Vieira et al. [14]) or multistations. For the multistation operations in the deterministic setting, two main stream of approaches have been applied. At first, to pursue the optimality, mathematical programming methods, especially dynamic programming (DP), are often used but also suffer from the computational challenge due to the number of stages and stations. Thus, Cheng et al. [15] reduce the scale of the problem by limiting their work in the “short-term” horizon. Li et al. [16] consider a decomposition-coordination mechanism to reduce dimensionality. Li et al. [17] and Cheng et al. [5] propose to parallelize the DP algorithm to reduce the computation time. The former uses the distributed memory architecture and the message passing interface protocol, while the latter considers the Fork/Join parallel framework in a multicore environment. Feng et al. [18] and Feng et al. [19] focus on the simplification of state set to accelerate the implementation of DP. Cheng et al. [20] and Feng et al. [21] adopt progressive optimality algorithm to modify the conventional DP by dividing the multistage problem into a sequence of subproblems and thus reduce the computational burden. Second, in order to address the computational complexity, numerous heuristic algorithms have been considered by sacrificing some optimality in recent years. Typical methods used in this stream include particle swarm optimization [22, 23, 24], electromagnetism-like mechanism [25, 26], genetic algorithm [27], water cycle algorithm [7], and artificial intelligence algorithms [28]. However, all these studies focus on the deterministic setting.

Uncertainties extensively exist in hydropower systems. For example, water inflows usually vary and cannot be accurately predicted since long-term meteorological forecasts are unreliable [29, 30]. Thus, the optimization of hydropower operations under uncertainties is basically a risk-based decision-making problem [31]. The stagewise stochastic process can be represented as a scenario tree; hence, the multistage stochastic programming (MSSP) model, which optimizes the expected value based on the scenario set, has been popularly used. For example, Fleten and Kristoffersen [32] develop an MSSP model to make the decisions on power generation, in which uncertain water inflows and electricity market prices are considered, with the optimization of the expected benefit. Chazarra et al. [33] propose an MSSP model for a hydropower system, taking uncertain water inflows and electricity market prices into account, to simultaneously maximize the expected profit in both energy and regulation reserve markets. To deal with uncertain streamflow, a multiobjective MSSP model is presented by Xu et al. [34] to optimize scheduling strategies. The first objective is to maximize direct revenue from energy production, and the second one is to minimize the expected energy shortfall percentage. Séguin et al. [35] address the stochastic hydropower unit commitment and loading problem under the uncertain inflows. However, all these works only consider the short-term horizon. Thus, off-the-shelf solvers can be applied to solve these models directly.

With the increase of the number of decision stages, the size of the MSSP grows dramatically which often requires decomposition methods [36]. A state-of-the-art one is the stochastic dual DP algorithm which is a sampling-based variant of nested Benders decomposition. Hjelmeland et al. [37] use it to handle a medium-term scheduling issue for a single producer under uncertain inflows and prices within one to three-year horizon. Most related works consider the medium-term setting as Gjelsvik et al. [4]; Helseth et al. [38]; Hjelmeland et al. [39]; and Poorsepahy-Samian et al. [40]. However, as indicated by Hjelmeland et al. [39], although decomposition methods can help to alleviate the solving complexity, the computation would significantly become slow with the increase of system size and decision stages. It shows the difficulty of multistage stochastic optimization with high dimensions of uncertainty.

In summary, a considerable amount of research has been conducted for the hydropower system design and operation. However, to the best of our knowledge, the location decisions in existing studies have not taken multistage operations into account. In terms of operations, numerous studies focus on developing efficient exact or heuristic algorithms for the deterministic setting, whereas many endeavors utilize the MSSP to address the stochastic operations. However, the high dimensionality characteristic has become the main bottleneck limiting the application of optimization approaches [41]. Even in the deterministic setting, the computation of the hydropower system operation problem is highly complex when the number of operational stages is big [19]. With the consideration of randomness, the optimization of the MSSP should be implemented on all the stagewise stochastic realizations, which further increase the solving burden. Hence, most existing works only focus on short- or medium-term stochasticity. However, the location selection is the planning decision that involves major capital investments and has an overlong effect. Thus, such a decision should be made from the long-lasting perspective, in which uncertainties would occur inevitably. Hence, the long-term stochasticity should be considered in the hydropower design, which is a difficult task.

We contribute to the existing literature by planning a cascade hydropower system with the consideration of long-term stochastic operations. Most mainstream algorithms, including DP for the deterministic setting and the decomposition methods for the stochastic setting, suffer from the number of stages. Few papers have looked specifically into long-term stochastic models. Considering that the purpose of our work is the location decisions rather than providing accurate schedules, in order to address the dimensionality issue, an intuitive idea is to keep the influence of stochastic operations while reducing the number of stages. Thus, applying the method of dual equilibrium (DE) [42, 43], we propose a novel dimensionality reduction approach which aggregates the long-term operational impacts on the present and thus simplifies the MSSP model to a two-stage one. Our method can handle the stochastic problem regardless of the number of stages and provide an alternative approach for the overlong setting.

The remainder of the study is presented as follows. In Section 2, the MSSP model is developed. The dimensionality reduction approach based on DE is presented in Section 3. Specifically, Section 3.1 focuses on the reduction of the scenario tree while Section 3.2 implements the transformation of the MSSP model accordingly. In Section 4, the application in one section of Yangtze River, China, is displayed and analyzed. The last section presents conclusions and remarks about some directions for future research.

2. Problem Definition and Mathematical Formulation

2.1. Problem Statement

Consider a basin, where a government prepares a construction plan of cascade hydropower stations to control flood or drought, satisfy irrigation, and pursue profits from power generation. Thus, the location and capacity decision-making problem of cascade hydropower stations is studied here, in which the long-term stochastic operations should be taken into account. The integrated MSSP model is formulated to optimize the total performance which involves the construction costs in the planning stage, as well as operational costs, penalty costs, and profits of power generation in the subsequent multiple stages.

Six decisions of the model can be divided into two groups. The first group should be made in the planning stage, including (1) the final selection of hydropower stations from the candidates; (2) the capacity of each selected hydropower station. The decisions in the second group are in each operational stage. They are (3) water storage of each selected hydropower station in each stage; (4) loss flow of each selected hydropower station in each stage; (5) water discharge to downstream from each hydropower station in each stage; and (6) water discharge of power generation in each selected hydropower station each stage. During these operational stages, we consider uncertain water inflows (including precipitation and inflows from tributaries) and water demands (from residents, agriculture, and industry). The number of stages, defined as T, could be extremely large.

As shown in Figure 1, given the locations of cascade hydropower stations, three scheduling activities: water discharge of power generation, abandoned water spill (go downstream directly without passing generator units) and loss flow should be periodically made. The water storage of each hydropower station is determined by its scheduling activities and random water inflows. Notice that the loss flow mentioned is discharged to local areas, rather than downstream. When the loss flow is larger than the actual demand, it would incur corresponding penalties. Moreover, the physical limits on the levels of water discharge, loss flow, and power generation of each hydropower station should be satisfied in each operational stage.

2.2. Model Formulation

Sets, exogenous deterministic and stochastic variables are given in Table 1. Table 2 displays the decision variables in all stages.

The MSSP model which integrates the planning decisions and stochastic operational impacts is proposed as follows:

The objective function (1a) is composed of the construction costs of all selected hydropower stations and the expected value of the operational costs, the environmental penalty costs, and the benefits of power generation under all the stagewise scenarios.

Constraint (1b) limits the number of hydropower stations to be chosen. Thus, Constraints (1c) and (1d) ensure that the corresponding maximum capacity and water discharge of unselected hydropower stations are zero in which MM is a number big enough. Constraint (1e) gives the power generation limits. Constraint (1f) ensures that the water storage of the selected hydropower station is between the minimum required water storage and maximum capacity. Constraint (1g) gives water balance equations. Constraint (1h) shows the relationship between water discharge to downstream and water discharge of power generation, as well as water spill in each hydropower station candidate. Constraint (1i) presents the upper and lower limits of water discharge between two adjacent selected hydropower stations. Constraint (1j) guarantees that if the candidate is chosen, its loss flow should meet the local demand and cannot exceed the maximum discharge limit. And if a candidate is not selected, its loss flow is equal to the water demands. Constraint (1k) specifies the domains of the decision variables.

3. Model Transformation

Even the deterministic version of the proposed MSSP model is NP-hard. Besides, the dimension of stochastic scenarios in the MSSP model increases with the number of stages exponentially. It would further increase the solving difficulty. DE is a useful approach to handle the dimension problem of the large-scale setting [44]. It was first proposed in [42] to simplify the multistage deterministic convex optimization problem and then was applied to a multistage stochastic production-inventory programming model in [43]. The essence of DE is going to add up all the influence after a time point in the future by a so-called discount factor to compress stages. Hence, it coincides with our idea of aggregating the operational impacts on the planning decisions.

Next, we proposed a DE-based dimensionality reduction approach to address the above MSSP model. Specifically, because the size of the scenario tree is one of the main causes resulting in the computational difficulty, we first show how to compress the stages of the scenario tree at first (see Section 3.1). Along with the simplification of the scenario tree, we further show how to aggregate the operational decision variables and the deterministic parameters. With the compression of all these variables, the proposed MSSP model would be integrated into a two-stage one which is tractable (see Section 3.2).

3.1. Scenarios Generation and Simplification

Because the acceptance level of water discharge includes the quantity of water demands, we assume these two factors have a deterministic linear relationship. Thus, we focus on random water inflows and water demands here. Before we give the dimensionality reduction approach, we first show the generation and simplification procedures of stochastic scenarios by taking uncertain water inflows as an example. Uncertain water demands can be handled similarly.

We match each operational stage with a season. At seasonal stage t = 1, 2, …, T, assume that ξ = 1, 2, and 3 correspond to three levels of random inflows: high, medium, and low, with the corresponding probability of p1H, p1M, and p1L, respectively. The quantities of water inflows in hydropower station candidate i at stage t with level ξ can be represented as , i ∈ R. There are multiple stages considered in this study. Hence, each stagewise scenario of inflows, defined as s1 (∈S1), can be represented as a branch from t = 0∼T of the scenario tree, in which t = 0 represents the planning stage. The corresponding probability p(s1) is the product of occurrence probabilities of all the nodes on this branch. Figure 2 gives the scenario tree faced by hydropower station candidate 1.

As can be seen, the number of inflow scenarios of one candidate is up to 3T in T seasonal stages. For example, if we consider a 10 year horizon, the scenario size of inflows of one candidate would be 340(≈1.049 × 1019). Obviously, a simplification way is required. Here, we can remain the scenarios in early stages, while inflows in subsequent stages are replaced by corresponding constant estimated values. We do this for two reasons. The first is that the stochastic scenarios in subsequent stages are harder to give than these of early stages due to forecasting difficulty. Second, random inflows have intrinsic periodicity which means inflows in subsequent stages can be offset. Hence, an approximated way is to use the corresponding estimated values to replace the seasonal inflows in subsequent stages.

Thus, the scenario tree in Figure 2 is simplified to that in Figure 3 in which the uncertainties of t = 1∼4 remain. Denote the stagewise scenario in Figure 3 as and the corresponding scenario set as . After simplification, the number of the decision variables is still large because of the big T. Hence, the dimensionality reduction approach is proposed to address this issue.

3.1.1. Step 1 of Dimensionality Reduction—Stages Reduction by Aggregating the Random Factors of the Same Season

The dimensionality reduction approach includes two steps. We focus on the first one in this section.

Due to the seasonal periodicity of inflows, we propose to aggregate the inflows of the same season together (see Figure 4). Specifically, take the inflows of candidate 1 in the first stage (t = 1), i.e., , as an example. Without loss of generality, assume that T is the multiple of four. The subsequent water inflows with the same season, i.e., stage 4τ + 1 (τ = 1, 2, …, (T/4)–1), have the same value, denoted as . Similarly, , , and represent other three seasonal values of water inflows at candidate 1. Referring to the concept of present value, we use factor δ (0 < δ < 1) to discount the impacts of subsequent stages. Thus, the corresponding aggregated water inflows of candidate 1 in the first stage (t = 1) under the case of ξ, denoted as , are the discounted value which can be expressed as

Then, , , and can be obtained similarly. Thus, the scenario tree in Figure 3 is compressed into Figure 5, in which the value associated with each node is a discounted value now. However, the number of branches () and their corresponding occurrence probabilities (p()) remain unchanged.

3.1.2. Step 2 of Dimensionality Reduction—Stages Reduction Across Seasons

It is worth recalling that this study focuses on the planning problem with the consideration of future operational activities under uncertainties. Thus, the second step of the dimensionality reduction approach is to implement the further aggregation of the impacts of operational stages together. In other words, the random variables (t = 2, 3, 4) on a branch in Figure 5 will be integrated into the first operational stage (t = 1). For the example that the inflows of candidate 1 under stagewise scenario which is composed of four high-level stages (i.e., ξ = 1), its present value can be given as

Although the impacts of scenario is aggregated now, such process would not change the number of scenario and its probability p().

The stochastic scenario of random water demands could be handled similarly, and the corresponding final scenario set is denoted as which is composed by scenario with the probability p(). Hence, the set of total scenarios S used in the proposed MSSP model is simplified to , which is the Cartesian product of and :

The corresponding probability of new scenario (∈), which is the union of and , can be calculated as

Till now, the stagewise scenario tree is compressed into a one-stage scenario set .

3.2. Transformation of the MSSP Model

The above work simplifies the scenario tree by discounting the random variables. Accordingly, with the compression of the scenario tree, the deterministic parameters and the decision variables should also be aggregated, and thus, the MSSP model would be simplified. The transformation process also includes two steps.

3.2.1. Step 1 of Dimensionality Reduction

At first, we decompose the operational stages of MSSP model into multiple submodels according to each stagewise scenario s (∈S). Then, such submodel is not only multistage but also deterministic because it is given based on the independent scenario. For the submodel, we show how to simplify the variables and the model along with the reduction of the stages.

To facilitate the implementation of DE, we first reformulate the submodel as the following equivalent:

Step 1. Here, we focus on the objective. Specifically, similar to formula (2), the objective values at stage 4τ + j (τ = 0, 1, …, (T/4)–1, j = 1, 2, 3, 4) also should be compressed into the corresponding stage j. Thus, with the introduction of the discount factor δ, (6a) can be reformulated as (7):

Step 2. Along with simplification of the scenario tree, the decision variables and the deterministic parameters should also be aggregated. Because they exist in both the objective and the constraints, to facilitate our transformation, we integrate the constraints into the objective. Specifically, we use dual multipliers βk,4τ+j (k = 1, …, 7) to relax the k-th one of Constraints (6b)∼(6h) of stage 4τ + j (τ = 0, 1, …, (T/4)–1, j = 1, 2, 3, 4). Then, the optimization problem ((6b)∼(6h), (7)) can be transformed into the unconstrainted problem (8):The DE approach assumes the dual multipliers have the linear relationship with the discount factor δ [42]. Thus, let the dual multipliers here take the form βk,4τ+j = δ4τ·βk(j). Then, problem (8) can further be reformulated as

Step 3. Thus, similar to the discounting way of the random variables in Section 3.1.1, we can integrate the decision variables and the parameters under the same season together by δ. To make the above model concise, the so-called ‘integrated’ primal variables are introduced to replace the corresponding discounted values. For example,Other integrated primal variables can be given similarly, and all of them take an asterisk at their top right corner. Thus, with the introduction of the ‘integrated’ primal variables, the symbol in Objective (9) could be removed, and then, the operational stages 5∼T in the submodel could be compressed into the stages 1∼4, respectively. Thus, we have

Step 4. Notice that the above transformation is for the submodel which is under a specific scenario. Specifically, Step 2 relaxes the constraints to get a unconstrainted model, and then, Step 3 aggregates the multistage for this model. Here, we need to extract the relaxed constraints back at first and then combine the model with each scenario (∈) (see Figure 5) and the planning stage. Thus, we can get the following intermediate model:in which the primal variables can be given as follows:in which δT can be approximated as zero when the number of stages T is large enough. Similarly, the parameters could be given asWe can assume the unit operational cost of candidate i at each stage t is the same, i.e., ri. Then, (12m) can be further represented asMoreover, it is easy to know that , , , , , an d are all nonnegative.
Till now, along with the stage reduction as shown in Section 3.1.1, the T-stage operations of the MSSP model is integrated into a four-stage model. The corresponding scenario tree of this intermediate model is shown in Figure 5.

3.2.2. Step 2 of Dimensionality Reduction

Further model transformation is required due to the stage reduction in Section 3.1.2. Notice that j (=1, 2, 3, 4) in the intermediate model (12) refers to the first four operational stages. Here, we use symbol t to replace the corresponding j and then integrate stage t = 2, 3, 4 into the first stage (t = 1) as illustrated in Section 3.1.2. The way how to transform is similar to that in Section 3.2.1.

We still focus on the operational stage under a specific stagewise scenario. Thus, the corresponding submodel can be described as follows:

Step 5. The discount factor δ is still used to aggregate the value of the objective function at stage t (=2, 3, 4). Then, the objective function (13a) is transformed as

Step 6. Similar to Step 2, use dual multipliers β7+k,t (k = 1, …, 7), to relax the k-th one of Constraints (13b)∼(13h) of stage t. Thus, the operational optimization problem under stage t = 1∼4 ((13b)∼(13h), (14)) can be transformed to an unconstrainted one:in which the dual multipliers follow the form β7+k,t = δt−1·β7+k. Hence, problem (15) can further be expressed as

Step 7. Similar to Step 3, the “integrated” primal variables are introduced to replace the corresponding discounted values, respectively. Taking QLi(t) and Qimax as examples, we have:Here, all the primal variables have double asterisks in their top right corner. Thus,in whichand for each i (∈R), is a discounted value that can be expressed asFrom (20), we haveRefer to (20), the discounted value can be denoted asFrom (21) and (22), we haveUsing (23) to replace in (18), we have the discounted objective as follows:Similarly, δT in (24) can be approximated as zero due to the large T. Then, by extracting the relaxed constraints from the above objective function, we have

Step 8. Combine the above model with each scenario (∈), which is given in Section 3.1.2 and then integrate with the planning stage of the original MSSP model. Thus, the final DE model, which is a two-stage stochastic program, is developed as (26a)∼(26k).in which the discounted value of water inflows and demands and (i ∈ R) can be obtained similar to the way used in formula (3). Besides, in (26a) is an integrated variable:Moreover, for simplicity, the double asterisks of the decision variables are omitted.
After transformation, the planning decisions xi and Capi remain unchanged, while the operational stages are aggregated. Therefore, the proposed approach not only keeps the future impacts but also reduces the solving complexity.

4. Experimental Study

To validate the applicability of our proposed approach, we give the data of a case in Section 4.1 at first. Then, based on this case, we compare our approach with the MSSP model in the small-scale setting in Section 4.2. Furthermore, we apply our approach to an overlong-term case in Section 4.3 and finally implement sensitivity analysis in Section 4.4.

4.1. Description of the Case

Yangtze River flows through 11 provinces in China with a drainage area of 1.8 million square kilometers. The whole river can be divided into the upstream, midstream, and downstream, among which the basin with a length of 1030 km in upstream is studied here. According to Wang et al. [45], five hydropower station candidates could be considered in this basin. Its schematization is shown in Figure 6 which also involves the main tributaries.

Among the five hydropower station candidates, at least two of them should be chosen in the plan. Unit construction fee (e) and unit profit of power generation (f) in Table 1 are set to be 5.09 (RMB/m3) and 0.25 (RMB/KWh), respectively [46]. Related parameters regarding five candidates are displayed in Table 3. Once the hydropower station candidate is chosen, the planned lowest water storage and upper and lower limit of water discharge are shown in Table 4. The initial water flows from upstream to each hydropower station candidate is set to be 0.2141, 0.0988, 0.0942, 0.0429, and 0.1080 (m3 × 108/h), respectively. And the initial water storage of each chosen hydropower station is assumed as the twice of the lowest water storage.

4.2. Comparison with the MSSP Model by Small-Scale Cases

The developed DE model is an approximate version of the MSSP model (1). In this section, we aim to evaluate the performance of our approach by comparing the planning decisions as well as the operational performance with these of the MSSP model. Due to the curse of dimensionality faced by the MSSP model, we consider small-scale cases based on Section 4.1.

4.2.1. Parameters in Small-Scale Experiments

To limit the dimensionality here, we set the planning horizon as 1, 2, and 3 years, respectively. According to the seasonal features of which, one year is divided into four stages: spring (March to May), summer (June to August), autumn (September to November), and winter (December to February). Then, each stage has 2160 (h) (i.e., Δt). Thus, the number of operational stages T is 4, 8, and 12, respectively. Besides, we merely consider the uncertainty caused by water inflows. Two kinds of stochastic outside water inflows (H, L), as well as the corresponding probabilities (p1H, p1L), in each seasonal stage are given in Table 5. The constant water inflows and water demands used in the simplification are displayed in Tables 6 and 7, respectively. The acceptance level of the water spill is assumed as twice of the demands.

Because the random inflows fluctuate with seasons, we assume the water inflows of each candidate take the same uncertain level in each season. Thus, for the MSSP model, 16(=24), 256(=28), and 4096(=212) stagewise scenarios are constructed, respectively. Furthermore, according to the generation and simplification process of the scenario set displayed in Section 3.1, 16 scenarios are generated for the two-stage DE model. Assume that spring is the first operational season. The discount factor δ used in the DE model is set to be 0.99, 0.95, …, 0.60, respectively.

4.2.2. Computational Results

We optimally solve the DE model and the MSSP model by IBM ILOG CPLEX 12.6.3 on a PC with 8 GB memory and a CPU at 2.5 GHz.

It is found that the DE model can generate the same planning results with the MSSP model for all these small-scale cases. Besides, in terms of the operations, the DE model considers the multistage operational impacts by the aggregated way. Hence, it is interesting to observe how accurate the DE model is in the estimation of the future operational performance of the cascade hydropower system. We use DE_2nd and MSSP_2nd to denote the operational performance generated by the DE model and the MSSP model, respectively. Table 8 displays the estimation gap (=(DE_2nd-MSSP_2nd)/MSSP_2nd) under different discount factors.

It is found that the operational performance generated by the DE model significantly varies with the discount factors. Specifically, when δ is large, more future impacts would be taken into account which results in the overestimation. Hence, the estimation gap is positive. With the decrease of δ, less future performance would be discounted. Thus, the estimation gap tends to be negative gradually. It also means that the DE model can generate accurate estimation of the future operational performance by choosing the appropriate discount factor.

Moreover, the computational time of the DP model outperforms that of the MSSP model greatly especially when the horizon is large, as shown in Table 9.

In summary, the DP model can yield the same location decisions with the MSSP model and the accurate estimation of the future operational performance by choosing a suitable discount factor, while less calculational time is required.

4.3. Real Case Study

Section 4.2 validates the effectiveness of our proposed DE model in the small-scale setting. In this section, we further apply our approach to a large-scale case to illustrate its applicability.

4.3.1. Parameters in the Large-Scale Case

Based on the case given in Section 4.1, we consider the 40-year planning horizon (i.e., T = 160). Three kinds of stochastic water inflows (H, M, and L) and two kinds of stochastic water demands (H, L) are taken into account in each seasonal stage. The corresponding probabilities of two random variables are (p1H, p1M, p1L) and (p2H, p2L) as shown in Table 10. Refer to Wang et al. [47] and Yang and Zhang [48]. Uncertain water inflows which mainly include precipitation and water from main tributaries (drainage areas larger than ten thousand square kilometers) are given in Figures 7 (precipitation) and 8 (tributary), respectively. Random demands are given in Figure 9. Besides, the predicted water inflows and demands used in the DE model are shown in Table 11, and the random acceptance level of water spill is also assumed as twice of the corresponding demands.

Thus, the number of stagewise scenarios in the MSSP model is up to 6160(≈3.193 × 10124), which is impossible to handle exactly. By the DE model, the number of scenarios is reduced to 1296. The discount factor δ is set to be 0.95 here.

4.3.2. Computational Results of the DE Model

The optimal solution of the DE model can be obtained by within 2047 (ms) by IBM ILOG CPLEX 12.6.3, in which Candidates 1, 4, and 5 are selected. Table 12 shows the capacity decisions and the operational performance, including the operational cost, the penalty cost, and the profit of power generation, of each selected candidate. The total performance value including the planning stage and the operational stages is 1.576 × 1011 (RMB).

We further observe two extreme scenarios. One is scenario 16 (high-level inflows and low-level demands in all four seasons), and another scenario is 1281 (low-level inflows and high-level demands in all four seasons). The corresponding scenario-dependent operational results of five hydropower station candidates under two scenarios are shown in Tables 13 and 14, respectively.

As shown in Tables 13 and 14, the daily scheduling strategies would be adjusted according to different scenarios. First, in terms of loss flow (QL), when scenario 16 happens, the value is relatively high in hydropower station 1 to prevent the flood (see Table 13). While if scenario 1281 occurs, the loss flow merely satisfies demand (see Table 14). Then, regarding water storage (y), the value is high in hydropower station 5 under scenario 16 due to the high-level inflows and the low-level demands. However, scenario 1281, which represents the low-level inflows and the high-level demands, requires hydropower stations to utilize their storage efficiently and then results in low water storage. Furthermore, in the computational results, the scheduling strategies would ensure that the values of water discharge (Q) are between the upper and lower limits in both scenarios, making it capable of preventing flood and drainage. Notice that the water discharges of power generation under two scenarios are the same and the upper limit of output has been reached under both scenarios. Thus, the scheduling strategies can always pursue maximum benefits of power generation under different scenarios. In such a way, the proposed DE model can optimize the global performance in the uncertain setting.

4.4. Sensitivity Analysis

In this section, we carry out the sensitivity analysis to reveal how critical parameters influence the strategic decisions and the performance of the cascade hydropower stations. Because one of the merits is to handle overlong stochasticity by the idea of aggregation, two natural concerns are how long and how much are discounted. Here, we focus on the planning horizon (T) and the discount factor (δ).

4.4.1. Sensitivity Analysis of the Planning Horizon

The planning horizon (T) is one of the main factors causing the curse of dimensionality [19, 41]. The DE model aggregates the multiple operational stages, but T would influence the results of the model transformation. Hence, we focus on the length of the planning horizon here. Considering the long-lasting effects of the location decisions, we set the length of the planning horizon as 10, 20, 30, 40, and 50 years, respectively. That is, we consider T = 40, 80, 120, 160, and 200. Other parameters are the same as those in Section 4.3.

After calculation, it is found that the decisions of location and capacity under different T, which are the planning results we care about in our study, are the same as these of T = 160 (see Table 12). It means these planning horizons would not alter the strategic decisions of the DE model.

Furthermore, the cost terms and the profit of power generation are displayed in Table 15. It is found that the operational cost and the profit of power generation remain unchanged while the penalty approaches to be stable with the increase of T. Specifically, the operational cost is the capacity-related cost within the planning horizon. In the DE model, the impacts of the capacity-related cost are aggregated by δT, respectively. When T is extremely large, δT can be approximated as zero, see (12l) and (12n). Hence, the operational cost keeps unchanged. Besides, to calculate the benefit of the power generation, it is found that the maximum output is reached in our cases. Thus, after aggregation by δT, the benefit of the power generation also remains constant. Moreover, the penalty of hydropower station 4 and 5 are zero because the loss flows are smaller than the corresponding penalty thresholds. However, T would influence (see (20)) at first and then the loss flow by constraints (25e). Hence, the penalty of hydropower station 1 would increase gradually until coming to be stable.

Based on the above three terms, with the increase of T, the total performance also shows a trend of increasing first and then approaches to be stable. This result coincides with the essence of the DE model. Specifically, the farther the future is, the fewer the impacts would be taken into account. Thus, the results would close to be stable finally. It is also in accord with the decision-making in the real-world, i.e., the coming future would be put more emphasis than the superfar future. Hence, when T is large enough, the calculational results of the DE model would tend to be stable. It means that the DE model could describe the steady results of long-term stochastic operations. Thus, for the decision-makers (DMs), it is not necessary to increase the planning horizon infinitely.

4.4.2. Sensitivity Analysis of the Discount Factor

Discount factor (δ) is an exogenous variable introduced by the DE model to help to reduce dimensionality. In Section 4.2, we have shown that the DE model can generate accurate operational performance estimation by selecting appropriate δ. In this section, we further explore the impacts of the discount factor in the large-scale setting. We vary it as 0.99, 0.98, …, 0.88. Other parameters are the same as those of Section 4.3.

The calculational results show the discount factors in the value range above will not influence the final strategic decisions. Besides, the operational cost, the penalty cost, the profit of power generation, and the total performance under different discount factors are shown in Figure 10. Firstly, it is shown that, with the decrease of the discount factor, both the operational cost and the profit decline with a decreasing speed since they are discounted more. Moreover, in terms of the penalty cost, it remains zero when δ is large (0.99). This is because the discounted penalty threshold also decreases but is still larger than the loss flow. Then, the penalty cost increases significantly with the decline of δ to 0.98. The reason is that the discounted penalty threshold begins to be smaller than the loss flow, leading to the larger penalty. Finally, with its continuous decrease (less than 0.98), δ dominates the penalty term which makes the aggregated penalty smaller.

These results indicate different discount factors will not influence the location decisions but result in the variation of the operational performance. Hence, the DMs should carefully choose the discount factor if they want to get an accurate estimation of the operational performance.

4.4.3. Mutual Effects of Two Factors

We further investigate the mutual impacts of two factors on decisions and objective terms. We set T as 40, 80, 120, 160, and 200 and δ as 0.99, 0.98, …, 0.88 and then calculate the model under each combination of two kinds of parameters.

From the calculational results, we find that the locations and capacities of hydropower stations remain unchanged under the different combinations of planning horizon and discount factor. Given a δ, the operational cost and the profit of power generation will not be influenced by the change of T, which is similar to Table 15. Hence, we focus on the penalty and the total operational performance.

Figure 11 shows the varying pattern of the penalty with the decrease of δ under each T. Specifically, the penalty increases first and then decreases. The reason is the same as that in Figure 10. However, with the increase of T, the peak point would appear earlier. It is because bigger T can bring significant discounting effects due to δT.

Furthermore, we show the change of the total performance in Figure 12. The variations under different T are very similar. Specifically, with the decline of δ, the total performance gradually decreases and finally tends to be stable. Besides, as δ becomes smaller, the total performance would reach its steady state within a shorter planning horizon, specifically, around 120 periods when δ = 0.96 and around merely 40 periods when δ = 0.90. Thus, we conclude that when the low δ is considered, the DMs could only consider a relatively shorter planning horizon since the impacts in the long-term future would be significantly discounted.

5. Conclusions and Future Research Directions

The integrated planning of cascade hydropower stations, which takes the operational strategies under future uncertainty into account, can bring global benefit. An MSSP model is developed to deal with the location and capacity decisions of cascade hydropower stations with the consideration of the expected long-term performance. To address the computational issue caused by the overlong stochasticity, a dimensionality reduction approach based on DE is designed to transform the MSSP model into a two-stage stochastic programming model, in which future impacts remain by aggregating the operational stages together. Thus, the optimization of the large-scale hydropower stations could be addressed regardless of the number of operational stages.

Experiments based on Yangtze River in China are implemented. First, by the comparison with the MSSP model under the small-sized cases, it is found that our approach can obtain the same planning results and accurate estimation for future operational performance with a short computational time. Then, a large-scale case is studied to show how the approach is applied to solve a practical problem. Moreover, to evaluate the influence of critical parameters in the proposed DE model, sensitivity analysis is conducted based on the planning horizon and the discount factor. The results reveal that our model is robust to different parameter settings to obtain stable location-related planning results, while the estimation of the future performance would be influenced by the planning horizon and the discount factor. The corresponding observations and implications are also obtained from the results.

In the future, we can extend our study in the following aspects. First, the discount factor plays an important role in our study. Specifically, it not only discounts the future impacts to get the so-called net present value but also has the internal relationship between the dual multipliers. Thus, it influences the accuracy of the future estimation of the DE model. In this paper, we follow the way of Grinold [42] and King and Wallace [43] to give the discount factor directly. But, how to select the appropriate discount factor to get better performance estimation should be considered in the future study. It is a hard issue but deserves further effort. Second, the minimum installed power and the nonlinearity, like the stage-storage curve, are ignored in this paper. The consideration of these requirements and constraints could help to improve the applicability of our approach to practical problems, which leaves certain spaces for future study. Finally, in this paper, the impacts of time-varying stochastic operations are considered in the form of expected value. Another popular way is to formulate the objective of Value-at-Risk or Conditional Value-at-Risk to incorporate the DMs’ risk attitudes. Then, is it possible to utilize the characteristics of Value-at-Risk or Conditional Value-at-Risk to facilitate our scenario tree reduction? It is another interesting research direction.

Data Availability

All data included in this paper are available upon request by contact with the corresponding author.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors wish to express their gratitude to the five anonymous referees for their constructive criticism, which led to substantial improvements of the paper. This research was partly supported by the National Natural Science Foundation of China (Grant no. 71971053, 71832001, 71572033) and the MOE (Ministry of Education in China) Project of Humanities and Social Sciences (Grant no. 18YJA630129).