Abstract

Blackening and odorization of heavily polluted rivers has become a serious concern and threat to ecological and human health. This paper aims to gain a deeper understanding of changes in water pollution and the cause of pollution formation in a heavily polluted river in the upper Hai River. In this study, comprehensive water quality index (CWQI) and multivariate statistical techniques (MSTs) were applied to assess the spatiotemporal variation characteristics of water pollution and to identify potential pollution sources. The seasonal Mann–Kendall (SMK) test and the SMK test of flow-adjusted concentrations were effectively used to explore the temporal variation trends of major pollutants and the causes of their formation. Data of 15 water quality parameters were analyzed during 1980–2018 at 19 monitoring sites in the mainstream and major tributaries of the Xinxiang Section of the Wei River (XSWR). The results showed that the rivers were seriously polluted from 1991 to 2009, but the water quality improved after 2010. Nineteen sampling sites were divided into a low pollution region and a high pollution region. In the flood season, the pollution sources were mainly domestic sewage, industrial wastewater, agricultural runoff, biochemical pollution, and natural sources. In the nonflood season, the pollution sources were mainly domestic sewage and industrial wastewater. In recent years, the water quality of seriously polluted river has generally improved, mainly due to reductions in pollutant discharge from point sources and nonpoint sources.

1. Introduction

A heavily polluted river called a black-odorous river often appears black or dark green and is accompanied by a strong, unpleasant odor, which seriously affects people’s daily lives and threatens the aquatic ecosystem [1]. Eliminating black-odorous rivers is listed as an urgent task promulgated by the State Council of China [2]. Studies have shown that the spatiotemporal variations in water pollution are jointly determined by both natural factors and human activities, such as climate, geography, seasonal variation, land-use patterns, population density, and damming [35]. Human activities such as high-intensity development and utilization, dam and gate adjustment, and sewage discharges contribute significantly to the degree of blackening and odorization of water in heavily polluted rivers [6, 7]. For effective water management, it is very important to collect reliable information about the water quality of heavily polluted rivers to assess spatiotemporal water quality changes, identify pollution sources, determine the status of water quality, and control water pollution in rivers [8].

The Hai River Basin is located in the semiarid region of the Northern Hemisphere. It is the political and cultural center of China, with numerous cities and a developed economy. Since the 1980s, there has been a widespread trend of drought in the upper plain [9, 10]. Many seasonal water-deficient rivers are replenished by unconventional water resources such as sewage and wastewater and interbasin water transfer projects [11]. Point sources such as domestic sewage and industrial effluent as well as nonpoint sources related to agriculture have produced serious pollution loads [12]. River water pollution poses a serious threat to ecological functioning and the health of the local residents, and the issue of water security is very prominent. In recent years, with the addition of effective pollution control measures, the water quality of heavily polluted rivers is currently recovering, and the spatiotemporal variation characteristics of water pollution and the causes of pollution formation have changed. Understanding spatiotemporal variation characteristics of water pollution is very important for solving the pollution problem. However, at present, there are few studies on changes in the water pollution in heavily polluted rivers in the upper reaches of the Hai River. It is urgent and useful to evaluate the spatiotemporal variation characteristics of water pollution in heavily polluted rivers and further analyze the cause of changes in river pollution.

For a small river basin, human activities obviously impact the river water quality due to a river’s sensitivity and vulnerability to external response in a short time [13]. In this study, the Wei River, as an important tributary of the upper Hai River, is selected as the study area for its representativeness in heavy pollution due to human activities in the Hai River Basin. Excessive anthropogenic input results in the blackening and odorization of this river, which seriously affects people’s lives.

Water quality index (WQI) is widely used in the assessment of water quality of rivers and plays an important role in water resource management [8, 14]. To determine the degree of blackening and odorization of surface water, Xu [15] described the comprehensive water quality index (CWQI) of surface water used to convert multiple water quality variables into a single number that reflects the status of water quality. CWQI has been successfully applied to assess river water quality in Shanghai, Nanjing, and other places [1].

In recent years, multivariate statistical methods (MSTs) such as cluster analysis (CA), discriminant analysis (DA), principal component analysis/factor analysis (PCA/FA), and multivariate linear regression (MLR) have been effectively applied to the assessment of surface water quality, evaluation of spatiotemporal variations in water quality, and identification of latent pollution sources by calculating only the observed data of water quality [1619].

The increase in pollutant loadings is an important cause of the degradation of water quality. Quantifying the cause-effect relationship between pollutant sources and water quality variations allows regulators to implement targeted measures to control water pollution. In previous studies, various methods of analysis have been employed to identify river pollution sources, and methods have been classified into two categories: (i) diffusion models, such as QUAL, MIKE, WASP, and EFDC [20, 21], and (ii) receptor models such as the chemical mass balance model (CMB), the positive matrix factorization model (PMF), the UNMIX model, and absolute principal component score-multiple linear regression (APCS-MLR) [2224]. Diffusion models can describe the fate and transport of water quality constituents in receiving water bodies. Therefore, pollutants can be spatially traceable [25]; however, taking the river basin as a unit and requiring a large amount of river basin data (such as meteorology, terrain, hydrology, soil type, and land use), the difficulty in data acquisition and the complexity of modeling is significant. Thus, it is time-consuming and difficult to apply, especially to rivers replenished by interbasin water transfer projects. Receptor models use the physical and chemical characteristics of pollution sources and sampling sites to analyze contributions of the pollution sources to each variable at sampling sites [26]. These methods are simple to operate and widely applied in pollution studies [27]. APCS-MLR, a popular receptor model, has recently been employed in the apportionment of pollution sources [26].

A literature review of these methods is presented below. CA is often used to effectively analyze spatiotemporal variations in water quality. Zhou et al. [27] applied CA to split the year into two periods, June–September and the remaining months, and divided the entire area into two parts representing different pollution levels in eastern Hong Kong. Furthermore, they considered appropriate data pretreatment, including estimation of missing data, examination of normal distributions, and data transformation, which can greatly influence the results of multivariate analysis. DA can confirm clusters determined by CA based on the accuracy of discriminant functions. Chen et al. [28] used DA to construct discriminant functions in three modes (standard mode, forward stepwise mode, and backward stepwise mode), which yielded a classification matrix correctly assigning 91.25%, 90.83%, and 90.83% of the cases, respectively. In addition, Singh et al. [18] showed that DA offers important data reduction by using only six variables discriminating spatial pattern and two variables for temporal variation in the Gomti River in India.

PCA/FA is a dimension-reduction technique that has been used to identify latent factors or pollution sources [29]. Tanriverdi et al. [30] applied PCA to analyze and assess the surface water quality of the Ceyhan River. Three principle components (PCs) were extracted by PCA, which explained 79.14% of the total variation. Stations near cities were strongly affected by household wastewater, while other stations were influenced by agricultural facilities. In order to reduce the impact of less significant variables, PCs were rotated by a maximum variance method to create new variance factors (VFs) [26]. Bu et al. [13] also calculated total factor scores using PCA/FA and identified the water quality at each sampling site in a polluted river near Liaodong Bay in Northeast China. The higher the total factor score, the worse the river water quality.

APCS-MLR can provide quantitative contributions of each source type to each variable [27]. Su et al. [31] found that most variables were related to agricultural runoff (NH3–N, 85.3%; CODMn, 74.2%; DO, 69.9%; and BOD, 70.5%) and soil weathering (F, 78.9%) for Zone D using the APCS-MLR in the Qiantang River in China. Furthermore, they also discovered that the concept of zoning should be taken seriously in water pollution control. Zhou et al. [27] studied source apportionment of coastal water pollution using the APCS-MLR method in eastern Hong Kong.

It is clear from the above literature review that many research studies have focused mainly on water quality variations in river basins or lakes but that attention has rarely been paid to the water pollution of heavily polluted rivers. Moreover, previous studies identified potential pollution sources and calculated the contribution of the potential pollution sources to each water quality parameter but did not further analyze the cause of pollution formation; that is, it is unknown whether pollution is caused by changes in flow or changes in pollution sources and whether pollution sources are mainly from point sources or nonpoint sources; answering these unknowns is key to solving the problem of water pollution. The seasonal Mann–Kendall (SMK) test and the SMK test of flow-adjusted concentrations can be used to analyze the change trend of water quality and the influence of flow and pollution sources on pollutant concentrations [32, 33] while simultaneously avoiding the influence of seasonality, lack of watershed data, and interbasin water transfer. CWQI, MSTs, and SMK test have some limitations when used alone for water quality assessment. Therefore, applying these mathematical instruments together can be more effective to assess water quality of heavily polluted rivers. However, few studies have used these mathematical instruments together for water quality assessment of heavily polluted rivers. It is therefore essential to perform an assessment of water quality and analyze the cause of pollution formation in heavily polluted rivers, especially in densely populated areas lacking data. A timely study would be not only essential but also instrumental in taking appropriate remedial action to improve the quality of the environment for sustainable development of the region.

Considering the above points, this paper took the Wei River as an example and used CWQI, MSTs, and the SMK test and the SMK test of flow-adjusted concentrations to analyze spatiotemporal variation characteristics of water pollution, the potential pollution sources, and the cause of pollution formation in the Xinxiang Section of the Wei River (XSWR) from 2013 to 2018. First, the water quality data were compared with the environmental quality standards for surface water of China to determine the major pollution parameters. Second, we also analyzed black-odorous levels by CWQI and the long-term temporal patterns of major pollutants from 1980 to 2018. Third, the spatial variation characteristics in river water quality were studied by CA and DA, and PCA/FA was used to identify potential pollution sources of rivers and confirm the spatial variation results of CA to determine the most polluted river reaches. The contribution rate of potential pollution sources in the flood season and nonflood season to each water quality parameter was further estimated by APCS-MLR. Finally, the causes of pollution formation in the most polluted river reaches were explored by the SMK test and the SMK test of flow-adjusted concentrations.

2. Materials and Methods

2.1. Study Area

Wei River is the main branch of the Hai River. The Wei River has a length of 388 km and a catchment area of 16578 km2 (Figure 1). The 9 major tributaries of the Wei River were selected for an analysis of their water pollution, including the Communist Canal; the Yu, Shimen, Baiquan, and Cang Rivers in Northern Xinxiang; and the Dashilao, People’s Victory Canal, Ximeng, and Dongmeng Rivers in Southern Xinxiang. The Communist Canal, the largest tributary of the Wei River, draws water from the Yellow River in the southern part of Xinxiang City and joins the Wei River in Hebi City and is responsible for irrigating farmland and draining floodwater from the northern mountain area into the Wei River. The mean annual discharges of the Wei River and the Communist Canal are both less than 15 m3/s. The study area belongs to the continental monsoon climate zone, with a mean annual evaporation of 898.0 mm and an average temperature of 14°C. The mean annual precipitation in the study area is 610.8 mm, and approximately 70% of annual precipitation occurs during the summer season (June–September), with abundant rainstorms. The study area has undergone a reduction of precipitation in recent years. Tributaries such as the Dashilao, Baiquan, Ximeng and Dongmeng Rivers flow through densely populated areas with developed industry and agriculture, which are the major rivers carrying pollutants, and river water pollution is more serious.

2.2. Data Collection

The monthly water quality data and daily flow data were provided by the Henan Hydrology and Water Resources Survey Bureau. There were 9 monitoring sites (G1–G9) along the mainstream and 10 monitoring sites (Z1–Z10) along the tributaries, namely, 6 monitoring sites along the Wei River, 3 monitoring sites along the Communist Canal, 2 monitoring sites along the Yu River, 2 monitoring sites along the Shimen River, and 1 monitoring site along each of the other rivers. Sites Z3, Z4, and Z10 were located in the Baoquan Reservoir, Shimen Reservoir, and Tagang Reservoir, respectively. Water quality data were collected from 2013 to 2018, with additional time series data from 1980 to 2018 at site G2 and from 1982 to 2018 at site G4. Flow data were collected from 2013 to 2018. Table 1 provides information of the major rivers and 19 monitoring sites.

The 15 water quality parameters at each monitoring site included water temperature (Temp), pH, dissolved oxygen (DO), chemical oxygen demand (CODCr), 5-day biochemical oxygen demand (BOD5), ammonium nitrogen (NH3–N), total phosphorus (TP), fluorine (F), cyanide (CN), volatile phenol (VP), cuprum (Cu), zinc (Zn), lead (Pb), cadmium (Cd), and chromium (Cr6+). These parameters reflect the physical properties, organic constituents, biological properties, and nutrient constituents of the rivers and metals in the rivers. Sampling, preservation, transportation, and analysis of the water samples were performed according to the environmental quality standards for surface water of China (GB3838-2002) [34]. This standard divides water quality into five grades (I–V) in accordance with environmental functions and protective objectives. Grades I and V correspond to the best and worst water quality, respectively. The grade III was frequently used in assessing the water quality in China because it indicated the water was suitable for fish, aquaculture, and swimming.

2.3. Statistical Analysis

Cluster analysis (CA) can be used to classify or cluster objects based on their similarity [3, 35]. Hierarchical cluster analysis (HCA) is one of the most common CA methods that could reduce the dimensionality of the original data dramatically [26]. HCA using Ward’s method with the squared Euclidean distance was adopted in this study [36]. The standardized quotient between the linkage distances for a particular case divided by maximal linkage distance is expressed as [37]. Discriminant analysis (DA) is usually used to confirm clusters found by CA and to identify significant parameters [38]. Standard and stepwise modes of DA were used in this paper.

Principal component analysis (PCA) is a dimension-reduction technique that attempts to explain the information of a large set of original variables by transforming them into a smaller set of variables, thus achieving the purpose of data reduction [39]. Factor analysis (FA) further reduces the contribution of these less significant variables through VARIMAX rotation and produces new groups of variation called variance factors (VFs) [28]. VFs were extracted based on the principle of the cumulative contribution of variance ≥80%. The VFs are explained based on the factor loadings, which were classified as “strong,” “moderate,” and “weak,” corresponding to absolute loading values of >0.75, 0.75–0.50, and 0.50–0.30 [40]. To understand the comprehensive pollution status, FA was also performed to calculate the total factor scores. Factor scores for each factor at different sampling sites were calculated based on the Anderson–Rubin method [13]. The total factor scores of different sampling sites were estimated by the factor scores and eigenvalues [13]. To examine the suitability of the data for PCA/FA, the Kaiser–Meyer–Olkin (KMO) test and Bartlett’s sphericity test were applied to the prepared datasets [5]. PCA/FA can be applied to the data to achieve significant reduction in the dimensionality of the original dataset only KMO >0.5 and (at a significance level of 0.05) [41, 42].

A nonparametric method, the seasonal Mann–Kendall (SMK) test, was used to identify whether there was a monotonic trend (i.e., increase or decrease through time) of the water quality parameters by comparing the pollutant concentrations of individual months over the years [43]. The SMK test of the flow-adjusted concentrations was used to further analyze the trends in the water quality parameters after the removal of flow [44]. If the significance level was ≤0.01, the water quality trend was statistically significant. If the significance level fell in the range 0.01 < a ≤ 0.1, the trend was not statistically significant. If the significance level was >0.1, there was no trend in the time series (trendless time series).

2.4. Modeling
2.4.1. CWQI

The CWQI is based on five levels of the environmental quality standard for surface water of China, with CODMn, BOD5, DO, NH3–N, and TP being the primary physiochemical variables [1]. CWQI is calculated according to the dissolved oxygen variable and the undissolved oxygen variables, respectively. In this study, CODCr was used instead of CODMn. The CWQI of DO was calculated as follows:

The CWQI of some variables, such as CODCr, BOD5, NH3–N, and TP, was calculated as follows:where Ci is the observed value of variable i; i= CODCr, BOD5, DO, NH3–N, and TP; Si,j is the j class value of variable i; and m is a correction coefficient. The final CWQI was the average score of all of the variables:

The CWQI index increases with an increasing degree of pollution, and a CWQI value greater than 7.0 indicates black-odorous water.

2.4.2. APCS-MLR

Absolute principal component score-multiple linear regression (APCS-MLR) was used to conduct multiple linear regressions of the water quality parameters and absolute factor scores to identify the contribution rate of the factors to each water quality parameter [45], which can expressed aswhere is the concentration of different water quality variable , is the constant obtained by the MLR analysis of the water quality variable , is the regression coefficient of the water quality variable and the extraction factor , and is the absolute principal components score of extraction factor of the sample . The and represent the total number of samples and extraction factors, respectively. In general, the coefficients of determination R2 > 0.5 and demonstrate that the result of APCS-MLR model was reliable [41].

2.5. Data Treatment

For statistical analysis (CA, DA, and PCA/FA) and APCS-MLR, the median pollutant concentration was determined to represent the overall water quality status from 2013 to 2018. The following data pretreatment methods were used for the water quality dataset: (i) values below the lower bounds were replaced with the midvalue between zero and the detection limit; (ii) the Kolmogorov–Smirnov (K–S) statistic was processed to test the normal distribution of the data since most multivariate statistical analyses require the variables to be normally distributed [46]. The K–S test is simple to use and is computationally efficient. Therefore, data with a nonnormal distribution were logarithmically transformed [27]. To avoid misclassification due to wide differences in data dimensionality, all the log-transformed variables were further z-scale standardized for CA, PCA/FA, and APCS-MLR analyses, whereas the original data were still used for DA analysis [47]. Details of standardization of data can be found in [42].

All mathematical computations were performed using Microsoft Office Excel 2007 (Microsoft Corporation, Redmond, WA, USA) and SPSS 22.0 (IBM Corporation, Armonk, NY, USA). The SMK test developed by the United States Geological Survey (USGS) was used.

3. Results and Discussion

3.1. Evaluation of River Water Quality

The summary characteristics of each parameter from 2013 to 2018 from 19 monitoring sites and from the guidelines of the national standard for surface water quality are shown in Table 2. By comparing each parameter’s mean value and the associated national standard, it can be seen that the overall water quality of the XSWR is poor. The mean temperature reached 18.47 °C, ranging from 1.50 °C to 33.50 °C. Six (CODCr, BOD5, NH3–N, TP, F, and VP) out of 13 parameters (not including Temp and DO) were greater than the requirement of standard III. CODCr, NH3–N, and TP were greater than requirement of standard V, and pollution by organics and nutrients in the study area was serious. Comparing each parameter’s maximum value with the associated national standard, of the 13 parameters, only CN, Cu, and Cr6+ had concentrations that met standard II, and Zn met standard IV, whereas the remaining 9 parameters had concentrations greater than the requirement of standard V. Among the coefficients of variation, the coefficient for pH (6.13%) indicated that the spatiotemporal change of pH is small. The coefficients of variation for NH3–N (100.81%), TP (153.86%), VP (317.45%), Cu (337.36%), Zn (213.96%), Pb (289.85%), Cd (216.61%), and Cr6+ (103.97%) all exceeded 100%, indicating that these monitoring parameters had strong spatiotemporal differences. VP, Cu, Zn, Pb, and Cd even exceeded 200%, indicating that they were greatly influenced by human beings and had strong differences. Water quality parameters that values were above the requirement of standard III were found at 19 monitoring sites, indicating the water was unfit for drinking and direct contact with the human body without any form of treatment.

The organic and nutrient pollution in the study area was serious and may have caused serious disturbances to the river ecosystem, which was related to the high emissions of pollutants in the study area. In 2018, water with concentrations meeting the requirements of standard IV, standard V, and inferior class V was found in 53.8% of the 160 monitoring sections in the Hai River [48]. The key pollution parameters in the Hai River were CODCr, CODMn, and BOD5. Organic pollutants mainly come from domestic sewage, livestock wastewater and wastewater from food, papermaking, the chemical industry, leather making, and the printing and dyeing industries [49, 50]. Among them, persistent organic pollutants (POPs) pose a serious threat to the environment and human health because of their long-term residual activity, bioaccumulation, semivolatility, and highly toxic nature [51]. Enrichment of N and P in rivers can lead to the overgrowth of algae and other aquatic plants, leading to eutrophication [52]. Nutrients mainly come from a variety of artificial and natural sources in the watershed, such as industrial wastewater, domestic sewage, and agricultural runoff [53, 54].

3.2. Long-Term Temporal Patterns of CWQI and Major Pollutants
3.2.1. Major Pollutants

Based on the analysis of the overall water quality in the study area, the long-term (1980–2018) variations and trends of the mean values of an organic matter parameter (CODCr) and a nutrient parameter (NH3–N) in the flood season and nonflood season were selected for further analysis. Site G2 was selected to represent the water quality of the Communist Canal and site G4 to represent the water quality of the Wei River to analyze long-term temporal variations in water quality (Figure 2).

At the two monitoring sites, CODCr mostly exceed the requirement of standard III (20 mg/L), and the NH3–N concentration also mostly exceeded the requirement of standard III (1.0 mg/L). The rivers in the study area have been seriously polluted for many years. The concentrations of the water quality parameters at both sites fluctuated during the research period, with the concentrations generally increasing first and then decreasing. The MK test was conducted on the annual mean concentrations of CODCr and NH3–N over the study period, and the results showed that the trends were all statistically significant during the study period (at a two-tailed confidence level a = 0.05). CODCr at site G2 reached obvious peak values between 1995 and 2002, with a maximum of 935 mg/L in the 2001 flood season and 46.8 times higher than the requirement of standard III. Before 1985 and after 2012, the concentration of NH3–N was relatively low. Obvious peak values in the nonflood seasons occurred in 1988–1995, while obvious peak values in the flood seasons occurred in 2001–2009. In the 2005 flood season, the NH3–N concentration reached a maximum of 64.2 mg/L, 64.2 times higher than the requirement of standard III. From 1990 to 2001, CODCr at site G4 showed obvious peak values: for example, CODCr during the 1995 flood season was as high as 317 mg/L; a decreasing trend was observed after 2009, indicating a reduction of oxygen-consuming organic matter in water body. During the monitoring period, the concentration of NH3–N was generally high between 1990 and 2009 and reached a maximum value (25.9 mg/L) in the 1998 nonflood season.

Generally, the water quality in the flood season mainly reflects the pollution intensity of nonpoint sources, while the water quality in the nonflood season can better reflect the pollution intensity of point sources. If the concentration in the dry season is obviously higher than that in the wet season, then the point source emissions have a strong influence on water quality; if the dry season concentrations are not higher than the wet season concentrations, then the nonpoint sources have a greater influence. The concentrations of the two parameters at the two water quality sites fluctuated over time throughout the flood season and nonflood season, indicating that the pollution sources change constantly.

3.2.2. CWQI

Based on the five water quality variables, the CWQI was calculated at two sites. The CWQI indexes of the water samples are presented in Figure 3. The mean CWQI values at site G2 were 5.3, 8.8, and 5.9 for 1980–1990, 1991–2009, and 2010–2018, respectively. The mean CWQI values at site G4 were 5.6, 7.8, and 5.4 for 1982–1990, 1991–2009, and 2010–2018, respectively. The CWQI values at two monitoring sites for 1991–2009 were both higher than 7.0, and the color and odor of the river exhibited apparent blackening and odorization. The long-term temporal pattern of the CWQI index was in accordance with the major pollutants (CODCr and NH3–N). These results indicated that the Wei River and the Communist Canal were lightly polluted from 1980 to 1990 and severely polluted from 1991 to 2009, with the water quality becoming relatively better after 2010. The water quality of the Wei River and the Communist Canal showed an increasing trend of pollution, followed by a decreasing trend. A study conducted by Wang showed that the overall water quality of the Wei River mainstream improved significantly between 2005 and 2011 [55]. This was due to the fact that the government had addressed these environmental problems and taken effective management measures in recent years, and the river water quality had improved greatly.

3.3. Spatial Variation Characteristics in Water Quality

Based on the analysis of the long-term temporal patterns of the major pollutant concentrations in the study area, data from 2013 to 2018 were used for CA. The K–S statistical test results suggested that pH, DO, CODCr, BOD5, NH3–N, TP, and F were normally distributed with a confidence level of 95%. While CN, VP, Cu, Zn, Pb, Cd, and Cr6+ deviated from a normal distribution, their log-transformed variables also deviated from a normal distribution, and their median values were lower than the requirement of standard III. Therefore, seven parameters (pH, DO, CODCr, BOD5, NH3–N, TP, and F) were used in the following analysis.

The dendrogram of spatial CA is shown in Figure 4. Nineteen sampling sites can be grouped into two statistically meaningful clusters at ; 19 sampling sites can be grouped into three statistically meaningful clusters at . To confirm the clusters found by CA and identify significant parameters, standard and stepwise modes of DA were applied to discriminate the groupings, correctly assigning 100% of sites to both 2 groups and 3 groups (Table 3). The case of 2 groups is further discussed in this study.

Based on the analysis of the original data, cluster A (Z3–Z5, Z8, Z10, and G9) and cluster B (Z1, Z2, Z6, Z7, Z9, and G1–G8) corresponded to a low pollution region and a high pollution region, respectively. The results showed that the sites of the low pollution region were mostly located at tributaries, including the Shimen River, the People’s Victory Canal, the upstream reach of the Yu River, and the upstream reach of the Cang River, while the remaining tributaries and most sections of the mainstream were classified as the high pollution region. River pollution shows an obvious spatial difference. By comparing the median values of the two groups, DO in cluster A was found to be significantly higher than that in cluster B and the pH in cluster A was slightly higher than that in cluster B, while CODCr, BOD5, NH3–N, TP, and F in cluster A were significantly lower than that in cluster B (Figure 5). High priority to water pollution control should be given to the mainstream of the Wei River, the mainstream of the Communist Canal, the Dashilao River, the Baiquan River, the Ximeng River, and the Dongmeng River, and the downstream section of the Yu River.

The upper reach of the Shimen River, the upper reach of the Yu River, and the upper reach of the Cang River, all located in the inaccessible northern mountain area, have better water quality, which represents the natural background level of water quality. The People’s Victory Canal draws water from the Yellow River in the south to provide Xinxiang City with drinking water and landscape-use water. The Qi River is an unpolluted tributary, and dilution by this tributary results in improved water quality for the Communist Channel. Ding et al. [56] also confirmed that the water quality of the Communist Canal was slightly better than that of the Wei River. The other rivers were seriously polluted by domestic sewage, industrial wastewater, and agricultural runoff. In particular, the urban population of Xinxiang City is concentrated, with a population density of approximately 1600 people/m2, and its sewage discharge affects the water quality of the lower part of the Wei River. Therefore, the spatial distribution of water quality is determined by both natural factors and human activities, and similar distribution patterns have been found in other studies. Shrestha and Kazama [17] suggested that their CA results represented the influence of land use, residential sewage, agricultural activities, and industrialization, which may have a significant impact on water quality. Huang et al. [24] used CA to divide the water quality monitoring sites of the Qiantang River into three groups, representing three pollution areas (low, medium, and high).

The results also show that CA offers a reliable classification of surface water, which provides the basis for the regional management of water quality. In addition, by selecting one or more wells within a cluster to represent the water quality of the cluster, CA will help to significantly reduce the number of sampling sites, as well as monitoring cost. For two clusters determined by DA, stepwise mode needed only two significant parameters (CODCr and BOD5) to construct the discriminant functions, which yielded a classification matrix correctly assigning 100% of the case. This means that these two parameters can replace the above seven parameters, reflecting spatial variation in water quality. DA will also help to significantly reduce the number of monitoring parameters and monitoring cost.

Since most multivariate statistical methods require variables to be normally distributed, some data were discarded in specific analyses, which could have affected the results of multivariate analysis to a certain extent [27]. This directly leads to the limitation of this paper in that it cannot fully reflect the water quality situation, which could be alleviated by increasing the number of future sampling sites and monitoring parameters.

3.4. Source Identification and Comprehensive Assessment of Water Pollution Status
3.4.1. Source Identification and Apportionment

The water quality parameters from 2013 to 2018 were divided into the flood season and nonflood season for PCA/FA to identify potential sources of water pollution. Both in the flood season and nonflood season, PCA/FA could be applied because it had a KMO test value of 0.7–0.8 and Bartlett’s sphericity test value of 0.00, which were greater than the critical values. Three variance factors (VFs) were obtained for the flood season and nonflood season with the cumulative contribution of variance ≥80%, summing almost 89.92% and 91.31% of the total variance in the dataset, respectively. The multivariate analysis results, including the factor loadings and variance contribution, are shown in Tables 4 and 5. The bold and italic values indicate strong factor loading and moderate factor loading of each factor, respectively.

In the flood season, VF1 (34.57% of the total variance) had a strong positive loading on TP and a strong negative loading on pH. In general, TP usually originates from domestic sewage, industrial effluent, and agricultural runoff [39, 57]. In densely populated areas, due to inadequate or absence of wastewater treatment facilities, huge amounts of domestic and industrial effluents were discharged directly into rivers [13]. Fertilizers and pesticides were intensively applied in rural areas, and the application of synthetic P fertilizers (e.g., nitrophosphate fertilizer) played an important role in fertilizer input. Thus, large amounts of nutrient losses generated by excess fertilizers and pesticides quickly entered rivers through leaching and runoff during the flood season [58]. Positive correction between pH and DO can be explained as follows: as the amount of available DO decreases, an aerobic fermentation process occurs, organic acids are produced, and then, the pH value falls [28]. VF2 (31.35% of the total variance) had strong positive loadings on BOD5 and F. In the flood season, due to sufficient nutrients and an appropriate temperature, a large number of organisms grow, causing biochemical pollution [39]. F mostly derives from industrial sewage, such as cement plants, fluorine chemical factories, and smelters [24]. In addition, Xinxiang City is a natural fluoride enrichment area, and the fluoride component of soil might increase the discharge of fluorine [59]. VF3 (24.00% of the total variance) had a strong positive loading on NH3–N, which originates mainly from domestic wastewater, industrial effluent, and agricultural runoff [18]. Untreated wastewater discharge also led to the increase in nitrogen input. Significant relationships between fertilizer and pesticide consumption and nitrogen variables were shown in rivers [13].

In the nonflood season, pollution sources might be from either point sources or nonpoint sources, but due to the low rainfall during this period (22 mm per month), it was presumed that pollution sources were unlikely to be from nonpoint sources (soil erosion, farmland, and urban runoff) [60]. VF1 (38.79% of the total variance) had strong positive loadings on CODCr and BOD5 and a strong negative loading on DO. CODCr and BOD5 represent the level of organic pollution in water, which mainly comes from the discharge of untreated domestic sewage and industrial effluent [27]. Degradation of highly concentrated organic matter consumes DO in the river, resulting in a decrease in the DO concentration [8]. VF2 (32.28% of the total variance) had a strong positive loading on NH3–N and strong negative loading on pH. This VF represents the contribution of nitrogen inputs from domestic wastewater and industrial effluent. VF3 (20.24% of the total variance) represents the level of the F concentration in water. F originated from the same point sources as VF2 in the flood season. The study area has a large number of population and developed industry and population centralized areas characterized by significantly elevated COD, N, and P concentrations [13]. At present, only part of the urban sewage in the region is being treated, and sewage treatment capacity is far from enough, due to the dispersion of pollution sources in rural areas. Consequently, domestic sewage and industrial effluents were the predominant sources of the deterioration of the river water quality.

APCS-MLR was used to further calculate the contribution percentage of each VF to each water quality parameter in the flood season and nonflood season. The regression result was reliable because the regression model had a good fitted receptor (R2 > 0.80, ). In the flood season, VF1 mainly affected pH and TP (contribution rates: 57.68% and 53.40%). VF2 mainly affected BOD5 and F (contribution rates: 81.09% and 79.38%). VF3 mainly affected the concentration of NH3–N (contribution rate: 65.80%). In the nonflood season, VF1 mainly affected CODCr and BOD5 (contribution rates: 60.78% and 73.59%). VF2 mainly affected pH and NH3–N (contribution rates: 69.81% and 56.26%). VF3 mainly affected the concentration of F (contribution rate: 63.85%).

The results showed that the pollution sources were different in the flood season and nonflood season. During the flood season, rivers were polluted by domestic sewage, industrial wastewater, agricultural runoff, biochemical pollution, and natural sources. In the nonflood season, rivers were mainly polluted by point sources, such as domestic sewage and industrial wastewater. For watercourses that are highly disturbed by human activities, point sources such as domestic sewage and industrial wastewater are the main pollution sources, consistent with previous studies [61]. Wang et al. [60] divided the Qin River Basin into a severe pollution area (cluster A) and a light pollution area (cluster B). The pollution in cluster A mainly came from industrial activities, and the pollution in cluster B mainly came from urban point sources and agricultural and rural nonpoint sources. Putri et al. [5] applied PCA–MLR to analyze and assess the surface water quality of the Erren River in Southern Taiwan and suggested that industrial emissions contributed 72% of pollution, domestic black water contributed 16%, and natural source and runoff contributed 12%.

In addition, this study failed to quantify the cause-effect relationship between pollutant sources and water quality variations. Thus, the potential pollution sources could be determined by previous experience only, as we wanted to identify pollution sources quickly.

3.4.2. Comprehensive Assessment of Water Pollution Status

The total factor score for each monitoring site is shown in Figure 6. The higher the total factor score, the worse the river water quality. In the flood season, the scores of site G7, site G5, and site G8 were 0.69, 0.64, and 0.51, respectively, indicating that water quality at these three monitoring sites was the most seriously polluted. Combined with the investigation of local sewage discharge outlets, the water pollution of site G7 and site G8 was found to be mainly caused by upstream water from site G5. In the nonflood season, the scores of site Z7, site G5, and site G7 were 1.13, 0.78, and 0.71, respectively, indicating that these three monitoring sites were the most seriously polluted. The Ximeng River flows through the plain area, which has a dense population, developed industry, and agriculture. These industries discharged a great deal of sewage and water with high concentrations of pollutants, resulting in serious pollution in this tributary. In the flood season and nonflood season, the water pollution of the mainstream of the Wei River after passing Xinxiang City was relatively serious. Site G9 in the Communist Canal had good water quality due to the dilution effect of the Qi River, a tributary of the Communist Canal. The total factor scores of the three reservoirs were the lowest found, indicating that the water quality was best in the reservoirs.

It is generally believed that the further the downstream, the greater the cumulative pollution [62]. The three reservoirs located in the upper reaches of rivers, where human activities are limited, have the best water quality. The water quality in the lower reaches of the mainstream was seriously polluted in the flood season and nonflood season. The results of FA also confirmed the clusters found by CA [36], which further suggests that surface water quality is directly related to human activities. For example, Gridharan et al. [63] found that the upper reach of the Cooum River (southern India) was divided into an unpolluted cluster, while the polluted middle and lower reaches of the river had a high population density along the watercourses and domestic sewage was discharged directly into the water bodies.

3.5. The Cause of Pollution Formation

The SMK test was conducted on the water quality of the water at site G5 during 2013–2018 (Table 6), the most polluted reach of the Wei River according to the total factor score. R2 is a parameter reflecting the correlation between flow and pollutant concentration. The concentrations of CODCr, NH3–N, TP, and F all showed a statistically significant downward trend, whereas the concentration of BOD5 showed a statistically significant upward trend. The decline rate (r) of CODCr was the fastest, with an annual average decrease in 4.300 mg/L. The annual average decrease rate in F was relatively slow, at 0.067 mg/L per year.

To explore the causes of the changes in water quality, the SMK test of the flow-adjusted concentrations was used for each monitoring parameter. The residual series of flow-adjusted concentrations of each parameter showed a trend basically consistent with the measured water quality data, indicating that flow was not the principal cause of the changes in water quality, which were mainly caused by reductions in pollutant emissions. The flow-adjusted concentrations of CODCr, NH3–N, and TP showed a downward or statistically significant downward trend, indicating a decrease in pollution sources. The flow-adjusted concentration of BOD5 showed an upward trend, so its pollution sources increased. The pollution sources of F show no obvious change. Generally speaking, if pollutants in a river mainly come from point sources, when the river flow increases, the concentrations of pollutants in the river may decrease due to the increasing effects of dilution and self-purification [33]. Using the synchronous flow and water quality data, the relationship between each pollutant concentration variable and discharge variable was plotted. The relationship between each pollutant concentration variable and discharge variable at the monitoring site did not conform with the above rule, indicating that the water quality changes were influenced by point sources and nonpoint sources (Figure 7).

The results indicated that the water quality of seriously polluted river reaches showed an improving trend from 2013 to 2018 due to a series of measures for pollution prevention and control taken by the local government. For example, CODCr and NH3–N were used as the main parameters for pollutant cap control; construction and renovation of municipal sewage treatment facilities were speeded up to control urban domestic sewage; industrial sewage was strictly controlled, especially along the Ximeng and Dongmeng Rivers; the distribution of rain and sewage was promoted; comprehensive measures were taken to control substandard reaches of rivers; and ecological water diversion measures were adopted, such as drawing water from the Yellow River and the South-to-North Water Transfer Project and so on. From sewage outlet surveys along the rivers, the waste discharges in 2013 and 2018 in the study area were 2.95 × 108 tons and 2.32 × 108 tons, respectively, with an average annual decline of 3.56% (Figure 8). Among these, the NH3–N discharge into the river decreased fast. In 2013 and 2018, NH3–N discharges were 4010.20 tons and 479.91 tons, respectively, with an average annual decline of 14.67%. Annual average decline of CODCr, TP, and TN was 7.60%, 11.19%, and 7.09%, respectively. The reduction of point sources should be one of the main reasons for the decrease in the concentrations of major pollutants.

The study also showed that the SMK test of flow-adjusted concentrations can successfully analyze the cause of pollution formation in heavily polluted rivers in densely populated areas lacking data. This method needs to calculate only the flow and water quality data to simply identify the causes of water pollution changes and quickly determine a response to management pollution sources. Therefore, managers do not need to allocate large amounts of human and material resources to collect extensive river basin data (such as meteorology, terrain, hydrology, soil type, and land use) and the water volume diverted from interbasin water transfer projects. This can save a lot of time and is computationally efficient. In particular, it is a challenge to accurately quantify the cause-effect relationship between pollutant sources and water quality variations in regions with rapid changes in pollutant sources. Although this method was enabled to draw a rough conclusion only, it can quickly solve the practical problem.

4. Conclusions

Comprehensive water quality index (CWQI), multivariate statistical techniques (MSTs), and seasonal Mann–Kendall (SMK) test and the SMK test of flow-adjusted concentrations together were applied to assess water quality of a heavily polluted river. The overall water quality in the study area indicated that water quality of the Wei River was poor and unfit for drinking purposes and for direct contact with the human body without any form of treatment. The Wei River and the Communist Canal were lightly polluted from 1980 to 1990 and severely polluted from 1991 to 2009, with the water quality becoming relatively better after 2010. The water quality showed obvious spatial differences, which can be divided into a low pollution region (group A) and a high pollution region (group B). In the flood season, the pollution sources were mostly from domestic sewage, industrial wastewater, agricultural runoff, biochemical pollution, and natural sources. In the nonflood season, the pollution sources were mostly from domestic sewage and industrial wastewater. In recent years, the overall water quality of seriously polluted rivers has shown improvement. The concentrations of CODCr, NH3–N, TP, and F showed statistically significant downward trends, while the concentration of BOD5 showed a statistically significant upward trend. The main cause of water quality changes was reduction in pollutant emissions from point sources and nonpoint sources, and water quality was relatively little affected by flow. Thus, this study demonstrated that CWQI, MSTs, and SMK test and SMK test of flow-adjusted concentrations are effective approaches to assess water quality of heavily polluted rivers and can be used together as useful tools for water quality management in the future.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This research was funded by the Government Financial Grants Project (CXGG17023) and the National Key Research and Development Program of China (2016YFC0400301).