Abstract

The truck operation of freeway has an impact on traffic safety. In particular, the gradually increasing in truck proportion will inevitably affect the freeway traffic operation of different traffic volume. In this paper, VISSIM simulation is used to supply the field data and orthogonal experimental is designed for calibrate the simulation data. Then, SSAM modeling is combined to analyze the impact of truck proportion on traffic flow parameters and traffic conflicts. The serious and general conflict prediction model based on the Adaptive Network-based Fuzzy Inference System (ANFIS) is proposed to determine the impact of the truck proportion on freeway traffic safety. The results show that when the truck proportion is around 0.4 under 3200 veh/h and 0.6 under 2600 veh/h, there are more traffic conflicts and the number of serious conflicts is more than the number of general conflicts, which also reflect the relationship between truck proportion and traffic safety. Under 3000 veh/h, travel time and average delay increasing while mean speed and mean speed of small car decreases with truck proportion increases. The mean time headway rises largely with the truck proportion increasing above 3000 veh/h. The speed standard deviation increases initially and then fall with truck proportion increasing. The lane-changing decreases while truck proportion increasing. In addition, ANFIS can accurately determine the impact of truck proportion on traffic conflicts under different traffic volume, and also validate the learning ability of ANFIS.

1. Introduction and Literature Review

With the prosperity of logistics industry in China, the volume of trucks on freeway has been increasing and the freeway traffic safety is more and more obvious [11, 30]. The factors impacting traffic safety on freeway are numerous, including drivers, vehicles, environment, and road alignment. According to statistics from the Traffic Management Bureau of the Public Security Ministry [25], a total of 50,400 road accidents involving trucks were reported, causing 25,000 deaths and 46,800 injuries, which accounted for 30.5%, 48.23%, and 27.81% of the total number of auto liability accidents, respectively. It is much higher than the truck proportion to total number of vehicles. And a total of 5088 traffic crashes occurred in Shanxi province in China, causing 2131 deaths and 5278 injuries. More than 50% of these crashes were involving trucks. The high truck proportion on freeway is an important reason for this phenomenon. Therefore, how to alleviate and eliminate excessive influence of trucks and truck proportion on freeway operational safety is an urgent problem needed to be solved.

A number of studies had examined the impact of trucks on traffic safety based on history crashes data [26] by using various statistical models such as univariate Poisson-lognormal (UVPLN), multivariate Poisson (MVP), and multivariate Poisson-lognormal (MVPLN) regression [5]. It suggested that car crashes involvement rate decreased while the frequencies of truck involved crashes and car–truck involved crashes increased as the truck percentage increasing. Two random parameters ordered probit models [33] were used to explore the influencing factors of single-vehicle and multi-vehicle crashes separately. The multinomial logit (MNL) and negative binomial (NB) models [7] were used to analyze the influences of risk factors on frequency and severity of large truck–involved crashes. A mixed (random parameters) logit model [16] was used to study the injury-severity distributions of accidents on highway segments combined with traffic flow characteristics, which all showed that the presence of higher truck percentages and the sheer number of trucks might have a slowing effect on travel speeds which would tend to decrease the injury-severity of accidents. The literatures mentioned the impact of traffic safety combined with truck proportion via traffic crashes data a little rather than center on the safety via truck proportion.

Then, some literatures analyzed the impact of trucks on traffic flow parameters in order to explore the relationship between truck proportion and traffic safety on freeway. Islam and El-Basyouny [12] quantified the effect of posted speed limit (PSL) reductions in an urban then revealed that there was a high proportion of vans, buses, and trucks on nighttime and weekends under PSL. Li et al. [15] concluded that as the heavy truck proportion grows, the flow and velocity would decrease. The empirical Bayes observational before-and-after study also carried out that the decrease in the standard deviation of speed was 26% while the proportion of light and heavy vehicles exceeding the speed limits more than 20 km/h was reduced respectively by 84% and 77% [18]. The percentage of trucks in the traffic contributed to the average speed.

Apart from that, traffic conflict technic is another useful method for analyzing traffic safety on freeway when traffic crashes data are limited [29, 32]. It showed that the conflict risk degree can reveal the actual safety level of different types of conflicts more comprehensive to some extent. Moreover, the Surrogate Safety Assessment Model (SSAM) [20] also could be used to analyzed the safety performance of conflicts. That the truck volume percentage and the traffic distribution have the highest impact on the conflict frequency is given. Taking the measures of lane-changing, merging and rear-end conflicts as evaluation indicators, the restricted truck lanes and dedicated truck lanes via VISSIM were studied [8]. The proportion of conflicts involving trucks increases as the truck percentage increases and truck lane strategies are most effective when truck percentage exceeded 15%. Time-to-collision (TTC) is also a surrogate safety measure of the safety risk [19] which can predict the probability of safety. Bachmann et al. [3] redefined the algorithm previously implemented for conducting conflict analysis in freeway studies. And, trucks and different vehicles were involved to verify the modified results. It showed that while providing a separate highway for trucks does reduce truck-related conflicts but car lane change conflicts increase. Different literatures tell that the traffic conflict could be surrogate safety measure of traffic safety, so most of conflict techniques could be used for indicating traffic safety under lacking of historical crashes data.

Above all, some literatures analyzed the influential factors such as the weight of truck based on traffic accident data, otherwise, statistic and simulation methods were used to analyze the impact of traffic parameters such as truck performance or conflicts on traffic safety. However, there are a few literatures that discuss the impact of truck proportion on freeway traffic safety and traffic crashes data are used for explore the relationship between truck proportion and traffic crashes mostly. And truck proportion is also the one of most important factors of traffic safety. As a consequence, the purpose of this paper is to explore the impact of truck proportion on traffic flow parameters and traffic conflicts and lay the foundation of studying the correlation between safety and truck proportion. First, the field data extracting from traffic video monitoring system of Shanxi freeway will be used to build simulation model to obtain the simulation data. And then, the traffic conflicts data will be got from SSAM to analyze the relationship between traffic conflicts and truck proportion and the Adaptive Network-based Fuzzy Inference System (ANFIS) will be used for establishing the model of traffic conflicts related to truck proportion in different traffic volume. Therefore, the impact of truck proportion on freeway traffic safety can be reflected.

2. Data Description

First, according to previous literature, traffic flow parameters and traffic conflict parameters were chosen to explore the impact of truck proportion on traffic flow. Then, the traffic parameters such as the mean speed, the average speed difference of truck and small car and the speed standard deviation were selected to explore the impact of truck proportion on traffic conflict and freeway traffic safety.

2.1. Traffic Flow Parameters

Traffic data were collected from surveillance video of three freeway in Shanxi province. There are 7 selected sections and each section is approximately 100 m, linear and straight. Also, the sunny weather was chosen as study condition. The type of vehicles was classified into two categories, small car (include pickup truck) and truck whose body length is more than 6 meters (include passenger car for about 40 passengers) due to specificity of high proportion of trucks on Shanxi freeway. All data that we collected and calculated include truck proportion, traffic volume, average speed, standard deviation, coefficient of variation, number of lane-changing, time headway, and time to collision. And the basic information is as follows (Table 1).

A few common traffic parameters are listed in (Table 1), which are related to truck proportion more or less. The different truck proportion was collected per hour, which is the value that the number of trucks divided by the total number of vehicles. The range of truck proportion covers from 0.176 (17.6%) to 0.905 (90.5%) per hour in different volume on different section, which are mostly in low traffic volume and most focus on the range from 0.2 to 0.4. The total sample collected from the surveillance video is 81 hours. Besides, the standard deviation of the traffic volume per hour is relatively large, which means a greater degree of dispersion between individuals within the group. According to the basic theory, when analyzing the correlation among these parameters, traffic volume was divided into five kinds: low, slightly low, medium, slightly high, high.

Standard deviation was defined as the arithmetic square root of variance, reflecting the degree of dispersion among individuals in the group. Truck proportion would affect the traffic volume, the mean speed, and the speed standard deviation even further, which represents the influence of truck proportion on the stability of speed standard deviation under a certain flow rate. In most cases, the speed standard deviation was estimated by taking a random sample from overall samples and calculated by the sample. It lays the foundation for the further study of the relationship between traffic safety and the truck proportion.

2.2. Traffic Conflict Parameters

Under observable conditions, traffic conflicts are the approach of each other spatially that two or more road users at the same time. If one of them takes abnormal traffic behavior, such as changing lane, changing speed, stopping suddenly and so on, a collision will occur unless the other takes the corresponding safety behaviors. This phenomenon is traffic conflicts. Because the object of this paper is the truck on freeway, so there mainly are two types of conflicts on freeway. These are collisions with rear-end on the same lane and lane changing conflicts between vehicles.

The sizes and operating performance of truck have a significant impact on road traffic safety. For larger trucks, the permissible speed is lower than those smaller cars of smaller mass. Larger trucks will affect the visibility of the rear vehicles as well as the lateral distance of the vehicles on adjacent lane, which makes the driver feel oppressed. Most of parameters involved in conflicts are the time to collision (TTC), post encroachment time (PET), deceleration to avoid crash (DRAC), and so on [31]. The truck proportion also has a great impact on traffic conflicts. Jeong and Oh [14] used the TTC, the warning index (WI) thresholds and market penetration rate (MPR) to explore active vehicle safety in a certain truck proportion. TTC is the most frequently used surrogate measure and usually utilized as a benchmark on SSAM imported into [9]. TTC will be utilized in this paper to explore the relationship between truck proportion and traffic conflict parameters. Because the angle of vehicle is not convenient to observe and measure, so the TTC is calculated by velocity and vehicle length (25 frames represent for 1 second). So, the Equation (1) is as follows [28]:

—The collision time (s) of vehicle collide to the vehicle ; —the position of vehicle on the road (m) at time ; —the position of vehicle on the road (m) at time ; —the spot speed of vehicle at time ; —the spot speed of vehicle at time ;—the length of front vehicle .

And, , is the headway distance of vehicle and vehicle respectively. So, the Equation (2) is as follows.

The indicator can be used as both the calculation of the rear-end collision (collision angle ) and the lane-changing collision (collision angle ).

3. Method

Due to the actual conditions, the traffic data obtained through field surveys can hardly cover the traffic flow status under different traffic conditions of the freeway. The main purpose of this paper is to study the influence of truck proportion on freeway traffic safety, so the more and more traffic parameters under different traffic conditions are needed to be obtained. Therefore, modeling of data validation and supplement via VISSIM simulation and SSAM are necessary [1, 21].

3.1. Simulation Modeling

The study segment was simulated using VISSIM, a stochastic, behavior based microscopic simulation platform. To capture the effect of truck’s proportion on traffic flow parameters, a basic scenario was constructed according to the characteristics of freeway in Shanxi. In the basic scenario, the divided freeway has two 3.5 m lanes in each direction and the length of the road section is 1000 m. To capture the traffic characteristics, data collection points were set up every 100 m in each direction. As vehicle in China is a right-hand drive, so the driver could only overtake from the left side of the vehicle in the model. Besides, the vehicle parameters derived from the video were set as (Table 2).

Ten different traffic flow rates were considered, i.e., 2200 veh/h, 2400 veh/h, 2600 veh/h, 2800 veh/h, 3000 veh/h, 3200 veh/h, 3400 veh/h, 3600 veh/h, 3800 veh/h, 4000 veh/h. The truck percentage was divided by 10% into 9 grades, 10–90% respectively. Different traffic flow states were combined with different traffic flow and different mixing rate of vehicle in field survey. Vehicles were generated randomly according to Poisson distribution in VISSIM. The traffic flow would have little difference in simulation model and field survey, since the relative error is within 0.1%, the simulation result will not be affected. Vehicle speed design principles are based on the actual situation of the normal distribution, and in accordance with the distribution of the interval via 3σ principle. Wiedemann 99 model was selected as following car behavior parameters which is suitable for intercity road (freeway) traffic.

Many parameters were used in car-following model and lane-changing model of VISSIM. In order to simulate the traffic behavior accurately, it needs to adjust the parameters for the study section [17]. Besides, these parameters have a direct effect on the interactions of vehicles and result in significantly different simulation results even with small adjustments to the parameter values, including the parameters of car-following and lane-changing behavior [2, 23]. Therefore, the average speed of vehicles on freeway was selected as the index to evaluate whether the simulation model meets the field traffic conditions. The calibrated parameters are shown in Table 3.

Four parameters, each of which took 4 different levels of value, were calibrated, and 256 experiments were required if each level of each factor was tested on a case-by-case basis. In order to reduce the number of tests effectively and obtain the required results at the same time, the parameters of the model were calibrated by orthogonal experiment in this simulation. In general, orthogonal table recording experimental programs and results should meet two basic requirements. First, different numbers in each column appear the same number of times. This characteristic indicates that each level of each factor is exactly the same as the probability of participating in the test at each level of the other factors, thereby it ensures that interference from other levels is maximally excluded at each level and it can effectively compare the test results and find the optimal test conditions. Second, in any two pairs of their horizontal composition of pairs, each number of pairs appear equal. This feature ensures that the test points are evenly dispersed in the complete combination of factors and levels, and therefore they have a strong representation. In order to reduce the number of experiments, smaller orthogonal tables are generally chosen. If the experiment requires a higher accuracy, a larger orthogonal table could be chosen according to the experimental conditions. Therefore, in this paper, the design of experimental program was designed as follows.

From Table 4, the influence degree of each factor is ranked as . However, the experimental scheme with the smallest error value may not be the optimal scheme. The optimal scheme should be a combination of the optimal value of each factor. Experimental indicators determine the optimal level of each factor. If the purpose of the experiment is to obtain the maximum experimental index value, the corresponding level will be the largest value among the factors, and vice versa. The model being consistent with the actual traffic conditions is the purpose of paper, that is, to make the absolute difference between the experimental results and the actual data as small as possible. Therefore, the appropriate level for each factor is , the simulation error of which is less.

The Surrogate Safety Assessment Model (SSAM) is a traffic safety evaluation software developed by the Federal Highway Administration (FHWA) based on surrogate model for traffic safety evaluation [6, 20, 32]. It can be used to analyze traffic conflicts in vehicle output track files from microscopic traffic simulations. In the paper, the number of TTC was used for exploring the effect of truck proportion on traffic safety. Analyzing the cumulative frequency distribution curve of the TTC sample data, the 85% quantile is taken as the traffic conflict threshold, which means the general conflict (TTC = 6 s) and through literature review [4, 14], TTC less than 2 s were taken as serious conflict, and TTC greater than 2 s and less than 6 s were regarded as general conflict. Apart from that, the conflict angle of rear end and lane change are defined as [0, 30] and [30, 85], respectively. Then, the track files from VISSIM simulations will be imported to SSAM to determine the severity of traffic conflict and the relationship between number of TTC and truck proportion. Thus, the number and the value of TTC will be calculated for analyzing.

In order to ensure the consistency between the simulation data and field data, the root mean square error () was used to verify the error. Taking the volume of 2200 veh/h as an example, the number of lane-changing was selected as the evaluation parameters to determine the root mean square error (Equation (3)). The observed data, simulation results and root mean square error are shown in Table 5.

where, is the field data and is the simulation result. The deviation between the observed value and the simulated value is evaluated by . The can reflect the well precision of the measurement. The smaller the is, the smaller the difference between the observed value and the simulated value is, and the closer the difference between observed data and the field value is.

Therefore, according to Equation (3), the is 10.1882 under the volume of 2200 veh/h, 5.2536 under the volume of 2400 veh/h, 8.4794 under the volume of 2600 veh/h, 15.6525 under the volume of 2800 veh/h and the average error between the observed value and the simulated value is under 20%, which is acceptable. The observed data is incomplete under the high volume, but the error between the measured number of lane-changing and the simulated simulation value is still within 20%, and the error is in the acceptable range. Therefore, the VISSIM simulation model data and observed data are consistency.

3.2. Adaptive Network-Based Fuzzy Inference System

Because there is no direct relationship between the truck proportion and traffic conflicts, the application of traditional mathematics methods is time-consuming and laborious to establish the relationship between the two, and it is not well adapted to modeling under various assumptions. The ANFIS (Adaptive Network-based Fuzzy Inference System) proposed by Jang [13] is a new type of fuzzy inference system that combines fuzzy logic and neuron networks. ANFIS has the advantages of expression of fuzzy logic and self-learning ability of neural network easily, which has gradually become an important research direction of computational intelligence in recent years. And it has the characteristics of simple calculation and mathematical analysis, which provides an effective tool for the modeling and control of complex systems. This also provides a good reference for the use of transportation field. The back-propagation algorithm and the least-squares method are used to adjust the precondition parameters and conclusion parameters, and the If-Then rules can be automatically generated.

ANFIS uses fuzzy neural network to realize the three basic processes of fuzzy control, fuzzy inference and anti-fuzzification. It uses neural network learning mechanism to automatically extract rules from the input and output sample data to form an adaptive neuro-fuzzy controller. Through offline training and online learning algorithms, the fuzzy inference control rules are self-adjusted, and the system itself develops in the direction of self-adaptation, self-organization, and self-learning. Rahimi [22] studied the quantitative assessment of the effects of intelligent transportation systems and technologies on road fatalities. Modification of user behavior, exposure, modal choice, and accident consequences were used for explore the effect of ITS technologies on road safety. Hosseinpour et al. [10] presented the application of ANFIS technique to estimate road accident frequencies as a function of road geometric and environmental characteristics. Compared with other regression models, it verified that the proposed model had higher prediction performance than the other traditional models. Besides, ANFIS combined with Global Satellite Navigation System also can be used for real-time car-following status identification [24] and lane changing maneuver prediction [27] to make sure the traffic safety. Therefore, ANFIS will be used to model the relationship between truck proportion and traffic conflict. The structure diagram is shown in Figure 1. Each node in the same layer has a similar function ( denotes the output of the-th node in the first layer).

Layer 1. Fuzzing the input variables, then outputting the degree of membership of the corresponding fuzzy set (the fuzzification layer). The input variables and are the input of node , represent for traffic volume and truck proportion, respectively. or is the linguistic variable associated with the node function value, That is, is the membership function of fuzzy set and . The membership function is determined by the actual input variables (Equation (3)). There, truck proportion and traffic volume are set as , . or denotes that fuzzy classification of different variables.

Layer 2. The node is represented by II. Each one is the fixed node. The input signal is multiplied and the output of the node represents a rule’s excitation intensity (Equation (4)), which denotes that each input component belongs to a membership function of a fuzzy set of linguistic variable values.

Layer 3. The node is denoted by , which is also a fixed node. The incentive intensity of each rule is normalized. The ratio that the sum of the of the -th rule and the sum of all the rules is calculated at the node(Equation (5)).

Layer 4. Each node is an adaptive node with a node function that calculates the output of each rule (Equation (6)), which denotes that fuzzy rule of these two variables is normalized.

Layer 5. A single node is a fixed node. The total output of all input signals is as follows. And the total output is traffic conflict in this paper (Equation (7)).

Through actual investigation and experimental simulation results, it is concluded that the factors affecting traffic collisions are mainly traffic volume and vehicle type. Therefore, different traffic volume and different truck proportion are used as input variables, and the number of traffic conflicts is for output variables. The fuzzy inference system considered in this paper has two inputs the traffic volume and truck proportion and a single output traffic conflict.

4. Result and Discussion

4.1. Simulation Results

In the simulation experiment, traffic flow is divided into several kinds of flow, which are the volume of 2200 veh/h, 2400 veh/h, 2600 veh/h, 2800 veh/h, 3000 veh/h, 3200 veh/h, 3400 veh/h, 3600 veh/h, 3800 veh/h, 4000 veh/h, and the relationship of truck proportion with other parameters are shown in Figure 2.

As it can be seen, with the increasing of truck proportion in different traffic volume, different parameters show different trends. In Figure 2(a), travel time rises below the volume of 2800 veh/h due to the truck proportion increasing and slightly decreases above 3400 veh/h while average delay first increases and then stabilize (Figure 2(b)), because with the increase of the traffic volume and truck proportion, the driver’s judgment on the road safety depends on driving speed and the road alignment. Based on the safety operation, the driver will choose the relatively safe speed. Therefore, when the truck proportion reaches around 0.6, the delay time tends to be stable. Undoubtedly, the mean speed decreases with the increase in the traffic volume and the truck proportion in whole (Figure 2(c)). By contrast, the mean speed of small car slightly reduces while the truck proportion is less than 0.5 and the volume above 3400 veh/h. And the truck proportion is greater than 0.5, the mean speed of small car is significantly descending (Figure 2(d)). The Figure 2(e) shows that the mean time headway of small car is relatively stable and then fluctuates around 0.6. Unlike the mean time headway of small cars, the trend of trucks is exactly the opposite for trucks (Figure 2(f)). Although the numerical value is fluctuated, the trends increasing overall. The reason why it is opposite is that the vehicle driving is relatively free and driver behavior is not constrained due to truck proportion is small, so they maintain normal driving behavior that the trend stays constant. But when the volume of trucks goes up and affects the driving operation of small car drivers, the mean time headway of small car begins to fluctuate. And in the high volume, the fluctuation is relatively steady. The principle of the mean time headway of trucks is the same. When the volume of small cars and the proportion of trucks are small, the interaction and interference between small cars and trucks is much less, so the mean speed of small cars is not marked reduced. When the volume increases, the interaction between the small cars and the trucks rises due to the difference in the vehicle performance. To ensure safety driving, the speed of small cars gradually approaches the speed of trucks to reach equilibrium in the end. When the state is reached, the drivers of small cars will increase the speed overtaking the truck by the judgment on safety. Therefore, when the proportion of trucks is the largest, there will be a slight increase.

The mean time headway increases following the increase of truck proportion from truck proportion at 0.5 (Figure 2(g)) in general. Below the volume of 2600 veh/h and less than truck proportion about 0.6, the mean time headway keeps steady. Furthermore, the speed standard deviation (Figure 2(h)) and the degree of dispersion (Figure 2(i)) are gradually increasing with the increase in the truck proportion in different traffic volume and then falling due to the difference in vehicle performance between small cars and trucks. This is consistent with the actual situation. The number of lane-changing (Figure 2(j)) is declining with the increase of truck proportion. Above all, at the volume of 3000 veh/h and 3200 veh/h, the all parameters except mean time headway of small cars, mean time headway and speed standard deviation are fluctuated. Besides, the parameters are slightly changed when the volume above 3400 veh/h while changed a lot below 2800 veh/h. The reason why it is fluctuated is that the volume at 3000 veh/h is the stable flow at the designed speed 80 km/h on the freeway.

The results of traffic conflict are shown in Figure 3. Generally, serious conflicts(Figure 3(b)) are more than general conflicts (Figure 3(a)), which indicates that the truck proportion has an impact on traffic safety to some degree. When TTC is less than 6 s but more than 2 s, the number of TTC increases with the truck proportion at 0.4 under different traffic flow while the number of serious conflicts is large at 0.5 of truck proportion. Similarly, with the increasing truck proportion, the number of general conflict and serious conflict are rising initially and then fall around 0.6 of the truck proportion in general. As the traffic volume and truck proportion increase, the interaction between different vehicles increases, resulting in limited freedom of travel. Moreover, when the traffic volume and truck proportion continue to increase, the driving speeds of the vehicles tend to be the same, which makes the speed dispersion small, and the vehicles are in a car-following state, leading to the reduction of conflict. Due to the traffic bottleneck, the volume around 3000 veh/h is tending to the stabilizing flow state and the parameters of traffic flow and other parameters are little fluctuated in the period.

Above all, truck proportion has a great impact on traffic flow parameters and traffic conflict. With increasing truck proportion in different traffic volume, travel time and average delay are increasing; the mean speed and time headway are generally decreasing; speed standard deviation, degree of desperation, the number of lane-changing and the number of serious and general conflicts are increasing initially and then decreasing. These all do a great impact on traffic safety.

4.2. ANFIS Modeling Results

The first step in ANFIS modeling is fuzzification. The purpose is to establish an initial fuzzy inference system, determine the number of membership functions and types of each input variable and select the structural rules of the ANFIS model. Because the data distribution is much obvious, the grid partition method is used to divide the loaded data. Besides, the fuzzy system is established according to the set parameters according to the fuzzy C-means clustering method. The purpose of clustering is to determine the minimum number of fuzzy rules required for FIS construction and their associated member parameters. The dataset will be input into multiple cluster groups by cluster partitioning so that the similarity among members of the group is higher and the similarity of members within the group is lower. The input membership function is a Gaussian function, and the number of fuzzy subsets is 3 for truck proportion and 5 for traffic volume to cover the input variables, and the membership function of the output variables is selected as a linear function, so the initial FIS is generated. In the learning process, both training and inspection datasets are used to avoid overfitting. The margin of error is set closely to 0 and the number of iterations is 50. The selection of the best ANFIS model is based on achieving a minimum for both the controlled and nonincremented training and testing datasets. The number of membership functions in this paper are truck proportion and traffic volume while the truck proportion type is divided into three sets and traffic volume is divided into five sets. For the rules : if (traffic volume) is (, 2, 3, 4, 5) and (truck proportion) is (, 2, 3), then . The ANFIS network structure of fuzzy inference system is shown in Figure 4. In Figure 4, the input variables are the traffic volume and truck proportion.

The fuzzy membership function subset with sharper shape function curve has higher resolution and higher control sensitivity. On the contrary, the membership function curve has a flatter shape, the control characteristics are more gradual, and the stability performance is better. Therefore, when selecting the membership function of the fuzzy variable, the low-resolution fuzzy set is used in the region with large error, and the high-resolution fuzzy set is selected in the region with small error. When the error is close to zero, the higher control selection is used. The blurring of the resolution makes it possible to achieve a control effect with high control precision and good stability. So the Guass membership function is used for fuzzing these two input variables. Traffic volume and truck proportion are fuzzed as five subsets and three subsets, respectively. Then 15 rules have been constructed to put out 15 membership functions and the output is the number of general conflicts. Thus, the for the training datasets converged to a minimum of 1.2457 ultimately, which led to the completion of the learning process. Then the testing data is loaded, which means the remaining data in the dataset is used as testing data. The for the testing data converged to a minimum of 6.6159 ultimately. The training data, testing data are as following Figure 5. Therefore, according to the parameters of each function, the output function of the system can be summarized.

The relationship among traffic volume, truck proportion and serious conflict is established as the same method as above. And the Gaussian membership function expression covering the fuzzy subset can be written and the estimated parameters is shown in Table 6 according to the Gaussian membership function formula . Similarly, the Gaussian membership function covering the fuzzy subset can be written as well as traffic volume.

Taking serious conflict as an example, it can be seen intuitively that the number of serious conflicts occurring under different traffic volume and different truck proportion in Figure 6. The output is the last line shown in Figure 6. Besides, the output of training data is approximately close to the output of simulation data.

In general, when the traffic volume is around 3000 veh/h, the number of general conflicts is large in general. When the truck proportion is below 0.5, the number of general conflicts is initially increasing then falling. While the truck proportion above 0.5, the number of serious conflicts is fluctuated but overall it is lower when the traffic volume is high. Meanwhile, when the traffic volume is around 2600 veh/h, the number of serious conflicts is large in general. When the truck proportion is below 0.5, the number of general conflicts is initially increasing then falling. While the truck proportion above 0.5, the number is lower than the truck proportion is above 0.5. When the truck proportion is constant, the number of serious conflicts decreases as the traffic volume increasing in general. When the traffic volume is low, the interaction between the vehicles is small because of the free driving, and the impact of the truck proportion, whether large or small, on traffic conflicts is not obvious. However, when the traffic volume is high or stable, the interaction between vehicles increases, especially the interaction between trucks and small cars. Trucks could narrow down the vision of other vehicles, especially small ones. The low speed of trucks and the decrease of sight distance lead to the overtaking behavior of small cars. This may have an impact on traffic safety and traffic conflicts, leading to an increase in serious conflicts. And when the truck proportion is more than 50%, the traffic flow is gradually stable, and the mean time headway has also reached a stable state.

5. Conclusion

This paper mainly studies the impact of truck proportion on traffic safety by analyzing its effect on traffic flow parameters and traffic conflicts. The traffic data of the straight section of freeway in Shanxi province was collected, and simulation data was corrected by orthogonal experiment and supplemented by VISSIM to obtain the trajectory of the vehicle. Then, the number of traffic conflicts was calculated through SSAM and the impact of truck proportion on traffic conflicts was established through ANFIS. Finally, the impact of the truck proportion on traffic flow parameters and traffic conflicts under different traffic flows was analyzed. The results mainly concluded that with the increase of traffic volume and truck proportion, travel time and average delay are increasing whilst mean speed and mean time headway are decreasing. What’s more, with the increase of truck proportion, speed standard deviation, degree of dispersion, the number of lane changing and the number of traffic conflict are initially increasing then falling down and the higher the traffic volume is, the larger these parameters are. These all explain the impact of the truck proportion on freeway traffic safety. The traffic state will be in danger when truck proportion is around 0.6 at traffic volume 2600 veh/h while the traffic state will be in danger when truck proportion is around 0.4 at traffic volume 3200 veh/h, where the number of traffic conflicts is large.

Actually, little works have been done in traffic data analysis using ANFIS. But in this paper, ANFIS establishes the relationship among parameters quite well, and also indicates the influence of truck proportion on traffic safety by the severity of surrogate indicators indirectly. ANFIS is the control system that combines the functional advantages of fuzzy system and neural network, and overcomes the traditional fuzzy logic reasoning process, which relies on operator thinking and human intervention to design the membership function graph, so as to reduce the weakness of system efficiency, and uses the system’s own training and learning process to solve complex adaptive problems. The algorithm is high efficiency, fast convergence and high model accuracy. It has the advantages of expression of fuzzy logic and self-learning ability of neural network easily, which has gradually become an important research direction of computational intelligence in recent years. And it has the characteristics of simple calculation and mathematical analysis, which provides an effective tool for the modeling and control of complex systems. This also provides a good reference for the use of transportation field. The back-propagation algorithm and the least-squares method are used to adjust the precondition parameters and conclusion parameters, and the If-Then rules can be automatically generated. From the typical representative input and output data, the mapping rules of input and output are directly concluded, that is, given some sample points of input and output, the model’s input and output characteristics are obtained by learning. This method greatly simplifies the modeling problem, avoids the modification of the original model in theory, and directly uses the trained ANFIS model to describe the law of input and output, which is its greatest advantage. What’s more, ANFIS is not based on experience or knowledge, but obtains the membership function and fuzzy rules of the system through learning a large number of sample data, and adjusts and optimizes the parameters of the front and back parts according to the self-learning characteristics of the neural network, so as to improve the performance of the fuzzy system. This laid the foundation for the analysis of traffic safety data.

Data Availability

The data (excel) used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This research is supported by the Department of Shanxi Communications Science and Technology (Project No.: 16002380370): Research in the Impact of Truck Proportion on Highway Traffic Safety, the People’s Republic of China. This project is also supported by the China Scholarship Council.