Prediction of Information Talent Demand Based on the Grayscale Prediction Model and the BP Neural Network

Sun, Chan; Lu, Yixia

doi:https://doi.org/10.1155/2022/4050502

Mobile Information Systems

On this page

Abstract Introduction Discussion Conclusion Data Availability Conflicts of Interest References Copyright Related Articles

Special Issue

Online Processing and Analyzing of IoT Data Streams in Intelligent Mobile Edge Computing

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 4050502 | https://doi.org/10.1155/2022/4050502

Prediction of Information Talent Demand Based on the Grayscale Prediction Model and the BP Neural Network

Chan Sun¹and Yixia Lu²

Academic Editor: Le Sun

Received21 Jun 2022

Revised03 Jul 2022

Accepted26 Jul 2022

Published24 Aug 2022

Abstract

With the increasing popularity of the Internet technology, people are now increasingly accustomed to obtaining information or help through the Internet. Meanwhile, the great development of the information service industry has led to the explosive growth of the demand for information service talents. In recent years, many information service talent demand reports have been released in China, and it has an important guiding significance for information service industry planning. However, there are three problems with the information service industry talent demand reports at present. First, the relevance and support of talent demand analysis and forecast to information service industry planning need to be clarified. Second, the coordination and cooperation of information service personnel demand report preparation need to be improved. The third is the wider application of scientific and reasonable information service personnel demand forecasting models. In the future, we need to develop and use more reasonable information service personnel demand forecasting models and improve the quality of information service personnel demand reports. At the same time, the supporting role of the information service industry in scientific planning needs to be strengthened continuously. Therefore, information service industry talent demand forecasting is of great significance. In this paper, a prediction model of information service talent demand is established by using gray system theory. For the deviations of the GM (1, 1) model, a combined GM (1, 1)-BP neural network prediction model is proposed. The simulation results show that the prediction results of the prediction model in this paper are satisfactory. Therefore, the GM (1, 1)-BP model proposed in this paper can be used as a reference for government decision-making and information service personnel training.

1. Introduction

Talent is the primary resource to support China’s innovative development [1]. Rational use of talents and accurate prediction of the demand for talents will affect the rapid development of a country. The industrial talent demand forecast is to adapt to the new round of scientific and technological revolution and industrial change trend, to analyze and forecast the number of talents needed for high-quality development of the industry, which is an important basis for scientific development of industrial talent training. With the vigorous development and wide application of new technologies of artificial intelligence, cloud computing, mobile Internet, and big data in Germany, information technology services have become the strategic engine of comprehensive digitalization, informatization, and intelligence of economic society. Information technology services are becoming a key force for driving technological innovation, economic and social transformation, and upgrading and high-quality development.

All walks of life are vigorously developing and using information industry applications [2]. Some studies show that 33.5% of enterprises have practiced applying information technology in practical work and production processes and have successful products and cases. 32.6% of enterprises are setting up their information service platforms. 25.6% of the companies have conducted sufficient research and are ready for development.

At present, the professional talents related to information technology cannot meet the needs of the rapid growth of enterprises [3]. Statistics show that there are currently only 4.7 million IT professionals in the country. In the next 3–5 years, the gap of big data talents is as high as over 15 million, and the information service industry is bound to face a global shortage of professional talents [4].

At present, information service talents are mainly distributed in mobile Internet [5]. This is followed by O2O, financial Internet, games, social networking, education, enterprise services, and other fields, as shown in Figure 1.

Information service talent demand forecast can provide reliable basis for talent planning in the information service industry and provide real-time data for the compilation of talent demand reports. The extensive application of the information service talent demand model can serve various industries synergistically and make talents to be applied more fully and reasonably. The development of talent demand forecasting can be divided into three stages [6]. A good talent demand forecasting model can not only accurately predict the total amount of talent demand but also predict the future growth trend of talent demand. Accurate prediction results can promote the growth of talents and ensure the full and reasonable use of talents. The initial stage is the manual method, that is, scientific researchers through manual ways to collect talent data. Then, the talent data are estimated based on experience. The calculation is large, the working process is complex, and the prediction result is far from the actual value when errors occur, so the limitation is very obvious. Subsequently, talent demand prediction models based on mathematical statistics theory emerged, such as linear regression and gray model (GM), mainly using some mathematical modeling methods to analyze talent demand prediction [7]. These methods can only describe the linear or upward trend of talent demand. The trend of talent demand is not necessarily a rise or linear trend but also a downward trend and nonlinear change characteristics [8]. The models established by these methods cannot accurately describe the characteristics of talent demand change, and the prediction accuracy is sometimes very low. Modern statistical methods, such as the back propagation neural networks (BPNN), have been applied in talent demand prediction [9]. They have strong nonlinear modeling ability and can be adjusted adaptively, so they become the most commonly used methods of talent demand prediction. Since talent demand prediction is a complex system with many characteristics, a single neural network cannot comprehensively track the characteristics of talent demand change [10].

With the rise of the information service industry, more and more academic scholars have researched the demand for information service talents [11]. As early as 2005, literature [12] carried out a forecast study on the demand for information service professionals. At present, the research methods and models used in this field are gradually becoming mature. Literature [13] analyzes and predicts the demand for information service talents in China by using the change index of information service talent demand (LPDIC). Literature [14] used the nonlinear regression model to predict the demand for information service talents in the Guangdong province. Literature [15] predicts the total amount of information service talents in the Shandong province using binary linear regression prediction and the gray prediction group method. Literature [16] used the smoothing index and time series prediction methods to forecast the demand for talent in property information service in the Zhejiang province.

It can be seen from the above literature summary that Chinese scholars use diversified model methods to predict the demand for information service talents. There are the exponential change method, the nonlinear regression model, the binary linear regression model, GM (1, 1) model, and the exponential smoothing method [17]. The gray GM (1, 1) forecasting model is a common model among single forecasting methods with good forecasting accuracy, which has been widely used in demand forecasting research in many fields [18]. GM has good periodic and linear modeling capabilities [19]. The BP neural network has a powerful nonlinear modeling ability, but a single method can only describe the piecemeal information of talent demand. Based on combinatorial optimization theory and aiming at the shortcomings of current talent demand forecasting methods, this paper proposes a GM (1,1)-BP neural network model based on GM (1, 1) and BPNN for information service talent demand forecasting.

This paper consists of four main parts: the first part is the introduction, the second part is the methodology of the algorithm, the third part is the result analysis and discussion, and the fourth part is the conclusion.

2. Methodology

Gray theory is a kind of a dynamic fuzzy prediction model. Compared with curve fitting, gray model prediction has the advantage that it does not need a lot of original data and has no requirement for the distribution of data. The system can be monitored in practical applications by extracting valuable contents from known information. The prediction model adopted in this paper is GM (1, 1).

2.1. Construction of the Gray Prediction Model

Since the GM(1, 1) model requires the monitoring data to have an equal time interval in the modeling process, the information service personnel demand data do not have equal time intervals. Therefore, the data of nonisochronous information service talent demand is isochronized by using the cubic interpolation method to form the data sequence of information service talent demand, as shown in the following formula:

In order to fully reveal the development trend of information service personnel demand data, the data are accumulated to generate (1-AGO) sequence , as shown in the following formula:

The compact adjacent value of generates sequence , as shown in the following formula:where .

Then, the GM (1, 1) model can be expressed as the following formula:where is the development coefficient, and the effective interval is (−2, 2). is the gray action. Generate the additive matrix and the constant vector .

MATLAB is used to solve the least square method, and the gray coefficient is calculated, as shown in the following formula:

Finally, the parameters and to be estimated are obtained.

The information service talent demand prediction model is finally calculated, as shown in the following formula:

2.2. Accuracy Test of Calculation Results

The accuracy of the results of modeling calculation is tested to verify whether the model meets the prediction accuracy requirements.

The original information service talent demand sequence is . The simulation sequence of information service talent demand is . The residual is shown in the following formula:

The mean value of the original information service talent demand sequence is shown in the following formula:

The serial variance of original information service talent demand is shown in the following formula:

The mean value of the residual order of the original information service talent demand is shown in the following formula:

The residual variance of the original information service talent demand sequence is shown in the following formula:

The mean square error of the original information service talent demand sequence is shown in the following formula:

The small error probability test of the original information service talent demand sequence is shown in the following formula:where the smaller the mean square error , the higher the prediction accuracy. The higher the small error probability , the higher the prediction accuracy. Specific discrimination criteria are shown in Table 1.

2.3. Improved GM (1, 1) Model

The BP network model is a kind of a multilayer feedforward network trained by an error backpropagation algorithm. It has the characteristics of self-training and self-adaptation, classification, and multidimensional function mapping. At the same time, the nonlinear continuous function can be fitted with high precision. In the process of training, the BP network model constantly adjusts the weights of the connections between nodes (neurons) to achieve the purpose of information processing. The common BP network model includes input, hidden, and output layers. Layers are fully connected with each other, and neurons are not connected between layers. Its structure is shown in Figure 2. In the process of training, input training data samples are mapped to the hidden layer, and the actual output values are calculated and transmitted to the output layer. In this process, the weight of the network is unchanged, and the single layer neurons are affected by the previous layer. Its working principle is shown in Figure 3.

The gray system theory requires equal time interval data in the modeling process. However, in practical practice, information service personnel demand data are often unequal intervals, so it is necessary to interpolate the data. The process of interpolation will affect the prediction accuracy of the model. Therefore, we need to find a way to compensate for the error. The BP network model is a widely used network model, which can train the model by taking the residual of GM (1, 1) fitting data as two training sources. Residual correction is performed using trained models. Then, the gray GM (1, 1)-bp tandem combination model is established. The model construction process is as follows: firstly, the GM (1, 1) gray model is constructed. The demand for information service personnel is predicted with the change of time series, and the predicted value of the GM (1, 1) gray model is , as well as the prediction residual is . Then, residual sequence is constructed, the BP network model is created, and residual correction is carried out. Finally, the predicted value of the GM (1, 1) model and the predicted residual value were superimposed to obtain the information service talent demand data revised by the BP network model. The specific process is shown in Figure 4.

2.4. Combination Model

The combined prediction model refers to the combination of different single models with different weights for the same prediction object. By combining the advantages and disadvantages of different prediction models, different weight coefficients are given to the selected curve and gray model. The models are combined to optimize the predicted results, and the specific process is as follows:(1)Demand data of n groups of information service talents are denoted as . The prediction result of the GM (1, 1) gray model is . The predicted result of the curve model is . The corresponding prediction error of the gray model at phase is . The prediction error of the curve model is . Assume that the weighted coefficient of the gray model is and that of the curve model is , then . and are optimal solutions that satisfy the prediction model.(2)The predicted value of the combined model is . The prediction error is . is the sum of squares of prediction errors of two groups of models, as shown in the following formula:(3)To meet the minimum prediction error, we need to meet the following equations:

Let T be the weight coefficient vector of the combined prediction model, . Calculate the sum of squares of the prediction errors of the gray GM (1, 1) model . The sum of squares of the prediction error of the curve model is , whereas the covariance between the prediction error of the gray model, and the curve model is ,

where , then formula (16) can be expressed as the following formula:

3. Result Analysis and Discussion

3.1. Data Sources

This paper mainly predicts the national level’s demand scale of information service talents during the 14th five-year plan period. Therefore, the input value is the statistical value of the number of information service talents under the national caliber. As for the sample length, there are two prediction methods involved in the combinatorial model, which have different requirements for the sample length. The gray model adopts small sample prediction, and data within 10 years are usually used in the literature for prediction. Some studies have shown that good results can be obtained when the sample size of gray prediction is about 8. Therefore, considering the trend of national economic development and referring to similar literature, 8 samples were selected as input values in this paper. This paper selects the data from China Statistical Yearbook (1999–2020) and takes the full-time equivalent number of R&D personnel from 2012 to 2019 as the input value of the gray forecasting model. The data from 1995 to 2019 were used as the input values of the exponential smoothing model. In this way, the combination of long period and short period prediction is realized to ensure prediction accuracy. Due to the policy of relevant statistical scope, the above data do not include data from Hong Kong, Macao, and Taiwan.

3.2. Gray Prediction of Information Service Talent Demand

The full-time equivalent number of information service personnel from 2013 to 2020 is shown in Table 2.

According to the data, the original data sequence is constructed, and the cumulative data sequence , data matrix , and are obtained after processing, as shown in the following formula.

When the original data sequence level sigma falls within the range of capacity , suggests that the original data sequence is a smooth sequence. According to the calculation, the stage ratios are . All fall within the accommodable interval , indicating that the original data are a smooth sequence, and GM (1, 1) prediction can be made. You get The gray differential equation is calculated as the following formula:

On further solving, the prediction model is obtained.

In order to analyze the prediction accuracy of the model, the demand of information service personnel from 2013 to 2020 was estimated by the prediction model and compared with the actual value. The predicted results are shown in Table 3. 2013 is the base period, and the predicted value is the same as the actual value. By calculating the correlation model index, the mean relative error , the correlation degree , and the posteriori error ratio were obtained. From and , the small error probability was obtained. The evaluation results of the model are shown in Table 4.

The evaluation results show that the gray prediction model has passed three kinds of tests and performs well in the posterior difference test, which proves that the model has good prediction accuracy. The calculated value is less than the critical value, indicating that the model can be used for medium and long-term prediction. This model is used to predict the demand scale of scientific and technological talents in China during the 14th five-year plan period, and the predicted results are shown in Table 5.

3.3. Time Series Prediction of Information Service Talent Demand

The time series model is a general name for a series of models, among which the most commonly used models are the regression moving average model and the exponential smoothing model. The former is generally used to predict stationary data, while the latter can deal with nonstationary data. In this paper, three common models are used for prediction, and the model with the minimum root mean square error (RMSE) is selected as the time series model. Figure 5 shows the scatter chart of full-time equivalent data of national information service personnel from 1995 to 2019. According to the scatter chart, it can be observed that the change of the number of information service personnel has an overall upward trend. The autocorrelation analysis of the data shows that not all data series are white noise and have time correlation, so the time series model is suitable for analysis.

In addition, by fitting each point in Figure 5, it was found that F value was the largest when the exponential distribution curve was used for fitting, indicating that this group of data presented exponential distribution and was suitable for the exponential smoothing method for prediction. In time series analysis, seasonality is generally considered. Since the data obtained in this paper are annual and do not have seasonal periodicity, only nonseasonal analysis is made. According to the four common models, SPSS software was used to predict and analyze the full-time equivalent number of information service personnel from 1995 to 2019, and the corresponding root mean square error (RMSE) value was obtained as shown in Table 6. Through analysis and comparison, it is found that the Brown linear trend model in the exponential smoothing method has the best prediction effect, and then the model is used to predict the total amount of information service talents from 2021 to 2025. The results are shown in Table 7.

3.4. Combination Forecast of Information Service Talent Demand

The results of the GM (1, 1) prediction model and the BP neural network prediction model are weighted average using the optimal weighted grouping method. The prediction result of the combined model is obtained according to the weighted average of this weight. To compare the validity of the models, the forecast results of the three models on the demand for information service talents from 2013 to 2020 are shown in Figure 6. Among the three prediction models, the MAE and MAPE of the combined model are 7.15 and 1.77%, respectively, both of which are lower than those of the single model, proving that the combined model has smaller error and better prediction effect.

This model is used to predict the demand for information service talents in China during the 14th five-year plan period, and the predicted results are shown in Table 8.

4. Conclusion

Demand forecasting of information service personnel is of great significance for scientific planning. The curve fitting obtained by calculating the raw statistics can predict the information service talent demand data better, and its effect is good. The algorithm of information service industry talent prediction based on the GM(1, 1) gray model requires less sample data and has high prediction accuracy. However, for data with complex features, the prediction results are not satisfactory, and the prediction curve gradually deviates from the measured curve as the prediction period increases. Therefore, the BP network model can be used to update and optimize the GM(1, 1) model to improve the prediction accuracy. By constructing a combined forecasting model, the advantages of both models are combined to improve the forecasting effect. When analyzing and processing a large amount of discrete and stochastic data, multiple models need to be selected for prediction comparison. In the case of high prediction accuracy, the selection of models should follow the principle of minimizing the average relative error of residuals. The experimental results show that the prediction algorithm in this paper achieves the best results. The combined prediction model formed by assigning unequal weight coefficients improves the prediction accuracy but still inherits the characteristics of the gray model with a large prediction error at the later stage. In the later period, the combined prediction model can be trained by the neural network after wavelet denoising to improve the prediction accuracy.

Data Availability

The labeled dataset used to support the findings of this study is available from the corresponding author upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

J. Wang and J. M. Sun, “Talent development in China: current practices and challenges ahead,” Advances in Developing Human Resources, vol. 20, no. 4, pp. 389–409, 2018.
View at: Publisher Site | Google Scholar
D. Jiang, “The construction of smart city information system based on the Internet of Things and cloud computing,” Computer Communications, vol. 150, pp. 158–166, 2020.
View at: Publisher Site | Google Scholar
V. S. Litvinenko, “Digital economy as a factor in the technological development of the mineral sector,” Natural Resources Research, vol. 29, no. 3, pp. 1521–1541, 2020.
View at: Publisher Site | Google Scholar
F. Gurcan and N. E. Cagiltay, “Big data software engineering: analysis of knowledge domains and skill sets using LDA-based topic modeling,” IEEE Access, vol. 7, pp. 82541–82552, 2019.
View at: Publisher Site | Google Scholar
K. Wang, H. Yin, W. Quan, and G. Min, “Enabling collaborative edge computing for software defined vehicular networks,” IEEE Network, vol. 32, no. 5, pp. 112–117, 2018.
View at: Publisher Site | Google Scholar
F. Dong, Y. Wang, B. Su, Y. Hua, and Y. Zhang, “The process of peak CO2 emissions in developed economies: a perspective of industrialization and urbanization,” Resources, Conservation and Recycling, vol. 141, pp. 61–75, 2019.
View at: Publisher Site | Google Scholar
Q. Qiang, “Analysis of debt-paying ability of real estate enterprises based on fuzzy mathematics and K-means algorithm,” Journal of Intelligent and Fuzzy Systems, vol. 37, no. 5, pp. 6403–6414, 2019.
View at: Publisher Site | Google Scholar
L. A. Wegman, B. J. Hoffman, N. T. Carter, J. M. Twenge, and N. Guenole, “Placing job characteristics in context: cross-temporal meta-analysis of changes in job characteristics since 1975,” Journal of Management, vol. 44, no. 1, pp. 352–386, 2018.
View at: Publisher Site | Google Scholar
L. Pan, X. Feng, F. Sang, L. Li, M. Leng, and X. Chen, “An improved back propagation neural network based on complexity decomposition technology and modified flower pollination optimization for short-term load forecasting,” Neural Computing & Applications, vol. 31, no. 7, pp. 2679–2697, 2019.
View at: Publisher Site | Google Scholar
Q. Chen, Y. Song, and J. Zhao, “Short-term traffic flow prediction based on improved wavelet neural network,” Neural Computing & Applications, vol. 33, no. 14, pp. 8181–8190, 2021.
View at: Google Scholar
C. C. Kuo, J. Z. Shyu, and K. Ding, “Industrial revitalization via industry 4.0 – a comparative policy analysis among China, Germany and the USA,” Global transitions, vol. 1, pp. 3–14, 2019.
View at: Publisher Site | Google Scholar
A. Tall, J. Y. Coulibaly, and M. Diop, “Do climate services make a difference? A review of evaluation methodologies and practices to assess the value of climate information services for farmers: implications for Africa,” Climate Services, vol. 11, pp. 1–12, 2018.
View at: Publisher Site | Google Scholar
M. I. Aziz, A. A. Adnan, A. Afthanorhan, H. Foziah, S. I. Ishak, and N Rashid, “The influence of employer value proposition in talent demand towards talent shortage in the Malaysian Islamic banking institutions: a SEM approach,” Management Science Letters, vol. 9, no. 6, pp. 843–850, 2019.
View at: Publisher Site | Google Scholar
Y. He, Y. Qin, S. Wang, X. Wang, and C Wang, “Electricity consumption probability density forecasting method based on LASSO-Quantile Regression Neural Network,” Applied Energy, vol. 233-234, pp. 565–575, 2019.
View at: Publisher Site | Google Scholar
A. Yang, W. Li, and X. Yang, “Short-term electricity load forecasting based on feature selection and Least Squares Support Vector Machines,” Knowledge-Based Systems, vol. 163, pp. 159–173, 2019.
View at: Publisher Site | Google Scholar
S. Smyl, “A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting,” International Journal of Forecasting, vol. 36, no. 1, pp. 75–85, 2020.
View at: Publisher Site | Google Scholar
S. Tang, S. Xu, and J. Gao, “An optimal model based on multifactors for container throughput forecasting,” KSCE Journal of Civil Engineering, vol. 23, no. 9, pp. 4124–4131, 2019.
View at: Publisher Site | Google Scholar
Q. Wang, S. Li, R. Li, and M. Ma, “Forecasting U.S. shale gas monthly production using a hybrid ARIMA and metabolic nonlinear grey model,” Energy, vol. 160, pp. 378–387, 2018.
View at: Publisher Site | Google Scholar
Z. X. Wang, Q. Li, and L. L. Pei, “A seasonal GM(1, 1) model for forecasting the electricity consumption of the primary economic sectors,” Energy, vol. 154, pp. 522–534, 2018.
View at: Publisher Site | Google Scholar
R. F. Sholeh, “Peramalan harga emas di Indonesia menggunakan algoritma double exponential smoothing damped trend,” INTECOMS: journal of information technology and computer science, vol. 4, no. 2, pp. 328–339, 2021.
View at: Publisher Site | Google Scholar
H. Yonar, A. Yonar, and M. A. Tekindal, “Modeling and forecasting for the number of cases of the COVID-19 pandemic with the curve estimation models, the box-jenkins and exponential smoothing methods,” EJMO, vol. 4, no. 2, pp. 160–165, 2020.
View at: Google Scholar
K. Suresh and O. Praveen, “Extracting of patterns using mining methods over damped window,” in Proceedings of the 2020 second international conference on inventive research in computing applications (ICIRCA), pp. 235–241, Coimbatore, India, July 2020.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Chan Sun and Yixia Lu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

415

Downloads

334

Citations