Abstract

Mine microseismic signal denoising is a basic and crucial link in microseismic data processing, which influences the accuracy and reliability of the monitoring system, and is of great significance with regard to safety during mining. Therefore, this study introduces a deep learning method to improve the mapping function and sparsity of signals in the time-frequency domain and constructs a denoising framework based on a deep convolutional autoencoder to address the denoising problem of mine microseismic signals. First, all noisy microseismic signals are normalized to ensure the nonlinear expression ability of the constructed denoising framework. Then, the normalized signals are transformed into the time-frequency domain using the short-time Fourier transform (STFT), and the real and imaginary parts of time-frequency coefficients serve as the input of the deep convolutional autoencoder to output the masks of the effective and noise signals. Next, these masks are applied to the time-frequency coefficients of the noisy microseismic signals, and the time-frequency coefficients of the potentially effective and noise signals are estimated. Finally, inverse STFT is used to transform these time-frequency coefficients to the time domain to obtain the final denoised effective and noise signals. The constructed framework automatically learns rich features from synthetic data to separate the effective and noise signals, thereby achieving the purpose of fast and automatic denoising. The experimental results show that compared with the wavelet threshold and ensemble empirical mode decomposition, the denoising framework considerably improves the signal-to-noise ratio of mine microseismic signals with less waveform distortion. Moreover, it can achieve a better denoising effect efficiently even in the case of a low SNR, which has obvious advantages. The constructed denoising framework is suitable for microseismic monitoring signals of various mine dynamic disasters and provides strong technical support for intelligent monitoring and early warning concerning production risks in mines.

1. Introduction

With the rapid development of China’s economy, the demand for various mineral resources has sharply increased. Therefore, it is imperative to expand the scale of mining and strengthen the exploration and exploitation of deep resources [1]. However, the security issues caused by complex, difficult, and deep mining have become increasingly prominent with the continuation of mining. Serious mine dynamic disasters and hidden dangers, such as roof fall, collapse, and rock burst, cause huge losses to people and their properties [2, 3]. Thus, continuously monitoring the rupture of rock masses, quantitatively evaluating the stability of underground surrounding rocks, and accurately predicting mine dynamic disasters have become crucial to mine safety [4]. Previous studies showed that microseismic monitoring is an effective technology for mine dynamic disaster monitoring and early warning [5, 6]. The main task in microseismic monitoring is to timely delineate dangerous areas by locating hypocenters, evaluating the stability of rocks, and predicting possible geological disasters [7]. Owing to the low signal-to-noise ratio (SNR) of microseismic signals, they must first be denoised to ensure the accuracy of first arrival picking and hypocenter location [8, 9].

Denoising is a classic problem in geophysics. Mine microseismic signals usually have nonlinear and nonstationary characteristics. Notably, traditional denoising methods, such as Fourier transform [10], can suppress noise to a certain extent, but they are more suitable for dealing with stationary periodic signals. When processing nonstationary signals with spikes or sudden changes, such as microseismic signals, their denoising effect is not very satisfactory. For the denoising of nonlinear signals, researchers have proposed several methods, such as wavelet transform (WT) [11, 12], S transform [13], curvelet transform [14], dreamlet transform [15], contourlet transform [16], shearlet transform [17], and empirical mode decomposition (EMD) [18, 19]. These methods can improve the quality of microseismic signals; however, they have certain limitations in terms of their validity and practicality. In practical application, they generally require more manual participation, such as the basis function selection of WT and the intrinsic mode function (IMF) selection of EMD, considerably limiting their application scope [20, 21]. Furthermore, the denoising effect of these methods is not satisfactory under a low-SNR condition. After denoising, the effective signals are distorted, destroying the original amplitude characteristics of microseismic signals and reducing the fidelity of effective signals [22, 23]. Therefore, their denoising performance needs further improvement.

In all these methods, microseismic signals are first transformed from the time domain to the time-frequency domain using a time-frequency analysis method. Then, the time-frequency coefficients are modified (usually thresholded) to attenuate noise-related coefficients, and the values of the effective signal coefficients are estimated. Finally, the modified coefficients are inverse-transformed to the time domain to reconstruct the denoised signals [24, 25]. They are referred to as time-frequency domain denoising methods, which basically involve improving the sparsity of signals by transforming microseismic data to other spatial domains, wherein the signals can be represented by a set of sparse features, making it easier to separate the effective and noise signals [26, 27]. Time-frequency domain denoising methods are the main focus of current research on microseismic signal denoising. On the basis of considering the time-frequency representation of multiple traces as a 3-D tensor, Iqbal et al. [28] introduced an approach for detecting and denoising microseismic events using tensor decomposition, where the higher order and tensor singular value decomposition were used for detecting and denoising, respectively. To obtain a sparse time-frequency representation and have a higher energy concentration, Zhang et al. [29] proposed a denoising framework based on the cumulative distribution function thresholding in the wavelet domain using synchrosqueezed continuous wavelet transform (SS-CWT). To suppress isolated noise on the time-frequency plane using SS-CWT with hard thresholding, Zeng et al. [30] introduced an image processing approach called pixel connectivity thresholding to further improve the SNR of microseismic signals. More studies on time-frequency domain denoising of microseismic signals can be seen in the literature [12, 23, 3133]. Generally, these methods can be used to improve the denoising effect in two ways: use a more flexible and powerful mapping function or use a more efficient sparse representation of the data [3436]. However, how to map microseismic data into an optimal denoising space and choose an appropriate threshold function to separate the effective and noise signals is a great challenge [37].

Recently, with the development of deep learning technology, deep neural networks have been gradually applied to signal denoising. For the denoising of 1-D seismic signals, Iqbal [38] proposed a denoising method based on an intelligent deep convolutional neural network, which learns sparse representation of the data simultaneously in the time-frequency domain and adaptively captures seismic signals corrupted with noise. Othman et al. [39] proposed a fully automated event detection and denoising method for seismic data, where a residual neural network and an IIR Wiener filter were used to detect seismic events and denoise the seismic data, respectively. These networks can improve the denoising effect by using convolution to extract the features of signals and training to obtain their better sparse representations [4042]. Therefore, a deep learning method is primarily used in this study to improve the mapping function and sparsity of signals to construct a novel time-frequency domain denoising framework, namely, a microseismic signal denoising method based on a deep convolutional autoencoder. This method can learn the sparse representations of input signals from the training set and generate two separate masks for each noisy microseismic signal according to these representations; the generated masks are then used to extract the effective and noise signals, thereby achieving the purpose of mine microseismic signal denoising. The main contributions of this study are outlined as follows:(1)A fully automated denoising framework based on a deep convolutional autoencoder is constructed for mine microseismic signals.(2)Despite only being trained on synthetic data, the denoising framework can be generalized to practical denoising tasks, and test results on a real dataset show better performance, denoising effect and computational efficiency, in comparison with two common denoising methods.(3)Because it does not require any prior knowledge or preprocessing, the constructed denoising framework is suitable for microseismic monitoring signals of various mine dynamic disasters.

2. Methodology

In the time-frequency domain, each noisy microseismic signal can be expressed as the superposition of an effective microseismic signal and a noise signal (a collective term for various noises or nonmicroseismic signals):

The objective of denoising is to estimate the potential effective microseismic signal from the polluted signal under the condition of minimizing the error between the estimated and real signals [37], which is as follows:where represents the L2 norm, indicates inverse time-frequency transform, and represents the mapping function, which maps the noisy microseismic signal to the time-frequency representation of the potential effective signal. The results of the study showed that this mapping can be obtained by using a simple threshold in a sparse representation, and its magnitude can be estimated based on the noise level [43].

2.1. Autoencoder

An autoencoder is an unsupervised feature extraction and data dimensionality reduction method [44]. It is a symmetric network consisting of an encoder, encoding, and decoder. The schematic of its network structure is shown in Figure 1.

Starting from the leftmost input layer, the input sample is processed layer by layer through the hidden middle layers and finally reaches the output layer. The output of the decoder can be considered as a prediction value of the input . The encoding and decoding processes can be expressed as formulas (3) and (4), respectively.where and are the power matrix and bias between the input and encoding layers, respectively; and are the power matrix and bias between the encoding and output layers, respectively; is the hidden variable; and is the activation function. The objective of network training is to minimize the loss function , namely,where and . In an autoencoder, since the number of nodes in the input and output layers is equal, it attempts to approximate an identity function through the encoding and decoding operations (so that the output value corresponds to the input value) to determine the hidden association structure in original data.

2.2. Convolutional Autoencoder

In a convolutional neural network, different features can be extracted from different convolutional layers [45], and both local and global features are beneficial to obtain clean effective microseismic signals. Therefore, in order to introduce a convolutional neural network into the denoising of microseismic signals, this study used the neural network to learn the sparse representations of the signals, generated an optimal mapping function for each signal according to the effective and noise signals in training samples, and defined the mapping functions as two separate masks, and [46], which represent the effective and noise signals, respectively:

The size of each mask is the same as that of the input time-frequency microseismic signal , with the value being between 0 and 1, which indicates the proportion of corresponding signals, effective microseismic signal or noise signal, in a noisy microseismic signal.

The network structure of the deep convolutional autoencoder designed in this study according to the characteristics of the autoencoder and convolutional neural network is shown in Figure 2.

In the deep convolutional autoencoder network (Figure 2), the input of the first layer comprises the real and imaginary parts of the time-frequency representations for microseismic signals. The output comprises two masks, and , respectively, of the effective and noise signals. The blue rectangles represent the inner layers of the network. In the first half of the network, the input time-frequency coefficients are first downsampled using multiple 3 × 3 convolutional layers with a stride of 2 × 2 and then processed using the activation layer of the rectified linear unit (ReLU) function after each convolutional layer. These convolutional layers act as feature extractors, compressing the feature maps and quickening the learning of the sparse representation of the input data to the bottleneck layer. In the second half of the network, the deconvolution layer upsamples to generate a high-dimensional nonlinear mapping of the sparse representations to the effective and noise signal masks, and the softmax function is used in the last layer to predict the difference between the effective and noise signal masks. These masks are considered the targets in optimizing the convolutional autoencoder network during training. Through training, the network learns how to construct a sparse representation of the data as well as how to optimize the cross-entropy loss function to obtain optimal masks that separate the effective and noise signals. Meanwhile, in the convolutional autoencoder network structure, skip connections are utilized to improve the convergence of training and prediction [47]. Based on an autoencoder structure, the convolutional autoencoder performs convolutional operations instead of the inner product of ordinary matrices, which improves the feature extraction ability of the network and provides higher computational efficiency.

2.3. Denoising Framework Based on a Convolutional Autoencoder

For mine microseismic signal denoising, a denoising framework based on the deep convolutional autoencoder is constructed in this study, and the calculation process is shown in Figure 3.

First, in order to ensure the nonlinear expression ability of the constructed denoising framework and accelerate the convergence speed during network training, each noisy microseismic signal is normalized by subtracting its minimum value and dividing the resulting expression by the difference between its maximum and minimum values.

Then, all normalized signals are transformed into the time-frequency domain using the short-time Fourier transform (STFT), and the real and imaginary parts of time-frequency coefficients serve as the input of the convolutional autoencoder to output the effective and noise signal masks. These masks are considered the targets in optimizing the convolutional autoencoder network during training. The loss function used in the network is the mean squared error loss, and the corresponding formula is as follows:where N indicates the number of training samples, represents the L2 norm, is the label (or mask) of an effective signal, and is the output of the convolutional autoencoder network (i.e., the predicted time-frequency mask of an effective signal).

Next, the effective and noise signal masks are applied to the time-frequency coefficient of each noisy microseismic signal, and the time-frequency coefficients of the potential effective signal and noise signal can be estimated according to the following formula:

Finally, inverse STFT is used to transform and to the time domain to obtain the final denoised effective and noise signals.

As can be seen from the entire computational process, the denoising framework based on the deep convolutional autoencoder considers microseismic signal denoising as separating the effective and noise signals. Taking the three-component noisy microseismic signals as the input and the time-frequency masks as the optimization targets, the denoising framework obtains the denoised effective and noise signals by applying the predicted masks to the noisy microseismic signals in the time-frequency domain. Instead of manually defining different features and thresholds to enhance the effective signals and attenuate the noise signals, it automatically learns rich features from training set to separate the effective and noise signals and realizes fast and automatic denoising. Therefore, the constructed denoising framework can reduce the effects of human parameter adjustment and threshold selection, accelerate the processing of microseismic data, and help the automation and intelligence of microseismic data processing.

3. Experiments and Results

To test the accuracy and efficiency of the constructed denoising framework, it was used to denoise real mine microseismic signals on the basis of analyzing the characteristics of noise signals. The results were compared with those obtained using WT and ensemble empirical mode decomposition (EEMD) to quantitatively evaluate the denoising effect of the constructed framework.

3.1. Data

In order to demonstrate the denoising effect of the constructed framework, we took the microseismic monitoring signals of a mine in Gansu Province as an example. This mine is a copper-nickel sulfide mine with a mining depth of more than 1,000 m. As mining deepens, the mining conditions and operating environment deteriorate; particularly, the problem of dynamic disasters caused by rock mass excavation becomes more and more prominent, which further intensifies the difficulty of mining. Therefore, to analyze and predict dynamic disasters caused by the disturbance from deep mining and ensure the safety of mine operators’ lives and properties, a set of microseismic monitoring system was set up for the mining area, and 16 three-component geophones (sampling rate is 1,000 Hz) were arranged to dynamically and continuously monitor various mine dynamic disasters, such as roof fall, collapse, and rock burst, in real time. The training and testing of the model require a large amount of sample data. However, clean effective microseismic signals cannot be obtained in practical application. Thus, this study selected 32,317 microseismic events with an extremely high SNR for clean effective microseismic signal samples and 51,251 nonmicroseismic event signals for noise signal samples and randomly combined these signals to generate noisy microseismic signals with different SNR levels. All the samples were randomly divided into training, validation, and test sets with proportions of 80%, 10%, and 10%, respectively. These sets were used to train the model, adjust the model hyperparameters, and test the denoising effect, respectively. Furthermore, 1,000 real mine microseismic signals with a low SNR were selected to test the generality of the model.

3.2. Analysis of Noise Signal Characteristics

Usually, signals received by a mine microseismic monitoring system can primarily be divided into three categories: effective microseismic signals, blast signals, and noise signals caused by various reasons [48]. Mining environments are complex, and various influencing factors lead to noise signals with a wide frequency band and rich spectrum components. The characteristics of various noise signals are clearly different in terms of waveform and spectrum. Even the same type of noise shows different characteristics owing to different generated conditions and environments [49]. Based on its source, noise can be roughly classified into the following types:(1)Mechanical operation noise: It mainly refers to the noise generated by mine machinery and equipment, such as road headers, drills, and fans (Figure 4(i)). The frequency of this noise mostly lies in the range of 100–300 Hz, and the amplitude variation is small, mostly in the range of 10−7–10−4 m/s; the waveforms are relatively dense, all of which are pulse burrs without clear attenuation and onset vibration. Being related to the working time of machinery, its duration is generally long, reaching tens of seconds. Moreover, some noise signals are periodic.(2)Electrical noise: It mainly refers to the electromagnetic interference generated by electrical equipment (e.g., fans, road headers, and other high-power mechanical equipment) in a mine and the electrical noise generated by cables and sensors (Figure 4(ii)). The frequency of this noise is relatively concentrated, with the main frequency being 50 Hz or an integer multiple of 50 Hz. The amplitude slightly changes, mostly in the range of 10−6–10−4 m/s. Its waveform is generally in the shape of pulse burrs without clear onset vibration and attenuation, and the duration is also generally long.(3)Human activity noise: It mainly refers to the noise generated by underground work, such as roadway repair, support, and manual equipment loading and unloading (Figure 4(iii)). This type of noise greatly changes in frequency range and amplitude; therefore, it is difficult to remove.(4)Random noise: It mainly refers to the noise near sensors, including noise from external environments (e.g., from the mass caving, collapse, and spalling of rocks) or caused by instruments (Figure 4(iv)). The amplitude of this noise greatly varies, mostly in the range of 10−8–10−3 m/s, and the frequency components are complex and distributed in high-, medium-, and low-frequency bands.

Generally, the complex waveform components of noise signals, various amplitudes, extremely irregular waveforms, and the complete spectrum aliasing of the noise and effective signals result in the effective microseismic signals being submerged into various noises, greatly increasing the difficulty of mine microseismic signal denoising.

3.3. Evaluation Indicators

Herein, three indicators, namely, SNR, correlation coefficient (CCR), and root mean squared error (RMSE), are used to quantitatively evaluate the denoising effect [50]. The formula for calculating the SNR iswhere σS is the standard deviation of an effective signal, while σN is that of a noise signal. The SNR represents the ratio of the effective signal to the noise signal in a noisy microseismic signal, and its unit is dB. It is a crucial indicator for evaluating signal quality. The larger the SNR, the lower the noise content and the better the denoising effect.

The formula for calculating the CCR iswhere and represent the signals before and after denoising, respectively; represents the covariance of the signals before and after denoising; and and represent the variance of the signals before and after denoising, respectively. The CCR indicates the degree of correlation between the denoised signal and the clean effective microseismic signal. Its value is between 0 and 1. The closer the value is to 1, the more similar the signal waveform is to the ideal signal waveform and the better the denoising effect is.

The formula for calculating the RMSE iswhere represents the L2 norm; and represent the signals before and after denoising, respectively; and n is the number of signal sampling points. The RMSE is the evaluation indicator for the deviation of the denoised signal from the mean value of the clean effective microseismic signal. Similar to the CCR, the smaller the value is, the closer the denoised signal is to the ideal signal and the better the denoising effect is.

3.4. Results and Discussion

The test environment included Intel(R) Core(TM) i7-8550U CPU @ 1.80 GHz, 16.0 GB RAM, and NVIDIA GeForce MX150 graphics card, with a Microsoft Windows 10 Home 64 bit operating system, Python 3.7.10, and TensorFlow 2.3.0. The experiment with the constructed denoising framework in this study was divided into two stages: model training and model denoising. In the model training stage, the effective and noise signals were used as input, and corresponding labels, the effective and noise signal masks, were used as the target output of the network. The cross-entropy loss function was used to calculate the distance between the output masks and labels, while the stochastic gradient descent algorithm was used to minimize the loss function. For the convenience of training, the sampling rate of the mine microseismic signals was resampled from 1,000 to 250 Hz. After repeated training and tuning experiments, the error of the training result was minimized (i.e., the training of the model was completed). The model training results are presented in Figure 5.

As shown in Figure 5, the average value of the cross-entropy loss function gradually decreased with an increase in the number of epochs during the model training, which showed a clear convergence trend. Moreover, the convergence speed of the model was fast. After 100 epochs of training, the average value of the cross-entropy loss function no longer decreased, and the model became stable.

After the model training was completed, it entered the model denoising stage. In this stage, the noisy microseismic signals were input into the model, and the separated effective and noise signals were output to realize the denoising of mine microseismic signals. In order to test the denoising effect of the proposed method, the noise and effective signals were combined into noisy microseismic signals with different SNR levels to form a test set for model testing. Furthermore, to quantitatively analyze the denoising effect of the proposed method, the denoising results of two common denoising methods, WT and EEMD, were compared with the processing results of the method based on the deep convolutional autoencoder. The experimental results of denoising under different SNR levels (from −6 to 8 dB) are presented in Table 1 and Figure 6.

In Figure 6, (i) and (ii) represent the clean effective and noise signals, respectively, (iii) represents noisy microseismic signals, and (iv) and (v) represent the denoising results obtained using the proposed method, namely, the effective microseismic and noise signals. Figure 6 illustrates two denoising examples. For the first one (Figures 6(a) and 6(b)), the SNR, CCR, and RMSE before denoising are −4.07, 0.68, and 11.43, respectively, while these values improve to 6.95, 0.86, and 6.08, respectively, after denoising. For the other one (Figures 6(c) and 6(d)), the SNR, CCR, and RMSE are −0.94, 0.79, and 8.55, respectively, before denoising, and 8.93, 0.90, and 5.27, respectively, after denoising. Taking Figures 6(a) and 6(b) as an example, on the one hand, the time-frequency planes before and after denoising (Figure 6(a)) show that the energy of low- and high-frequency noise was completely suppressed after denoising in two time periods of approximately 0 to 20.96 s and 24.12 to 45 s, whereas the energy of effective microseismic signal was clearly evident in 20.96 to 24.12 s. Meanwhile, some low-frequency noises around the effective signal were suppressed to varying degrees. Based on the whole time-frequency plane (Figure 6(a)(iv)), only the effective signal part of the microseismic signal had energy, showing that the proposed method had achieved a satisfactory noise suppression of the mine microseismic signal. On the other hand, it can be seen intuitively from the time-domain planes (Figure 6(b)) that the proposed method could effectively remove the noise in the mine microseismic signal and restore the effective microseismic signal.

WT and EEMD are two common methods in mine microseismic signal denoising. The basic principle of WT is that after a microseismic signal is decomposed by the Mallat algorithm, the wavelet coefficient amplitude of the effective signal is greater than that of the noise signal. If an appropriate threshold is selected to remove the wavelet coefficient smaller than the threshold and retain those greater than the threshold, the denoising task can be completed [51, 52]. To achieve the best denoising effect for WT, repeated tests were conducted. The results show that when the db4 wavelet basis function is used, the number of layers in the wavelet decomposition is set to five, and when the Heursure threshold function is adopted [53], the denoising effect is the best. EMD is based on the Hilbert–Huang transform to decompose a microseismic signal into a sum of IMFs. The noise signal is mainly concentrated in high-frequency components (low-sequence IMFs), while the effective signal is mainly concentrated in low-frequency components (high-sequence IMFs). Therefore, reconstruction using low-frequency IMFs can achieve the purpose of denoising [54, 55]. Although EMD decomposes each signal according to its time-scale characteristics without presetting any basis functions, modal aliasing is prone to appear. EEMD utilizes the uniform distribution and zero-mean characteristics of white noise spectrum to solve the problem of modal aliasing in EMD by adding white noise and averaging it multiple times [19]. According to the test results, twelve IMFs are generated after a microseismic signal is adaptively decomposed using EEMD. Among them, the first six IMFs are noisy components, and the energy ratio of the last six IMFs is much higher than that of others, which contain more information about the effective microseismic event. Therefore, the denoising task can be completed well by extracting the last six IMFs to reconstruct the effective microseismic signal. Table 1 and Figure 6 show that although WT and EEMD can improve the SNR of each microseismic signal to a certain extent, the degree of improvement is limited. However, the proposed method can learn the characteristics of the effective and noise signals through training, and the SNR of each effective microseismic signal after denoising is higher. Moreover, the CCR exhibits considerable improvement at different SNR levels, and the corresponding RMSE is smaller. The higher SNR and CCR and the smaller RMSE indicate that the proposed method has a strong denoising ability and a small degree of waveform distortion after denoising, which is close to the ideal signal. The experimental results show that the overall SNR for the proposed method is considerably improved, CCR is substantially improved, RMSE is smaller, and denoising ability is the best compared with those for WT and EEMD. In addition, the more serious the microseismic signal pollution is, the more obvious the denoising advantage of our method is.

To visually demonstrate the overall denoising effect of the constructed framework, the statistical histograms of the SNR, CCR, and RMSE for the test set before and after denoising are presented in Figure 7. It can be clearly seen from Figure 7(a) that the SNR for the test set before denoising is mainly distributed between −5.99 and 7.92 dB, with the average SNR being 1.03 dB. The SNR after denoising is mainly distributed between −1.94 and 13.77 dB, with the average SNR being 5.68 dB. Therefore, the average SNR of the denoised signals is considerably improved. As shown in Figure 7(b), the CCR before denoising is mainly between 0.51 and 0.78, with an average of 0.65. The number of noisy microseismic signals distributed in each interval of the CCR is relatively uniform. After denoising, the CCR is mainly between 0.63 and 0.98, the number of denoised microseismic signals in each interval of the CCR gradually increases with an increase of its value, and the average CCR after denoising is 0.92. These findings indicate that the denoised microseismic signals can effectively preserve the corresponding waveforms of the effective signals. In Figure 7(c), the RMSE for the test set before denoising is mainly distributed between 5.12 and 15.87, whose average RMSE is 10.43. The RMSE after denoising is mainly distributed between 1.25 and 9.76, whose average RMSE is 4.98. The RMSE is greatly reduced after denoising, indicating closeness to the ideal signal.

The denoising framework based on the convolutional autoencoder was trained on synthetic microseismic data that superimposed the real noise signals on the real high-SNR effective microseismic signals. To test the generality of the constructed denoising framework, we applied it to 1,000 real mine microseismic signals. These signals (Figure 8(i)) are severely polluted by noise, and the SNR is low. Moreover, to intuitively compare the denoising results of the proposed method with those of other methods, we used the constructed framework, WT, and EEMD to denoise mine microseismic signals with different SNR levels. The comparison is presented in Figure 8.

Figure 8 presents the denoising results of different methods under different SNR levels. As can be seen from (iv) and (v) in Figure 8, at a high-SNR level, WT and EEMD can achieve a relatively good denoising effect, and the waveform distortion is relatively small. However, with the decrease in the SNR of mine microseismic signals, the denoising ability of WT and EEMD gradually decreases. Although the denoising effect of EEMD is better than that of WT, the denoising results of these two methods still contain much noise, the distortion of waveforms after denoising is large, and the denoising effect is poor. In comparison, the denoising effect of the proposed method is obviously better, and the overall waveform of each effective microseismic signal after denoising is clearer. After denoising, the P- and S-waves of each effective microseismic signal are more obvious, leading to better recognition of P- and S-wave phases. In particular, the noise before the arrival of P-wave is well suppressed, and the noise residue is approximately zero (Figure 8(iii)), which is beneficial for accurately detecting microseismic events and phases in the subsequent microseismic data processing. The experimental results show that even in the case of a low SNR, the proposed method can better suppress the noise in mine microseismic signals and realize the separation of the effective and noise signals. Therefore, although our method is trained on synthetic data, it can be generalized to practical denoising tasks and applied to the denoising of real mine microseismic signals. Moreover, it does not require any prior knowledge or preprocessing and has strong practicability.

In real mine microseismic signal processing, given the extended period of microseismic monitoring and the numerous data involved, computational efficiency also represents a key concern for denoising methods. To evaluate the computational efficiency of the constructed denoising framework, we recorded the time taken by the proposed method to process different number of microseismic signals and compared them with those of WT and EEMD. The results are presented in Table 2 and Figure 9. As can be seen from the comparison (Table 2 and Figure 9), our method takes a far shorter time to denoise microseismic signals, and this merit will become even more prominent with an increase in the data processed. Of course, as our method is a supervised learning method, the model has to be trained before use—in our experiment, it takes approximately 9,627 s to train the model on the constructed training samples—but once the model is trained, it denoises signals efficiently and quickly.

In summary, the common denoising methods of mine microseismic signals, such as WT and EEMD, can improve the quality of signals to a certain extent, but the ability to denoise is limited, especially under a low-SNR condition. Moreover, in practical application, these methods generally require more manual participation, which greatly limits their application scope. For instance, to have a good denoising effect, WT needs to select the best basis function and decomposition layer according to the real signals. Although EEMD solves the problem of signal distortion caused by modal aliasing in EMD and does not need to preset any basis functions, it requires multiple iterations during decomposition, which is very time-consuming. Furthermore, there is a lack of uniform criteria for stopping iterations, and different conditions may lead to different results. By comparison, instead of manually defining different features and thresholds to improve the SNR, our data-driven method can automatically learn rich features from synthetic data to separate the effective and noise signals and realize fast and automatic denoising. The performance, denoising effect and computational efficiency, of the constructed framework outperforms these two denoising methods even in the case of a low SNR. As a supervised learning method, it has to be trained before use, and the quality and quantity of training data will influence the denoising effect. It is sometimes difficult to obtain large, diverse, and reliable training samples in practical application. Using microseismic signals with a high SNR instead of clean effective signals to train the model will more or less bring some interference to it and affect its generality. This may be the main reason why the waveform amplitude preservation of the denoised signal is not so great. In the process of denoising, our method applies the predicted masks to noisy microseismic signals in the time-frequency domain to separate the effective and noise signals. Because the range of the value for each mask is between 0 and 1, the constructed denoising framework cannot recover a signal with the value larger than that of the input noisy microseismic signal.

Blasting is a common construction method in mining. Despite providing convenience for mining, it may cause some potential hazards. Therefore, researching blasting signals has a crucial engineering application value. To accurately extract the subsequent blasting features, each blasting signal first needs to be denoised. The source of blast signals is of an expansive type. The energy generated is instantaneously released, compressing the surrounding rock masses and primarily propagating outwards in the form of a P-wave. This type of signal has a high energy and large waveform amplitude. Its waveform decays fast at the beginning and slows down later, forming a “dovetail” waveform shape. The duration of blast signal is generally longer than that of effective microseismic signal (rock rupture signal). When making the model training set, we also added a certain number of blasting signals so that the constructed framework can also denoise blasting signals. The denoising results of blasting signals are presented in Figure 10. The blasting signal waveforms after denoising using the constructed framework are obvious, the characteristics of “dovetail” waveforms are clear (Figure 10(ii)), noise is well suppressed, and blasting events can be effectively distinguished.

4. Conclusion

Microseismic monitoring is an integral part of mine safety management and an effective means to realize mine dynamic disaster monitoring and early warning. Microseismic signal denoising is a basic and crucial link in data processing, which influences the accuracy and reliability of the monitoring system. Therefore, to address the denoising problem of mine microseismic signals, a denoising framework based on a deep convolutional autoencoder was constructed in this study. The constructed denoising framework can learn the sparse representations of the input signals in the time-frequency domain and generate two separate masks for each noisy microseismic signal based on these representations to extract the corresponding signals (one is used to extract the effective signal, while the other is used to extract the noise signal), thereby achieving the purpose of fast and automatic denoising. To verify the effectiveness of the denoising framework, we applied it to real mine microseismic signals and compared the denoising results with those of two methods. The experimental results show that the denoising framework can considerably improve the SNR of each microseismic signal as well as reduce its waveform distortion degree after denoising. Compared with WT and EEMD, even in the case of a low SNR, the denoising framework can better suppress the noise in mine microseismic signals, preserve the waveform shape of effective signals to the greatest extent, and efficiently separate the effective and noise signals. Moreover, the constructed denoising framework does not need to understand and identify the complex relationship between attributes. Because it does not require any prior knowledge or preprocessing and has strong practicability, the denoising framework can also be utilized for microseismic monitoring signals in other fields, such as for the stability monitoring of surrounding rocks in tunnel excavations and chamber surrounding rocks in hydropower stations.

Data Availability

Some or all data, models, and codes used in this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This work was supported by the Department of Science and Technology of Sichuan Province (grant no. 2022YFS0521) and National Natural Science Foundation of Sichuan Province (grant no. 2023NSFSC0020).