Abstract

In this paper, we explore the potential of using the multivoxel proton magnetic resonance spectroscopy (1H-MRS) to diagnose neuropsychiatric systemic lupus erythematosus (NPSLE) with the assistance of a support vector machine broad learning system (BL-SVM). We retrospectively analysed 23 confirmed patients and 16 healthy controls, who underwent a 3.0 T magnetic resonance imaging (MRI) sequence with multivoxel 1H-MRS in our hospitals. One hundred and seventeen metabolic features were extracted from the multivoxel 1H-MRS image. Thirty-three metabolic features selected by the Mann-Whitney test were considered to have a statistically significant difference (). However, the best accuracy achieved by conventional statistical methods using these 33 metabolic features was only 77%. We turned to develop a support vector machine broad learning system (BL-SVM) to quantitatively analyse the metabolic features from 1H-MRS. Although not all the individual features manifested statistics significantly, the BL-SVM could still learn to distinguish the NPSLE from the healthy controls. The area under the receiver operating characteristic curve (AUC), the sensitivity, and the specificity of our BL-SVM in predicting NPSLE were 95%, 95.8%, and 93%, respectively, by 3-fold cross-validation. We consequently conclude that the proposed system effectively and efficiently working on limited and noisy samples may brighten a noinvasive in vivo instrument for early diagnosis of NPSLE.

1. Introduction

Systemic lupus erythematosus (SLE) is an autoimmune disease involving multiple organs or systems, such as the central nervous system (CNS), peripheral nervous system (PNS), skin, joints, and kidneys [1]. Up to 75% of SLE patients suffer from CNS and PNS disorder [2]. The neuropsychiatric systemic lupus erythematosus (NPSLE) is closely related to a worse prognosis and a serious mortality [35]. In 1999, a classification criterion for NPSLE has been developed by the American College of Rheumatology (ACR), which included case definitions for 19 neuropsychiatric syndromes, significant exclusions, and recommendation of ascertainment [6]. It is still a tough task to ascribe a specific symptom or sign to NPSLE.

Magnetic resonance imaging (MRI) is widely considered a promising noninvasive tool for SLE diagnosis [7]. Conventional MRI sequences, including T1-weighted, T2-weighted, T2 fluid-attenuated inversion recovery (T2-FLAIR) images, and diffusion-weighted imaging (DWI) can sensitively reveal abnormal changes caused by axonal damage, cortical damage, cerebral atrophy, cerebral infarctions, inflammatory-like lesions, and small vessel disease [810]. However, there were about 50% of NPSLE patients that had normal intensities in the structural MRI [11], displaying the limitation of conventional MRI.

Proton magnetic resonance spectroscopy (1H-MRS) is accessible to the levels of tissue metabolites including N-acetylaspartate (NAA), creatine (Cr), choline (Cho), glutamate (Glu), Glu-glutamine (Gln), myo-inositol (mI), and lactate (Lac) [1214].

N-Acetylaspartate (NAA) is a special character of neural cells, and a decreased peak of NAA in MRS spectra represents the reduction of neurons [1517]. Choline (Cho) plays an important role in generating the phospholipid of the cell membrane. An elevated Cho peak shows the increased cell membrane synthesis, which reflects that the cell structure is in a mess [16]. Glutamate (Glu) and Glu-glutamine (Gln) are related to glutamatergic neurotransmitters, and increased Glu and Gln represent a high risk in psychosis [17]. Myo-inositol (mI) is involved in glial metabolism, and increased mI reflects glial involvement [18]. Moreover, 1H-MRS enables quantifying metabolic concentration noninvasively in vivo [19]. Thus, it has been widely explored in pioneering researches of noninvasively diagnosing NPSLE [9, 20, 21]. In previous studies, a reduction in NAA was observed, whereas total Cho (tCho) and mI are raised, in normal appearing brain tissue of NPSLE patients [20], which demonstrated that neural biomarkers were able to predict the early involvement of the central nervous system in SLE. However, Zimny et al. found that the levels of mI and Cho were almost normal in their patients with SLE or NPSLE [22].

Therefore, these changes detected by single-voxel MRS in the above studies may not be specific enough, since there is limitation of one voxel which is not included in all the regions that may have pathological changes. In this regard, the conventional statistical methods easily fail to distinguish NPSLE due to the individual difference of metabolic features among limited samples. The accuracy of diagnosing NPSLE by 1H-MRS required further improvement by emerging machine learning techniques.

Deep neural networks have been successfully applied in a great number of applications [23], including medical image processing [2426]. However, there are some unavoidable systematic errors, such as relaxation and partial-volume effects, which resulted in missing metabolic values. Consequently, a sufficient and consistent training set can hardly be constructed, which makes a great challenge to apply deep techniques to this task, since the samples are too small to train a deep structure with a great number of parameters. Moreover, the metabolic features mingling with missing values and noise also burden the classifier to make a correct judgement.

To develop a robust and effective model to quantitatively analyse the metabolic features or the nonlinear combination of metabolic features of the NPSLE patients, we rethink the potential of a support vector machine (SVM), which has been regarded as a succinct model to separate complicated data in limited samples and able to optimize convexly [27]. We also draw the idea of the broad learning system [28] to construct a shallow but effective learning system to extract discriminable features by shallow structures in a layer-wise mechanism. The proposed support vector machine broad learning system (BL-SVM) was applied to a retrospective analysis of the metabolic features screened by 1H-MRS quantitatively. Although the samples in the training set were limited, the SVMs embedded in the BL-SVM can still learn from the metabolic features layer-wise optimally. The diversity of each SVM is increased by resampling the training using the bootstrap method to enhance the robustness of the learning system. The results have confirmed that the metabolic features screened by multivoxel magnetic resonance spectroscopy can be used to quantitatively distinguish NPSLE patients from the healthy controls. Our findings may brighten an automatic and noninvasive computer-aided diagnostic instrument for NPSLE at an early stage.

2. Materials and Methods

2.1. Patients and Controls

This retrospective study has been approved by the Research Ethics Committee of the 2nd Affiliated Hospital, Shantou University Medical College. Informed consent was obtained from all subjects previously. The identifiers of the subjects were removed before analysis. The 1H-MRS data from 23 NPSLE patients and 16 age-matched healthy controls (HCs) were obtained at the Department of Rheumatology and Immunology of Shantou Central Hospital and the Department of Endocrinology and the Medical Examination Center of the 2nd Affiliated Hospital, Medical College of Shantou University, during April 2014 to March 2015. The inclusion criteria were as follows: (1) The group of NPSLE patients was diagnosed according to the revised 1997 American College of Rheumatology (ACR) criteria and the 1999 ACR definitions for NPSLE. All clinical manifestations were obtained at the baseline visit by a careful medical record review. In this group, patients had at least one neuropsychiatric complaints. (2) HCs did not have any neurologic, psychiatric, or systemic diseases, which would influence the results of multivoxel 1H-MRS, and none of them uses any psychoactive medication. (3) All the subjects underwent both conventional MRI examination and multivoxel 1H-MRS examination in our hospitals. (4) The clinical characteristics of all patients were available.

2.2. Magnetic Resonance Imaging

All subjects underwent MR imaging using a 3.0 T system (SIGNA, General Electric Medical Systems) with an eight-channel standard head coil. The repetition time (TR) of T2-weighted imaging was 4420 ms. The echo time (TE) was 112.1 ms. The slice thickness was 5 mm with a 1 mm gap. The matrix size was . The field of view (FOV) was mm. The parameters of T2-weighted imaging are listed in Table 1.

The multivoxel 1H-MRS was based on a point-resolved spectroscopy sequence (PRESS) with a two-dimensional multivoxel technique. The TR of the multivoxel 1H-MRS was 1500 ms. The TE was 35 ms. The number of excitations (NEX) was 1. The , and the . The VOIs of 1H-MRS were placed on the T2-weighted images including the entire basal ganglion level. The parameters of multivoxel 1H-MRS are listed in Table 2.

2.3. Imaging Processing

The acquired spectroscopy data were firstly preprocessed by a SAGE software package (GE Healthcare) to correct the phase and frequency. Then, commercially available automatic LCModel software (LCModel Inc., Canada, version 6.2-2B) was used to fit the spectra, correct the baseline, relaxation, and partial-volume effects, and quantify the concentration of metabolites. Furthermore, we used the absolute NAA concentration in single-voxel MRS as the standard to gain the absolute concentration of metabolites. After that, the NAA concentration of the corresponding voxel of multivoxel MRS was collected consistently. The spectra would be accepted if the signal-to-noise ratio (SNR) is greater than or equal to 10 and the metabolite concentration with standard deviations (SD) is less than or equal to 20%. Notice that every individual metabolite has its basis spectra, even if the metabolites are hardly separated, such as NAA and N-acetylaspartylglutamate (NAAG). However, the linear combination of similar spectra of metabolic concentrations is more accurate than the individual concentrations. In this regard, we list the linear combination, together with their %SD values, in the concentration table. Concentration ratios are not easily affected by water scaling and less sensitive to relaxation and partial-volume effects. Thus, we extracted the absolute metabolic concentrations, the corresponding ratio, and the linear combination of the spectra from different brain regions, which were RPCG, LRCG, RDT, RDT, LDT, RLN LLN, RPWM, and LPWM.

2.4. Quantitative Analysis via a SVM-Based Broad Learning System

The computer-aided analysis not only is user-friendly, rapid, and low-cost for learning and operation but also avoids clinical subjective judgment [25]. Building deeper neural networks has attracted increasing interests from academies and industries [23]. However, the neural network-based deep models still suffer from nonconvex optimization, unfriendly paralleling, and uninterpretable issues [27]. In particular, when the training samples are limited, the neural network-based deep models tend to overfit the training set; e.g., the model remembers what all training samples are exactly alike but fails to distinguish the samples which they have never seen. In this regard, the conventional deep neural networks are not compatible with this task. Thus, we formulate this retrospective analysis as a classification problem, so we develop a support vector machine broad learning system (BL-SVM) for quantitative analysis to distinguish the NPSLE patients from control ones. Unlike backpropagate deep stacked architecture, the BL-SVM organizes support vector machines (SVMs) in a shallow but broad scheme. The SVMs in each layer optimally extract the data representation layer-wise even if the training samples are limited, which ensures the antisaturation property. Different from a BP-like tuning scheme developed by Wang et al. [27], our BL-SVM enables fast learning for each SVM in each layer simultaneously and without time consumption for backpropagating iteratively.

The features involved in this model were selected by recursive feature elimination (see [29] for more details). Then, we construct a training set , where , , , and is the total number of training samples. We denote the th SVM in the th layer as , which can be trained by optimizing

For each SVM, the further away from a sample from its corresponding hyperplane, the more confident the SVM makes the classification decision. Different from Wang et al. [27], we design a new confidence function, e.g., equation (2), for the th SVM in the th layer, since is continuously differentiable everywhere.

The input of the SVMs in th layer is the initial input concatenating the confidence values of all SVMs in previous layers.

The th SVM in the th layer takes as input to calculate the confidence value . Then, we train the SVMs layer-wise. In each layer, we resample the training set for each SVM to increase the diversity of the individual SVM using the bootstrap method proposed by Zhou et al. [30].

In this study, the number of layers is set to 5. We use 3-fold cross-validation, which is a common resampling procedure in machine learning, to evaluate the performance of our metabolism-based diagnosis model, since it generally results in a less biased or less optimistic estimate of the model skill than other methods, such as a simple train/test split, especially on a dataset with limited samples. Twenty-three NPSLE patients and sixteen healthy controls were randomly divided into 3-fold. The cross-validation was conducted 50 times. For each run, 2 out of 3 subjects were selected for training, and the rest was used for testing, using different random seeds. We calculated the accuracy, sensitivity, and specificity to evaluate the performance of our model.

3. Results

3.1. Demographics

Twenty-three NPSLE patients and sixteen healthy controls met the inclusion criteria and satisfied the spectra quality detailed in our previous study [9]. Table 3 summarizes the number of NPSLE patients presenting with neuropsychiatric manifestations, including myelitis, seizure disorder, severe headache, stroke, peripheral polyneuropathy, acute confusional state, and anxiety in this study.

There was no significant difference in age between NPSLE patients and the HC set (). Obviously, SLE was closely related to gender (), and 79% of the patients were females in our study. Although there was a significant difference between NPSLE and HCs, the performance of predicting NPSLE would not be influenced. As we used 3-fold cross-validation to evaluate the accuracy of our BL-SVM system, every subject had the chance to be one of the training set. Results for their demographic characters are summarized in Table 4.

3.2. Metabolic Features from Multivoxel 1H-MRS

We collected metabolic data from the bilateral posterior cingulate gyrus (PCG), dorsal thalamus (DT), lentiform nucleus (LN), and posterior paratrigonal white matter (PWM), as well as from the right insula (RI) in all subjects. The metabolic features include creatine (Cr), phosphocreatine (PCr), Cr+PCr, NAA, N-acetylaspartylglutamate (NAAG), NAA+NAAG, NAA+NAAG/Cr+PCr ratio, myo-inositol (mI), mI/Cr+PCr, glycerophosphocholine (GPC/Cho)+phosphocholine (Pch), Cho+Pch/Cr+PCr, glutamate (Glu)+glutamine (Gln), and Glu+Gln/Cr+PCr. All brain regions and metabolic features were combined into 117 metabolic features as shown in Table 5. Thirty-three features were found with significant difference () between NPSLE patients and HCs: PCr and Cho+PCh in the right PCG; NAA+NAAG, NAA+NAAG/Cr+PCr, and mI in the left PCG; PCr, Cr+PCr, NAA, NAA+NAAG/Cr+PCr, mI/Cr+PCr, and Cho+PCh in the right DT; NAAG, NAA+NAAG/Cr+PCr, mI, mI/Cr+PCr, Cho+PCh, and Glu+Gln in the left DT; Cr, PCr, mI, Cho+PCh, and Cho+PCh/Cr+PCr in the right LN; PCr, mI/Cr+PCr, Cho+PCh, and Cho+PCh/Cr+PCr in the left LN; Cr and NAA in RI; PCr and Cho+PCh/Cr+PCr in the right PWM; and Cr, NAAG, and mI in the left PWM. The corresponding AUC values using these features for quantitative analysis are listed in Table 5. The AUC values generated by mI/Cr+PCr in LDT and the mI/Cr+PCr in RDT are 0.77 and 0.76, respectively, which achieve the best performance for diagnosing NPSLE among the evaluated features. Obviously, as shown in Figure 1, it is hard to distinguish the NPSLE patients and the HCs, whether by structure images or MRS alone.

3.3. Metabolic Features for the BL-SVM System

We employed a feature selection method, e.g., recursive feature elimination [29], to analyse which metabolite or combination of metabolites was closely related to NPSLE and filter out weak features to avoid overfitting. We first built the model on the entire set of metabolite features and computed an importance score for each feature. Then, the least important feature was removed from the current feature set. We retrained the model and computed the important score again. We repeat this step on the feature set until the specified number of features were selected. In the end, we found 26 features that were of the highest importance, as shown in Figure 2. The 26 features were as follows: NAAG, mI/Cr+PCr, and Glu+Gln/Cr+PCr in the right PCG; Cr+PCr, NAA+NAAG, NAA+NAAG/Cr+PCr, mI/Cr+PCr, and Glu+Gln in the left PCG; NAA, NAAG, and Cho+PCh in the left DT; PCr, Cr+PCr, Cho+PCh, Cho+PCh/Cr+PCr, and Glu+Gln/Cr+PCr in the right LN; mI/Cr+PCr, Cho+PCh, and Cho+PCh/Cr+PCr in the left LN; NAA+NAAG/Cr+PCr and Cho+PCh in RI; Cho+PCh/Cr+PCr and Glu+Gln/Cr+PCr in the right PWM; and PCr, NAAG, and NAA+NAAG/Cr+PCr in the left PWM.

However, these features had a complex nonlinear relationship, which made our diagnosis quite challenging. This motivated us to leverage the kernel tricks of the SVM classifier to map the features into a higher dimensional space to make the samples linearly separable. With the selected features, we evaluated the performance of our BL-SVM system on a ROC curve as shown in Figure 3. The AUC, sensitivity, and specificity were 95%, 95.8%, and 93%, respectively.

To estimate the generalization capacity of the proposed model, we perform the cross-validation for 50 times. In each run, we feed a different random seed for the resampling procedure. Two-thirds of samples are for training, and the rest is for testing. The scores of the AUC, sensitivity, and specificity are plotted in a box diagram in Figure 4. The box plot demonstrates that the model is capable of unseeing samples in each run.

4. Discussions

In this study, we retrospectively analysed the diagnosis of the NPSLE using multivoxel 1H-MRS. Each metabolic feature hardly identified the NPSLE patients from the HCs precisely and was even insignificant to NPSLE. We introduced a support vector machine-based deep-stacked network to quantitatively analyse the metabolic features. The result has shown that this model has good robustness, even there were several missing values of metabolic features caused by the noise of spectra. Furthermore, this model can accurately distinguish NPSLE patients from the HCs, although individual feature does not even manifest statistics significantly, which is better than any single metabolic feature. The results indicated that this model can be a helpful noninvasive computer-aided diagnostic tool for quantitative analysis of NPSLE.

The clinical complications of NPSLE severely affect the patients in their quality of work and life, which also consume a large amount of money. Therefore, the examination methods of early diagnosis and prediction of NPSLE have aroused wide attention; increasingly, laboratory biomarkers and neuroimaging tools have been proposed [3134].

The production of autoantibodies was used to be a diagnostic biomarker of SLE, and 116 autoantibodies had been found in a literature review using the keywords autoantibodies and systemic lupus erythematosus [35]. Various autoantibodies are reported that can be used as diagnostic biomarkers, since one of these autoantibodies had a significant difference between NPSLE and SLE patients and healthy controls. Most studies were explorative studies, which were short of repeatability. Segovia-Miranda et al. [36] have confirmed that the antiribosomal P (RP) antibody is related to cognitive dysfunction and other diffuse neuropsychiatric manifestations of NPSLE by altering glutamatergic synaptic transmission in the hippocampus. However, there was a recent study conflicted with their results, which suggested that the anti-P ribosomal antibodies have limited diagnostic value for NPSLE [37].

In this case, advanced neuroimaging technologies were needed urgently. In vivo multivoxel MRS allows simultaneously measuring the level of metabolites in several brain regions within a single slice [8] However, a standard for the choice of metabolites and brain regions is not available until now. Single metabolite or the ratio between two metabolites have been used to diagnose NPSLE in most studies [22, 3840]. There will be numerous exploratory biomarkers for diagnosing NPSLE by applying the previous laboratory and neuroimaging methods. However, accuracy needs to be further improved. Thus, more advanced machine learning methodologies are urgently required.

Broad learning systems are an alternative way to address the time-consuming training process and nonconvex issues, especially when the structure is insufficient to model the system [28]. Support vector machines are a succinct model with convexity optimization property to learn sample-limited data with complicated features [27]. We rethink the potential of SVM in diagnosing NPSLE, and we reconstruct the broad learning system to increase the diversity of features that the SVMs learned. To demonstrate the advantage of SVM-based broad learning systems for diagnosing NPSLE, we compared the BL-SVM with traditional statistical methods that were frequently used for diagnosing NPSLE. Guillen-Del Castillo et al. [41] suggested that the increased mI in normal parietal white matter and parietal white matter demonstrates a strong relationship to the deteriorated prognosis in NPSLE. In our study, the best accuracy of mI in parietal white matter was only 51.6%. For other single metabolites described in previous studies, such as NAA [42], Cho, and Cr [43], used in our model, the best accuracy was 75% in the right DT, 72% in the left PWM, and 77% in the right DT, respectively. It is also not known whether metabolite ratios could improve diagnostic accuracy. Cagonoli et al. proposed that NAA/Cr ratios and Glu/Cr ratios in RI might be biomarkers for NPSLE patients [44]. However, in our study, the accuracy of NAA/Cr ratios and Glu/Cr ratios in RI was both only 50%. Overall, thirty-three features were selected using conventional statistical methods, and the best accuracy among them was only 77%, whereas the BL-SVM system with the metabolic features from multivoxel 1H-MRS achieved 95% AUC, 95.8% sensitivity, and 93% specificity, respectively. To confirm that no overfitting occurred in our experiment, we performed 3-fold cross-validation to demonstrate the generalization ability of our BL-SVM system. It is worth noting that there were 26 features of eight brain regions of NPSLE patients that showed the optimal performance to diagnose NPSLE. So we should realize that single 1H-MRS may not suitably be used to diagnose NPSLE, because it was restricted to one small region, which missed important pieces of information. What is more, our study confirmed that not only should the absolute concentration of metabolites be considered but also the combination between them, such as NAA+NAAG, Glu+Gln/Cr+Pcr, and mI/Cr+Pcr. The BL-SVM system with the metabolic features from multivoxel 1H-MRS as a novel tool should be popularized in the diagnosis of NPSLE, though some studies have combined machine learning and 1H-MRS to increase the sensitivity and specificity for distinguishing diseases [14, 45, 46]. However, to the best of our knowledge, the current study is the first to use machine learning-based metabolic features to improve the accuracy of 1H-MRS to diagnose NPSLE. Our BL-SVM has achieved a specific performance for the following reasons. First, the kernels map the features into a higher dimensional space to empower the SVMs to split the samples linearly [47], which enables the BL-SVM system to distinguish NPSLE patients from HCs. Secondly, the BL-SVM system can learn to deal with the unavoidable absent metabolic feature value caused by patients moving, partial-volume effect, and overlapping among metabolites, which demonstrated the good robustness of our model. The BL-SVM learning system is general and can be extended to other applications, such as intelligent transportation systems [4855], intelligent computing [56, 57], and emotion computing [5860].

4.1. Limitations

However, there were some limitations in our study. First, the samples are limited. Although the number of samples is important for evaluating the generalization ability of a model, cross-validation is one of the alternative techniques by resampling to evaluate the generalization capacity of a machine learning model, when we have limited samples. We apply 3-fold cross-validation to evaluate the performance of the presented model 50 times. The results indicate the proposed model capable of unseeing samples. Besides, we have been collecting new samples to evaluate our model and develop new models. Secondly, other advanced medical imaging technologies should be considered to combine with 1H-MRS in this system, such as voxel-based morphometry, diffusional kurtosis imaging, and chemical exchange saturation transfer [6165].

5. Conclusion

In this retrospective study, we confirm that the metabolic features obtained by multivoxel proton magnetic resonance spectroscopy can be used to diagnose neuropsychiatric systemic lupus erythematosus by a well-trained support vector machine broad learning system. The support vector machine broad learning system achieves satisfactory AUC, sensitivity, and specificity as 95%, 95.8%, and 93%, respectively. We have also found that the support vector machine broad learning system can even leverage the metabolic features that were not regarded as statistically significant to distinguish the NPSLE patients from HC ones. Furthermore, our support vector machine broad learning system overcame the situation of limited samples with missing metabolic feature values.

In conclusion, the multivoxel proton magnetic resonance spectroscopy enhanced by our support vector machine broad learning system may brighten the computer-aided noninvasive diagnostic instrument for neuropsychiatric systemic lupus erythematosus in vivo.

Data Availability

The 1H-MRS data used to support the findings of this study have not been made available for the private protection of patient information.

Conflicts of Interest

The authors declare that they have no conflict of interest.

Authors’ Contributions

Yan Li and Zuhao Ge contribute equally to this work.

Acknowledgments

This work was supported by the NSFC (Nos. 81471730, 82020108016, 61902232 and 31870981), the Natural Science Foundation of Guangdong Province (No. 2018A030313291), the Education Science Planning Project of Guangdong Province (2018GXJK048), the Grant for Key Disciplinary Project of Clinical Medicine under the Guangdong High-Level University Development Program (002-18120302), the 2020 Li Ka Shing Foundation Cross-Disciplinary Research Grant (No. 2020LKSFG05D), the STU Scientific Research Foundation for Talents (NTF18006), and the Guangdong Special Cultivation Funds for College Students’ Scientific and Technological Innovation (No. pdjh2020b0222).