Abstract

The spread of the COVID-19 epidemic, since December 2019, has caused much damage around the world, disturbed every aspect of daily life, and has become a serious health threat. The COVID-19 epidemic impacted nearly 150 countries around the globe between December 2019 and March 2020. Since December 2019, researchers have been trying to develop new suitable statistical models to adequately describe the behavior of this deadly pandemic. In this paper, a flexible statistical model has been proposed that can be used to model the lifetime events associated with this deadly pandemic. The new distribution is derived from the combination of an extended Weibull distribution and a trigonometric strategy referred to as the arcsine-X approach. Hence, the new model may be referred to as the arcsine new flexible extended Weibull model. The proposed model is capable of capturing five different behaviors of the hazard rate function. The model parameters are estimated via the maximum likelihood approach. Furthermore, a Monte Carlo study is conducted to assess the behavior of the estimators. Finally, the applicability of the new model is demonstrated using the data of fifty-three patients taken from a hospital in China.

1. Introduction

COVID-19 (coronavirus disease 2019) was first spotted in China in December 2019 and then spread worldwide exponentially. The main reasons for the exponential growth of COVID-19 include unawareness and ignorance of this deadly virus, inadequate tools, low efficiency in detection, and delays in formulating appropriate and effective policies during the initial stage of the epidemic [1]. As of April 04, 2021, 08 : 47 GMT, 131,437,582 cases have been confirmed, 2,860,569 deaths have occurred, and 105,852,256 patients have been luckily recovered.

The top ten countries with the higher number of total confirmed COVID-19 cases include the USA with 32,669,121 cases, India with 16,263,695 cases, Brazil with 14,172,1399 cases, France with 5,408,606 cases, Russia with 4,744,961 cases, Turkey with 4,501,382 cases, the UK with 4,398,431 cases, Italy with 3,920,945 cases, Spain with 3,456,886 cases, and Germany with 3,238,054 cases.

The comparison of the COVID-19 epidemic between different countries is worth studying and is of great concern. In this regard, researchers are devoting serious efforts to make comparisons between different countries. The problems related to the COVID-19 epidemic in Italy have been discussed in [2]. Comparison of COVID-19 in some European countries, South Korea, and the USA has been done in [3]. The COVID-19 pandemic in Australia has been discussed in [4]. The phenomena of the spread of COVID-19 in Lebanon are provided in [5]. A case study from Spain has been discussed in [6]. Mathematical analysis of COVID-19 in Mexico is provided in [7]. A case study from Brazil is discussed in [8]. The issues related to the COVID-19 epidemic in Pakistan are provided in [9]. A mathematical model developed for assessing the transmission of the COVID-19 epidemic in India is introduced in [10]. Among the Asian countries, the phenomena of the spread of the COVID-19 epidemic are discussed in [11]. The comparison between Iran and mainland China has appeared in [12]. The comparison between two neighbor countries Iran and Pakistan has appeared in [13]. The problems related to the COVID-19 pandemic in Indonesia have been discussed in [14]. From the literature cited above, we see that there is a great interest to learn and know more about the COVID-19 epidemic. In the field of big data science and other related sectors, providing the best description of the real phenomena is a prominent research topic (refer [1520] for details). Recent studies have pointed out the potentiality of the statistical models in different sectors of applied sciences. The goal of this article is to carry out this research area of distribution theory and propose a new statistical model to provide a better fit to the survival times data.

Recently, Liao et al. [21] introduced a modification of the Weibull distribution called a NFEW (new flexible extended Weibull) and suggested its application in modeling lifetime events. Let X have the NFEW distribution with two shape parameters and two scale parameters ; then, its DF (distribution function) is as follows:where . The corresponding PDF (probability density function) is as follows:

In this work, we propose a new modification of the NFEW model called ASNFEW (arcsine new flexible extended Weibull) distribution using the arcsine-X strategy [22], which can be obtained as a subcase of [23]. The DF and PDF of the arcsine-X distributions are given byrespectively.

The DF of the proposed ASNFEW distribution is obtained by inserting equation (1) in (3). In the next section, we introduce the proposed model and sketch some possible behaviors of the PDF and HRF (hazard rate function) of the ASNFEW distribution.

2. The Arcsine New Flexible Extended Weibull Model

A random variable X has the ASNFEW model if its DF is given bywith SF (survival function) denoted by , and it can be expressed as follows:

For , and , the plots for the DF and SF of the ASNFEW model are sketched in Figure 1.

The PDF corresponding to equation (5) is given by

Some possible behaviors of the PDF of the ASNFEW model are presented in Figure 2. The plots in Figure 2 are sketched for (red line), (green line), (black line), and (blue line). From the plots sketched in Figure 2, we can see that the ASNFEW distribution is capable of capturing four different patterns of the PDF including bimodal (red line), modified unimodal (green line), reverse J-shaped (black line), and unimodal (blue line).

The HRF of the ASNFEW distribution is given by

Some possible behaviors of the HRF of the ASNFEW distribution are shown in Figure 3. The plots in Figure 3 are sketched for (red line), (green line), (blue line), (magenta line), and (black line). From the plots sketched in Figure 3, we can see that the ASNFEW distribution is capable of capturing five different patterns of the HRF including increasing (red line), decreasing (green line), unimodal (blue line), modified unimodal (magenta line), and bathtub (black line).

3. Basic Mathematical Properties

In this section, we will establish some statistical properties of the ASNFEW distribution.

3.1. Quantile Function

Let X denote the ASNFEW random variable with DF given by equation (5); then, the QF (quantile function) of X, denoted by Q(u), is given by

The QF (also called inverse DF) can be used to generate random numbers. Later, in Section 5, we will use the inverse DF method to carry out the simulation study.

3.2. Moments

This section deals with the derivation of the moment of the ASNFEW distribution that can be further used to obtain important characteristics. The moment of the ASNFEW distribution can be obtained as follows:

Using binomial series that is convergent when (see https://socratic.org/questions/how-do-you-use-the-binomial-series-to-expand-f-x-1-sqrt-1-x-2), we have

Using equation (11), we have

Using expression (12) in (10), we have

Using the series , we have

Let ; then, using expression (14), we get

Using expression (15) in (13), we get

3.3. Moment-Generating Function

This section offers the MGF (moment generating function) of the ASNFEW model. Let has the ASNFEW model; then, the MGF of X is derived as

Using equations (16) in (17), we get the MGF of the proposed model.

3.4. Skewness and Kurtosis

Using in equation (16), we get the first four moments of the ASNFEW distributions. These moments can be used to derive the following important characteristics of the ASNFEW distribution:(i)The CV (coefficient of variation):(ii)The CS (coefficient of skewness):(iii)The CK (coefficient of kurtosis):For and different values of and , plots of the important characteristics of the ASNFEW distribution are sketched in Figure 4.

4. Maximum Likelihood Estimation

This section deals with the computation of the MLEs (maximum likelihood estimators) of the ASNFEW model parameters. Let be the observed values of a sample randomly selected from the ASNFEW with parameters . The log-likelihood function of the respective sample denoted by iswhere . By computing the partial derivatives of on behalf of the model parameters , we have

The expressions , and do not have closed forms. Henceforth, the numerical estimates of the parameters can be obtained via an iterating process using computer software (see Supplementary Materials (available (here))). In order to show the uniqueness of the MLEs of the ASNFEW distribution, the profiles of the log-likelihood functions of the parameters are sketched in Figure 5.

5. Simulation Study

This section deals with the Monte Carlo simulation study to evaluate the MLEs of the ASNFEW distribution parameters. The ASNFEW distribution is easily simulated by inverting equation (5) as follows:

The simulation results are obtained for two different sets of parameters: (i) and (ii) .

The random number generation is done using the inverse DF approach. Note that the inverse process and simulation results are obtained using a kind of well-known statistical software with a library through the command . Moreover, we consider the sample size as , and the Monte Carlo replications were made 1000 times. For the maximization of the log-likelihood function, the algorithm “” is used with . For , the MLEs of are obtained for each set of simulated data. The bias and mean square error (MSE) were then computed for accessing the model adequacy. These quantities are calculated as follows:where .

Corresponding to the first set of parameter values , the simulation results are provided in Table 1, whereas for the second set of parameter values , the simulation results are provided in Table 2.

6. An Application to the Survival Times of COVID-19 Patients in China

A clear motivation behind proposing new statistical distributions is to increase the level of flexibility and applicability of the existing distributions. The main objective of proposing the ASNFEW distribution is its utilization in describing different types of time-to-event data. In this section, the respective fact is demonstrated using the data of fifty-three patients taken from a hospital in China. The data set is given as follows: 0.054, 0.064, 0.704, 0.816, 0.235, 0.976, 0.865, 0.364, 0.479, 0.568, 0.352, 0.978, 0.787, 0.976, 0.087, 0.548, 0.796, 0.458, 0.087, 0.437, 0.421, 1.978, 1.756, 2.089, 2.643, 2.869, 3.867, 3.890, 3.543, 3.079, 3.646, 3.348, 4.093, 4.092, 4.190, 4.237, 5.028, 5.083, 6.174, 6.743, 7.274, 7.058, 8.273, 9.324, 10.827, 11.282, 13.324, 14.278, 15.287, 16.978, 17.209, 19.092, and 20.083. The basic summary measures of the data are given in Table 3.

The histogram and box plot of COVID-19 data along with the total time test (TTT) plot are sketched in Figure 6.

We illustrate the best fitting power of the ASNFEW model and compare its fitting results with the other (i) three-parameter distributions such as Marshall–Olkin–Weibull (MOW) [24], (ii) four-parameter distributions such as Kumaraswamy–Weibull (Ku-W) [25], odd log-logistic modified Weibull (OLL-MW) [26], transmuted modified Weibull (T-MW) [27], and Frechet–Weibull (FrW) [28], and (iii) five-parameter model called beta modified Weibull (B-MW) [29]. The PDFs of the competitive models are as follows:(i)MOW distribution:(ii)Ku-W distribution:(iii)OLL-MW distribution:(iv)T-MW distribution:(v)FrW distribution:(vi)B-MW distribution:

The decision about the best fitting of the competing distributions is made by considering certain criteria selected for comparison. These criteria consist of some discrimination and goodness-of-fit measures. The expressions of the discrimination measures (DM) are given as follows:(i)The AIC (Akaike information criterion):(ii)The BIC (Bayesian information criterion):(iii)The HQIC (Hannan–Quinn information criterion):(iv)The CAIC (corrected Akaike information):

The expressions of the goodness-of-fit measures are given as follows:(i)The AD (Anderson–Darling) test statistic:(ii)The CM (Cramer–von Mises) test statistic:(iii)The KS (Kolmogorov–Smirnov) test statistic:

The term “best fitting” is used in the sense of a model having smaller values of the considered criterion. The MLEs and DM of the fitted distributions are provided in Tables 4 and 5, respectively, whereas the goodness-of-fit measures are provided in Table 6.

From Tables 5 and 6, we can see that corresponding to the COVID-19 data set, the values of the DM for the ASNFEW distribution are and the values of the goodness-of-fit quantities with the corresponding value are with value , whereas the values of these measures for the second best model (MOW distribution) are , , with value . From the above reported results, we can see that the ASNFEW distribution has smaller values of the analytical measures and a high value indicating a closer fit to data. Furthermore, for the best description of the results provided in Tables 5 and 6, a graphical display of these results is provided in Figures 7 and 8.

Based on the numerical results in Tables 5 and 6 as well as the graphical display of these results in Figures 7 and 8, we can conclude that the ASNFEW is a useful and suitable candidate distribution for modeling COVID-19 and other related data sets. For further confirmation of the best fitting capability of the ASNFEW distribution, the plots of the estimated PDF, DF, SF, and PP (probability-probability) are sketched in Figure 9. The plots sketched in Figure 9 show a closer fit of the ASNFEW distribution to the data under consideration.

7. Concluding Remarks

The statistical distributions have proven to be of great importance and attracted the attention of researchers to use them for modeling data, particularly, in the fields related to lifetime events. In this work, a generalization of the new flexible extended Weibull model is proposed. An example of real-life data related to the survival times of COVID-19 patients in China is used to demonstrate the applicability of the ASNFEW distribution. Comparisons of the proposed model with the other competitors, including four-parameter and five-parameter models, are provided. Certain analytical tools, including four discrimination measures and three goodness-of-fit measures, are considered to compare the fitted distributions. The numerical results of these analytical measures showed that the ASNFEW model provides a better fit, supported by the graphical sketching and numerical tools. In summary, the proposed model has a wide range of applications because of its flexibility in modeling different types of hazard functions. A simulation study also reveals that the proposed model can be valuable in adequately describing different types of time-to-event data. We hope that beyond the scope of this paper, the ASNFEW distribution can be applied to analyze other forms of the data related to COVID-19 events.

Data Availability

The data set used to support this study is included within the article. However, additional data (if required) will be provided upon request to the corresponding author.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Supplementary Materials

The R-code used for analysis in this paper is provided as a supplementary file. (Supplementary Materials)