In recent years, queueing models with service interruption have been studied extensively due to their widespread applications. This paper is devoted to the study of an M/M/1 queue with a particular service interruption discipline under two types of the maintenance activities, namely the reactive maintenance and the planned maintenance. In the process of serving customers, when the system receives a negative feedback from customers, and in the next N customers to be served, if a negative feedback from the next N customers is received again, the service is interrupted and a reactive maintenance will be triggered immediately; otherwise, if all of the next certain amount of feedbacks are positive, the system administrator has a reason to believe that the occurrence of the negative feedback is an accidental phenomenon, and the reactive maintenance will not be initiated. By using the matrix analytic approach and spectral expansion method, we derive the steady-state probabilities, which are then used to compute the performance measures of the queueing system. Sensitivity analyses of parameters are included in this work for illustrative purposes, and from the perspective of queueing model, we also show that designing the queue size in terms of the steady-state probabilities is a more reasonable and rational method than the expected queue length.

1. Introduction

Queueing models with service interruption have been studied extensively due to their widespread applications including production systems, transportation systems, complex modern communication systems and service systems. So far, according to the emerged reasons, the literature on the service interruption can be roughly divided into two categories, depending on whether the interruption is caused by external factors (e.g., due to unexpected failures) or if it is caused by internal factors (e.g., due to the gradual deterioration of the servers (facilities)).

The category of service interruption, due to external factors, occurs in queueing systems that are subject to unexpected failure or breakdown. In previous studies, a large number of queueing researchers have investigated the queueing systems with repairable servers, where the maintenance action is carried out once the systems are breakdown. For example, Wartenhorst [1] considered an parallel queueing system with server breakdown and repair. Gray et al. [2] studied a queueing model with multiple types of server breakdowns. Lam et al. [3] discussed a geometric process model for an M/M/1 queueing system with a repairable service station. By using the supplementary variable technique, the authors obtained some queueing characteristics as well as reliability measures of the service station. Yechiali [4] studied a queueing system with system disasters and impatient customers when the system is down. For more complex queueing systems with breakdown, Li et al. [5] studied a BMAP/G/1 retrial queue with a server subject to breakdowns and maintenances and derived the stationary availability, failure frequency, and queue length. Liu et al. [6] studied an M/G/1 retrial G-queue with preemptive resume and feedback under N-policy, where the server subjects to breakdowns and repairs. Cordeiro and Kharoufeh [7] examined an M/M/1 retrial queue with an unreliable server whose arrival, service, failure, repair, and retrial rates are all modulated by an exogenous random environment. Jain et al. [8] studied a fault-tolerant system with general distributed repair time, server vacation, and server breakdown. Ammar et al. [9] provided the transient analysis of impatient customers in an M/M/1 disasters queue in random environment. Recently, while the server in a queueing system suffers from unexpected failures and becomes defective, the system can be equipped with a substitute server which continues to provide service for the arriving customers, instead of stopping service completely. This phenomenon is also referred to working breakdown. Kalidass and Kasturi [10] first introduced the concept and applied the strategy to the facility replacement problems. Later, a variety of research efforts have contributed to the topic and its applications in manufacturing systems. For example, Kim and Lee [11] investigated an M/G/1 queue with disasters and working breakdown, where the system can provide service by an auxiliary machine while the main machine is in maintenance period. Liu and Song [12], Jiang and Liu [13], respectively, considered a batch arrival /M/1 queueing system and a GI/M/1 queueing system with working breakdown. Meanwhile, in [13], the authors gave the applications of working breakdown in an unreliable multiproduct manufacturing network and the wireless sensor networks. Recently, Yang and Wu [14] investigated a finite-capacity Markovian queueing system with working breakdowns, reneging, and retention of impatient customers. Ye and Liu [15] studied an MAP/M/1 queue with working breakdowns. Jiang and Xin [16] investigated a queueing system with working breakdowns and delaying repair under a Bernoulli-schedule-controlled policy by the matrix analytic method and the spectral expansion method. Ammar and Rajadurai [17] investigated a preemptive priority retrial queueing system with disaster under working breakdown services.

The category of service interruption, due to internal factors, is triggered by the gradual deterioration of the servers. In this category, the so-called condition-based maintenance, which utilizes the finished item quality as the intermediate to monitor the server condition, is often used in the production systems. Under the condition-based maintenance, maintenance action is implemented by setting a control chart to monitor the operating condition of the servers. Once a defective item is detected, the control chart will generate an alert signal; then, the service is interrupted and the server will experience a maintenance period immediately. In recent decades, this type of service interruption with condition-based maintenance has been extensively investigated in the literature. For example, Zhou and Zhu [18] gave an economic design of the integrated model of the control chart and maintenance management. Liu et al. [19] applied the condition-based maintenance to give the economic and economic-statistical designs of a control chart for two-unit series systems. Zhou et al. [20] considered two types of maintenance activities and studied a trade-off between energy consumption and customer delay based on the queueing approach. For more related papers on the maintenance method, the interested readers are referred to Cha et al. [21], Yang et al. [22], and Alaswad and Xiang [23], and the references therein.

The underlying queueing model is motivated by a problem that involves the maintenance planning in real service systems. For example, in some consulting service centers, customers may be asked for taking a few moments of their time to fill out a Customer Service Feedback Form about their service. If a customer is satisfied with the service, he/she will respond positively; otherwise, he/she will make a negative response. If the consulting service center receives a negative response on the service, in order to ensure whether the negative response is an accidental phenomenon or not, the consulting service center should continue to check the next certain amount of responses from customers. During the next certain amount of responses, if a negative response appears again, the consulting service center will do some adjustments and a reactive maintenance should be scheduled; otherwise, if all of the next certain amount of responses are positive, the consulting service center has a reason to believe that the occurrence of the negative response is an accidental phenomenon, and the adjustments will not be initiated. In addition, the underlying queueing model can be also applied to production systems. For example, in a real production system, equipment may deteriorate gradually due to the accumulation of wear and corrosion. In order to identify the operating condition of the facility, the system usually employs the quality characteristic of items as the intermediate to monitor the facility condition. Then, the system is equipped with a quality control detector and monitored using the Shewhart control charts to check the quality of the finished items. If the quality characteristic of a finished item falls outside the so-called “in-control” region, i.e., a defective item is detected, then, the control chart will generate an alert signal. However, in order to ensure the efficiency of the system, the inspector should estimate the defective item is whether an accidental phenomenon or not, i.e., the inspector should continue to check the next certain amount of finished items. During the next certain amount of finished items, if a defective item is detected again, it is reasonable to believe that the operating condition of the equipment is in a danger state and a reactive maintenance should be scheduled; otherwise, if all of the next certain amount of finished items are acceptable, the inspector has a reason to believe that the occurrence of the defective item is an accidental phenomenon, and the reactive maintenance will not be initiated.

Different from a recent study [24], in which the author considers a clearing queueing model with server maintenance after N negative feedbacks, in this paper, we describe the service process of the queueing system and the maintenance policies as a single-server queue with two types of maintenance activities for the server, i.e., planned maintenance and reactive maintenance. To the best of our knowledge, the underlying queueing model with service interruption has not appeared in the literature. The derived results of the queueing model could be applied to deal with more practical queueing problems and provide important reference value for system administrator to scientifically design the scheduled maintenance policy. For more details of the common methods on solving the queueing models, the interested readers are referred to [25, 26], which give a detailed introduction on the idea and application of the methods. Next, the main contributions of the paper are as follows:(i)Model: We consider a novel and special service interruption discipline with two types of maintenance activities for the underlying queueing system. The model is suitable for reflecting the characteristics of customers’ services for the modern service systems and production systems.(ii)Methodology and results: The matrix analytic approach and spectral expansion method are applied to analyze the special service interruption discipline in queueing problems. Compared to other methods (e.g., probability generating functions), the two methods could effectively give a detailed computational procedure to derive the steady-state probabilities and the performance measures. Furthermore, there are always finite number of solution techniques for solving any given problem mathematically; therefore, we feel that the considered model extends the analytical literature of queueing theory by the intricate successful formulation of the problem.(iii)Numerical illustrations: Sensitivity analysis of parameters is included in this work for illustrative purpose, and some qualitative take-away messages are also provided and intuitively explained. Moreover, by some numerical examples, we show that designing the queue size in terms of the steady-state probabilities is a more reasonable and rational method than the expected queue length.

The underlying article is organized as follows. Section 2 describes the proposed queueing model. In Section 3, we present the stability analysis of the underlying queueing system. Some numerical examples are conducted to show the impacts of system parameters on the performance measures in Section 4. Section 5 concludes the paper.

2. Model Description

In practice, it is more common to model the queueing system as a queue with multiple servers. To simplify the model, in this paper, we will study the queueing model based on an M/M/1 queue with two types of maintenance activities for the server (the reactive maintenance and the planned maintenance), where the reactive maintenance is triggered by a particular service interruption discipline, which will be described in the following section, and the frequency of planned maintenance is determined by the system administrator, which can be represented by a proportion parameter at the beginning of each idle period. We believe the simplified model could also derive similar results. Below, we use the related parameters of queueing system to describe the underlying queueing model. Our assumptions are driven by the actual problem that motivates the analysis and our desire for keeping the model as simple as possible.(i)Customers arrive according to a Poisson process with rate and customers are served based on a first-come, first-served (FCFS) discipline.(ii)Uncertain service times at server are exponentially distributed with parameter .(iii)Once a customer is served, the served customer is required to give a response (feedback) on the quality of customer service. If he/she is satisfied with the service, he/she will respond positively on his/her service; otherwise, he/she will make a negative response for his/her horrible or unpleasant service. To this end, we consider as the probability that a served customer gives a negative feedback and as the probability that a served customer gives a positive feedback on the service. Next, we describe the particular service interruption discipline and show that when the reactive maintenance will be triggered. In the process of serving customers, when the system is in the normal working state and a negative feedback is received from the served customers, an alert signal is immediately generated; meanwhile, the system goes into a warning working state and checks the next customers to be served, so as to ensure the negative feedback is whether an accidental phenomenon or not. In the next customers to be served, if a negative feedback from the customers is received again, the reactive maintenance will be scheduled immediately; otherwise, if the next feedbacks from the customers are positive, the system administrator believes that the occurrence of the negative feedback in the normal working state is an accidental phenomenon; therefore, the reactive maintenance will not be initiated, and the system returns to the normal working state.(iv)Beside the reactive maintenance, planned maintenance is also adopted according to a frequency parameter at the beginning of an idle period to ensure the system in the normal operation. For analytical simplicity, we use a proportion parameter to denote the frequency parameter (i.e., represents that the planned maintenance is carried out every 5 idle periods). We also assume that if there is no customer while the system is in the warning working state, the server will also schedule a planned maintenance.(v)The reactive maintenance time is assumed to follow an exponential distribution with a rate , the planned maintenance time is assumed to follow an exponential distribution with a rate .(vi)In addition, we further assume that the arrival process, the service process, and the maintenance process are mutually independent. Since the checking time is relatively short compared with the expected service time, then the checking time can be negligible.

3. Model Analysis

Based on the model description, we will first give the stability analysis for the underlying queueing system, and then we use the matrix analytic method and spectral expansion method to analyze the queueing model. Actually, the other possible method of analysis is via probability generating functions which, in this case, will be more complex. Moreover, because of the complicated structure of the queueing model, the steady-state probability vector may be difficult to obtain in a closed form. However, the two methods in consideration could effectively give a detailed computational procedure to derive the steady-state probabilities and the performance measures of the underlying queueing model.

3.1. Stability Condition

Let denote the number of customers in the system at time , denotes the state of the server at time , which is defined as

As the server is in the warning state at time , i.e., , then, denotes that when the system goes into the warning working state, the next feedbacks from customers are positive. Obviously, is a continuous-time Markov chain with the state space

Considering the variety of system states, the transition rate diagram of the continuous-time Markov chain is shown in Figure 1.

Next, according to the transition rate diagram and using the lexicographical sequence for the aforementioned states, the infinitesimal generator matrix of the continuous-time Markov chain in the block-partitioned form can be obtained as follows:whereand the other partitioned matrices are square ones with orders,

In order to analyze the queueing model, we first study the stability condition. According to the aforementioned state-transition-rate matrix, the stability condition can be derived in the following lemma.

Lemma 1. The underlying queueing system is stable if and only if

Proof. Based on Theorem 1.7.1 in Neuts [25], the underlying queueing system is stable if and only ifwhere is the invariant probability vector of , which satisfies and , is a column vector with dimensions and all its elements are equal to one. From the expressions of matrices , we find thatis irreducible, then, solving the equations and , we haveTherefore, from , the sufficient and necessary stability condition can be further derived byNext, in order to analyze the queueing model in stability condition, we first denote the steady-state vector of the queueing model as , where

3.2. Matrix Analytic Method

From the special structure of matrix , we find that is a quasi-birth-and-death (QBD) process, whose state-transition-rate matrix is block tridiagonal. Therefore, in this subsection, we first use the matrix analytic method to derive the steady-state probabilities for the underlying queueing system. The most important step in applying this method is to obtain the rate matrix , which is the minimal nonnegative solution of

According to the structure of matrices , we find that it is difficult to obtain the explicit expression of rate matrix . However, based on the theory in [25], the matrix equation can be typically solved numerically using the following iteration procedure. Consider the sequence of matrices , obtained by successive substitutions, starting with and then . If there exist , such that , then we stop the iterative procedure. Because of the convergence of the iterative algorithm, i.e., , then the obtained is approximately equal to . After is obtained, according to the matrix analytic method, we have

Next, the boundary vector can be obtained by solving the equationswhere

Then, the steady-state probabilities can be obtained by the following theorem.

Theorem 1. If the stability condition is satisfied, then, the steady-state probabilities can be obtained as followswhere satisfies the following set of equations:

3.3. Spectral Expansion Method

Next, we turn to use an alternate method called spectral expansion method to derive the steady-state probabilities for the underlying queueing system. From , we haveassociated with it is the characteristic matrix polynomial, which has the following structure:where

From the expression of , we have the determinant of :

Referring to Mitrani and Chakka [26], if the underlying queueing system is stable, then the number of eigenvalues in the unit disk of is equal to the degree of ; that is, eigenvalues should lie inside the unit disk. Then, the steady-state probabilities have the following form:where are the eigenvalues inside the unit disk, are the corresponding left eigenvectors, coefficients can be determined from the balance equations, and the eigenvalue-eigenvector pairs of satisfy

Solving the two matrix equations, the eigenvectors corresponding to can be obtained aswhere

After deriving the eigenvalues and their corresponding left eigenvectors, the coefficients can be directly computed by the balance equations and the normalization condition. From , can be obtained in terms of . Then, combining the expression of and solving the set of equations unknown coefficients can be derived, and hence the steady-state probabilities.

3.4. Performance Measures

From the derived steady-state probabilities, in this subsection, we will obtain some performance measures. Let and denote the expected number of customers in the system and the mean sojourn time of a customer needing to be served in the system. Then, some performance measures could be obtained as follows:(i)The probability that the server is idle: .(ii)The probability that the server is busy in the normal working state: .(iii)The probability that the server is in the warning working state: .(iv)The probability that the server is in the planned maintenance state: .The probability that the server is in the reactive maintenance state: .

The expected number of customers in the system isor

By using Little’s law, the mean sojourn time of a customer needing to be served is

4. Numerical Results

In this section, we would provide a set of numerical examples to give a comparison between the underlying queueing model and its special case (). Moreover, sensitivity analysis of parameters is included in this section for illustrative purpose, and some qualitative take-away messages are provided and intuitively explained.

We first derive the steady-state probabilities for the special case of the underlying queueing model (queueing model with the condition-based maintenance). If we assume that , this means that, once the system receives a negative feedback, the reactive maintenance will be triggered immediately. Let denote the number of customers in the system at time and denote the state of the server at time ; then the state-transition-rate matrix of this model can be obtained as follows:where all partitioned matrices are square ones with orders

In this case, the sufficient and necessary stability condition is

Similar to the analysis in Section 3, if the stability condition is satisfied, we can obtain the steady-state probabilities.

Next, based on the theoretical framework given by the above analysis, we present some figures below to investigate the impact of the parameters on the expected number of customers . We first assume that , , , and and then provide a comparison between the cases and .

From Figure 2, we find that is increasing in , while decreasing in . It is noteworthy that, for the curves in Figure 2, if values and are fixed, the value of in the case is greater than the case . It is reasonable that, as , once the system receives a negative feedback from customers, the reactive maintenance will be triggered immediately; therefore, the case could increase the reactive maintenance frequency. However, high-frequency maintenance will lead to a much longer queue length to the system. On the contrary, as , the system will verify whether the negative feedback is an accidental phenomenon or not. Compared to the case , the case will lower the reactive maintenance frequency, which could satisfy the short queue length requirement. Thus, from the numerical result, to reduce the holding cost for customers and shorten queue length (or the customer sojourn time), the system administrator should verify whether the negative feedback is an accidental phenomenon or not, so that the reactive maintenance frequency could be properly lowered, and the cost could be reduced.

Next, we assume that , , , , and and pay attention to the curves of with the change of from 1 to 20 for the different values of . Some interesting managerial insights can be derived from Figure 3. First, first increases significantly and then tends to be smooth. In the author’s opinion, the phenomenon may be caused by the value of , i.e., the negative feedback percentage (meaning statistically the event may happen every experiments on average). For a fixed , if , the system will be more likely to receive a negative feedback in the next feedbacks, which means that the increase of leads to a high-frequency reactive maintenance and a much longer queue length to the system. However, when , the designed inspections/detections do not have chance to be realized, i.e., the reactive maintenance will always be triggered; then, in this case, tends to be smooth as increases, and the parameter has a weak impact on . According to our model description, we assume that remains unchanged while the system is in the warning working state. Thus, from the numerical result, the maximum value of the next number of feedbacks needs to be checked in the warning working state is a certain value; when exceeds this value, the curve becomes smooth. Moreover, is increasing in . The reason is that high-frequency planned maintenance could also contribute a much longer queue length. Thus, according to the negative feedback percentage, the system administrator needs to choose a suitable number of feedbacks from customers to check while the system is in the warning working state. Moreover, the frequency of planned maintenance should be properly lowered, so that the queue length of the system could be reduced, the small sojourn time requirement could be satisfied, and the holding cost for customers could be saved.

Finally, we assume that , , , and and explore the impact of the parameters and on for the different values of . As expected, Figure 4 reveals that for the fixed values and , increases with the increase of . Thus, from the numerical results, the system administrator should control the negative feedback percentage so as to reduce the queue length of the system and shorten the customer sojourn time. Also, from Figure 4(a), the smaller is, the smoother of the curve becomes. Moreover, from Figure 4(b), we also find that increases with the increase of . The reason is that as increases, the system has a higher probability to proceed a planned maintenance process when the system is empty, i.e., the system has a higher frequency to carry out the planned maintenance, which could contribute a much longer queue length to the system. Thus, to satisfy the short queue length requirement, the system administrator should lower the frequency of the planned maintenance.

In short, these numerical results can provide important reference value for system administrator to scientifically design the parameter of the maintenance policy to save cost and keep the server staying in an efficient operation state.

From the perspective of the queueing model, it is well known that if the queue capacity design is too large, the construction and operation costs of the system will be too high. On the contrary, if the capacity of the queue is too small, the arriving customers will be lost due to insufficient capacity, which will cause an economic loss and bring a negative effect. Therefore, there is a great significance in designing the size of queue capacity. Next, we discuss the capacity setting of the underlying queueing model by numerical examples and analyze the importance of steady-state probabilities in designing the queue capacity. We assume that , , , , , , and .

We first provide a table to show the steady-state probabilities of the queueing model, i.e., the probability that there are customers in the system. In Table 1, the expected number of customers in the system is . From Table 1, we find that is greater than a certain number, , so it is not necessary to design the system capacity to be infinite. According to , we have

That is, if we use the mean value to design the queue size, the loss probability of arriving customers due to the finite capacity reaches 30.94%, even if we expand a unit of mean value, the loss probability is also up to 21.38%. Therefore, at this point, it is not advisable to design the queue size according to the expected number of customers in the system.

Next, we consider the designing the queue size by controlling the loss probability. For example, we assume that the loss probability is less than 0.0001. Then, from , we have ; that is, we can set the queue size as 28. Therefore, designing the queue capacity based on the steady-state probabilities is a more reasonable and rational method than the expected number of customers in the system.

5. Conclusion and Future Work

This paper is devoted to the study of the performance of the queueing system with service interruptions under a reactive maintenance policy combining the planned maintenance for the server. We dealt with the problem by constructing a queueing model. Using the matrix analytic approach and spectral expansion method, we respectively obtained the steady-state probabilities. Based on the theoretical framework given by the queueing analysis, we derived some performance measures and provided some numerical examples to show the impacts of system parameters on some key performance measures. Moreover, from the perspective of queueing model, we discussed the capacity setting of the underlying queueing model by numerical examples and analyzed the importance of steady-state probabilities in designing the queue capacity. We hope that our results can be applied to more practical queueing problems, and the results can provide important reference value for system administrator to scientifically design the maintenance policy.

Although in this paper, we just considered a single-server queueing system instead of a multiserver queueing system. In the further work, we could also consider the more realistic queueing system with multiple servers and use a Phase-type (PH) distribution for modelling the service times, which needs to develop new methods, and it will be an interesting direction for future research. Another interesting direction for future research is that, based on the steady-state analysis, we could investigate a real-world production system (service system) by using the real data and verify the validity of the underling queueing model. Finally, we could also consider pricing problem in this system, i.e., we could investigate the strategic behavior of customers, and then obtain the optimal pricing decision on the basis of the equilibrium strategies of customers.

Data Availability

All data included in this study are available upon request by contact with the corresponding author.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant no. 12001329), Shandong Provincial Natural Science Foundation (China) (Grant no. ZR2019BG014), and Scientific Research Foundation of Shandong University of Science and Technology for Recruited Talents (Grant no. 2019RCJJ016).