Abstract

We address the performance analysis of the maximum likelihood (ML) direction-of-arrival (DOA) estimation algorithm in the case of azimuth/elevation estimation of two incident signals using the uniform circular array (UCA). Based on the Taylor series expansion and approximation, we get explicit expressions of the root mean square errors (RMSEs) of the azimuths and elevations. The validity of the derived expressions is shown by comparing the analytic results with the simulation results. The derivation in this paper is further verified by illustrating the consistency of the analytic results with the Cramer-Rao lower bound (CRLB).

1. Introduction

There have been many studies on the direction-of-arrival (DOA) estimation [15]. Our concern in this paper is the performance analysis of the maximum likelihood- (ML-) based DOA estimation algorithm.

Many studies have been conducted on asymptotic performance analysis of maximum likelihood DOA estimation algorithm [68]. The shortcoming of the asymptotic performance analysis is that it is only applicable to high SNR region since it is based on approximation which is valid only for small amount of noise. To circumvent this problem, many studies on nonasymptotic performance analysis have been conducted for performance analysis which is valid for low SNR as well as high SNR.

In [912], the authors dealt with threshold performance analysis for single incident signal. On the other hand, our scheme presented in this paper is applicable to multiple incident signal as well as single incident signal.

In this paper, we address the performance analysis of the ML algorithm for the estimation of the DOA of incident signals. Our interest is in the case of multiple incident signals. The ML algorithm exploits the fact that, in the absence of the noise, the incident signal on the array elements can be expressed as a linear combination of the array steering vectors corresponding to the true incident angles.

Based on this observation, in the ML algorithm, the estimate of the DOA is obtained from the angles whose steering vectors can span the subspace to which the signal on the array elements belongs.

In practical situation, the noise is inevitable. Therefore, the noisy incident signal on the array elements cannot be expressed as a linear combination of the array steering vectors associated with true incident angles, which accounts for why estimation error occurs for noisy array response.

To quantify the bias of the estimate due to the noise on the antenna elements, we adopt the Taylor series approximation around the true incident angle. We derive the closed-form expression of the biases of the estimates, where different estimates are due to some approximations. For one of the many estimates, we derive the closed-form expression of the MSE of the estimate.

In Figure 1, the mathematical and statistical theories used for the performance analysis of the ML algorithm are enumerated.

Novelty of the proposed method over the existing methods is as follows.

Performance Analysis for Multiple Incident Signals and Simultaneous Estimation of Azimuth/Elevation. In [911], the authors made performance analysis of ML DOA algorithm for single incident signals. On the other hand, the scheme presented in this paper handles multiple incident signals as well as single incident signal. In addition, the studies presented in [911] consider the estimation of azimuth, not azimuth and elevation. Note that, in this paper, we consider the estimation of azimuth and elevation of multiple incident signals.

Derivation of Explicit Expressions of Bias and MSE, Not the Lower Bound of MSE. In [13], the authors derived the CRLB of the DOA algorithm, which is a lower bound of the variance of the DOA algorithm. Note that the CRLB is the lower bound of the variance, not the variance itself of the DOA algorithm. In this paper, explicit expression of the bias of the estimate and explicit expression of the MSE itself, not the CRLB, have been derived.

Intuition on the Effect of Each Approximation on the Estimation Accuracy. In this paper, we obtain, for , the expressions of [, ], [, ], and [, ]. The superscripts and denote approximation and approximation, respectively. Therefore, [, ], [, ], and [, ] are the estimates with no approximation, approximation, and approximations, respectively. By comparing these three values, we can intuitively recognize which approximation of approximation and approximation results in larger error. The importance of this observation is that the proposed scheme gives us an intuition on which approximation of all the approximations results in large approximation error. The approach presented in [14] does not show any intuition on which approximation results in large approximation error, since in [14] the authors only explicitly consider Taylor series expansion for general problem. Since the approach in [14] is a general approach applicable to many estimation problems, it does not give explicit expressions specific to DOA estimation problem. Since the scheme presented in this paper only considers the estimation of DOA estimation, rather than general estimation problem, the expressions presented in this paper are more explicit than the results presented in [14].

Explicit Expressions of Various Estimates for Each Step of Successive Approximations. To the best of our knowledge, there has been no study where explicit expressions of successive estimates have been derived [613]. Successive estimates imply various estimates obtained as each approximation has been successively applied. In [14], no explicit expression of estimate for each step of successive approximations has been presented. Although the study presented in [14] deals with the estimation of vector parameter as well as scalar parameter, the results in [14] for vector parameter estimation, equations and in [14], are quite implicit, rather than explicit, and it is very difficult to get intuition on how the bias and the covariance matrix are dependent on the amount of an additive noise.

Search-Free Approximate ML DOA Estimation and Derivation of Closed-Form Estimation Error. For getting and in (13) for , four-dimensional search with respect to , , , and has to be performed. In general, for estimation of incident signals, dimensional search has to be performed for simultaneous estimation of , which is computationally very intensive. In addition, in practical implementation of (13), the estimates are highly dependent on the search range and search step, which implies that the estimates in (13) may be wrong if search step and search range are not properly chosen. In summary, the original ML estimate obtained from implementation of (13) has two demerits of computational cost and dependence on search step and search range.

On the other hand, and are obtained from (42) and (45), respectively. Note that and are obtained in closed-form and that evaluation of and is search-free, which implies that evaluation of and is much less computationally intensive than evaluation of . In addition, since evaluation of and is search-free, ] and are not dependent on search parameters such as search step and search range.

[] and [] in (42) and (45) can be regarded as estimation errors for approximate ML DOA algorithm. [] and [] can be obtained in closed-form, although [] and [] are less accurate than [] in the sense that approximation is applied in getting [] and that approximations are applied in getting [].

Derivation of Analytic Expression of Closed-Form MSE of . Many evaluations of (42) and (45) are required for getting empirical performance analysis. Single evaluation of (42) and (45) is not computationally intensive since they are search-free closed-form solutions. Note that main computational cost of evaluating (42) and (45) is inversion of matrices, which is not computationally intensive at all. For estimation of incident signals, matrices have to be inverted to implement (42) and (45). But for empirical performance analysis based on Monte-Carlo simulation, (42) and (45) have to be evaluated many times. Note that the number of repetitions should be large for getting reliable empirical performance analysis, which implies that the empirical performance analysis based on many evaluations of (42) and (45) is computationally intensive. In this paper, it has been shown that empirical performance of [, ] can be obtained from (47), without many evaluations of (45). Note that single evaluation of (47) results in analytic performance analysis, which should be equivalent to empirical performance based on many repetitions of (45). Therefore, although it is not necessary to repeatedly evaluate (45) to get empirical MSE, (45) is evaluated many times for validating the derived expression (47). Also, note that since analytic expression of the MSE of [, ] is not available, the MSE of [, ] has to be empirically obtained via many evaluations of (42) with different realizations of additive noise for each evaluation.

In and , the superscripts and denote approximation and approximation, respectively. On the other hand, in [14], no explicit expression of MSE is presented, and only the implicit expression of final estimate is presented. Note that, in this paper, explicit expressions of all the estimates associated with each successive approximation have been presented.

2. ML DOA Algorithm

For simplicity, we present performance analysis for two incident signals on the UCA in this paper. But, extension to more incident signals and general planar array structure is straightforward. Let , , , and representThe number of the antenna elements is denoted by . When there is no noise, the received signals on the antenna elements are expressed aswhere is given byArray matrix in (2) is written aswhere array vector is defined as

The corresponding expression for the noisy signals iswhere and denote the azimuth and the elevation, respectively.

Expression (7) for a general planar array with elements can be written aswhere and denote the element spacing in the -direction and -direction, respectively, and and denote the index in the -direction and -direction, respectively.

Also, (8) for general planar array is written asWe have explicitly shown how (7) and (8) should be modified for a two-dimensional planar array. Note that, for an arbitrary array structure, (7) and (8) are modified appropriately to apply the proposed algorithm to the specific array structure. (7) for an arbitrary array structure can be easily obtained from the phase difference between antenna elements, given the azimuth and the elevation of the th incident signal. Note that the phase difference is both dependent on the array geometry for a specific array structure and the direction of the incident signal.

The projection matrix onto the column space of can be expressed as [15]. It is assumed that the entries of the Gaussian vector are independent and identically distributed Gaussian random variables with zero-mean and the same variance. Note that the noise is complex-valued and that the real part and the imaginary part of the noise are independent Gaussian random variables with zero-mean. The variance of the real part is denoted by , which is equal to the variance of the imaginary part, :Note that, in this paper, the formulation is for the case where there are two incident signals. But, it can be easily extended to the case of more incident signals. Using the maximum likelihood (ML) algorithm, the estimates are given by [16]More specifically, (12) for is written asFor notational simplicity, is used to denote : is defined fromThe explicit expression of the entries of the matrix in (11) is given byUsing (4) and (5), in (16) is given byLet represent :

Using (16) and (17), can be expressed as

Using (17), can be written as

The explicit expression of the entries of the matrix in (11) is given by

Using (16)–(21), in (22) is defined as

Note that denotes the entry at the th row and the th column of . The explicit expression of the entries of the matrix in (12) is given by

Using (24) in (12), we get . The biases of the estimates are defined as

3. Closed-Form Expression of Estimation Error

Since , , , and should be the maximizing arguments of , the first-order derivative of with respect to each argument should be zero at where the first-order partial derivatives of in (26) with respect to each argument are expressed as, using (22),The first-order partial derivatives of in (20) and in (23) with respect to each argument in (27) are expressed in the Appendices A and B. Denominator of equation (27) can be written asUsing (21), is defined asWe define in (26) aswhere is given in (2). Let represent the difference between and :Substituting (32) in (26) and applying the first-order Taylor expansion of , , , and yieldLet , , , , , and be defined aswhere denotes the th component of and and denote the th component of and , respectively. Using (34)–(36), (33) can be expressed in matrix formwhere is defined asNote that in (34) denotes the entry at the th row and the th column of in (38), not at the th row and the th column. should be maximized at , since, with the sample covariance matrix from noiseless response, the ML algorithm estimates the DOA accurately. Therefore, the partial derivatives of with respect to the each entry of should be zero at The last equalities of (39) can be expressed asSubstituting (40) in (35), is simplified to Using (38) and (35), in (37) is defined asSince is true in (34), (34) can be approximated aswhere the superscript denotes that the approximation based on is applied. Accordingly, is defined asThe first-order derivatives of , and in (34) are derived in Appendix G. Applying -approximation to (37), is given bywhere an explicit expression of the entries of is

4. Closed-Form Expression of Mean Square Error

Using (45), an analytic MSE of can be expressed asThe entry at the th row and the th column of can be expressed as

in (48) is derived in Appendix P. Entries of in (47) are written asFrom (49), it is clear that the MSEs of , , , and are given bywhere and are true since and are real-valued. Mathematical details on the derivation are summarized as follows.

Mathematical Details on the Performance Analysis of ML DOA Estimation. Number of snapshots is , number of arrays is , and number of signals is 2.Projection matrix of The derivatives of the ML cost function, based on the covariance matrix associated with noisy array response, with respect to each incident signal, are not identically zero at true incident angles. From the angles at which the derivative is identically zero, the expression of the estimation error can be derivedThe derivative of the ML cost function, based on the covariance matrix associated with noiseless array response, with respect to each incident signal, is identically zero at true incident anglesSubstituting (55) and (56) in (58) results in

Using in (59) results inAn analytic expression of can be derived by exploiting statistical distribution of .

5. Numerical Results

MSEs of , , and are defined aswhere denotes the number of repetitions. The subscript denotes the estimate associated with the th repetition out of repetitions. , , and in (61) are given by (42), (45), and (13), respectively.

In Figure 2(a), we illustrate how accurate the MSEs of , , and are. The results with “Simulation ,” “Simulation ,” “Simulation ,” and “Analytic ” are obtained using (61) and (50), respectively. It has been clearly shown that the results with “Simulation ” show good agreement with those with “Analytic ”. In addition, the results with “Simulation ” get closer to those with “Simulation ” as SNR increases. Therefore, at high SNR, the results with “Analytic ” can be used for determining how accurate the results with “” are. The lower bound of the MSE for an unbiased estimator is called the CRLB. The CRLB for is also illustrated in Figure 2(a), where it is verified that the CRLB is actually below the analytic MSEs. Generally, the CRLB of each parameter can be obtained from the diagonal entry of :where is defined in [13]. The CRLB of is obtained from element of in (63).

MSEs of , , and are defined as, , and in (64) are given by (13), (42), and (45), respectively. In Figure 2(b), we illustrate how accurate the MSEs of , , and are. The results with “Simulation ,” “Simulation ,” “Simulation ,” and “Analytic ” are obtained using (64) and (51), respectively. We can easily see that the observation for and in Figure 2(a) is also true for and in Figure 2(b). The CRLB of is obtained from the element of the in (63).

MSEs of the , , and are defined as, , and in (65) are given by (13), (42), and (45), respectively.

In Figure 2(c), we illustrate how accurate the MSEs of , , and are. The results with “Simulation ,” “Simulation ,” “Simulation ,” and “Analytic ” are obtained using (65) and (51), respectively. It is clear in Figure 2(c) that can be used to approximate and that empirical MSE of is equal to an analytic MSE of in (51). The CRLB of is obtained from the element of in (63).

MSEs of , , and are defined as, , and in (66) are given by (13), (42), and (45), respectively. In Figure 2(d), we illustrate how accurate the MSEs of , , and are. The results with “Simulation ,” “Simulation ,” “Simulation ,” and “Analytic ” are obtained using (66) and (51), respectively. It is clear that the observation on the various estimates of , in Figure 2(c), also applies to the various estimates of in Figure 2(d). The CRLB of is obtained from the element of in (63).

In Figures 35, the results with respect to SNR are shown. It has been shown in the new numerical results that a gap between the MSE of the MLE and the Cramer-Rao lower bound gets smaller as the SNR varies from −20 dB to 20 dB in increments of 10 dB. Note that, in Figure 3, the gap for SNR = 20 dB is much smaller than that for SNR = −20 dB, which implies that the MSE of the MLE attains CRLB as SNR increases. Similar observation can be found in Figures 4 and 5.

Closed-form expression is only available for the estimate both with -approximation and -approximation. -approximation in our scheme is based on the first-order Taylor series expansion, which implies that -approximation becomes less accurate when the variance of the noise is large since the first-order Taylor series expansion does not take into account higher-order terms. Note that high variance of noise corresponds to low SNR. -approximation in our scheme is based on the fact that the difference between the covariance matrix with noisy array response and that with noiseless antenna response is much smaller than the covariance matrix with noiseless antenna response. -approximation is accurate when the noisy antenna response is similar to the noiseless antenna response, which is true for high SNR. That is, -approximation is valid for high SNR.

In conclusion, both -approximation and -approximation are valid for high SNR, and therefore the closed-form expression for the estimate with -approximation and -approximation is also valid for high SNR.

6. Summary and Conclusion

In Figures 6 and 7, we illustrate the outline of our derivation and how the performance analysis of the ML algorithm method is conducted.

In the case of azimuth/elevation estimation of multiple incident signals using the UCA, it has been shown that the bias of the estimate associated with the ML algorithm can be expressed in the closed-form at high SNR where the estimation error is small. The closed-form expression of the MSE of the one of the estimates has been derived, which implies that the MSE of the estimates can be obtained without computationally intensive Monte-Carlo simulation. The derivation is based on the fact that, for small estimation error, we can make the Taylor series approximation of the array vector, which makes it possible for us to get a closed-form expression of the MSE of the estimate.

We assume that an additive noise on the antenna elements is complex Gaussian distributed and that noise at each antenna is spatially white. Future works include how to extend the results in this paper to the case of non-Gaussian noise and spatially correlated noise. Although the formulation and the numerical results for two incident signals are presented in this paper, an extension to multiple incident signals is quite intuitively clear and straightforward. The number of columns in (4) is equal to the number of incident signals. For incident signals, in (14) is matrix and in (17) can be explicitly expressed using the adjoint method. More specifically, in (18), which is a determinant of , can be obtained from the cofactor expansion along any row or any column of . The entry at the th row and the th colmn of , in (21), is the entry at the th row and the th colmn of the adjoint matrix of . Note also that is matrix, where is the number of antenna elements. Therefore, the number of rows and the number of columns of are not dependent on the number of incident signals, . Therefore, simple modification of the scheme presented in this paper results in more general scheme, which can be used irrespective of the number of incident signals.

Equation (7) of this manuscript is valid only for the uniform circular array. For different array structure, (7) should be modified appropriately. In that case, we only have to simply modify(i)the expressions associated with the first derivatives of (7), which are (D.1),(ii)the expressions associated with the second derivatives of (7), which are (K.1).

Simply modifying (D.1) and (K.1) for specific array structure is not so hard. Therefore, although we presented the formulation for the UCA structure, it can be very easily extended to any array structure.

Appendix

A. The First-Order Derivatives of with respect to , , , and

For use in (27), Appendices F and G, the first-order partial derivatives of in (20) with respect to each argument are expressed as

B. The First-Order Derivatives of with respect to , , , and

For use in (27), Appendices E and G, the first-order partial derivatives of in (23) with respect to each argument are expressed as

C. The First-Order Derivatives of with respect to , , , and

For use in Appendices B and L, the first-order partial derivatives of in (17) with respect to each argument are expressed as

D. The First-Order Derivatives of with respect to and

For use in Appendices A, B, C, E, F, I, J, K, and L, the first-order partial derivatives of in (18) with respect to each argument are expressed as

E. The Second-Order Partial Derivatives of

For use in Appendix G, the second partial derivatives of in (23) with respect to each argument are expressed as

F. The Second-Order Partial Derivatives of

For use in Appendix G, the second-order partial derivatives of in (20) with respect to each argument are expressed as

G. The First-Order Derivatives of , , , and

The first partial derivatives of , , , and in (27) with respect to each argument can be expressed as

H. The First-Order Derivatives of

For use in Appendix G, the first partial derivatives of in (28) with respect to each argument are expressed as

I. The First-Order Derivatives of

For use in Appendix H, the first-order partial derivatives of in (21) with respect to each argument are expressed as

J. The First-Order Derivatives of

For use in Appendix H, the first-order partial derivatives of in (29) with respect to each argument are expressed as

K. The Second-Oder Derivatives of

For use in Appendices E, F, and L, the second partial derivatives of in (18) with respect to each argument are expressed as

L. The Second-Order Derivatives of

For use in Appendix F, the second partial derivatives of in (17) with respect to each argument are expressed as

M. Second-Order Central Moment of Zero-Mean Complex Gaussian Random Variable with Variance

Depending on how , , , and are related, we define four cases.

Case 1. and .

Case 2. and .

Case 3. and .

Case 4. and .  (i): For Case  1, is given bySimilarly, it can be shown that is identically zero for Cases  2–4:Note that, in deriving (M.1)–(M.3), we used the fact that the real part and the imaginary part of noise are independent and identically distributed with . (ii):Using the same algebraic manipulation used in evaluating , it can be shown that is equal to zero for Cases  1–4: (iii):Using the same algebraic manipulation used in evaluating , it can be shown that is equal to zero for Cases  1–4:

N. Third-Order Central Moment of Zero-Mean Complex Gaussian Random Variable with Variance

We define ten cases depending on how , , , , and are related.

Case 1. and and .

Case 2. and and .

Case 3. and and .

Case 4. and and .

Case 5. and and .

Case 6. and and .

Case 7. and and .

Case 8. and and .

Case 9. and and .

Case 10. and and . (i): for Case  1 is identically zero:Note that, in deriving (N.1)-(N.2), we used the fact that the real part and the imaginary part of noise are independent and identically distributed with .Using the same manipulation used in obtaining (N.2), it can be shown that is identically zero for Cases  2–10:From (N.2) and (N.3), is zero for all ten cases defined in Case  1–Case  10: (ii):In a similar way to get (N.4), we get

O. Fourth-Order Central Moment of Zero-Mean Complex Gaussian Random Variable with Variance

(a) :

For , (O.1) can be written as

The first term of (O.2) is given bywhere the results in Appendices M and N are used.

Using the same scheme in getting (O.3), the other terms of (O.2) are given byFrom (O.3)–(O.10), in (O.2) is given by

(b) :Using a similar way to get (O.11), for , is given by

P. Derivation of

From (P.3), is given bywhere the last equality follows since the noiseless signals are not stochastic.

In Appendices M and N, it is shown that the followings are identically zero:Using (P.5) in (P.4), (P.4) is simplified to ,where the second moments and the fourth moment in (P.6) are derived in Appendices M and O, respectively. Finally, is given by

Competing Interests

The authors declare that they have no competing interests.