Abstract

A new nonlinear adaptive control law for a class of uncertain nonlinear systems is proposed. The proposed control law is designed by a modified adaptive control Lyapunov function (ACLF) which satisfies a Hamilton-Jacobi-Bellman (HJB) equation. The modified ACLF is derived from transformation of an ACLF. The proposed control law is different from the inverse optimal one in decreasing the value of a cost function specified by a designer. In this paper, we show a transformation coefficient for an ACLF and a design method of a nonlinear adaptive controller. Finally, it is shown by a numerical simulation that the proposed control law decreases the value of a given cost function and achieves the desirable trajectory.

1. Introduction

Design of control laws considering stability and optimality is the central issue in control theory [1]. For stability, Lyapunov theory is a strong tool to design controllers and to assure the stability of systems. For optimality, a value function which is the solution to a Hamilton-Jacobi-Bellman (HJB) equation is derived from dynamic programming. If a value function and an optimal control law can be found, then the closed system possesses robustness such as gain margin, phase margin, and low sensitivity against parameter variations [2, 3].

However, a general approach to find the value function has not been shown and it is not easy to design the optimal control. Due to the difficulty, the inverse optimal control problem which minimizes a meaningful cost function was proposed by Freeman and Kokotovic. If the inverse optimal problem is solved, namely, a control Lyapunov function (CLF) is found, it is possible to design a control law with the good characteristics mentioned above by applying a CLF to the Pointwise Min-Norm (PMN) control law [4]. But the minimized cost function and the trajectory may not be desirable. In order to improve this problem, a locally approximate approach around the origin by numerical calculation and transformation of a CLF was proposed. The approach gives characteristics of local optimality without loss of characteristics of the global inverse optimality [5, 6], and it is based on the fact that the PMN control law can minimize the desired cost function if a CLF has the same level sets as the value function [7].

The inverse optimal control law was applied to robust control and adaptive control [8, 9]. Moreover, a control law in which a Sontag type control law and a PMN control law were generalized has been proposed [10ā€“12]. Also, many approaches to approximate the value function have been proposed. They are based on numerical calculation and focus on improvement of calculation speed and accuracy of approximation [5, 13].

Recently, a new approach has been provided by [14]. The new approach in [14] is different from the inverse optimal and approximate approaches in directly considering a cost function specified by a designer and provides a modified ACLF by introducing a transformation coefficient. This paper expands [14] and proves a modified ACLF and a control law designed by a modified ACLF minimize the value of a cost function specified by a designer in the set of control laws based on an ACLF. Also, the condition under which a controller and a transformation coefficient are continuous is provided and the problem of overparameterization which is to increase the number of parameter estimation according to the order of a system is solved.

In this paper, the design of a modified ACLF and a nonlinear adaptive control law is shown in Sections 3 and 4 and effectiveness of the proposed controller is shown by a numerical simulation in Section 5.

The following notations are used in this paper. denotes the sets of real value. denotes the sets of positive real value. denotes the derivative of with respect to . For a matrix , denotes that a matrix is positive definite (semipositive definite) matrix. denotes transpose of matrix .

2. Preliminaries

We deal with a nonlinear strict-feedback system with unknown parameters described by where , , , and denote the state, the control input, and a vector of unknown constant parameters, respectively. It is assumed in this paper that all information of the state can be used. and satisfy and , respectively. is not equal to 0 for all . In compact notation, the system (1) is described as where , , and are smooth and , , so that is an equilibrium of the uncontrolled plant. In order to estimate the unknown parameter , the following adaptive law is introduced: where is the estimation of the parameter , is a constant parameter specified by a designer, and is the tuning function. The construction of the tuning function is explained in Section 4.

The cost function is defined as where and are weighting functions and satisfy for all , , and for all .

It is well known that the optimal control problem which minimizes the cost function (4) is reduced to the problem to find the value function satisfying the following HJB equation [10]: where the optimal control law is given by However, it is generally difficult to solve the HJB equation and to find the value function. In this paper, assuming that there exists an ACLF for the system (1), a certain transformation coefficient is introduced and a nonlinear adaptive control law is constructed by a modified ACLF based on an original ACLF.

3. Main Results

In this section, we show the controller based on a modified ACLF by a certain transformation coefficient.

First, an ACLF with unknown parameters and an adaptive law is defined as the following definition.

Definition 1. is called an ACLF for the system (2) and (3) if it is a positive definite function which is continuous and radially unbounded and satisfies the following inequality: The following theorem gives the transformation coefficient for the modified ACLF and the control law based on an ACLF.

Theorem 2. It is assumed that there exists an ACLF for the system (2) and (3). Let be where is defined by and and in (9a)ā€“(9c) are scalar functions defined by respectively. denotes a constant and is sufficiently large so that is a positive definite function. Then, for all and satisfies HJB equation (5) with the following control input: and the closed system with the control input (12) is asymptotically stable to the origin.

Proof. First, we show that is given by (9a)ā€“(9c) so that satisfies HJB equation and for all . In HJB equation (5), when is applied instead of the value function , the quadratic equation in terms of is derived by where input is given by (12) from . Equation (13) can easily be solved with respect to ; namely, in the case of and in the case of where , , and and the solution becomesThough there are two solutions in (14a) when , we select one of them as (9a). Then, for all because, in the case that , for all and, in the case that , since is an ACLF which satisfies (7), for all . Therefore, we obtain for all .
Next, we will show that the system is asymptotically stable by the control law (9a)ā€“(9c) and (12). In the case that , the time derivative of is given by (16a). Since , we obtain . In the case that , the time derivative of is given by (16b). Consider because is an ACLF which satisfies (7). Therefore we also obtain in the case of (16b) and is satisfied for all . As a result, is the ACLF for the system (2) and (3) with the input (9a)ā€“(9c) and (12). We show that is also an ACLF. It is trivial that the time derivative of is for all because and for all . Since changes its value with similarly as and and there always exists to make , we can have be positive definite for all . Therefore, is an ACLF for the system. Therefore, the solution of the system (2) and (3) converges to the maximum invariant sets included in the set by LaSalleā€™s invariance principle [15]. Since the invariant sets consist of the origin only, the closed system is asymptotically stable to the origin.

In Theorem 2, the ACLF is modified by the transformation coefficient and the modified is an ACLF and satisfies HJB equation.

Theorems about the cost function (4), the transformation coefficient , and the control input (12) are given as follows.

Corollary 3. The value of cost function is given by the following equation if the transformation coefficient (9a)ā€“(9c) and control law (12) are applied:

Proof. Equation (18) is given from (13): By integrating of (18) from 0 to , we obtain

Theorem 4. The control law (12) with (9a)ā€“(9c) minimizes the cost function (4) in the set of control laws based on the ACLF .

Proof. It is assumed that the control (20) stabilizes the system: By substituting (20) for the cost function (4) and applying (13), we obtain Since the first term in (21) is positive, (21) is minimized when . Therefore, the cost function (4) is minimized by the control law (12) using the ACLF .

Theorem 5. If , , , and are continuous and differentiable for all , then transformation coefficient (9a)ā€“(9c) is continuous and differentiable for all .

Proof. of (13) can be regarded as the function of and , namely, . The derivative of the function with respect to is given by where we used (9a)ā€“(9c). In (22a), for all . In (22b), for all since is an ACLF. Therefore, for all . From implicit function theorem, is continuous and differentiable for all if , , , and are continuous and differentiable for all .

Remark 6. It is assumed that the weighting function for all in this paper. But even if when for all , the proposed approach can also be applied (refer to Section 5).

Remark 7. The control law includes the unknown parameters in (10). Therefore, it is required to estimate them by an adaptive law. The adaptive law is given in Section 4.

Remark 8. The proposed approach does not provide the value function but the transformation coefficient is determined such that the modified ACLF (8) satisfies the HJB equation. Since the modified ACLF (8) depends on the original ACLF, the value of the cost function (17) varies depending on the selection of the original ACLF.

4. Application of Theorem 2 to ACLF Obtained by Backstepping

In this section, the transformation coefficient, the modified ACLF, and a nonlinear adaptive control law are designed by applying Theorem 2 to the ACLF which is chosen in the following backstepping. Backstepping can be applied to the strict-feedback system (1) by the following procedure [16].

Step ā€‰ā€‰. We introduce the variables of , , and defined by (23) and (24): where , are the virtual inputs and and denotes the control law derived in the step of backstepping. Differentiating (23) with respect to and using (1), system is written as where and is given by If the system (25) is asymptotically stable, then the system (1) is also asymptotically stable. Therefore, we solve the problem of stabilization of the system by Lyapunov approach considering the Lyapunov function candidate as follows (in this paper, denotes the ACLF derived by backstepping. , denote the control law and the transformation coefficient based on , respectively): The virtual input is chosen as where and is a design parameter and The time derivative of along the solution of (25) in the Step is given by

If the tuning function is chosen as then we get Thus, is an adaptive Lyapunov function for the system, the system is asymptotically stable, and the estimation error is stable; then system is asymptotically stable. Since the adaptive Lyapunov function and the asymptotic stabilizing control law are given, is an ACLF.

Applying Theorem 2 to ACLF (27), we obtain the control law as where the transformation coefficient is given byand and are given by from the following which is rewritten by (23) and (27):

Now, we show the stability of the system (2). The time derivative of (36) along the solution of (2) with the control input (33) yieldsSince and , (37b) becomes from (32). Now, it is shown that (36) is an adaptive Lyapunov function and the system is, therefore, asymptotically stable to the origin.

Finally, we show that which is related to by (8) satisfies HJB equation. Using instead of , we obtain HJB equation (39) for the system (1) and (3) with the cost function (4): By substituting (34a)ā€“(34c) into (39), it is shown that HJB equation is satisfied.

From the above, it is confirmed that Theorem 2 is satisfied.

5. Numerical Example

The effectiveness of the proposed control law is shown by a numerical example of a system with an unknown parameter. The numerical simulation is done for the second order system (40) and the cost function (41) [10]: where the true value of the unknown parameter is .

The value function of the optimal control problem for the system (40) with the cost function (41) is given by the following equation if the true value of in (40) is 0.5 (the value function of the HJB equation is given in the existing result [10]): and the optimal control law is from (6).

On the other hand, using the ACLF which is given by (43) from (27), the control law is derived as (44) by backstepping: Then, applying Theorem 2 to ACLF (43), the proposed control is and the transformation coefficient is

It should be noted that if and , then . In this case, we need to show that mentioned in Remark 6. Actually, we can obtain (48) from (47) when : Therefore, the proposed approach can be applied to ACLF (43).

The simulation results are shown in Figures 1, 2, 3, and 4. The initial state is and design parameters are and . The value of the cost function is shown in Table 1.

Figures 1ā€“3 show that the trajectories by the proposed control law are close to the optimal trajectories and Table 1 shows that the value of the cost function by the proposed control law is smaller than the one by backstepping and close to the optimal value for the system of which parameter is known.

6. Conclusions

In this paper, a nonlinear adaptive control law is proposed. The control law is designed by a modified ACLF which satisfies the HJB equation. It is proved that a cost function specified by a designer is minimized in the set of the control laws based on an ACLF for the strict-feedback system. The condition under which the proposed control law and the transformation coefficient are continuous is provided. The effectiveness of the proposed control law is shown by the simple numerical simulation.

If an ACLF is given, then the proposed approach enables designing a control law. Therefore, the proposed approach can also be applied to other nonlinear systems compared to strict-feedback systems.

Research subject in the future is extension to systems in which all the states cannot be accessed.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.