In this post, we explain how to simulate a model reference adaptive controller in MATLAB. The motivation for creating this post comes from the fact that MATLAB implementation of model reference adaptive controllers is usually omitted in control theory books. However, implementation is far from trivial and there are a number of pitfalls that can potentially lead us to misleading conclusions. In this post, we focus on the adaptive control algorithm whose structure is determined using the so-called MIT rule. An excellent reference to adaptive control is
Åström, K. J., & Wittenmark, B. (2013). Adaptive Control. Courier Corporation.
This post is largely based on Chapter 5 from the above reference. In this post, we numerically simulate Example 5.2 from the above reference. A YouTube video accompanying this post is given here:
Copyright Notice and Code License
Copyright notice and code license: this tutorial, webpage, video lesson, developed code files, and documents should not be copied, redistributed, or posted on public or private websites, public or private code repositories, or social media platforms. This lesson and all the provided study materials are strictly for personal use. Without the permission of the author, this lesson and the provided material should not be used for commercial purposes and for training of engineers working in companies or in government or privately owned entities or institutions. This lesson and the provided material should not be used as lecture materials on online learning platforms and in university courses. This lesson and the provided material should not be used to train an AI algorithm or a large language model. Without the permission of the author, the provided code should not be used for academic research or for commercial purposes.
A GitHub page with all the codes used in this post can be accessed here.
We focus on a simple first-order system, represented by the differential equation
(1)
where is an output of the system that we want to control, is an output of a controller (referred to as the control input), and and are constants determining the system dynamics. We assume that constants and are unknown during the control design procedure.
The goal of the Model Reference Adaptive Controller (MRAC) is to compute the control input such that the system output is as close as possible to an output of a reference model. The reference model is a model that specifies the desired system behavior. That is, the output of the reference model is the desired system output that we want to achieve in practice. During the design process of MRAC, it is assumed that the reference model is known. Accordingly, for the model (1), the reference model is defined by the following equation
(2)
where is the output of the reference model, is the input of the reference model, and and are the parameters of the reference model. We assume that and are known. The signal is known. Since is known, we can simulate the reference model (2) to generate the output of the reference model . The control input structure is given by the following equation
(3)
where and are the adaptive control parameters that are determined by an adaptive control algorithm.
The goal of the adaptive control algorithm is to determine the parameters and such that the system output is as close as possible to the output of the reference model for some selected reference input function .
For the development of the control algorithm, it is instructive to perform the following analysis. Namely, by substituting the control law (3) in (1), we obtain
(4)
The model (4) is a closed-loop control system. Let us now compare (4) with the reference model (2). If the parameters and take the following constant values
(5)
then, we can conclude that the dynamics of the closed-loop system (4) is identical to the reference model (2). However, in practice, we do not know the constants and of the system, and consequently, we cannot determine the parameters and in this way. Despite this, the above presented analysis will help us to validate the convergence of the adaptive controller that will be developed in the sequel.
Loosely speaking, the MIT adaptive control rule aims at minimizing the following cost function
(6)
where is the control error defined as follows
(7)
To decrease the value of is reasonable to adjust the parameters and in the direction of the negative gradient of J. Using this idea, we obtain
(8)
To compute these partial derivatives, we need to express from (4). This can be done by using a derivative operator , that acts on a function in the following way
(9)
Consequently, from (4), we obtain
(10)
Consequently, using (10), the error can be written as follows
(11)
By taking the partial derivatives of (11) with respect to and , we obtain
(12)
The partial derivatives in (12) depend on the unknown system parameters and and consequently, we cannot explicitly compute them. In order to perform computations, we need to introduce an approximation that is physically justified. Basically, in the ideal scenario, the parameters and should converge to the values and that render the closed-loop dynamics to be equal to the reference model dynamics. In this ideal case, from (5) we have
(13)
Consequently, we can use this approximation to approximate the unknown terms in the denominators of the expressions in (12). Accordingly, we have
(14)
Next, we substitute this approximation in (12) and as the result, we obtain
(15)
Finally, by substituting these partial derivatives in (8) we obtain the final expressions describing parameter changes in time
(16)
where is a new gain. Note that we have normalized the filter
(17)
in this way
(18)
This filter will now have a unit steady-state gain.
The equations in (16) and the last equation in (10) describe the dynamics of the closed-loop system. These equations are written in the mixed time-domain and differential operator forms. For further developments, it is instructive to represent these equations in pure time-domain. Using the definition of the differential operator (9) and by performing a few algebraic steps, we obtain
(19)
where
(20)
This system of differential equations is coupled with the reference model equation (2) that we rewrite here for clarity.
(21)
For selected we can simulate the reference model (21). In this way, we obtain the reference model output that is a known function of time. The time series are used as known “inputs” to steer the dynamics given by (19).
To simulate the model (19), we need to transform it into a state-space model. First, we need to introduce state-space variables:
(22)
The state-space model is
(23)
Next, we explain the code for simulating the closed-loop system behavior. We assume the following parameters
(24)
This combination of parameters gives the following values of the converged parameters (see equations in (5))
(25)
These values of parameters will be used to verify the convergence of the algorithm.
The following code lines are used to define the reference model and to generate the reference input and reference output signals.
% MATLAB simulation of a model reference adaptive controller
% Author: Aleksandar Haber
% Date: April 21, 2021
% reference model parameters
am=2
bm=2
% plant parameters (that are unknown during control design)
a=1
b=0.5
% final values of the parameters for verification of convergence
theta1final=bm/b
theta2final=(am-a)/b
% reference model
Wm=tf([bm],[1 am])
tmax=100 % max simulation time
time=0:0.001:tmax; %time vector
% define a reference input signal
pulsew = 10; %pulse width
delayop= pulsew/2:pulsew*2:tmax; %delay vector
% reference input
ur=2*pulstran(time,delayop,'rectpuls',pulsew)-1;
% input reference signal
figure(1)
plot(time,ur);
set(gca,'Ylim',[-1.5 1.5]);
% output reference signal
yr=lsim(Wm,ur,time)
figure(2)
plot(time,yr,'r')
hold on
plot(time,ur,'k')
Code line 36 in the above code is used to generate the reference output signal by simulating the reference model.
The reference input and reference output signals are shown in Fig. 1 below.
The closed loop system dynamics is defined by the following MATLAB function.
function dxdt = dynamics_adaptive(t, x, um, ym, time_um,am,bm,a,b,gamma)
um_interp = interp1(time_um, um, t); % Interpolate the data set (time_um, um) at time t
ym_interp = interp1(time_um, ym, t); % Interpolate the data set (time_um, ym) at time t
dxdt(1,1)=x(2);
dxdt(2,1)=-am*x(2)-gamma*am*x(5)*um_interp+gamma*am*ym_interp*um_interp;
dxdt(3,1)=x(4);
dxdt(4,1)=-am*x(4)+gamma*am*x(5)^2-gamma*am*x(5)*ym_interp;
dxdt(5,1)=-a*x(5)+b*x(1)*um_interp-b*x(3)*x(5);
This function takes as input the current time “t”, current state “x”, reference model control input vector “um”, reference model output vector “ym”, time vector “time_um”, parameters “am”,”bm”,”a”,”b”, and the control gain “gamma”. This function is a MATLAB implementation of the closed-loop system dynamics given by (23).
We use an interpolation approach for dealing with time-varying inputs “um” and “ym” in ode45 function simulations. More details about this approach can be found in this and in this posts.
The state trajectories are simulated by executing these lines of code.
% initial condition
x0=zeros(5,1);
% gain
gamma2=0.5;
[time1 state_trajectories] = ode45(@(t,x) dynamics_adaptive(t, x, ur, yr, time,am,bm,a,b,gamma2), time, x0);
theta1=state_trajectories(:,1)
theta2=state_trajectories(:,3)
y=state_trajectories(:,5)
figure(3)
plot(time,yr,'k')
hold on
plot(time,y,'m')
figure(4)
plot(time,theta1,'k')
hold on
plot(time,theta2,'m')
We simulate the system response for three values of the controller gain , , and .
The reference model output and the controlled output of the plant are shown in Fig. 2 for .
The convergence trajectories of the parameters and are shown in Fig. 3 for .
The reference model output and the controlled output of the plant are shown in Fig. 4 for .
The convergence trajectories of the parameters and are shown in Fig. 5 for .
The reference model output and the controlled output of the plant are shown in Fig. 6 for .
The convergence trajectories of the parameters and are shown in Fig. 7 for .
From figures 2-7, we can conclude the following
- The convergence of the parameters increases for larger values of . For the parameters and approximately converge in 20 seconds to the values and . This shows that the controller works well and that the equations are properly implemented.
- After the initial transients difference between the controlled output and the reference model output decreases as we increase . However, for larger values of the control transients start to exhibit oscillatory behavior. This can be seen during the first 10 seconds in Figs. 4 and 6. Increased overshoot is the price we need to pay for achieving a better tracking performance.