首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 718 毫秒
1.
This paper considers a trilayer Stackelberg game problem for nonlinear system with three players. A novel performance function is defined for each player, which depends on the coupling relationships with the other two players. The coupled Hamilton–Jacobi–Bellman (HJB) equations are built from the performance functions, and the optimal control polices of three players are obtained based on the Bellman’s principle of optimality. Because of the nonlinearity and coupling characteristics, a policy iteration (PI) algorithm with a three-layer decision-making framework is developed to online learn the coupled HJB equations. In order to implement the algorithm, we construct a critic-action neural network (NN) structure and design a NN approximation-based iteration algorithm. Finally, a simulation example is presented to verify the effectiveness of the proposed method.  相似文献   

2.
The purpose of this paper is to present an iterative algorithm for solving the general discrete-time periodic Sylvester matrix equations. It is proved by theoretical analysis that this algorithm can get the exact solutions of the periodic Sylvester matrix equations in a finite number of steps in the absence of round-off errors. Furthermore, when the discrete-time periodic Sylvester matrix equations are consistent, we can obtain its unique minimal Frobenius norm solution by choosing appropriate initial periodic matrices. Finally, we use some numerical examples to illustrate the effectiveness of the proposed algorithm.  相似文献   

3.
By means of the real linear operator, we establish an iterative algorithm for solving a class of complex generalized coupled Sylvester matrix equations. The finite termination of the proposed algorithm is proved. By representing a complex matrix as a larger real matrix, we present a new method to prove that the minimum-norm solution or minimum-norm least squares solution of the complex generalized coupled Sylvester matrix equations can be obtained by an appropriate selection for the initial matrices, which has not been found in the existing work. Numerical experiments on some randomly generated data and practical image restoration problem show that the proposed algorithm is feasible and effective.  相似文献   

4.
This paper focuses on the numerical solution of a class of generalized coupled Sylvester-conjugate matrix equations, which are general and contain many significance matrix equations as special cases, such as coupled discrete-time/continuous-time Markovian jump Lyapunov matrix equations, stochastic Lyapunov matrix equation, etc. By introducing the modular operator, a cyclic gradient based iterative (CGI) algorithm is provided. Different from some previous iterative algorithms, the most significant improvement of the proposed algorithm is that less information is used during each iteration update, which is conducive to saving memory and improving efficiency. The convergence of the proposed algorithm is discussed, and it is verified that the algorithm converges for any initial matrices under certain assumptions. Finally, the effectiveness and superiority of the proposed algorithm are verified with some numerical examples.  相似文献   

5.
Optimal parametrization in numerical construction of curve   总被引:1,自引:0,他引:1  
The application of the optimal parametric continuation method to constructing a solution set curve for a system of nonlinear algebraic or transcendental equations depending on a parameter is considered. There are discussed two approaches to solving this problem—the use of iterative methods and reduction to an initial value problem for a system of ordinary differential equations. The algorithm suggested in this paper can also be used for finding an appropriate initial approximation when solving a system of nonlinear algebraic or transcendental equations not depending on a parameter by an iterative method.  相似文献   

6.
In this paper, the linear quadratic (LQ) optimal decentralized control and stabilization problems are investigated for multi-sensors networked control systems (MSNCSs) with multiple controllers of different information structure. Specifically, for a MSNCS, in view of the packet dropouts and the transmission delays, each controller may access different information sets. To begin with, the sufficient and necessary solvability conditions for the LQ decentralized control problems are developed. Consequently, for the purpose of deriving the optimal decentralized control strategy, an innovative orthogonal decomposition method is proposed to decouple the forward and backward stochastic difference equations (FBSDEs) from the maximum principle. In the following, we show that the optimal decentralized controller can be calculated according to a set of Riccati-type equations. Finally, a stabilizing controller is derived for the stabilization problem.  相似文献   

7.
In this study, a robust fractional-order controller design methodology for a type of fractional-order or integer-order model with dead time is proposed using phase and gain margin specifications. The delayed Bode’s ideal transfer function is used as a reference model to design the controller analytically. The delay term in delayed Bode’s ideal transfer function provides the exact determination of these frequency domain specifications when the system owns a dead time. The analytical robust controller design problem is transformed to solving four nonlinear equations with four unknown variables, two of which are the desired specifications; namely, phase and gain margins. The remaining two are the phase and gain cross-over frequencies. Next, some conditions are set based on the desired specifications so that nonlinear equations provide a unique solution. The proposed method is compared with the other existing robust controller methods based on the same frequency domain specifications. The simulation results reveal that the proposed method outperforms the other methods and also gives closer outcomes to the desired specifications.  相似文献   

8.
The paper studies the iterative solutions of the generalized coupled Sylvester transpose matrix equations over the reflexive (anti-reflexive) matrix group by the generalized conjugate direction algorithm. The convergence analysis shows that the solution group can be obtained within finite iterative steps in the absence of round-off errors for any initial given reflexive (anti-reflexive) matrix group. Furthermore, we can get the minimum-norm solution group by choosing special kinds of initial matrix group. Finally, some numerical examples are given to demonstrate the algorithm considered is quite effective in actual computation.  相似文献   

9.
In this paper, a novel iterative approximate dynamic programming scheme is proposed by introducing the learning mechanism of value iteration (VI) to solve the constrained optimal control problem for CT affine nonlinear systems with utilizing only one neural network. The idea is to show the feasibility of introducing the VI learning mechanism to solve for the constrained optimal control problem from a theoretical point of view, and thus the initial admissible control can be avoided compared with most existing works based on policy iteration (PI). Meanwhile, the initial condition of the proposed VI based method can be more general than the traditional VI method which requires the initial value function to be a zero function. A general analytical method is proposed to demonstrate the convergence property. To simplify the architecture, only one critic neural network is adopted to approximate the iterative value function while implementing the proposed method. At last, two simulation examples are proposed to validate the theoretical results.  相似文献   

10.
At present, gradient iteration methods have been used to solve various Sylvester matrix equations and proved effective. Based on this method, we generalize the factor gradient iterative method (FGI) for solving forward periodic Sylvester matrix equations (FPSME) and backward periodic Sylvester matrix equations (BPSME). To accelerate the convergence of the iterative method, we refer to Gauss-Seidel and Jacobi iterative construction ideas and use the latest matrix information in the FGI iterative method to obtain the modified factor gradient iterative (MFGI) method. Then, the convergence of the proposed methods and the selection of optimal factors are proved. The last numerical examples illustrate the effectiveness and applicability of the iterative methods.  相似文献   

11.
Riccati differential equations are a class of first-order quadratic ordinary differential equations and have various applications in systems and control theory. In this study, we analyzed a switched Riccati differential equation driven by a Poisson-like stochastic signal. We specifically focused on computing the mean escape time of the switched Riccati differential equation. The contribution of this study is twofold. We first show that, under the assumption that the subsystems described as deterministic Riccati differential equations escape in finite time regardless of their initial state, the mean escape time of the switched Riccati differential equation admits a power series expression. To further expand the applicability of this result, we then present an approximate formula to compute the escape time of deterministic Riccati differential equations. Numerical simulations were performed to illustrate the obtained results.  相似文献   

12.
An Impact Angle, Speed and Acceleration Control Guidance (IASAG) law against the stationary target is proposed, which is critical for the effectiveness of the air-to-surface guided weapons. It is hard to address multiple terminal constraints problem for unpowered missile, especially including terminal speed constraint, which is uncontrollable state. Based on Line-of-Sight (LOS) angle, a fourth-order polynomial function is designed to make the number of coefficients of the function equal to number of boundary conditions. Through analytic calculation and transformation, the relation between the specified boundary conditions and the coefficients are established. The coefficient equations are reduced to a univariate nonlinear equation whose solution is determined by terminal speed constraint. Based on the characteristic of the nonlinear equation, we propose a Particle Swarm Optimization(PSO) method to find the coefficient that satisfies terminal speed constraint. According to Lyapunov stability theory, an asymptotically stable trajectory tracking controller is designed to track the reference leading angle with respect to range-to-go to guarantee the impact angle, speed and acceleration constraints. The effectiveness of the proposed guidance law is verified through numerical simulations.  相似文献   

13.
There are many hybrid stochastic differential equations (SDEs) in the real-world that don’t satisfy the linear growth condition (namely, SDEs are highly nonlinear), but they have highly nonlinear characteristics. Based on some existing results, the main difficulties here are to deal with those equations if they are driven by Lévy noise and delay terms, then to investigate their stability in this case. The present paper aims to show how to stabilize a given unstable nonlinear hybrid SDEs with Lévy noise by designing delay feedback controls in the both drift and diffusion parts of the given SDEs. The controllers are based on discrete-time state observations which are more realistic and make the cost less in practice. By using the Lyapunov functional method under a set of appropriate assumptions, stability results of the controlled hybrid SDEs are discussed in the sense of pth moment asymptotic stability and exponential stability. As an application, an illustrative example is provided to show the feasibility of our theorem. The results obtained in this paper can be considered as an extension of some conclusions in the stabilization theory.  相似文献   

14.
This paper deals with the privacy-preserving average consensus problem for continuous-time multi-agent network systems (MANSs) based on the event-triggered strategy. A novel event-triggered privacy-preserving consensus algorithm is designed to achieve the average consensus of MANSs while avoiding the disclosure of the agents’ initial states. Different from the approaches incorporating stochastic noises, an output mask function in the proposed algorithm is developed to make initial state of each agent indiscernible by the others. Particularly, under the output mask function, all agents can exactly tend to the average value of initial states rather than the mean square value. Under the proposed algorithm, detailed theoretical proof about average consensus and privacy of the MANSs are conducted. Moreover, the proposed algorithm is extended to nonlinear continuous-time MANSs, and the corresponding results are also derived. A numerical simulation eventually is performed to demonstrate the validity of our results.  相似文献   

15.
Nonlinear characteristic widely exists in industrial processes. Many approaches based on kernel methods and machine learning have been developed for nonlinear process monitoring. However, the fault isolation for nonlinear processes has rarely been studied in previous works. In this paper, a process monitoring and fault isolation framework is proposed for nonlinear processes using variational autoencoder (VAE) model. First, based on the probability graph model of VAE, a uniform monitoring index can be calculated by the probability density of observation variables. Then, the fault variables are estimated with normal variables by a missing value estimation method. The optimal fault variable set can be searched by branch and bound (BAB) algorithm. The proposed method can resolve the ”smearing effects” problem existing in traditional fault isolation methods. Finally, a numerical case and a hot strip mill process case are used to verified the proposed method.  相似文献   

16.
This study focuses on the research of the globally asymptotic tracking problem of unknown nonlinear reaction-diffusion equations with time-varying coefficients and uncertain external disturbance. Firstly, fuzzy logic systems and adaptive bounding technique are used to deal with nonlinear reaction-diffusion equations with time-varying coefficients and uncertain external disturbance. Secondly, a novel global state feedback adaptive fuzzy control algorithm is proposed to make the nonlinear reaction-diffusion equations track the target systems globally and asymptotically. In addition, the globally asymptotic tracking condition can be obtained, which overcomes the semi-global results in the existing literatures. Finally, three simulation examples are given to illustrate the feasibility and effectiveness of the proposed control protocols.  相似文献   

17.
《Journal of The Franklin Institute》2023,360(14):10564-10581
In this work, we investigate consensus issues of discrete-time (DT) multi-agent systems (MASs) with completely unknown dynamic by using reinforcement learning (RL) technique. Different from policy iteration (PI) based algorithms that require admissible initial control policies, this work proposes a value iteration (VI) based model-free algorithm for consensus of DTMASs with optimal performance and no requirement of admissible initial control policy. Firstly, in order to utilize RL method, the consensus problem is modeled as an optimal control problem of tracking error system for each agent. Then, we introduce a VI algorithm for consensus of DTMASs and give a novel convergence analysis for this algorithm, which does not require admissible initial control input. To implement the proposed VI algorithm to achieve consensus of DTMASs without information of dynamics, we construct actor-critic networks to online estimate the value functions and optimal control inputs in real time. At last, we give some simulation results to show the validity of the proposed algorithm.  相似文献   

18.
Within the framework of numerical modelling and multi-objective control of partial differential equations, in this work we deal with the problem of determining the optimal location of a new industrial plant. We take into account both economic and ecological objectives, and we look not only for the optimal location of the plant but also for the optimal management of its emissions rate. In order to do this, we introduce a mathematical model (a system of nonlinear parabolic partial differential equations) for the numerical simulation of air pollution. Based on this model, we formulate the problem in the field of multi-objective optimal control from a cooperative viewpoint, recalling the standard concept of Pareto-optimal solution, and pointing out the usefulness of Pareto-optimal frontier in the decision making process. Finally, a numerical algorithm – based on a characteristics/Galerkin discretization of the adjoint model – is proposed, and some numerical results for a hypothetical situation in the region of Galicia (NW Spain) are presented.  相似文献   

19.
This paper proposes a privacy-preserving consensus algorithm which enables all the agents in the directed network to eventually reach the weighted average of initial states, and while preserving the privacy of the initial state of each agent. A novel privacy-preserving scheme is proposed in our consensus algorithm where initial states are hidden in random values. We also develop detailed analysis based on our algorithm, including its convergence property and the topology condition of privacy leakages for each agent. It can be observed that final consensus point is independent of their initial values that can be arbitrary random values. Besides, when an eavesdropper exists and can intercept the data transmitted on the edges, we introduce an index to measure the privacy leakage degree of agents, and then analyze the degree of privacy leakage for each agent. Similarly, the degree for network privacy leakage is derived. Subsequently, we establish an optimization problem to find the optimal attacking strategy, and present a heuristic optimization algorithm based on the Sequential Least Squares Programming (SLSQP) to solve the proposed optimization problem. Finally, numerical experiments are designed to demonstrate the effectiveness of our algorithm.  相似文献   

20.
In this paper, we considered a time-optimal control problem for a new type of linear parameter varying (LPV) system which is obtained through data identification in the process of dealing with actual problems. The addition of non-linear terms is compensation for the method that does not require linear expansion at the equilibrium point. Since the objective function is the terminal time which is an implicit function concerning decision variables, it is a non-standard optimal control problem with uncertain terminal time. To find the global optimal solution to this problem, firstly, the control parameterization method is used to transform it into a nonlinear optimization problem of parameter selection, and then the modifed particle swarm optimization (PSO) algorithm is combined to solve the equivalent nonlinear programming problem. Numerical examples are used to illustrate the effectiveness of the proposed algorithm.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号