期刊文献+
共找到85篇文章
< 1 2 5 >
每页显示 20 50 100
Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information
1
作者 Yun Zhang Lulu Zhang Yunze Cai 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第3期690-697,共8页
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the l... This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence proof.The players are divided into two groups in the learning process and adapt their policies sequentially.Our method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based frameworks.Furthermore,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control laws.The efficacy of our method is illustrated by three examples. 展开更多
关键词 Adaptive dynamic programming incomplete information multi-player differential game value iteration
在线阅读 下载PDF
NUMERICAL METHOD BASED ON HAMILTON SYSTEM AND SYMPLECTIC ALGORITHM TO DIFFERENTIAL GAMES
2
作者 徐自祥 周德云 邓子辰 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2006年第3期341-346,共6页
The resolution of differential games often concerns the difficult problem of two points border value (TPBV), then ascribe linear quadratic differential game to Hamilton system. To Hamilton system, the algorithm of s... The resolution of differential games often concerns the difficult problem of two points border value (TPBV), then ascribe linear quadratic differential game to Hamilton system. To Hamilton system, the algorithm of symplectic geometry has the merits of being able to copy the dynamic structure of Hamilton system and keep the measure of phase plane. From the viewpoint of Hamilton system, the symplectic characters of linear quadratic differential game were probed; as a try, Symplectic-Runge-Kutta algorithm was presented for the resolution of infinite horizon linear quadratic differential game. An example of numerical calculation was given, and the result can illuminate the feasibility of this method. At the same time, it embodies the fine conservation characteristics of symplectic algorithm to system energy. 展开更多
关键词 differential game Hamilton system algorithm of symplectic geometry linear quadratic
在线阅读 下载PDF
Open Loop Saddle Point on Linear Quadratic Stochastic Differential Games
3
作者 WANG JUN 《Communications in Mathematical Research》 CSCD 2014年第1期11-22,共12页
In this paper, we deal with one kind of two-player zero-sum linear quadratic stochastic differential game problem. We give the existence of an open loop saddle point if and only if the lower and upper values exist.
关键词 stochastic differential game saddle point open loop strategy
在线阅读 下载PDF
Stochastic Differential Games of Mean-Field Dynamics and Second-Order Bellman–Isaacs Equations on the Wasserstein Space
4
作者 Tao Hao Jie Xiong 《Acta Mathematica Sinica,English Series》 2025年第3期873-907,共35页
This paper concerns two-player zero-sum stochastic differential games with nonanticipative strategies against closed-loop controls in the case where the coefficients of mean-field stochastic differential equations and... This paper concerns two-player zero-sum stochastic differential games with nonanticipative strategies against closed-loop controls in the case where the coefficients of mean-field stochastic differential equations and cost functional depend on the joint distribution of the state and the control.In our game,both the(lower and upper)value functions and the(lower and upper)second-order Bellman–Isaacs equations are defined on the Wasserstein space P_(2)(R^(n))which is an infinite dimensional space.The dynamic programming principle for the value functions is proved.If the(upper and lower)value functions are smooth enough,we show that they are the classical solutions to the second-order Bellman–Isaacs equations.On the other hand,the classical solutions to the(upper and lower)Bellman–Isaacs equations are unique and coincide with the(upper and lower)value functions.As an illustrative application,the linear quadratic case is considered.Under the Isaacs condition,the explicit expressions of optimal closed-loop controls for both players are given.Finally,we introduce the intrinsic notion of viscosity solution of our second-order Bellman–Isaacs equations,and characterize the(upper and lower)value functions as their viscosity solutions. 展开更多
关键词 Stochastic differential game Wasserstein space viscosity solution dynamic programming principle mean-field stochastic differential equation Riccati equation
原文传递
Constructive Approximate Nash Equilibrium for In-Orbit Target Enclosing with Collision Avoidance and Full-state Constraint via Nonzero-Sum Differential Games
5
作者 Bosong Wei Xiaokui Yue +1 位作者 Zhiwei Hao Zongcheng Liu 《Guidance, Navigation and Control》 2024年第3期166-190,共25页
The problem of in-orbit cooperative target enclosing involving N thrust-limited satellites under collision avoidance and maneuver amplitude constraints is studied. In order to find global optimal trajectories for targ... The problem of in-orbit cooperative target enclosing involving N thrust-limited satellites under collision avoidance and maneuver amplitude constraints is studied. In order to find global optimal trajectories for target enclosing task with all constraints above, by integrating the collision threat and maneuver boundaries into a novel nonlinear cost functional, the studied target enclosing problem is described as a nonlinear nonzero-sum differential game. Further, to avoid iterative calculations caused by traditional nonlinear-game-solving methods, an approximate solution which can be constructed directly is designed. Then the approximate Nash equilibrium strategies can be educed by the constructive approximate solution for the proposed nonzero-sum game. Analysis shows that the proposed control strategies can asymptotically approach the exact one and can ensure a zero-error tracking of the enclosing configuration.Simulation results illustrate lower time costs and better enclosing accuracy while the collision avoidance and maneuver amplitude constraints are satisfied simultaneously. 展开更多
关键词 Approximate Nash equilibrium differential game invariant manifold asymptotic stability target enclosing
原文传递
Two-to-one differential game via improved MOGWO
6
作者 BAI Yu ZHOU Di +2 位作者 ZHANG Bolun HE Zhen HE Ping 《Journal of Systems Engineering and Electronics》 2025年第1期233-255,共23页
When the maneuverability of a pursuer is not significantly higher than that of an evader,it will be difficult to intercept the evader with only one pursuer.Therefore,this article adopts a two-to-one differential game ... When the maneuverability of a pursuer is not significantly higher than that of an evader,it will be difficult to intercept the evader with only one pursuer.Therefore,this article adopts a two-to-one differential game strategy,the game of kind is generally considered to be angle-optimized,which allows unlimited turns,but these practices do not take into account the effect of acceleration,which does not correspond to the actual situation,thus,based on the angle-optimized,the acceleration optimization and the acceleration upper bound constraint are added into the game for consideration.A two-to-one differential game problem is proposed in the three-dimensional space,and an improved multi-objective grey wolf optimization(IMOGWO)algorithm is proposed to solve the optimal game point of this problem.With the equations that describe the relative motions between the pursuers and the evader in the three-dimensional space,a multi-objective function with constraints is given as the performance index to design an optimal strategy for the differential game.Then the optimal game point is solved by using the IMOGWO algorithm.It is proved based on Markov chains that with the IMOGWO,the Pareto solution set is the solution of the differential game.Finally,it is verified through simulations that the pursuers can capture the escapee,and via comparative experiments,it is shown that the IMOGWO algorithm performs well in terms of running time and memory usage. 展开更多
关键词 differential game improved multi-objective grey wolf optimization(IMOGWO) cooperative pursuit optimal game point
在线阅读 下载PDF
Cooperative differential games guidance laws for multiple attackers against an active defense target 被引量:11
7
作者 Fei LIU Xiwang DONG +1 位作者 Qingdong LI Zhang REN 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2022年第5期374-389,共16页
This paper is concerned with a scenario of multiple attackers trying to intercept a target with active defense.Three types of agents are considered in the guidance:The multiple attackers,the target and the defender,wh... This paper is concerned with a scenario of multiple attackers trying to intercept a target with active defense.Three types of agents are considered in the guidance:The multiple attackers,the target and the defender,where the attackers aim to pursuit the target from different directions and evade from the defender simultaneously.The guidance engagement is formulated in the framework of a zero-sum two-person differential game between the two opposing teams,such that the measurements on the maneuver of the target or estimations on the defending strategy of the defender can be absent.Cooperation of the attackers resides in two aspects:redundant interception under the threat of the defender and the relative intercept geometry with the target.The miss distances,the relative intercept angle errors and the costs of the agents are combined into a single performance index of the game.Such formulation enables a unitary approach to the design of guidance laws for the agents.To minimize the control efforts and miss distances for the attackers,an optimization method is proposed to find the best anticipated miss distances to the defender under the constraint that the defender is endowed with a capture radius.Numerical simulations with two cases are conducted to illustrate the effectiveness of the proposed cooperative guidance law. 展开更多
关键词 Active defense Cooperative guidance differential games Minimum effort optimization Relative intercept angle
原文传递
Satellite proximate interception vector guidance based on differential games 被引量:10
8
作者 Dong YE Mingming SHI Zhaowei SUN 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2018年第6期1352-1361,共10页
This paper studies the proximate satellite interception guidance strategies where both the interceptor and target can perform orbital maneuvers with magnitude limited thrusts. This problem is regarded as a pursuit-eva... This paper studies the proximate satellite interception guidance strategies where both the interceptor and target can perform orbital maneuvers with magnitude limited thrusts. This problem is regarded as a pursuit-evasion game since satellites in both sides will try their best to capture or escape. In this game, the distance of these two players is small enough so that the highly nonlinear earth-centered gravitational dynamics can be reduced to the linear Clohessy-Wiltshire(CW) equations. The system is then simplified by introducing the zero effort miss variables. Saddle solution is formulated for the pursuit-evasion game and time-to-go is estimated similarly as that for the exoatmospheric interception. Then a vector guidance is derived to ensure that the interception can be achieved in the optimal time. The proposed guidance law is validated by numerical simulations. 展开更多
关键词 differential games Saddle solution Satellite interception Time-to-go estimation Zero effort miss trajectory
原文传递
Infinite horizon linear quadratic differential games for discrete-time stochastic systems 被引量:2
9
作者 Huiying SUN Liuyang JIANG Weihai ZHANG 《控制理论与应用(英文版)》 EI 2012年第3期391-396,共6页
This paper deals with the infinite horizon linear quadratic (LQ) differential games for discrete-time stochas- tic systems with both state and control dependent noise. The Popov-Belevitch-Hautus (PBH) criteria for... This paper deals with the infinite horizon linear quadratic (LQ) differential games for discrete-time stochas- tic systems with both state and control dependent noise. The Popov-Belevitch-Hautus (PBH) criteria for exact observability and exact detectability of discrete-time stochastic systems are presented. By means of them, we give the optimal strategies (Nash equilibrium strategies) and the optimal cost values for infinite horizon stochastic differential games. It indicates that the infinite horizon LQ stochastic differential gaines are associated with four coupled matrix-valued equations. Further- more, an iterative algorithm is proposed to solve the four coupled equations. Finally, an example is given to demonstrate our results. 展开更多
关键词 Discrete-time stochastic systems Exact observability Exact detectability differential games Nash equi-librium
原文传递
Infnite time horizon nonzero-sum linear quadratic stochastic differential games with state and control-dependent noise 被引量:2
10
作者 Huainian ZHU Chengke ZHANG 《控制理论与应用(英文版)》 EI CSCD 2013年第4期629-633,共5页
This paper discusses the infinite time horizon nonzero-sum linear quadratic (LQ) differential games of stochastic systems governed by Itoe's equation with state and control-dependent noise. First, the nonzero-sum L... This paper discusses the infinite time horizon nonzero-sum linear quadratic (LQ) differential games of stochastic systems governed by Itoe's equation with state and control-dependent noise. First, the nonzero-sum LQ differential games are formulated by applying the results of stochastic LQ problems. Second, under the assumption of mean-square stabilizability of stochastic systems, necessary and sufficient conditions for the existence of the Nash strategy are presented by means of four coupled stochastic algebraic Riccati equations. Moreover, in order to demonstrate the usefulness of the obtained results, the stochastic H-two/H-infinity control with state, control and external disturbance-dependent noise is discussed as an immediate application. 展开更多
关键词 Stochastic system differential games Stochastic H-two/H-infinity control
原文传递
A BSDE Approach to Stochastic Differential Games Involving Impulse Controls and HJBI Equation 被引量:1
11
作者 ZHANG Liangquan 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2022年第3期766-801,共36页
This paper focuses on zero-sum stochastic differential games in the framework of forwardbackward stochastic differential equations on a finite time horizon with both players adopting impulse controls.By means of BSDE ... This paper focuses on zero-sum stochastic differential games in the framework of forwardbackward stochastic differential equations on a finite time horizon with both players adopting impulse controls.By means of BSDE methods,in particular that of the notion from Peng’s stochastic backward semigroups,the authors prove a dynamic programming principle for both the upper and the lower value functions of the game.The upper and the lower value functions are then shown to be the unique viscosity solutions of the Hamilton-Jacobi-Bellman-Isaacs equations with a double-obstacle.As a consequence,the uniqueness implies that the upper and lower value functions coincide and the game admits a value. 展开更多
关键词 Dynamic programming principle(DPP) forward-backward stochastic differential equations(FBSDEs) Hamilton-Jacobi-Bellman-Isaacs(HJBI) impulse control stochastic differential games value function viscosity solution
原文传递
Nash and Stackelberg Differential Games 被引量:1
12
作者 Alain BENSOUSSAN Jens FREHSE Jens VOGELGESANG 《Chinese Annals of Mathematics,Series B》 SCIE CSCD 2012年第3期317-332,共16页
A large class of stochastic differential games for several players is considered in this paper.The class includes Nash differential games as well as Stackelberg differential games.A mix is possible.The existence of fe... A large class of stochastic differential games for several players is considered in this paper.The class includes Nash differential games as well as Stackelberg differential games.A mix is possible.The existence of feedback strategies under general conditions is proved.The limitations concern the functionals in which the state and the controls appear separately.This is also true for the state equations.The controls appear in a quadratic form for the payoff and linearly in the state equation.The most serious restriction is the dimension of the state equation,which cannot exceed 2.The reason comes from PDE(partial differential equations) techniques used in studying the system of Bellman equations obtained by Dynamic Programming arguments.In the authors' previous work in 2002,there is not such a restriction,but there are serious restrictions on the structure of the Hamiltonians,which are violated in the applications dealt with in this article. 展开更多
关键词 Stochastic games Bellman equation Nonlinear elliptic and parabolicequations Stochastic differential games HAMILTONIANS
原文传递
Stochastic Differential Games with Reflection and Related Obstacle Problems for Isaacs Equations
13
作者 Rainer BUCKDAHN 《Acta Mathematicae Applicatae Sinica》 SCIE CSCD 2011年第4期647-678,共32页
In this paper we first investigate zero-sum two-player stochastic differential games with reflection, with the help of theory of Reflected Backward Stochastic Differential Equations (RBSDEs). We will establish the d... In this paper we first investigate zero-sum two-player stochastic differential games with reflection, with the help of theory of Reflected Backward Stochastic Differential Equations (RBSDEs). We will establish the dynamic programming principle for the upper and the lower value functions of this kind of stochastic differential games with reflection in a straightforward way. Then the upper and the lower value functions are proved to be the unique viscosity solutions to the associated upper and the lower Hamilton-Jacobi-Bettman-Isaacs equations with obstacles, respectively. The method differs significantly from those used for control problems with reflection, with new techniques developed of interest on its own. Further, we also prove a new estimate for RBSDEs being sharper than that in the paper of E1 Karoui, Kapoudjian, Pardoux, Peng and Quenez (1997), which turns out to be very useful because it allows us to estimate the LP-distance of the solutions of two different RBSDEs by the p-th power of the distance of the initial values of the driving forward equations. We also show that the unique viscosity solution to the approximating Isaacs equation constructed by the penalization method converges to the viscosity solution of the Isaacs equation with obstacle. 展开更多
关键词 stochastic differential games value function reflected backward stochastic differential equations dynamic programming principle Isaacs equations with obstacles viscosity solution
原文传递
Policy Iteration Algorithms for Zero-Sum Stochastic Differential Games with Long-Run Average Payoff Criteria
14
作者 JoséDaniel López-Barrientos 《Journal of the Operations Research Society of China》 EI 2014年第4期395-421,共27页
This paper studies the policy iteration algorithm(PIA)for zero-sum stochastic differential games with the basic long-run average criterion,as well as with its more selective version,the so-called bias criterion.The sy... This paper studies the policy iteration algorithm(PIA)for zero-sum stochastic differential games with the basic long-run average criterion,as well as with its more selective version,the so-called bias criterion.The system is assumed to be a nondegenerate diffusion.We use Lyapunov-like stability conditions that ensure the existence and boundedness of the solution to certain Poisson equation.We also ensure the convergence of a sequence of such solutions,of the corresponding sequence of policies,and,ultimately,of the PIA. 展开更多
关键词 Ergodic payoff criterion Zero-sum stochastic differential games Policy iteration algorithm Nondegenerate diffusions Poisson equation Schäl convergence Bias game
原文传递
FORWARD-BACKWARD STOCHASTIC DIFFERENTIAL EQUATIONS, LINEAR QUADRATIC STOCHASTIC OPTIMAL CONTROL AND NONZERO SUM DIFFERENTIAL GAMES 被引量:13
15
作者 WUZhen 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2005年第2期179-192,共14页
In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash ... In this paper, we use the solutions of forward-backward stochastic differential equations to get the explicit form of the optimal control for linear quadratic stochastic optimal control problem and the open-loop Nash equilibrium point for nonzero sum differential games problem. We also discuss the solvability of the generalized Riccati equation system and give the linear feedback regulator for the optimal control problem using the solution of this kind of Riccati equation system. 展开更多
关键词 stochastic differential equations stochastic optimal control riccatiequation nonzero sum stochastic differential game
原文传递
LINEAR QUADRATIC NONZERO-SUM DIFFERENTIAL GAMES WITH RANDOM JUMPS 被引量:5
16
作者 WU Zhen(吴臻) YU Zhi-yong(于志勇) 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2005年第8期1034-1039,共6页
The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. ... The existence and uniqueness of the solutions for one kind of forward- backward stochastic differential equations with Brownian motion and Poisson process as the noise source were given under the monotone conditions. Then these results were applied to nonzero-sum differential games with random jumps to get the explicit form of the open-loop Nash equilibrium point by the solution of the forward-backward stochastic differential equations. 展开更多
关键词 stochastic differential equation Poisson process stochastic differential game
在线阅读 下载PDF
A STACKELBERG DIFFERENTIAL GAMES SOLUTION TO A PROBLEM OF OPTIMAL INTERTEMPORAL INVESTMENT AND TAX RATE DESIGN 被引量:2
17
作者 ZHANGRong LIUXing 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2004年第2期253-261,共9页
Using the Stackelberg differential games(SDG) theory,we quantitatively study a problem of optimal intertemporal investment and tax rate design.Under some appropriate assumptions,the open-loop Stackelberg equilibrium s... Using the Stackelberg differential games(SDG) theory,we quantitatively study a problem of optimal intertemporal investment and tax rate design.Under some appropriate assumptions,the open-loop Stackelberg equilibrium solutions are obtained.Equilibrium solutions show that:1.The optimal strategies derived from differential game and unilateral optimal control approaches are different;2.It is not always the best strategy for the government to use a constant tax rate over the whole time period;3.The admissible size of tax rate adjustment may have great effect on the government's optimal strategy;4.SDG approach has no significant effect on the firm's optimal investment strategy. 展开更多
关键词 stackelberg differential game tax rate design optimal investment
原文传递
Pareto Efficiency of Finite Horizon Switched Linear Quadratic Differential Games 被引量:3
18
作者 HUANG Yabing ZHAO Jun 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2018年第1期173-187,共15页
A switched linear quadratic(LQ) differential game over finite-horizon is investigated in this paper. The switching signal is regarded as a non-conventional player, afterwards the definition of Pareto efficiency is e... A switched linear quadratic(LQ) differential game over finite-horizon is investigated in this paper. The switching signal is regarded as a non-conventional player, afterwards the definition of Pareto efficiency is extended to dynamics switching situations to characterize the solutions of this multi-objective problem. Furthermore, the switched differential game is equivalently transformed into a family of parameterized single-objective optimal problems by introducing preference information and auxiliary variables. This transformation reduces the computing complexity such that the Pareto frontier of the switched LQ differential game can be constructed by dynamic programming. Finally, a numerical example is provided to illustrate the effectiveness. 展开更多
关键词 Dynamic optimization linear quadratic problems Pareto efficiency switched differential game
原文传递
Capturability of Pseudo Differential Games of Pursuit 被引量:1
19
作者 YAN Yangkui Department of Mathematics, Shanxi University, Taiyuan 030006, Shanxi, P.R.China 《Systems Science and Systems Engineering》 CSCD 1993年第4期354-356,共3页
This paper discusses the capturability with fixed time of a pseudo-linear differential game of pursuit. The pursuit set in which the pursuit will end once the initial state lies in this set is given by the method of i... This paper discusses the capturability with fixed time of a pseudo-linear differential game of pursuit. The pursuit set in which the pursuit will end once the initial state lies in this set is given by the method of integration of multi-valued function. The results obtained here solve an open problem of Pontrjagin’s on the linear differential game of pursuit. Meanwhile, the requirement of the convexity of the control set and other related Pontrjagin’s conditions are removed. 展开更多
关键词 pursuit differential game pseudo-linear differential game.
原文传递
Linear Quadratic Leader-Follower Stochastic Differential Games:Closed-Loop Solvability
20
作者 LI Zixuan SHI Jingtao 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2023年第4期1373-1406,共34页
In this paper,a leader-follower stochastic differential game is studied for a linear stochastic differential equation with quadratic cost functionals.The coefficients in the state equation and the weighting matrices i... In this paper,a leader-follower stochastic differential game is studied for a linear stochastic differential equation with quadratic cost functionals.The coefficients in the state equation and the weighting matrices in the cost functionals are all deterministic.Closed-loop strategies are introduced,which require to be independent of initial states;and such a nature makes it very useful and convenient in applications.The follower first solves a stochastic linear quadratic optimal control problem,and his optimal closed-loop strategy is characterized by a Riccati equation,together with an adapted solution to a linear backward stochastic differential equation.Then the leader turns to solve a stochastic linear quadratic optimal control problem of a forward-backward stochastic differential equation,necessary conditions for the existence of the optimal closed-loop strategy for the leader is given by a Riccati equation.Some examples are also given. 展开更多
关键词 Backward stochastic differential equation closed-loop solvability leader-follower stochastic differential game linear quadratic control Riccati equation Stackelberg equilibrium
原文传递
上一页 1 2 5 下一页 到第
使用帮助 返回顶部