重访无损凸化:离散时间最优控制问题的理论保证

Revisiting Lossless Convexification: Theoretical Guarantees for Discrete-time Optimal Control Problems

摘要 Abstract

无损凸化(Lossless Convexification, LCvx)是一种建模方法,通过凸松弛将一类非凸最优控制问题(主要由控制约束引起的非凸性)转化为凸问题。这些凸问题在离散化后可以使用多项式时间数值方法求解,从而将原始无限维问题转化为有限维问题。然而,现有的LCvx理论仅限于连续时间最优控制问题,因为松弛后的凸问题与原始非凸问题的等价性仅在连续时间下成立。本文通过将离散时间最优控制问题分类为常规情形和长时间跨度情形,将LCvx扩展到离散时间最优控制问题。对于常规情形,在对系统动力学(递归等式约束)进行任意小的扰动后,应用现有的LCvx方法到离散时间问题可得到满足原始非凸约束的最优控制,且最多在$n_x-1$个时间网格点上不满足,其中$n_x$为状态维度。对于长时间跨度情形,现有LCvx方法失效,但通过结合二分搜索方法,并利用松弛凸问题中值函数的连续性,我们实现了类似常规情形的结果。本文改进了LCvx的理论基础,拓展了其在实际离散时间最优控制问题中的适用性。

Lossless Convexification (LCvx) is a modeling approach that transforms a class of nonconvex optimal control problems, where nonconvexity primarily arises from control constraints, into convex problems through convex relaxations. These convex problems can be solved using polynomial-time numerical methods after discretization, which converts the original infinite-dimensional problem into a finite-dimensional one. However, existing LCvx theory is limited to continuous-time optimal control problems, as the equivalence between the relaxed convex problem and the original nonconvex problem holds only in continuous time. This paper extends LCvx to discrete-time optimal control problems by classifying them into normal and long-horizon cases. For normal cases, after an arbitrarily small perturbation to the system dynamics (recursive equality constraints), applying the existing LCvx method to discrete-time problems results in optimal controls that meet the original nonconvex constraints at all but no more than $n_x - 1$ temporal grid points, where $n_x$ is the state dimension. For long-horizon cases, the existing LCvx method fails, but we resolve this issue by integrating it with a bisection search, leveraging the continuity of the value function from the relaxed convex problem to achieve similar results as in normal cases. This paper improves the theoretical foundation of LCvx, expanding its applicability to real-world discrete-time optimal control problems.

重访无损凸化:离散时间最优控制问题的理论保证 - arXiv