Abstract

This paper presents a hierarchical nonlinear control algorithm for the real-time planning and control of cooperative locomotion of legged robots that collaboratively carry objects. An innovative network of reduced-order models subject to holonomic constraints, referred to as interconnected linear inverted pendulum (LIP) dynamics, is presented to study cooperative locomotion. The higher level of the proposed algorithm employs a supervisory controller, based on event-based model predictive control (MPC), to effectively compute the optimal reduced-order trajectories for the interconnected LIP dynamics. The lower level of the proposed algorithm employs distributed nonlinear controllers to reduce the gap between reduced- and full-order complex models of cooperative locomotion. In particular, the distributed controllers are developed based on quadratic programing (QP) and virtual constraints to impose the full-order dynamical models of each agent to asymptotically track the reduced-order trajectories while having feasible contact forces at the leg ends. The paper numerically investigates the effectiveness of the proposed control algorithm via full-order simulations of a team of collaborative quadrupedal robots, each with a total of 22 degrees-of-freedom. The paper finally investigates the robustness of the proposed control algorithm against uncertainties in the payload mass and changes in the ground height profile. Numerical studies show that the cooperative agents can transport unknown payloads whose masses are up to 57%, 97%, and 137% of a single agent's mass with a team of two, three, and four legged robots.

1 Introduction

Legged robots can form collaborative robot (corobot) teams that assist humans in labor-intensive tasks such as construction, manufacturing, and assembly. The evolution of legged robots that cooperatively manipulate/transport objects can be described by high-dimensional and inherently unstable complex systems. Although powerful computational approaches have allowed the deployment of distributed control algorithms for complex robot systems, state-of-the-art techniques are tailored to the control of multirobot systems (MRSs) (see, e.g., Refs. [1] and [2]) composed of collaborative robotic arms and multifingered hands [3], aerial vehicles [4,5], and ground vehicles [68], but not sophisticated legged machines that cooperatively transport objects.

The overarching goal of this paper is to develop a hierarchical computational algorithm to enable the real-time planning and control of cooperative locomotion for multi-agent legged robotic systems that carry objects. The higher level of the proposed algorithm employs a supervisory control, based on event-based model predictive control (MPC), to generate optimal trajectories for individual agents. In particular, the MPC is formulated for the optimal control of an interconnected network of holonomically constrained reduced-order systems, developed based on linear inverted pendulum (LIP) models, subject to having feasible individual ground reaction forces (GRFs). To reduce the gap between the network of reduced- and full-order complex models of cooperative locomotion, distributed nonlinear controllers, based on quadratic programing (QP) and virtual constraints, are implemented at the lower level of the proposed algorithm to impose the full-order dynamics of each agent to asymptotically track the optimal trajectories while keeping the GRFs at all contacting leg ends in the friction cone. It is shown that the proposed control approach can generate and robustly stabilize cooperative locomotion patterns for multi-agent quadrupedal robotic systems in the presence of model uncertainties arising from unknown payloads and ground height variations (see Fig. 1).

Fig. 1
Illustration of a team of two Vision 60 robots augmented with Kinova arms for cooperative locomotion. Full-order dynamical models of these robots are used for the numerical simulation.
Fig. 1
Illustration of a team of two Vision 60 robots augmented with Kinova arms for cooperative locomotion. Full-order dynamical models of these robots are used for the numerical simulation.
Close modal

1.1 Related Work, Motivation, and Challenges.

Gait planning for complex dynamical models of cooperative locomotion is a significant challenge arising from the hybrid nature of models, nonlinearities, high dimensionality, and strong interactions amongst the agents. Hybrid systems theory has provided powerful techniques for modeling and analyzing dynamic locomotion of single legged machines [918]. Advanced nonlinear control algorithms have been developed to address the hybrid nature of locomotion such as hybrid reduction [19], controlled symmetries [15], transverse linearization [16], and hybrid zero dynamics (HZD) [10,20]. The HZD approach considers a set of kinematic constraints, referred to as virtual constraints, to coordinate the links of the robots during locomotion. Virtual constraints are asymptotically imposed by the action of a feedback control law (e.g., input–output (I-O) linearization [21]) and have been validated for stable locomotion of bipedal robots [10,11,2225] and powered prosthetic legs [26,27]. The HZD approach formulates the gait planning problem as an offline nonlinear programing problem [24] which cannot address the real-time planning for cooperative locomotion.

Various powerful MPC-based approaches have been introduced for the real-time planning and robust control of solitary legged machines, including the LIP-based approach [2834], single rigid body dynamics approach [3537], nonlinear MPC [38], policy-regularized MPC [39], and QP-based whole-body control [40,41]. Quadrupedal robots that cooperatively transport an object can be described by a set of legged agents that are coupled to each other and the object via a set of holonomic constraints. The challenge is to develop real-time optimal control algorithms for such a complex and inherently unstable robotic system that control locomotion with many degrees-of-freedom (DOFs). Existing MPC approaches for legged robots are typically formulated as QPs to be solved every time sample—this makes the extension of these MPC-based techniques for composite mechanical systems arising from cooperative locomotion of quadrupedal robots computationally intensive. We would like to deploy innovative MPC techniques in the context of networked systems that can reduce the computational burden to allow real-time planning and coordination of sophisticated corobot teams. One approach to tackle this challenge is through the development of event-based MPC techniques [33,42], in which MPC problems are solved at particular time samples, referred to as events (e.g., beginning of each domain), rather than every time sample. Our previous work [33] presented a nonlinear control approach, based on event-based MPC, for the robust and stable locomotion of single quadrupedal robots. We would like to extend this framework for the motion control of cooperative locomotion.

1.2 Objectives and Contributions.

The objectives and contributions of this paper are as follows. We present an innovative and interconnected network of reduced-order models to address the real-time planning for cooperative locomotion of quadrupedal robots that carry an object. The proposed network of reduced-order models is nonlinear and developed based on the interconnection of LIP models subject to holonomic constraints. We then present a hierarchical nonlinear computational algorithm for the motion control of legged corobots (see Fig. 2). At the higher level of the control scheme, we present a supervisory predictive control algorithm, based on event-based MPC, to compute the optimal center of mass (COM) trajectories for the network of reduced-order models subject to the feasibility of the individual net GRFs. To address the nonlinearity of the interconnected reduced-order models and to formulate a convex optimal control problem, the event-based MPC is employed using the linearized dynamics at the beginning of each continuous-time domain (i.e., event samples). At the lower level of the control scheme, distributed nonlinear controllers, based on QP and virtual constraints, are developed to impose the full-order dynamical model of each agent to asymptotically track the prescribed optimal COM trajectories while keeping all individual GRFs at the contacting leg ends feasible. To demonstrate the effectiveness of the proposed control algorithm, a series of extensive and full-order numerical simulations is presented for cooperative locomotion of a team of two, three, and four advanced quadrupedal robots, Vision 60, augmented with Kinova arms, each with a total of 22DOFs (see Fig. 1). It is numerically shown that the proposed control scheme can generate and robustly stabilize locomotion patterns for a team of quadrupedal robots that carry different objects in the presence of model uncertainties in terms of payloads and unknown ground height variation. The numerical studies show that the agents can cooperatively transport unknown payloads whose masses are up to 57%, 97%, and 137% of a singles agent's mass with a team of two, three, and four legged corobots.

Fig. 2
Overview of the proposed hierarchical nonlinear control algorithm for cooperative locomotion of legged agents subject to holonomic constraints. The figure also illustrates the concept of the interconnected LIP dynamics.
Fig. 2
Overview of the proposed hierarchical nonlinear control algorithm for cooperative locomotion of legged agents subject to holonomic constraints. The figure also illustrates the concept of the interconnected LIP dynamics.
Close modal

The approach of this paper completely differs from Ref. [43] in that this paper presents an innovative network of reduced-order models together with a supervisory MPC for the real-time planning of cooperative locomotion, whereas [43] only studied the stabilization of preplanned and offline trajectories. In addition, the approach of Ref. [43] studied the locomotion of two agents via the hybrid systems formulation that cannot be easily extended to the locomotion of multiple agents. We finally remark that the objective of this paper is to study the development of hierarchical control algorithms for stable and robust cooperative locomotion subject to the holonomic constraints that can arise from cooperative transportation problems. Hence, the main focus of the paper is on cooperative locomotion, and robotic manipulators are being used toward this goal.

2 Interconnected Linear Inverted Pendulum Dynamics

The objective of this section is to derive an interconnected network of reduced-order models for the cooperative locomotion of N2 legged corobots that carry an object. The reduced-order network, referred to as the interconnected LIP dynamics, will be utilized for the real-time trajectory planning in Sec. 3. Here, we consider an open path graph for the network of LIP dynamics with N vertices and N – 1 edges. In particular, all inner vertices have degree 2 except the end vertices 1 and N that have degree 1 (see Fig. 2). The vertices of the graph represent the agents, and the edges represent interconnection between agents. In our notation, N(i) denotes the set of all agents that are adjacent to the agent iV:={1,,N}.

Remark 1 (Path graphs). The reason for the assumption of open path graph is to simplify the presentation of the interaction forces and the interconnected LIP dynamics in Theorem 1. This also allows us to easily present the proposed control scheme.

We consider the following LIP dynamics [28] for the locomotion of the agent iV:
(1)

where ri:=col(rix,riy)2 represents the Cartesian coordinates of the COM of the agent i in the horizontal plane with respect to the inertial world frame, denotes the height of the COM, g is the gravitational constant, ui:=col(uix,uiy)2 represents the Cartesian coordinates of the center of pressure (COP), fi2 denotes the external force on the COM of the agent i, and m is the total mass of the agent

Assumption 1 (Rigidity). We suppose that there are holonomic constraints amongst the adjacent agents as follows:
(2)

for alliVandjN(i)and some constants cij > 0. The forces between agents i and j are further assumed to befij=fji=(rirj)λij, where λij denotes the Lagrange multipliers with the symmetry propertyλij=λji(see Fig. 2).

We remark that from Assumption 1 as well as the symmetry condition, there are N – 1 independent Lagrange multipliers λij to be determined. For future purposes, we show these independent Lagrange multipliers as a vector λ:=col(λij|i=1,,N1,j=i+1)N1, where “col” represents the column operator. Throughout this paper, the boldface variables will correspond to the global variables of the interconnected LIP network.

Remark 2. Assumption 1 considers the holonomic constraints amongst the COMs of the adjacent agents as the LIP dynamics cannot address the moments about the COM. In particular, the addition of robotic manipulators to the reduced-order model can result in moments around the COM generated by the arms. Hence, the interconnected LIP dynamics do not include the manipulator models. However, we remark that the full-order dynamical model of cooperative locomotion in Sec. 5 will consider the holonomic constraints amongst the manipulators' end effectors (EEs). We further assume that the EE's motion with respect to the body is almost static. The numerical results of Sec. 5 will show the adequacy and validity of this assumption for the development of the supervisory MPC. The numerical results will also show that the proposed control algorithms can bridge the gap between the developed interconnected LIP model and the detailed full-order model. Section 6.2 will discuss this assumption and the results with more details.

Using this assumption, the interconnected network of LIP dynamics can be expressed as
(3)
Subject to the holonomic constraints (2). For future purposes, we define the augmented position, velocity, and control input vectors as r:=col(ri|iV)2N,r˙:=col(r˙i|iV)2N, and u:=col(ui|iV)2N. By differentiating the holonomic constraint (2) twice along the trajectories of Eq. (3), we get
(4)
Combining Eqs. (3) and (4) then results in
for all i=1,,N1 and j=i+1 which can be written in a compact form to solve for the Lagrange multipliers λ, i.e.,
(5)
Here, ΛN(N1)×(N1) is a symmetric matrix as follows:

in which eij:=rirj2. In addition, b:=col(bi|i=1,,N1)N1, where bi:=mg(rirj)(uiuj)mg||rirj||2m||r˙ir˙j||2 with j=i+1. In what follows, we study the conditions under which there is a unique solution λ for the algebraic Eq. (5).

Theorem 1 (Uniqueness of λ). Suppose that N2 and Assumption 1 is met. Then, the matrix ΛN(r) is positive definite if rirj (or, equivalently, eij0) for all iV and jN(i).

Proof. See Appendix  A.1.▪

Using Theorem 1, the Lagrange multipliers can be solved as λ=ΛN1(r)b(r,r˙,u) which in combination with Eq. (3) results in the following compact and nonlinear equations of motion:
(6)
where L(λ):=[Lij]2N×2N is a weighted Laplacian matrix with the blocks Lij2×2 for 1i,jN such that Lii:=(kN(i)λik)I2,Lij:=λijI2 for jN(i), and Lij:=02 for jN(i). Here, I2 and 02 denote the identity and zero matrices of order two, respectively. We remark that according to the construction procedure, the state manifold for the interconnected LIP dynamics can be expressed as

for some cij > 0 which is invariant under the flow of Eq. (6). In addition, we can show that M is a 2N+2-dimensional embedded submanifold of 4N.

Example 1. For the case of two agents, the interconnected LIP dynamics can be expressed as the following nonlinear system:
By defining the augmented state vector x:=col(r,r˙)4N, the nonlinear state equation for the coupled LIP dynamics can be expressed as x˙=f(x,u), where f:M×2NTM is differentiable and TM denotes the tangent bundle of the manifold M. In addition, the continuous-time dynamics can be discretized using the Euler approach as follows:
(7)

in which Ts denotes the sampling time and x[k] and u[k] represent the state vector and control inputs at the time sample kZ0:={0,1,}, respectively.

3 Supervisory Predictive Control

The objective of this section is to develop a supervisory control algorithm, based on the interconnected LIP dynamics, MPC, and convex optimization, to effectively plan and coordinate multi-agent legged robots in real-time.

Models of legged locomotion are hybrid and can be illustrated as directed graphs. In this representation, continuous-time dynamics are represented by vertices of the graph to describe the evolution of the system by the Lagrangian dynamics. The edges of the graph then represent the instantaneous and discrete-time transitions amongst the continuous-time dynamics to model the possible and abrupt changes in the state vector according to the rigid impacts of the leg ends with the environment. In this paper, we consider a general locomotion (walking) pattern for the quadrupedal agents with start and stop conditions as a directed graph Gw=(Vw,Ew) (see Fig. 3), where the vertices set Vw represents the continuous-time domains (e.g., double-, triple-, and quadruple-contact domains) and edges set EwVw×Vw denotes the discrete-time transitions (e.g., impacts and take-offs) (see Fig. 3(b)). We further suppose that there are md1 continuous-time domains and each continuous-time domain consists of ng1 grid points (i.e., time samples) (see Fig. 3(a)). In this paper, we consider a general aperiodic locomotion pattern. Hence, domains are enumerated to show the successive continuous-time domains from start to stop. Consequently, there can be two distinct domains with the same stance legs (e.g., domains 2 and md1 in Fig. 3(b)). The domain indicator function is then defined as ζ:Z0{1,2,,md} by ζ(k):=kng+1 for 0k<mdng and ζ(k):=md for kmdng to assign the domain index for every time sample. Here, · represents the floor function.

Fig. 3
(a) Illustration of the proposed supervisory predictive control. Here, agents share their actual and reduced-order states with the higher-level supervisory control. The supervisory control then optimizes for the COM motions subject to the interconnected LIP dynamics and feasibility conditions. (b) Illustration of the directed cycle to represent the locomotion pattern of each agent with different continuous-time domains. Snapshot of the cooperative locomotion highlights different domains for each agent.
Fig. 3
(a) Illustration of the proposed supervisory predictive control. Here, agents share their actual and reduced-order states with the higher-level supervisory control. The supervisory control then optimizes for the COM motions subject to the interconnected LIP dynamics and feasibility conditions. (b) Illustration of the directed cycle to represent the locomotion pattern of each agent with different continuous-time domains. Snapshot of the cooperative locomotion highlights different domains for each agent.
Close modal
For the feasibility of the interconnected LIP model, we assume that all local control inputs (i.e., COPs) ui[k] for iV lie in a time-varying support polygon which is defined as the convex hull of the contacting points with the ground. That is
(8)
in which Uζ(k)i2 is the corresponding support polygon for the agent i in the domain ζ(k) (see Fig. 2). In addition, the net GRF acting on the agent i must be in the friction cone. These friction cone conditions together with the dynamics (6) can be expressed as the following nonlinear inequality constraints:
(9)

Problem 1 (Real-time planning of agents). Let us consider the locomotion pattern Gw with a given set of desired footholds encoded in the convex hulls (i.e., support polygons) Uζ(k)i for all agents iV. For a given initial state x0M and final state xfM, the planning problem consists of finding an optimal augmented control input u[k] in real-time that steers the interconnected LIP dynamics (7) from x0 to xf subject to the constraints (8) and (9).

Remark 3 (Computation of footholds). We remark that the desired footholds are computed at the beginning of the locomotion and are used during the locomotion to form the support polygons in Problem 1. One way to compute the desired footholds is as follows. We can first consider a straight line connecting the initial position of each agent to its final position in the horizontal plane. We then generate a sequence of footholds along this line via a proper step length.

To address Problem 1, we consider a supervisory predictive control that has access to the global positions (i.e., reduced-order sates) of all agents (i.e., ri for iV) via a direct communication network [1]. We then extend the event-based MPC approach of [33]—that generates optimal trajectories for locomotion of a single agent—to address the motion planning problem for cooperative locomotion of multi-agent robots. In the proposed approach, the supervisory predictive control is solved at the event samples, taken at the beginning of continuous-time domains, to reduce the computational burden of the networked system.

Remark 4 (Supervisory MPC). The supervisory predictive control can be either solved on one of the agents' onboard computers or all agents' computers. The first approach would result in a heterogeneous team with a leader, and the latter one would result in a homogeneous team. In the first approach, the supervisory MPC is solved at the beginning of each continuous-time domain for the leader. In contrast, in the second approach, the MPC is solved at the beginning of continuous-time domains for each agent. Although the first approach generally requires less computation burden than the second one, our numerical results show that both of these techniques are computationally tractable for cooperative locomotion of a team of legged robots with up to four agents. In particular, the computation time for the supervisory MPC in these techniques takes less than 1 (ms) (see Sec. 5 for details).

The nonlinear interconnected LIP dynamics in Eq. (7) are then linearized at the event samples to formulate a convex optimization problem. More specifically, we consider an affine approximation of Eq. (7) at the event sample k=lng for some integer l0 to estimate the future states as follows:
(10)
where Nh=nhng denotes the control horizon for some positive integer nh1,xk+j|k represents the estimated state of the interconnected LIP network model at time k + j predicted at time k, and uk+j|k denotes the input of the LIP network (i.e., COPs) at time k + j computed at time k. In addition, the Jacobian matrices and affine term are computed from Eq. (7) and updated at every event sample according to A:=Fx(x[k],u[k1])4N×4N,B:=Fu(x[k],u[k1])4N×2N, and d:=F(x[k],u[k1])Ax[k]Bu[k1]4N. An analogous technique can be used to estimate (9) as the following affine inequality:
(11)
We then formulate a convex MPC problem over the control horizon Nh to steer (10) from x0M to xfM subject to Eqs. (8) and (11), that is
(12)

subject to Dynamics (10) and in equalities (8) and (11),

where Uk+Nh1|k:=col(uk|k,,uk+Nh1|k)2NNh. Here, the terminal and stage cost functions are given by p(xk+Nh|k):=||xk+Nh|kxk+Nh|kdes||P2 and (xk+j|k,uk+j|k):=||xk+j|kxk+j|kdes||Q2+||uk+j|k||R2, respectively, for some positive definite matrices P4N×4N,Q4N×4N, and R2N×2N. In addition, xk+j|kdes denotes a desired state trajectory and ||z||P2:=zPz. We remark that the supervisory event-based MPC can be translated into QP. Let (xk+j|k,uk+j|k) denote the optimal solution over the control horizon. Then the optimal COM trajectory of the agents over one continuous-time domain (i.e., xk+j|k for j=0,,ng1) will be utilized as the reference trajectory to be tracked by the low-level distributed controllers in Sec. 4. The MPC problem will be solved again at the beginning of the next continuous-time domain.

4 Distributed Controllers

The objective of this section is to present the low-level distributed controllers to impose the full-order dynamical models of individual agents to asymptotically track the optimal reduced-order trajectories prescribed by the supervisory predictive control while having feasible contact forces. Here, we extend the virtual constraints controller of Ref. [33] for the development of distributed controllers for multi-agent systems. More specifically, the distributed low-level controller formulates the I-O linearization problem as a QP that addresses the feasibility of each agent's individual GRFs at the contacting leg ends while tracking the optimal COM trajectories for cooperative locomotion and desired swing leg path of each agent. We remark that, unlike the supervisory MPC, the distributed low-level controller only considers the full-order dynamical model of each agent. Hence, it does not require the full state measurements of the other agents.

In this paper, each legged agent is assumed to consist of the 18-DOF quadrupedal robot Vision 60, manufactured by Ghost Robotics (Philadelphia, PA),2 plus a 4DOF Kinova arm (see Fig. 4) for the locomotion and manipulation purposes. The total mass of this augmented agent is 35 (kg). The DOFs on Vision 60 are composed of six underactuated DOFs for the absolute position and orientation of the robot plus 12 actuated DOFs associated with the legs. More specifically, each leg of the robot consists of six actuated DOFs for the hip roll, hip pitch, and knee joints. All DOFs of the Kinova arm are further assumed to be actuated. The detailed view of the joint arrangement and DOFs of the robot are represented in Fig. 4. In our notation, the local configuration vector and local control inputs (i.e., joint-level torques) for the agent iV are denoted by qi:=col(ptorso,i,ϕtorso,i,qbody,i)22 and τi16, respectively, where ptorso,i3 and ϕtorso,i3 describe the absolute position and orientation of the torso for the agent i with respect to the inertial frame (see Fig. 4). The remaining portion, qbody,i16 then represents the body variables of the robot that form the shape of the robot. Finally, let zi=col(qi,q˙i)44 and Fi3c,i denote the local full-order states and contact forces at the leg ends of the agent. Here, c,i represents the number of contacting legs with the ground.

Fig. 4
Illustration of 22DOFs for the full-order model of each robotic agent. The agents are composed of the 18DOF quadrupedal robot Vision 60 plus the 4DOF Kinova arm. Six unactuated DOFs are associated with the absolute position and orientation of the torso frame with respect to an inertial world frame. Each leg of the robot then consists of three actuated joints as hip roll, hip pitch, and knee joints. The arm is finally composed of four actuated joints. The axis of actuation for actuated joints are shown with dashed lines, where the axes with circle ends, axes with square ends, and axes with triangle ends represent the x, y, and z directions, respectively.
Fig. 4
Illustration of 22DOFs for the full-order model of each robotic agent. The agents are composed of the 18DOF quadrupedal robot Vision 60 plus the 4DOF Kinova arm. Six unactuated DOFs are associated with the absolute position and orientation of the torso frame with respect to an inertial world frame. Each leg of the robot then consists of three actuated joints as hip roll, hip pitch, and knee joints. The arm is finally composed of four actuated joints. The axis of actuation for actuated joints are shown with dashed lines, where the axes with circle ends, axes with square ends, and axes with triangle ends represent the x, y, and z directions, respectively.
Close modal
We now define the following local holonomic outputs to be regulated for the motion control of the agent i:
(13)

where h0(qi) represents the set of holonomic quantities to be controlled, referred to as the controlled variables, and hd,i(t) denotes the desired evolution of the controlled variables. The controlled variables, h0(qi), are chosen as the orientation of the agent (i.e., roll, pitch, and yaw) together with its COM position, the Cartesian coordinates of the swing leg ends, and the Cartesian coordinates of the manipulator's EE in the inertial world frame. The desired evolution of the COM position in hd,i(t) is defined as a Bézier polynomial [44] that passes through the discrete and optimal reduced-order trajectory generated by the supervisory predictive control. In particular, we consider a Bézier polynomial whose coefficients are solved via least-squares at the beginning of each domain such that the polynomial has the best fit to the optimal COM trajectory of the agent i for the current domain. For the swing leg ends, hd,i(t) is taken as a Bézier foot trajectory in the task space starting from the previous foothold with zero velocity and ending at the next preplanned foothold with zero velocity. Finally, the desired evolution for the EE's Cartesian coordinates is chosen as the desired COM trajectory plus a constant vector representing the EE's position with respect to the torso.

To compute the local control torques τi, we consider the full-order and floating-base dynamics of the agent i without considering the interaction forces arising from manipulation. Although the low-level distributed controllers do not consider the interaction forces amongst the EE and objects for simplifying the controller synthesis, the full-order simulation models of the cooperative locomotion in Sec. 5 will consider these interacting forces to illustrate the validity of this assumption and robustness of the proposed control algorithms. We now consider the following local dynamics for the controller synthesis:
(14)
where Di(qi)22×22 denotes the positive definite mass-inertia matrix of the agent i, Hi(qi,q˙i)22 represents the Coriolis, centrifugal, and gravitational forces of the agent i, and ϒi22×16 is the input distribution matrix. Furthermore, Jc,i(qi)3c,i×22 represents the contact Jacobian matrix. For future purposes, the local dynamics (14) can be written in the state-space form as follows:
(15)
Differentiating the local output yi in Eq. (13) along the full-order dynamics of the agent i described in Eq. (15) results in the following output dynamics:
(16)
where LgiLfiyi,LwiLfiyi, and Lfi2yi are Lie derivatives that are used for I–O linearization [21], and KP and KD are positive definite matrices. Closed form expressions for the Lie derivatives can be found in Appendix  A.2. In addition, the local controller assumes a rigid contact model between the stance leg ends of the agent and the ground. In particular, the acceleration of the stance leg ends is assumed to be zero which can be expressed as
(17)
where pi denotes the Cartesian coordinates of the stance leg ends. The condition in Eq. (17) along with the local dynamics (14) yields the following affine condition in (τi,Fi):
(18)
The closed‐form of the Lie derivatives in Eq. (18) can be found in Appendix  A.2. In order to solve for the local torques τi, we are interested in solving for (τi,Fi) subject to Eqs. (16) and (18) such that (1) the contact forces belong to the friction cone (i.e., FiFC) while having feasible torques (i.e., τminττmax), and (2) the local torques are minimum two-norm. Hence, we setup the following set of distributed real-time QPs that can be solved at 1 kHz:
(19)

Here, δ is a defect variable added to the output dynamics (16) to guarantee the existence of a feasible solution in two different scenarios. (1) If the coefficient matrix loses rank at particular configurations, there may not be a pair of control torques and GRFs, i.e., (τi,Fi), that satisfies (16) and (18). To tackle this issue, we introduce the defect variable δ to make the equality constraints feasible. (2) If the torques and GRFs, i.e., (τi,Fi), do not belong to the admissible sets (i.e., the inequality constraints are violated), the defect variable δ can again help us to find a feasible solution. To reduce the effect of the defect variable δ on the output dynamics, we minimize its two-norm via a large weighting factor in the cost function. More specifically, the cost function (19) tries to minimize a weighted sum of the two-norms of the local torques and the defect variable, where ω>0 is the weighting factor. We remark that using the defect variable δ, the output dynamics become y¨i+KDy˙i+KPyi=δ(t), which is input-to-state stable [45]. Hence, if δ(t) remains bounded, the output profile y(t) will be also bounded. This will be analyzed more in the numerical simulations of Sec. 5. The optimal solutions of these QPs are finally denoted by τi=Γi(t,zi) for iV and are employed as local whole-body motion controllers.

5 Numerical Simulations

The objective of this section is to numerically verify the effectiveness of the proposed hierarchical control algorithm for cooperative transportation of objects by a team of composite robotic agents as shown in Fig. 1. We study both reduced- and full-order coupled models of legged agents to show the stability of locomotion patterns for the closed-loop system. We further investigate the robustness of the closed-loop system in the presence of unknown payloads and uncertainty in the ground height profile.

Control parameters: We consider the cooperative locomotion of two and three agents with trot gaits including start and stop domains. We have observed that for every sampling time Ts in [60,80] (ms) with ng = 4 grids per domain, the proposed control scheme can stabilize the locomotion patterns. Here, we choose Ts = 80 (ms). The control horizon for the supervisory predictive control is taken as Nh=nhng=4 which considers one domain ahead. The other parameters for the supervisory predictive control are tuned as P=103I4N×4N,Q=I4N×4N,R=1010I2N×2N which stabilize the cooperative motion. We have numerically observed that for l[0.35,0.55] (m), the robots behave safely, and the joint-level torques remain in an acceptable range. For the purpose of this paper, we choose l =0.5 (m). The friction coefficient is assumed to be μ=0.6. The supervisory predictive control is solved in an event-based manner (i.e., at the beginning of each domain), that is approximately every ngTs=0.32 s. Analogous to Ref. [33, Remark 1], we make use of a sparse QP structure to effectively solve the MPC (12). We can show that the number of decision variables for the sparse QP are 8NNh and 10NNh during the middle and start/stop domains, respectively.

Reduced-order coupled models: The evolution of the COM and COP for forward trot gaits of the individual agents in the interconnected LIP dynamics with N =2 and N =3 agents is depicted in Figs. 5(a)5(b) and Figs. 5(c)5(e), respectively. Here we make use of matlab for simulating the interconnected LIP dynamics (7) subject to the holonomic constraints and the supervisory predictive control (12). The initial configurations of the LIP models are chosen as r1[0]=(0,0)(m) and r2[0]=(0,1)(m) for N =2 agents. Moreover, the initial positions of the LIP models are taken as r1[0]=(0.2,1)(m),r2[0]=(0,0)(m), and r3[0]=(0.5,1)(m) for N =3 agents. The step length for N =2 and N =3 is chosen as (0.05,0)(m) and (0.03,0)(m) in 2, respectively. The target points are taken as the geometric center of the contact points in the last (i.e., stop) domain. Convergence to the target points with different number of agents and after md = 20 continuous-time domains is clear.

Fig. 5
COM and COP trajectories of the individual agents in the interconnected LIP dynamics (7) during forward trot gaits with N = 2 agents ((a) and (b)) and N = 3 agents ((c)-(e)). Here, the optimal control inputs (i.e., COPs) are computed via the supervisory predictive control (12). The sampling time for the supervisory predictive control in (a)–(e) are assumed to be Ts = 80 (ms).
Fig. 5
COM and COP trajectories of the individual agents in the interconnected LIP dynamics (7) during forward trot gaits with N = 2 agents ((a) and (b)) and N = 3 agents ((c)-(e)). Here, the optimal control inputs (i.e., COPs) are computed via the supervisory predictive control (12). The sampling time for the supervisory predictive control in (a)–(e) are assumed to be Ts = 80 (ms).
Close modal

Full-order coupled models: Next, we study the full-order complex model of cooperative locomotion with the proposed hierarchical control algorithm over md = 50 domains in RaiSim [46]. Here, we assume massless bars to be carried by the EEs of Vision 60 agents augmented with Kinova arms as shown in Fig. 1. The contact between the bar and the EE of Kinova arm is considered as a point contact. Based on this contact condition, the wrench between the object and EE of the arm only consists of the interaction forces. The QP arising from the supervisory predictive control is solved with qpSWIFT [47]. The average computation time of the higher-level QP on a laptop computer with an Intel® Core™ i7-10750H CPU 2.60 GHz and 16 GB RAM is 0.35 (ms) and 0.59 (ms) for N =2 and N =3 agents, respectively. The distributed and low-level controllers of Eq. (19) are also solved with qpSWIFT in 1 kHz and the weighting factor ω is chosen as 107.

The numerical simulation results for the stable cooperative locomotion of robots are provided in Figs. 6 and 7. Figures 6(a)6(d) and 7(a)7(c) illustrate the evolution of the virtual constraints and torque inputs for the individual agents during collaborative and forward trot gaits with N =2 and N =3 agents, respectively. Here, the speed of cooperative locomotion for two and three agents is 0.15 (m/s) and 0.1 (m/s). From these figures, we observe that the control inputs (joint-level torques) for all agents are bounded. In addition, the outputs (i.e., virtual constraints) remain bounded during the cooperative locomotion. We remark that Figs. 6 and 7 depict the first three components of the virtual constraints (i.e., yx, yy, and yz) that represents the COM tracking error. In particular, these figures show that the COM of the full-order dynamical model of each agent tracks the optimal and reduced-order COM trajectory generated by the supervisory MPC. We also remark that the range of the control inputs (torques) is bounded between –5 (N·m) and 5 (N·m) by the low-level nonlinear controller. Finally, the control inputs in Figs. 6 and 7 depict the motor torques before the gearbox system. Figure 8 depicts the evolution of the two-norm of the defect variable δ for the cooperative locomotion of two agents. From this figure, we observe that δ remains very small.

Fig. 6
Evolution of the virtual constraints and torque inputs in RaiSim for stable forward trot gait with N = 2 agents. Subplots (a) and (c) illustrate the evolution of outputs whereas subplots (b) and (d) depict the evolution of torque inputs. Here, yx, yy, yz denote the virtual constraints corresponding to the absolute position (i.e., x, y, and z) of the agent. In addition, the subscript “rFHip,” “pFHip,” and “FKnee” in the torque plots represent the roll torque of the front hip, pitch torque of the front hip, and pitch torque of the front knee for the left side of the robot, respectively.
Fig. 6
Evolution of the virtual constraints and torque inputs in RaiSim for stable forward trot gait with N = 2 agents. Subplots (a) and (c) illustrate the evolution of outputs whereas subplots (b) and (d) depict the evolution of torque inputs. Here, yx, yy, yz denote the virtual constraints corresponding to the absolute position (i.e., x, y, and z) of the agent. In addition, the subscript “rFHip,” “pFHip,” and “FKnee” in the torque plots represent the roll torque of the front hip, pitch torque of the front hip, and pitch torque of the front knee for the left side of the robot, respectively.
Close modal
Fig. 7
Plot of the virtual constraints and torque inputs in RaiSim for stable forward trot gait with N = 3 agents. Subplots (a), (b), and (c) correspond to the agents 1, 2, and 3, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Fig. 7
Plot of the virtual constraints and torque inputs in RaiSim for stable forward trot gait with N = 3 agents. Subplots (a), (b), and (c) correspond to the agents 1, 2, and 3, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Close modal
Fig. 8
Evolution of the two-norm of the defect variable δ in RaiSim for N = 2 agents. Subplots illustrate the evolution for each agent.
Fig. 8
Evolution of the two-norm of the defect variable δ in RaiSim for N = 2 agents. Subplots illustrate the evolution for each agent.
Close modal

Robustness analysis: To demonstrate the robustness of the proposed control algorithm against uncertainties, we assume that the mass of the bars between the adjacent agents' EEs is increased to 0.5 (kg) which is unknown for the controller. We further assume that additional unknown payloads of 20 (kg), 34(kg), and 48(kg) are cooperatively transported by two, three, and four agents on their torsos, respectively. Figures 9(a)9(d) and Figs. 10(a)10(c) depict the evolution of the virtual constraints and torque inputs for N =2 and N =3 agents, respectively. Furthermore, Figs. 11(a)11(d) shows the evolution of the virtual constraints and torque inputs for N =4 agents. From these figures, it is observed that the control inputs (i.e., torques) and outputs (i.e., virtual constraints) remain bounded during the cooperative locomotion with uncertainties. Hence, the proposed control algorithm is capable of addressing the uncertainty arising from the payload mass. To demonstrate the effectiveness of the proposed control algorithms based on the interconnected LIP dynamics, we study the same numerical simulations with the MPC control algorithm of Ref. [33], in which the MPC is designed for individual robots without considering the interconnected LIP dynamics. Snapshots of the simulation results for cooperative locomotion of agents with and without the proposed approach of this paper are depicted in Figs. 1214 to visualize the successes and failures. It is clear that the agents cannot have robustly stable cooperative locomotion while using their own MPC without considering the interconnected LIP dynamics.

Fig. 9
Evolution of the virtual constraints and torque inputs in RaiSim for robust trot gait subject to a payload with N = 2 agents. Subplots (a) and (c) illustrate the evolution of outputs whereas subplots (b) and (d) depict the evolution of torque inputs.
Fig. 9
Evolution of the virtual constraints and torque inputs in RaiSim for robust trot gait subject to a payload with N = 2 agents. Subplots (a) and (c) illustrate the evolution of outputs whereas subplots (b) and (d) depict the evolution of torque inputs.
Close modal
Fig. 10
Plot of the virtual constraints and torque inputs in RaiSim for robust forward trot gait subject to a payload with N = 3 agents. Subplots (a), (b), and (c) correspond to the agents 1, 2, and 3, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Fig. 10
Plot of the virtual constraints and torque inputs in RaiSim for robust forward trot gait subject to a payload with N = 3 agents. Subplots (a), (b), and (c) correspond to the agents 1, 2, and 3, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Close modal
Fig. 11
Plot of the virtual constraints and torque inputs in RaiSim for robust trot gait subject to a payload with N = 4 agents. Subplots (a), (b), (c), and (d) correspond to the agent 1, 2, 3, and 4, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Fig. 11
Plot of the virtual constraints and torque inputs in RaiSim for robust trot gait subject to a payload with N = 4 agents. Subplots (a), (b), (c), and (d) correspond to the agent 1, 2, 3, and 4, respectively. Subplots in the first and second rows correspond to the virtual constraints and torque inputs of each agent, respectively.
Close modal
Fig. 12
(a) Snapshots of the unstable cooperative locomotion of two agents with the individual predictive control of Ref. [33] for agents in RaiSim. Here, each agent makes use of MPC for its own LIP dynamics without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of two agents with the proposed supervisory predictive control in the presence of a 20 (kg) payload.
Fig. 12
(a) Snapshots of the unstable cooperative locomotion of two agents with the individual predictive control of Ref. [33] for agents in RaiSim. Here, each agent makes use of MPC for its own LIP dynamics without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of two agents with the proposed supervisory predictive control in the presence of a 20 (kg) payload.
Close modal
Fig. 13
(a) Snapshots of the unstable cooperative locomotion of three agents in RaiSim, in which each agent makes use of its own MPC without considering the interaction forces between agents in the interconnected LIP dynamics. (b) Snapshots of the robustly stable cooperative locomotion of three agents with the proposed supervisory predictive control in the presence of a 32 (kg) payload.
Fig. 13
(a) Snapshots of the unstable cooperative locomotion of three agents in RaiSim, in which each agent makes use of its own MPC without considering the interaction forces between agents in the interconnected LIP dynamics. (b) Snapshots of the robustly stable cooperative locomotion of three agents with the proposed supervisory predictive control in the presence of a 32 (kg) payload.
Close modal
Fig. 14
(a) Snapshots of the unstable cooperative locomotion of four agents in RaiSim, in which each agent makes use of its own MPC without considering the interaction forces in the interconnected LIP dynamics. (b) Snapshots of the robustly stable cooperative locomotion of four agents with the proposed supervisory predictive control in the presence of a 48 (kg) payload.
Fig. 14
(a) Snapshots of the unstable cooperative locomotion of four agents in RaiSim, in which each agent makes use of its own MPC without considering the interaction forces in the interconnected LIP dynamics. (b) Snapshots of the robustly stable cooperative locomotion of four agents with the proposed supervisory predictive control in the presence of a 48 (kg) payload.
Close modal

To show the robustness of the controller against the change in the ground height profile, we study the cooperative locomotion with N =2 and N =3 agents on uneven terrain. Here, we assume that the ground height profile changes in a random manner in the discrete set {±1,±2} (cm). The evolution of the virtual constraints and torque inputs together with the convergence of the robots to the target points is depicted in Figs. 15 and 16 for two and three agents, respectively. From the figures, it is observed that the control inputs and outputs remain bounded during the cooperative locomotion. Figures 17 and 18 depict the snapshots of the cooperative locomotion patterns with the proposed control algorithm. In addition, Figs. 17 and 18 compare the robustness and performance of the proposed control solutions with the individual MPC algorithms that do not consider the interaction forces for the path planning. Animations of these simulations can be found online.3

Fig. 15
Evolution of virtual constraints and control inputs for locomotion of N = 2 agents over an unknown terrain. Plots show the inputs and outputs for individual agents.
Fig. 15
Evolution of virtual constraints and control inputs for locomotion of N = 2 agents over an unknown terrain. Plots show the inputs and outputs for individual agents.
Close modal
Fig. 16
Evolution of virtual constraints and control inputs for locomotion of N = 3 agents over an unknown terrain
Fig. 16
Evolution of virtual constraints and control inputs for locomotion of N = 3 agents over an unknown terrain
Close modal
Fig. 17
(a) Snapshots of the unstable cooperative locomotion of two agents over an unknown terrain, where each agent makes use of its own MPC algorithm without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of two agents over the same terrain with the proposed supervisory predictive control based on the interconnected LIP dynamics.
Fig. 17
(a) Snapshots of the unstable cooperative locomotion of two agents over an unknown terrain, where each agent makes use of its own MPC algorithm without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of two agents over the same terrain with the proposed supervisory predictive control based on the interconnected LIP dynamics.
Close modal
Fig. 18
(a) Snapshots of the unstable cooperative locomotion of three agents, where each agent makes use of its own MPC algorithm without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of three agents over an unknown terrain with the proposed supervisory predictive control.
Fig. 18
(a) Snapshots of the unstable cooperative locomotion of three agents, where each agent makes use of its own MPC algorithm without considering the interaction forces. (b) Snapshots of the robustly stable cooperative locomotion of three agents over an unknown terrain with the proposed supervisory predictive control.
Close modal

6 Discussion

The numerical simulations of the reduced- and full-order models show the effectiveness of the proposed supervisory predictive control algorithm in generating stable cooperative locomotion patterns for multi-agent legged robots. The proposed hierarchical control algorithm developed based on the interconnected LIP dynamics allows robustly stable cooperative locomotion of multi-agent legged robots subject to holonomic constraints whereas the same legged machines cannot perform stable cooperative locomotion patterns without the proposed algorithm. The objective of this section is to analyze the results and to discuss the limitations of the proposed control approach.

6.1 Robustness Against Uncertainties.

The developed control algorithm enhances the level of robustness of the coupled full-order dynamical systems against uncertainties arising from the unknown payloads as well as ground height profile changes. Here, we make use of the number of steps that the robots can take as metrics to evaluate the performance of the closed-loop system. In particular, success occurs if the robots can reach the target points in a specified number of domains, denoted by md. Otherwise, it is a “failure (instability)” (e.g., the robots may fall before reaching the target point or in a number of domains less than md).

6.1.1 Unknown Payloads. For the payload simulations, we consider cooperative locomotion over md = 50 continuous-time domains. The objective is to evaluate the performance of cooperative locomotion for a team of N{2,3,4} agents. As described in Sec. 5 and Figs. 914, the developed control algorithms allow transporting unknown and much heavier objects than the maximum payload of a single agent (i.e., 12 (kg)). More specifically, legged corobots can cooperatively carry the payloads and arrive at the target positions in 50 domains. In contrast, the same interconnected agents without the supervisory MPC cannot depart from the initial positions due to the lack of consideration of the interaction forces at the planner level. Our numerical studies show that the cooperative system with the supervisory MPC algorithm can transport 20 (kg), 34 (kg), and 48 (kg) with two, three, and four agents, respectively. In other words, the agents can transport unknown payloads whose masses are up to 57%, 97%, and 137% of a singles agent's mass with a team of two, three, and four legged corobots.

In addition to the payloads mentioned above on the agents' torso, we consider payloads amongst the EEs in Sec. 5. The maximum weight for this load follows the Kinova arm's payload limitation (i.e., 0.5 (kg)). If we do not consider this limitation, our numerical simulations show that the proposed control algorithm can transport payloads of 4.5 (kg), 9 (kg), and 13.5 (kg) between EEs with a team of two, three, and four corobots, respectively.

6.1.2 Varying Ground Height Profiles. For this set of simulations, we study cooperative locomotion on uneven terrains over md = 100 continuous-time domains. The objective is to evaluate the performance of cooperative locomotion for a team of N{2,3} agents. Our numerical studies show that the proposed control algorithm can result in stable cooperative locomotion on unknown terrains with ground height changes in the discrete set {±1,±2} (cm). In particular, we simulated 100 different ground height profiles with discontinuities within the above set. In all of these simulations, the agents can successfully reach the final target in the specified number of domains. In contrast, the interconnected system without the supervisory MPC algorithm always fails on these terrains.

6.2 Limitations and Analysis of Results

6.2.1 Linearization of the Interconnected Linear Inverted Pendulum Dynamics. The interconnected LIP dynamics in Eq. (6) are nonlinear. In order to formulate a convex optimal control problem, the supervisory control algorithm first linearizes the dynamics, and then solves an MPC problem for the linearized dynamics. Our numerical simulations in Fig. 5 depict the behavior of the nonlinear dynamics subject to the supervisory MPC. From this figure, we observe that the optimal control problem, formulated for the linearized dynamics, can stabilize the target points for the original and interconnected LIP dynamics. In particular, the states of the nonlinear system remain bounded and asymptotically converge to the target points. We also remark that the supervisory MPC does not use a constant Jacobian linearization for the entire period of locomotion. Instead, it linearizes the dynamics around the current point at the beginning of each continuous-time domain. This makes the linearization error zero (i.e., resets it) at the beginning of each domain, which in turn reduces the gap between the states of the linearized and nonlinear dynamics. Furthermore, the adequacy of this linearization technique is validated in the full-order and complex models of cooperative locomotion. In particular, the virtual constraint plots in Figs. 6 and 7 show that the actual COM positions of the agents follow the desired COM trajectories, based on the linearized dynamics, and the error remains bounded. For future research, we will investigate nonlinear MPC algorithms that can address the path planning problem for the interconnected reduced-order models without linearization.

6.2.2 Limitations of the Reduced-order Linear Inverted Pendulum Dynamics. In this paper, we make use of the LIP dynamics to form the interconnected reduced-order network. One of the limitations of the LIP model is that it cannot capture moments about the COM. Hence, the arms model is not used in the interconnected LIP dynamics as the forces generated by the grippers due to the holonomic constraints can create moments about the COMs. Consequently, the holonomic constraints are represented amongst the COMs in the reduced-order model. However, we remark that the actual holonomic constraints are imposed between the EEs in the full-order simulations of Sec. 5. In the numerical simulations of Sec. 5, the arm joints are not locked. However, we define some virtual constraints to control the EE's Cartesian coordinates in the task space. The desired trajectory for the EE's position is taken as the desired COM trajectory, generated by the supervisory MPC, plus a constant value that represents the relative motion of the EE with respect to the body. Hence, the EE's relative motion with respect to the body becomes almost static. Consequently, the process of initiating the grasping motion is not addressed in this work. This can limit the general problem of loco-manipulation during cooperative locomotion of multi-agent robots. For future research, we will investigate alternative networks of reduced-order models that can be integrated with simple arm models for manipulation purpose.

The alternative limitation of the LIP model is that it cannot address dynamic locomotion. Furthermore, the height of the COM is assumed to be constant in the LIP dynamics that can limit locomotion on rough terrains. This motivates the use of alternative reduced-order models for future research. However, this may also increase the complexity of the reduced-order models and the computational burden of the supervisory MPC.

6.2.3 Considerations for the Real-World Implementation. The proposed controllers of this work assume that the agents can share their “reduced-order” states for the path planning purpose. In particular, the higher level of the control algorithm (i.e., supervisory MPC) is assumed to have access to all the reduced-order states of the agents (i.e., positions of the COMs). This can be realized via a direct communication network [1]. However, the low-level controllers for the whole-body motion control are distributed and do not require full-order state sharing. Our recent preliminary work [48, Chap. 4] shows that the QP-based low-level controllers can stabilize the locomotion of single-agent legged robots in practice. We will experimentally evaluate the performance of the proposed controllers for multi-agent legged robots in future work.

6.2.4. Dynamic Locomotion: The numerical simulations of this paper have shown that the interconnected LIP dynamics and the event-based supervisory MPC are sufficient to have robustly stable cooperative locomotion of legged robots. Although state-of-the-art single quadrupedal robots have dynamic gaits, the nature of single-agent locomotion and collaborative locomotion of multi-agent systems for cooperative transportation is completely different. Hence, the proposed control algorithms are validated for cooperative locomotion with quasi-static gaits but not dynamic gaits. The developed control approach would likely need to be altered to address more agile locomotion patterns in complex environments for future work.

7 Conclusion

This paper presented a hierarchical nonlinear control algorithm for the real-time planning and control of legged robots that collaboratively carry objects. We presented an innovative network of reduced-order models subject to holonomic constraints, referred to as the interconnected LIP dynamics, to address the motion planning problem of cooperative locomotion. The properties of the interconnected LIP dynamics were studied to formulate a supervisory control as the higher-level planner in the proposed control algorithm. The supervisory control is formulated as an event-based predictive control to steer the interconnected LIP dynamics subject to the feasibility of the net GRFs of individual agents. At the lower level of the proposed control scheme, distributed nonlinear controllers, based on QP and virtual constraints, were developed to impose the full-order dynamical model of each agent to asymptotically track the optimal reduced-order trajectories, prescribed by the supervisory predictive control, while having feasible contact forces at the leg ends. The effectiveness and robustness of the proposed nonlinear control scheme were demonstrated and investigated via full-order numerical simulations of a team of two, three, and four collaborative quadrupedal robots, each with a total of 22DOFs, while carrying different objects in the presence of uncertainties.

In this work, we considered path graphs to describe the shape of the objects to be carried. For future work, we will investigate more sophisticated shapes and graphs subject to holonomic constraints. In addition, we will investigate the design of distributed predictive controllers at the higher level of the proposed control scheme to reduce the computational burden further. This work also focused on the cooperative locomotion of legged robots, and the manipulators were used to enable the agents' holonomic constraints. Hence, the robotic manipulators' grasping mechanism was not studied. For future work, we will investigate alternative networks of reduced-order models that can represent the interaction wrenches for the loco-manipulation purpose. Furthermore, we will extend the desired foothold planning framework to more agile cooperative locomotion in aggressive environments.

Acknowledgment

The content is solely the responsibility of the authors and does not necessarily represent the official views of the NSF.

Funding Data

  • Division of Civil, Mechanical and Manufacturing Innovation, National Science Foundation (NSF) (Grant No. CMMI-1923216; Funder ID: 10.13039/100000147).

  • Division of Electrical, Communications and Cyber Systems, National Science Foundation (NSF) (Grant No. ECCS-1924617; Funder ID: 10.13039/100000148).

Nomenclature

A, B, d =

Jacobian linearization of the interconnected LIP dynamics

D(.),H(.) =

mass-inertia matrix, Coriolis, centrifugal, and gravitational terms

F =

discrete-time dynamics for the network of LIPs

Fi =

ground reaction forces on the agent i

Gw,Vw,Ew =

directed graph of the locomotion pattern, vertices set, and edges set

h0(qi),hd,i(t) =

controlled variables for the agent i, and desired evolution of the controlled variables for the agent i

c,i =

number of the contacting legs with the ground for the agent i

LgiLfi,LwiLfi,Lfi2 =

lie derivatives along the full-order floating-base dynamics of the agent i

md, ng =

total number of continuous-time domains, Number of grid points for each continuous-time domain

N =

total number of agent

nh, Nh =

number of continuous-time domains to plan over, control horizon

pi =

Cartesian coordinates of the stance leg ends of the agent i

ptorso,i =

absolute position of the torso of the agent i with respect to the inertial frame

qi =

configuration vector of the agent i consisting of ptorso,i,ϕtorso,i, and qbody,i

qbody,i =

shape (body) variables of the agent i

ri, r =

Cartesian coordinates of the COM of the agent i, Column vector consisting of all ris

Ts =

time step between grid points known as the sampling time

ui, u =

Cartesian coordinates of the COP of the agent i, Column vector consisting of all uis

V =

vertices set of the open path graph for the interconnected LIP dynamics

x =

augmented state vector of the interconnected LIP dynamics consisting of r and r˙

yi =

local outputs of the agent i

zi =

full state variables of the agent i consisting of qi and q˙i

δ =

defect variable for the low-level QP

ζ(k),Uζ(k)i =

current domain indicator function for the sample time k, and Convex hull of the contacting points with the ground for the agent i

λij, λ =

Lagrange multipliers between agents i and j, Column vector consisting of independent λijs

τi =

local torque inputs for the agent i

ϕtorso,i =

absolute orientation of the torso of the agent i with respect to the inertial frame

ω =

weighting factor of the low-level QP

ϒ(.),Jc(.) =

input distribution matrix, and Contact Jacobian matrix

Footnotes

Appendix

A.1 Proof of Theorem 1.
Let us take an arbitrary nonzero vector α:=col(α1,,αN1). Then, αΛNα can be expanded as
(A1)

where in the fifth line, we have made use of the norm property, that is, ek,k+1ek+1,k+2αkαk+1||ek,k+1||||ek+1,k+2|||αk||αk+1|. We remark that the last result in Eq. (A1) is indeed positive. To clarify this point, let us assume that the |αk|=||ek+1,k+2||||ek,k+1|||αk+1| for all k=1,,N2. Then k=1N2(||ek,k+1|||αk|||ek+1,k+2|||αk+1|)2=0. However, because of the term ||e12||2α12+||eN1,N||2αN12, the quadratic function αΛNα is strictly positive which completes the proof.▪

A.2 Closed Form Expressions for the Lie Derivatives

References

1.
Tuci
,
E.
,
Alkilabi
,
M. H. M.
, and
Akanyeti
,
O.
,
2018
, “
Cooperative Object Transport in Multi-Robot Systems: A Review of the State-of-the-Art
,”
Front. Rob. AI
,
5
, p.
59
.10.3389/frobt.2018.00059
2.
Yan
,
Z.
,
Jouandeau
,
N.
, and
Cherif
,
A. A.
,
2013
, “
A Survey and Analysis of Multi-Robot Coordination
,”
Int. J. Adv. Rob. Syst.
,
10
(
12
), p.
399
.10.5772/57313
3.
Murray
,
R.
,
Li
,
Z.
, and
Sastry
,
S.
,
1994
,
A Mathematical Introduction to Robotic Manipulation
,
Taylor & Francis/CRC
, Boca Raton, FL.
4.
Sreenath
,
K.
, and
Kumar
,
V.
,
2013
,
Dynamics, Control and Planning for Cooperative Manipulation of Payloads Suspended by Cables From Multiple Quadrotor Robots
,
Science and Systems (RSS)
,
Robotics
, Berlin, Germany.
5.
Turpin
,
M.
,
Michael
,
N.
, and
Kumar
,
V.
,
2014
, “
Capt: Concurrent Assignment and Planning of Trajectories for Multiple Robots
,”
Int. J. Rob. Res.
,
33
(
1
), pp.
98
112
.10.1177/0278364913515307
6.
Panagou
,
D.
,
Turpin
,
M.
, and
Kumar
,
V.
,
2020
, “
Decentralized Goal Assignment and Safe Trajectory Generation in Multirobot Networks Via Multiple Lyapunov Functions
,”
IEEE Trans. Autom. Control
,
65
(
8
), pp.
3365
3380
.10.1109/TAC.2019.2946333
7.
Mesbahi
,
M.
, and
Egerstedt
,
M.
,
2010
,
Graph Theoretic Methods in Multiagent Networks
,
Princeton University Press
, Princeton, NJ.
8.
Bullo
,
F.
,
Cortés
,
J.
, and
Martinez
,
S.
,
2009
,
Distributed Control of Robotic Networks: A Mathematical Approach to Motion Coordination Algorithms
,
Princeton University Press
, Princeton, NJ.
9.
Grizzle
,
J. W.
,
Chevallereau
,
C.
,
Sinnet
,
R. W.
, and
Ames
,
A. D.
,
2014
, “
Models, Feedback Control, and Open Problems of 3D Bipedal Robotic Walking
,”
Automatica
,
50
(
8
), pp.
1955
1988
.10.1016/j.automatica.2014.04.021
10.
Ames
,
A.
,
Galloway
,
K.
,
Sreenath
,
K.
, and
Grizzle
,
J.
,
2014
, “
Rapidly Exponentially Stabilizing Control Lyapunov Functions and Hybrid Zero Dynamics
,”
IEEE Trans. Autom. Control
,
59
(
4
), pp.
876
891
.10.1109/TAC.2014.2299335
11.
Sreenath
,
K.
,
Park
,
H.-W.
,
Poulakakis
,
I.
, and
Grizzle
,
J.
,
2013
, “
Embedding Active Force Control Within the Compliant Hybrid Zero Dynamics to Achieve Stable, Fast Running on MABEL
,”
Int. J. Rob. Res.
,
32
(
3
), pp.
324
345
.10.1177/0278364912473344
12.
Veer
,
S.
,
Rakesh
., and
Poulakakis
,
I.
,
2019
, “
Input-to-State Stability of Periodic Orbits of Systems With Impulse Effects Via Poincaré Analysis
,”
IEEE Trans. Autom. Control
,
64
(
11
), pp.
4583
4598
.10.1109/TAC.2019.2909684
13.
Saglam
,
C.
, and
Byl
,
K.
,
2013
, “
Switching Policies for Metastable Walking
,”
IEEE Conference Decision Control
, Firenze, Italy, Dec. 10–13, pp.
977
983
. 10.1109/CDC.2013.6760009
14.
Johnson
,
A. M.
,
Burden
,
S. A.
, and
Koditschek
,
D. E.
,
2016
, “
A Hybrid Systems Model for Simple Manipulation and Self-Manipulation Systems
,”
Int. J. Rob. Res.
,
35
(
11
), pp.
1354
1392
.10.1177/0278364916639380
15.
Spong
,
M.
, and
Bullo
,
F.
,
2005
, “
Controlled Symmetries and Passive Walking
,”
IEEE Trans. Autom. Control
,
50
(
7
), pp.
1025
1031
.10.1109/TAC.2005.851449
16.
Manchester
,
I.
,
Mettin
,
U.
,
Iida
,
F.
, and
Tedrake
,
R.
,
2011
, “
Stable Dynamic Walking Over Uneven Terrain
,”
Int. J. Rob. Res.
,
30
(
3
), pp.
265
279
.10.1177/0278364910395339
17.
Smit-Anseeuw
,
N.
,
Gleason
,
R.
,
Vasudevan
,
R.
, and
Remy
,
C. D.
,
2017
, “
The Energetic Benefit of Robotic Gait Selection-A Case Study on the Robot RAMone
,”
IEEE Rob. Autom. Lett.
,
2
(
2
), pp.
1124
1131
.10.1109/LRA.2017.2661801
18.
Akbari Hamed
,
K.
, and
Gregg
,
R. D.
,
2017
, “
Decentralized Feedback Controllers for Robust Stabilization of Periodic Orbits of Hybrid Systems: Application to Bipedal Walking
,”
IEEE Trans. Control Syst. Technol.
,
25
(
4
), pp.
1153
1167
.10.1109/TCST.2016.2597741
19.
Ames
,
A. D.
,
Gregg
,
R. D.
,
Wendel
,
E. D. B.
, and
Sastry
,
S.
,
2007
, “
On the Geometric Reduction of Controlled Three-Dimensional Bipedal Robotic Walkers
,”
Lagrangian and Hamiltonian Methods for Nonlinear Control 2006
,
Springer
Berlin Heidelberg
, Germany, pp.
183
196
.10.1007/978-3-540-73890-9_14
20.
Westervelt
,
E.
,
Grizzle
,
J.
, and
Koditschek
,
D.
,
2003
, “
Hybrid Zero Dynamics of Planar Biped Walkers
,”
IEEE Trans. Autom. Control
,
48
(
1
), pp.
42
56
.10.1109/TAC.2002.806653
21.
Isidori
,
A.
,
1995
,
Nonlinear Control Systems
, 3rd ed.,
Springer
, London, UK.
22.
Chevallereau
,
C.
,
Grizzle
,
J.
, and
Shih
,
C.-L.
,
2009
, “
Asymptotically Stable Walking of a Five-Link Underactuated 3-D Bipedal Robot
,”
IEEE Trans. Rob.
,
25
(
1
), pp.
37
50
.10.1109/TRO.2008.2010366
23.
Ramezani
,
A.
,
Hurst
,
J. W.
,
Akbari Hamed
,
K.
, and
Grizzle
,
J. W.
,
2014
, “
Performance Analysis and Feedback Control of ATRIAS, a Three-Dimensional Bipedal Robot
,”
ASME J. Dyn. Syst. Meas. Control
,
136
(
2
), p.
021012
.10.1115/1.4025693
24.
Hereid
,
A.
,
Hubicki
,
C. M.
,
Cousineau
,
E. A.
, and
Ames
,
A. D.
,
2018
, “
Dynamic Humanoid Locomotion: A Scalable Formulation for HZD Gait Optimization
,”
IEEE Trans. Rob.
,
34
(
2
), pp.
370
387
.10.1109/TRO.2017.2783371
25.
Martin
,
A. E.
,
Post
,
D. C.
, and
Schmiedeler
,
J. P.
,
2014
, “
Design and Experimental Implementation of a Hybrid Zero Dynamics-Based Controller for Planar Bipeds With Curved Feet
,”
Int. J. Rob. Res.
,
33
(
7
), pp.
988
1005
.10.1177/0278364914522141
26.
Gregg
,
R. D.
,
Lenzi
,
T.
,
Hargrove
,
L. J.
, and
Sensinger
,
J. W.
,
2014
, “
Virtual Constraint Control of a Powered Prosthetic Leg: From Simulation to Experiments With Transfemoral Amputees
,”
IEEE Trans. Rob.
,
30
(
6
), pp.
1455
1471
.10.1109/TRO.2014.2361937
27.
Zhao
,
H.
,
Horn
,
J.
,
Reher
,
J.
,
Paredes
,
V.
, and
Ames
,
A. D.
,
2017
, “
First Steps Toward Translating Robotic Walking to Prostheses: A Nonlinear Optimization Based Control Approach
,”
Auton. Robots
,
41
(
3
), pp.
725
742
.10.1007/s10514-016-9565-1
28.
Kajita
,
S.
,
Kanehiro
,
F.
,
Kaneko
,
K.
,
Fujiwara
,
K.
,
Harada
,
K.
,
Yokoi
,
K.
, and
Hirukawa
,
H.
,
2003
, “
Biped Walking Pattern Generation by Using Preview Control of Zero-Moment Point
,”
IEEE International Conference on Robotics and Automation
, Vol.
2
, Taipei, Taiwan, Sept. 14–19, pp.
1620
1626
.10.1109/ROBOT.2003.1241826
29.
Griffin
,
R. J.
,
Wiedebach
,
G.
,
Bertrand
,
S.
,
Leonessa
,
A.
, and
Pratt
,
J.
,
2017
, “
Walking Stabilization Using Step Timing and Location Adjustment on the Humanoid Robot, Atlas
,”
IEEE/RSJ International Conference on Intelligent Robots and Systems
, Vancouver, BC/Canada, Sept. 24–28, pp.
667
673
.10.1109/IROS.2017.8202223
30.
Englsberger
,
J.
,
Ott
,
C.
,
Roa
,
M. A.
,
Albu-Schäffer
,
A.
, and
Hirzinger
,
G.
,
2011
, “
Bipedal Walking Control Based on Capture Point Dynamics
,”
IEEE/RSJ International Conference on Intelligent Robots and Systems
, San Francisco, CA, Sept. 14–19, pp.
4420
4427
.10.1109/IROS.2011.6094435
31.
Pratt
,
J.
,
Carff
,
J.
,
Drakunov
,
S.
, and
Goswami
,
A.
,
2006
, “
Capture Point: A Step Toward Humanoid Push Recovery
,”
Sixth IEEE-RAS International Conference on Humanoid Robots
, Genova, Italy, Dec. 4–6, pp.
200
207
.10.1109/ICHR.2006.321385
32.
Winkler
,
A. W.
,
Farshidian
,
F.
,
Pardo
,
D.
,
Neunert
,
M.
, and
Buchli
,
J.
,
2017
, “
Fast Trajectory Optimization for Legged Robots Using Vertex-Based ZMP Constraints
,”
IEEE Rob. Autom. Lett.
,
2
(
4
), pp.
2201
2208
.10.1109/LRA.2017.2723931
33.
Akbari Hamed
,
K.
,
Kim
,
J.
, and
Pandala
,
A.
,
2020
, “
Quadrupedal Locomotion Via Event-Based Predictive Control and QP-Based Virtual Constraints
,”
IEEE Rob. Autom. Lett.
,
5
(
3
), pp.
4463
4470
.10.1109/LRA.2020.3001471
34.
Fawcett
,
R. T.
,
Pandala
,
A.
,
Kim
,
J.
, and
Akbari Hamed
,
K.
,
2021
, “
Real-Time Planning and Nonlinear Control for Quadrupedal Locomotion With Articulated Tails
,”
ASME J. Dyn. Syst. Meas. Control
,
143
(
7
), p.
071004
.10.1115/1.4049555
35.
Di Carlo
,
J.
,
Wensing
,
P. M.
,
Katz
,
B.
,
Bledt
,
G.
, and
Kim
,
S.
,
2018
, “
Dynamic Locomotion in the MIT Cheetah 3 Through Convex Model-Predictive Control
,”
IEEE/RSJ International Conference on Intelligent Robots and Systems
, Madrid, Spain, Oct. 1–5, pp.
1
9
.10.1109/IROS.2018.8594448
36.
Ding
,
Y.
,
Pandala
,
A.
, and
Park
,
H.
,
2019
, “
Real-Time Model Predictive Control for Versatile Dynamic Motions in Quadrupedal Robots
,” IEEE
International Conference on Robotics and Automation
, Montreal, QC/Canada, May 20–24, pp.
8484
8490
.10.1109/ICRA.2019.8793669
37.
Villarreal
,
O.
,
Barasuol
,
V.
,
Wensing
,
P. M.
,
Caldwell
,
D. G.
, and
Semini
,
C.
,
2020
, “
MPC-Based Controller With Terrain Insight for Dynamic Legged Locomotion
,”
IEEE International Conference on Robotics and Automation
, Paris, France, May 31–Aug. 31, pp.
2436
2442
.10.1109/ICRA40945.2020.9197312
38.
Neunert
,
M.
,
Stäuble
,
M.
,
Giftthaler
,
M.
,
Bellicoso
,
C. D.
,
Carius
,
J.
,
Gehring
,
C.
,
Hutter
,
M.
, and
Buchli
,
J.
,
2018
, “
Whole-Body Nonlinear Model Predictive Control Through Contacts for Quadrupeds
,”
IEEE Rob. Autom. Lett.
,
3
(
3
), pp.
1458
1465
.10.1109/LRA.2018.2800124
39.
Bledt
,
G.
,
Wensing
,
P. M.
, and
Kim
,
S.
,
2017
, “
Policy-Regularized Model Predictive Control to Stabilize Diverse Quadrupedal Gaits for the MIT Cheetah
,”
IEEE/RSJ International Conference on Intelligent Robots and Systems
, Vancouver, BC/Canada, Sept. 24–28, pp.
4102
4109
.10.1109/IROS.2017.8206268
40.
Fahmi
,
S.
,
Mastalli
,
C.
,
Focchi
,
M.
, and
Semini
,
C.
,
2019
, “
Passive Whole-Body Control for Quadruped Robots: Experimental Validation Over Challenging Terrain
,”
IEEE Rob. Autom. Lett.
,
4
(
3
), pp.
2553
2560
.10.1109/LRA.2019.2908502
41.
Kuindersma
,
S.
,
Permenter
,
F.
, and
Tedrake
,
R.
,
2014
, “
An Efficiently Solvable Quadratic Program for Stabilizing Dynamic Locomotion
,”
IEEE International Conference on Robotics and Automation
, Hong Kong, China, pp.
2589
2594
.https://agile.seas.harvard.edu/publications/efficiently-solvable-quadratic-program-stabilizingdynamic-locomotion
42.
Lehmann
,
D.
,
Henriksson
,
E.
, and
Johansson
,
K. H.
,
2013
, “
Event-Triggered Model Predictive Control of Discrete-Time Linear Systems Subject to Disturbances
,”
European Control Conference
, Zurich, Switzerland, July 17–19, pp.
1156
1161
.https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.838.8695&rep=rep1&type=pdf
43.
Akbari Hamed
,
K.
,
Kamidi
,
V. R.
,
Pandala
,
A.
,
Ma
,
W.
, and
Ames
,
A. D.
,
2020
, “
Distributed Feedback Controllers for Stable Cooperative Locomotion of Quadrupedal Robots: A Virtual Constraint Approach
,”
American Control Conference
, Denver, CO, July 1–3, pp.
5314
5321
.https://authors.library.caltech.edu/96716/2/09147673.pdf
44.
Prautzsch
,
H.
,
Boehm
,
W.
, and
Paluszny
,
M.
,
2002
,
Bézier and B-Spline Techniques
,
Springer-Verlag
, Berlin and Heidelberg, Germany.
45.
Khalil
,
H. K.
,
2001
,
Nonlinear Systems
, 3rd ed.,
Pearson
, Upper Saddle River, NJ.
46.
Hwangbo
,
J.
,
Lee
,
J.
, and
Hutter
,
M.
,
2018
, “
Per-Contact Iteration Method for Solving Contact Dynamics
,”
IEEE Rob. Autom. Lett.
,
3
(
2
), pp.
895
902
.10.1109/LRA.2018.2792536
47.
Pandala
,
A. G.
,
Ding
,
Y.
, and
Park
,
H.
,
2019
, “
qpSWIFT: A Real-Time Sparse Quadratic Program Solver for Robotic Applications
,”
IEEE Rob. Autom. Lett.
,
4
(
4
), pp.
3355
3362
.10.1109/LRA.2019.2926664
48.
Fawcett
,
R. T.
,
2021
, “
Real-Time Planning and Nonlinear Control for Robust Quadrupedal Locomotion with Tails
,” M.S. thesis,
Virginia Tech
, Blacksburg, VA.