SIT-LMPC: Safe Information-Theoretic Learning Model Predictive Control for Iterative Tasks

Structure

Online.

SIT-LMPC architecture: starting from an initial trajectory, the algorithm iteratively updates the safe set and value function model (orange loop), while solving multiple MPPI problems in parallel (blue loop) to generate optimal trajectories. Each MPPI problem corresponds to one set of sampled penalty parameters λi, whose solutions are then filtered and optimized over to ensure optimality while satisfying the constraints

Simulated Experiment

Recorded video of the experiments on map 1: Left: live simulation of the vehicle, displaying planned trajectory in MPPI horizon (yellow), and the control invariant safe set (pink). Right: Speed profile showing the evolution of trajectories.

Real-world Experiment

Publications

Zang, Zirui and Amine, Ahmad and Kokolakis, Nick-Marios T. and Nghiem, Truong X. and Rosolia, Ugo and Mangharam, Rahul

Contributors

Anonymous Submission

Citation