Marc Toussaint on planning as inference and graphical models

Marc Toussaint on planning as inf...

Marc Toussaint on planning as inference and graphical models

The Convergent Science Podcasts on Mind, Brain and Technolog... por Dr. Paul F.M.J. Verschure / Prof. Dr. Tony Prescott / Dr. Anna Mura

T2013 · E7

15 mar 2026

52:18

Notas del episodio

What if planning is not about computing value functions but about performing probabilistic inference? Marc Toussaint shows how recasting optimal control as message passing opens new computational pathways for robotics and decision-making.

Subscribe for more from the Convergent Science Network podcast series.

Marc Toussaint presents a theoretical framework that reformulates planning and optimal control as probabilistic inference in graphical models. Rather than iterating backward through Bellman equations to compute value functions, his approach computes both forward and backward messages whose product yields a posterior distribution over actions. This shift in perspective is not merely notational: it leads to genuinely different approximation algorithms, particularly for complex problems like partially observable Markov decision proce ...

Palabras clave

planning as inferencegraphical modelsreinforcement learningBoltzmann distributionoptimal control