indirect imitation learning

This note last modified January 27, 2021

Require environmental interaction and an evaluation function as opposed to direct imitation learning