Szepesvári, Csaba. 1998. “Non-Markovian Policies in Sequential Decision Problems”. Acta Cybernetica 13 (3), 305-18. https://cyber.bibl.u-szeged.hu/index.php/actcybern/article/view/3493.