Active Bayesian perception and reinforcement learning | Istituto di Scienze e Tecnologie della Cognizione

In a series of papers, we have formalized an active Bayesian perception approach for robotics based on recent progress in understanding animal perception. However, an issue for applied robot perception is how to tune this method to a task, using: (i) a belief threshold that adjusts the speed-accuracy tradeoff; and (ii) an active control strategy for relocating the sensor e.g. to a preset fixation point. Here we propose that these two variables should be learnt by reinforcement from a reward signal evaluating the decision outcome. We test this claim with a biomimetic fingertip that senses surface curvature under uncertainty about contact location. Appropriate formulation of the problem allows use of multi-armed bandit methods to optimize the threshold and fixation point of the active perception. In consequence, the system learns to balance speed versus accuracy and sets the fixation point to optimize both quantities. Although we consider one example in robot touch, we expect that the underlying principles have general applicability. © 2013 IEEE.

Publication type:

Contributo in atti di convegno

Author or Creator:

Lepora, Nathan F.

Martinez-Hernandez, Uriel

Pezzulo, Giovanni

Prescott, Tony J.

Publisher:

Institute of Electrical and Electronics Engineers,, New York, NY , Stati Uniti d'America

Source:

Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2013), pp. 4735–4740, 2013

Date:

2013

Resource Identifier:

http://www.cnr.it/prodotto/i/309368

https://dx.doi.org/10.1109/IROS.2013.6697038

info:doi:10.1109/IROS.2013.6697038

http://www.scopus.com/record/display.url?eid=2-s2.0-84893720075&origin=inward

urn:isbn:9781467363587

Language:

Eng

ISTC Author:

Real name:

Giovanni Pezzulo