status 5 Dec 2011

VU is generating logs to analyse for preference-based learning.
Ran tests with QI as internal reward (based on gps first, now enhanced with accelerometer and joint force-feedback - not separately tuned, though); leads to comparable results as using distance as a reward.

Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License