Internal reward (task 3) progress week 2

Evolutionary Task Force » Internal reward (task 3) progress week 2

Work is underway to test QI as an alternative to distance for the internal reward for lifetime learning.

Secondly, we will endeavour to generate robot traces (i.e., sensori-motor logs) labelled as containing more or less desirable behaviour, where 'desirable' may be seen as walking far or maybe 'naturallly'. These logs can then be used as a basis for preference-based learning and indirectly define a fitness function of desirable sensori-motor states.

page revision: 2, last edited: 01 Dec 2011 10:04

Edit Tags History Files Print Site tools + Options

Click here to edit contents of this page.

Click here to toggle editing of individual sections of the page (if possible). Watch headings for an "edit" link when available.

Append content without editing the whole page source.

Check out how this page has evolved in the past.

If you want to discuss contents of this page - this is the easiest way to do it.

View and manage file attachments for this page.

A few useful tools to manage this Site.

See pages that link to and include this page.

Change the name (also URL address, possibly the category) of the page.

View wiki source for this page without editing.

View/set parent page (used for creating breadcrumbs and structured layout).

Notify administrators if there is objectionable content in this page.

Something does not work as expected? Find out what you can do.

General Wikidot.com documentation and help section.

Wikidot.com Terms of Service - what you can, what you should not etc.

Wikidot.com Privacy Policy.

Symbrion Evolutionary Computing Blog

The evolution of swarms and organisms