{"id":11639,"date":"2023-03-18T17:23:18","date_gmt":"2023-03-18T17:23:18","guid":{"rendered":"https:\/\/aurelis.org\/blog\/?p=11639"},"modified":"2023-03-20T19:01:30","modified_gmt":"2023-03-20T19:01:30","slug":"agents-and-expected-rewards","status":"publish","type":"post","link":"https:\/\/aurelis.org\/blog\/cognitive-insights\/agents-and-expected-rewards","title":{"rendered":"Agents and Expected Rewards"},"content":{"rendered":"\n<h3>This is a story from expectation to motivation. Any motivation is eventually the expectation of \u2018reward,\u2019 be it a concrete one or more abstract; be it deeply value-based or more superficial.<\/h3>\n\n\n\n<p><strong>The simple case looks like conditioning.<\/strong><\/p>\n\n\n\n<p>You may read first about <a href=\"https:\/\/aurelis.org\/blog?p=11625\">what agents are<\/a>. An agent acts upon a reward by learning from it and changing its behavior towards heightening future expected rewards.<\/p>\n\n\n\n<p>Within a non-complex agent, this is theoretically like conditioning: a (seemingly) mechanical process that can be described from the outside without considering anything inside \u2015 no complex reasoning or other mental processing.<\/p>\n\n\n\n<p>Who says, \u2018conditioning theories,\u2019 says <a href=\"https:\/\/aurelis.org\/blog?p=2129\">Pavlov<\/a> and automatically thinks of his dogs.<\/p>\n\n\n\n<p><strong>But these dogs were also complex agents.<\/strong><\/p>\n\n\n\n<p>They were no abstract drawings on a sheet of paper.<\/p>\n\n\n\n<p>To any agent with some complexity \u2013 like Pavlov\u2019s dogs or you or me \u2013 the true reward is the interpretation, not the treat or the sound as a substituting (\u2018unconditioned\u2019) stimulus.<\/p>\n\n\n\n<p>Therefore, one can rightly ask what functions as an expected reward in any concrete case. Naturally, the reward may be <a href=\"https:\/\/aurelis.org\/blog?p=11645\">rewarding because of the underlying (deeper) value<\/a>. This is an everyday issue, yet <em>much <\/em>more complex (and more than simply \u2018subjective\u2019) than is generally thought. A lot of this goes on at the <a href=\"https:\/\/aurelis.org\/blog?p=8486\">subconceptual level<\/a>, as does most of what is related to motivation.<\/p>\n\n\n\n<p><strong>Reward is in the eye of the observer.<\/strong><\/p>\n\n\n\n<p>In real life, the reward is part of a concrete observation, which is the whole of feedback an agent gets after one or more of his actions. Thus, a reward can be seen as part of a pattern on its way toward pattern recognition and completion. [There\u2019s <a href=\"https:\/\/aurelis.org\/blog?p=11637\">more to say about rewards in this vein<\/a>.]<\/p>\n\n\n\n<p>It may be difficult to discern what should be proper feedback to serve as a reward. Is it the treat from the dog\u2019s boss or the attention that comes with it? This difference may seem inconsequential. However, in many cases, it\u2019s crucial, and seeing the difference may be the start of solving many problems in the human sphere \u2015 even at the geopolitical level.<\/p>\n\n\n\n<p>We may be consciously looking at the treats \u2013 or the social status, or the loads of money \u2013 while the brain is looking at the level of meaningfulness. Actually, it is <em>only<\/em> looking at that level.<\/p>\n\n\n\n<p><strong>A little story of dopamine<\/strong><\/p>\n\n\n\n<p>Brain researchers used to think of dopamine as one of the happiness neuromodulators, the brain\u2019s little treat or prize (to itself?) for doing something good.<\/p>\n\n\n\n<p>However, we see now that dopamine has a different function. It flows more on expectation than on fulfillment. It is, therefore, not the motivator, but part of the motivation itself. Meanwhile, it prepares the agent\u2019s brain to learn what is good or bad for the organism.<\/p>\n\n\n\n<p>Dopamine symbolizes the expectation of the reward, not the reward itself. One way to use this insight is by leveraging on the anticipation a bit more when you want to teach your dog (or child or\u2026) to behave in a desired manner.<\/p>\n\n\n\n<p>Keep some suspense in a friendly way.<\/p>\n\n\n\n<p><strong>The other side of the coin: immediate gratification<\/strong><\/p>\n\n\n\n<p>In immediate gratification, there is much less expectation. Dopamine flows in short peaks, not leaving time to savor expectations. May that make people more egotistic? What about gaming, and the use of game-theoretic means to try to motivate people? Is this not degrading \u2015 even culturally if many people are subjected to this?<\/p>\n\n\n\n<p><a href=\"https:\/\/aurelis.org\/blog?p=1376\">Delayed gratification<\/a> can lead to more depth, especially if frustration is meanwhile skillfully avoided. The brain\/mind gets more time and incentive for forming broader patterns. This way, the focus can spontaneously turn toward deeper values.<\/p>\n\n\n\n<p><strong>That is not the end.<\/strong> But at least it is on the road toward end values (not needing any other underlying). May the final one be <a href=\"https:\/\/aurelis.org\/blog?p=6822\">global compassion<\/a>?<\/p>\n<div data-object_id=\"11639\" class=\"cbxwpbkmarkwrap cbxwpbkmarkwrap_no_cat cbxwpbkmarkwrap-post \"><a  data-redirect-url=\"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\"  data-display-label=\"0\" data-show-count=\"0\" data-bookmark-label=\" \"  data-bookmarked-label=\" \"  data-loggedin=\"0\" data-type=\"post\" data-object_id=\"11639\" class=\"cbxwpbkmarktrig  cbxwpbkmarktrig-button-addto\" title=\"Bookmark This\" href=\"#\"><span class=\"cbxwpbkmarktrig-label\"  style=\"display:none;\" > <\/span><\/a> <div  data-type=\"post\" data-object_id=\"11639\" class=\"cbxwpbkmarkguestwrap\" id=\"cbxwpbkmarkguestwrap-11639\"><div class=\"cbxwpbkmarkguest-message\"><a href=\"#\" class=\"cbxwpbkmarkguesttrig_close\"><\/a><h3 class=\"cbxwpbookmark-title cbxwpbookmark-title-login\">Please login to bookmark<\/h3>\n\t\t<form name=\"loginform\" id=\"loginform\" action=\"https:\/\/aurelis.org\/blog\/wp-login.php\" method=\"post\">\n\t\t\t\n\t\t\t<p class=\"login-username\">\n\t\t\t\t<label for=\"user_login\">Username or Email Address<\/label>\n\t\t\t\t<input type=\"text\" name=\"log\" id=\"user_login\" class=\"input\" value=\"\" size=\"20\" \/>\n\t\t\t<\/p>\n\t\t\t<p class=\"login-password\">\n\t\t\t\t<label for=\"user_pass\">Password<\/label>\n\t\t\t\t<input type=\"password\" name=\"pwd\" id=\"user_pass\" class=\"input\" value=\"\" size=\"20\" \/>\n\t\t\t<\/p>\n\t\t\t\n\t\t\t<p class=\"login-remember\"><label><input name=\"rememberme\" type=\"checkbox\" id=\"rememberme\" value=\"forever\" \/> Remember Me<\/label><\/p>\n\t\t\t<p class=\"login-submit\">\n\t\t\t\t<input type=\"submit\" name=\"wp-submit\" id=\"wp-submit\" class=\"button button-primary\" value=\"Log In\" \/>\n\t\t\t\t<input type=\"hidden\" name=\"redirect_to\" value=\"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\" \/>\n\t\t\t<\/p>\n\t\t\t\n\t\t<\/form><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>This is a story from expectation to motivation. Any motivation is eventually the expectation of \u2018reward,\u2019 be it a concrete one or more abstract; be it deeply value-based or more superficial. The simple case looks like conditioning. You may read first about what agents are. An agent acts upon a reward by learning from it <a class=\"moretag\" href=\"https:\/\/aurelis.org\/blog\/cognitive-insights\/agents-and-expected-rewards\">Read the full article&#8230;<\/a><\/p>\n<div data-object_id=\"11639\" class=\"cbxwpbkmarkwrap cbxwpbkmarkwrap_no_cat cbxwpbkmarkwrap-post \"><a  data-redirect-url=\"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\"  data-display-label=\"0\" data-show-count=\"0\" data-bookmark-label=\" \"  data-bookmarked-label=\" \"  data-loggedin=\"0\" data-type=\"post\" data-object_id=\"11639\" class=\"cbxwpbkmarktrig  cbxwpbkmarktrig-button-addto\" title=\"Bookmark This\" href=\"#\"><span class=\"cbxwpbkmarktrig-label\"  style=\"display:none;\" > <\/span><\/a> <div  data-type=\"post\" data-object_id=\"11639\" class=\"cbxwpbkmarkguestwrap\" id=\"cbxwpbkmarkguestwrap-11639\"><div class=\"cbxwpbkmarkguest-message\"><a href=\"#\" class=\"cbxwpbkmarkguesttrig_close\"><\/a><h3 class=\"cbxwpbookmark-title cbxwpbookmark-title-login\">Please login to bookmark<\/h3>\n\t\t<form name=\"loginform\" id=\"loginform\" action=\"https:\/\/aurelis.org\/blog\/wp-login.php\" method=\"post\">\n\t\t\t\n\t\t\t<p class=\"login-username\">\n\t\t\t\t<label for=\"user_login\">Username or Email Address<\/label>\n\t\t\t\t<input type=\"text\" name=\"log\" id=\"user_login\" class=\"input\" value=\"\" size=\"20\" \/>\n\t\t\t<\/p>\n\t\t\t<p class=\"login-password\">\n\t\t\t\t<label for=\"user_pass\">Password<\/label>\n\t\t\t\t<input type=\"password\" name=\"pwd\" id=\"user_pass\" class=\"input\" value=\"\" size=\"20\" \/>\n\t\t\t<\/p>\n\t\t\t\n\t\t\t<p class=\"login-remember\"><label><input name=\"rememberme\" type=\"checkbox\" id=\"rememberme\" value=\"forever\" \/> Remember Me<\/label><\/p>\n\t\t\t<p class=\"login-submit\">\n\t\t\t\t<input type=\"submit\" name=\"wp-submit\" id=\"wp-submit\" class=\"button button-primary\" value=\"Log In\" \/>\n\t\t\t\t<input type=\"hidden\" name=\"redirect_to\" value=\"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\" \/>\n\t\t\t<\/p>\n\t\t\t\n\t\t<\/form><\/div><\/div><\/div>","protected":false},"author":2,"featured_media":11680,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","jetpack_publicize_message":""},"categories":[30],"tags":[],"jetpack_featured_media_url":"https:\/\/i1.wp.com\/aurelis.org\/blog\/wp-content\/uploads\/2023\/03\/2054b.jpg?fit=960%2C560&ssl=1","jetpack_publicize_connections":[],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p9Fdiq-31J","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639"}],"collection":[{"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/comments?post=11639"}],"version-history":[{"count":10,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\/revisions"}],"predecessor-version":[{"id":11683,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/posts\/11639\/revisions\/11683"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/media\/11680"}],"wp:attachment":[{"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/media?parent=11639"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/categories?post=11639"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/aurelis.org\/blog\/wp-json\/wp\/v2\/tags?post=11639"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}