{"id":71,"date":"2015-02-26T12:40:41","date_gmt":"2015-02-26T12:40:41","guid":{"rendered":"http:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/?p=71"},"modified":"2020-11-24T11:31:43","modified_gmt":"2020-11-24T11:31:43","slug":"human-level-control-through-deep-reinforcement-learning","status":"publish","type":"post","link":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/2015\/02\/26\/human-level-control-through-deep-reinforcement-learning\/","title":{"rendered":"Human-level control through deep reinforcement learning"},"content":{"rendered":"\n<figure class=\"wp-block-embed-youtube aligncenter wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Inside DeepMind\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/xN1d3qHMIEQ?rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>Demis Hassabis and colleagues at DeepMind have published their work on Q-networks. The agent is capable of learning how to play a number of Atari games by receiving only pixels and score as &#8216;sensory input&#8217;. Its performance surpasses that of any professional human game tester.<\/p>\n\n\n\n<p><a href=\"http:\/\/www.nature.com\/nature\/journal\/v518\/n7540\/full\/nature14236.html?WT.ec_id=NATURE-20150226\">Link to the article<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Demis Hassabis and colleagues at DeepMind have published their work on Q-networks. The agent is capable of learning how to play a number of Atari games by receiving only pixels and score as &#8216;sensory input&#8217;. Its performance surpasses that of any professional human game tester. Link to the article<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_genesis_hide_title":false,"_genesis_hide_breadcrumbs":false,"_genesis_hide_singular_image":false,"_genesis_hide_footer_widgets":false,"_genesis_custom_body_class":"","_genesis_custom_post_class":"","_genesis_layout":"","footnotes":""},"categories":[8],"tags":[],"class_list":{"0":"post-71","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-blog","7":"entry"},"featured_image_src":null,"featured_image_src_square":null,"author_info":{"display_name":"","author_link":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/author\/"},"_links":{"self":[{"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/posts\/71","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/comments?post=71"}],"version-history":[{"count":3,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/posts\/71\/revisions"}],"predecessor-version":[{"id":243,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/posts\/71\/revisions\/243"}],"wp:attachment":[{"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/media?parent=71"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/categories?post=71"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www2.mrc-lmb.cam.ac.uk\/groups\/tripodi\/wp-json\/wp\/v2\/tags?post=71"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}