Dec 31, 2011
Big Bad Uzbek Solar Laser
Sep 16, 2011
A Change to the Link List was Overdue
Aug 23, 2011
Aug 3, 2011
Paper Drafts
This article presents a modification of reinforcement learning where an agent’s action lead to rewards being received by a second agent interacting with same environment. This model can be useful in the development of powerful AIs. Agent policies are proposed for dealing with observable rewards, with non-observable rewards in perfectly rational agents, and with non-observable rewards in bounded rational agents.
Newcomblike Problems and Optimal Agents
Abstract:
This article discusses the family of Newcomblike problems in the context of reinforcement learning. It reframes the problem of rational decision making as one of obtaining maximal rewards in a wide range of environments. Newcomblike problems are characterized by correlations between agent and environment policies. An optimal policy, taking into account these correlations, is given for known environments. For unknown environments, a quality criterion for policies is formulated.
Aug 1, 2011
Spy
Jul 8, 2011
The End of the Shuttle Program
LUKE:You were raised Jewish, right?
ELIEZER: Well that’s what I used to think, and then at one point I was watching a space shuttle launch on TV and getting tears in my eyes and realizing that I didn’t really get tears in my eyes for anything Judaism-related. That was when I realized that my childhood religion that I’d sort of grown away from over time, but still had the power to bring tears to my eyes, wasn’t Judaism so much as space travel.
(Luke Muehlhauser interviewing Eliezer Yudkowsky)Celebrating the last launch of a space shuttle earlier today, a little music video by Yours Truly. Yoko Kanno's "BLUE" (performed by Mai Yamane, Yoko Kanno & the Seatbelts) from Cowboy Bebop set to Discovery's last launch, and a slideshow of a few shuttle-related images. This is in personal memoriam of A.K., who didn't make it for Discovery's final flight by a few weeks.