Causal decision theory

Causal decision theory is a mathematical theory intended to determine the set of rational choices in a given situation. It is a school of thought in decision theory. In informal terms, it maintains that the rational choice is that with the best expected causal consequences. This theory is often contrasted with evidential decision theory, which recommends those actions that provide the best expected outcome conditional on one’s best evidence about the world.

Informal description

Informally, causal decision theory recommends the agent to make the decision with the best expected causal consequences. For example: if eating an apple will cause you to be happy and eating an orange will cause you to be sad then you would be rational to eat the apple. One complication is the notion of expected causal consequences. Imagine that eating a good apple will cause you to be happy and eating a bad apple will cause you to be sad but you aren't sure if the apple is good or bad. In this case you don't know the causal effects of eating the apple. Instead, then, you work from the expected causal effects, where these will depend on three things: (1) how likely you think the apple is to be good and how likely you think it is to be bad; (2) how happy eating a good apple makes you; and (3) how sad eating a bad apple makes you. In informal terms, causal decision theory advises the agent to make the decision with the best expected causal effects.

Formal description

In a 1981 article, Allan Gibbard and William Harper explained causal decision theory as maximization of the expected utility $U$ of an action $A$ "calculated from probabilities of counterfactuals":[1]

U(A)=\sum \limits _{j}P(A>O_{j})D(O_{j}),

where $D(O_{j})$ is the desirability of outcome $O_{j}$ and $P(A>O_{j})$ is the counterfactual probability that, if $A$ were done, then $O_{j}$ would hold.

Difference from evidential decision theory

David Lewis proved[2] that the probability of a conditional $P(A>O_{j})$ does not always equal the conditional probability $P(O_{j}|A)$ .[3] (see also Lewis's triviality result) If that were the case, causal decision theory would be equivalent to evidential decision theory, which uses conditional probabilities.

Gibbard and Harper showed that if we accept two axioms (one related to the controversial principle of the conditional excluded middle[4]), then the statistical independence of $A$ and $A>O_{j}$ suffices to guarantee that $P(A>O_{j})=P(O_{j}|A)$ . However, there are cases in which actions and conditionals are not independent. Gibbard and Harper give an example in which King David wants Bathsheba but fears that summoning her would provoke a revolt.

Further, David has studied works on psychology and political science which teach him the following: Kings have two personality types, charismatic and uncharismatic. A king's degree of charisma depends on his genetic make-up and early childhood experiences, and cannot be changed in adulthood. Now, charismatic kings tend to act justly and uncharismatic kings unjustly. Successful revolts against charismatic kings are rare, whereas successful revolts against uncharismatic kings are frequent. Unjust acts themselves, though, do not cause successful revolts; the reason uncharismatic kings are prone to successful revolts is that they have a sneaky, ignoble bearing. David does not know whether or not he is charismatic; he does know that it is unjust to send for another man's wife. (p. 164)

In this case, evidential decision theory recommends that David abstain from Bathsheba, while causal decision theory—noting that whether David is charismatic or uncharismatic cannot be changed—recommends sending for her.

When required to choose between causal decision theory and evidential decision theory, philosophers usually prefer causal decision theory.[5]

Criticism

Vagueness

The theory of causal decision theory (CDT) does not itself specify what algorithm to use to calculate the counterfactual probabilities.[4] One proposal is the "imaging" technique suggested by Lewis:[6] To evaluate $P(A>O_{j})$ , move probability mass from each possible world $w$ to the closest possible world $w_{A}$ in which $A$ holds, assuming $A$ is possible. However, this procedure requires that we know what we would believe if we were certain of $A$ ; this is itself a conditional to which we might assign probability less than 1, leading to regress.[4]

Counterexamples

There are innumerable "counterexamples" where, it is argued, a straightforward application of CDT fails to produce a defensibly "sane" decision. Philosopher Andy Egan argues this is due to a fundamental disconnect between the intuitive rational rule, "do what you expect will bring about the best results", and CDT's algorithm of "do whatever has the best expected outcome, holding fixed our initial views about the likely causal structure of the world." In this view, it is CDT's requirement to "hold fixed the agent’s unconditional credences in dependency hypotheses" that leads to irrational decisions.[7]

An early alleged counterexample is Newcomb's problem. Because your choice of one or two boxes can't causally affect the Predictor's guess, causal decision theory recommends the two-boxing strategy.[1] However, this results in getting only $1,000, not $1,000,000. Philosophers disagree whether one-boxing or two-boxing is the "rational" strategy.[8] Similar concerns may arise even in seemingly-straightforward problems like the prisoner's dilemma,[9] especially when playing opposite your "twin" whose choice to cooperate or defect correlates strongly, but is not caused by, your own choice.[10]

In the "Death in Damascus" scenario, an anthropomorphic "Death" predicts where you will be tomorrow, and goes to wait for you there. As in Newcomb's problem, we postulate that Death is a reliable predictor. A CDT agent would be unable to process the correlation, and may as a consequence make irrational decisions:[7][11][12] "You should rather play hide-and-seek against someone who cannot predict where you hide than against someone who can. Causal Decision Theory denies this. So Causal Decision Theory is false."[13]

Another recent counterexample is the "Psychopath Button":[7][14]

Paul is debating whether to press the ‘kill all psychopaths’ button. It would, he thinks, be much better to live in a world with no psychopaths. Unfortunately, Paul is quite confident that only a psychopath would press such a button. Paul very strongly prefers living in a world with psychopaths to dying. Should Paul press the button?

According to Egan, "pretty much everyone" agrees that Paul should not press the button, yet CDT endorses pressing the button.[7]

Philosopher Jim Joyce, perhaps the most prominent modern defender of CDT,[15] argues that CDT naturally is capable of taking into account any "information about what one is inclined or likely to do as evidence". This interpretation of CDT would require solving additional issues: How can a CDT agent avoid stumbling into having beliefs related to its own future acts, and thus becoming provably inconsistent via Gödelian incompleteness and Löb's theorem? How does the agent standing on a cliff avoid inferring that if he were to jump, he would probably have a parachute to break his fall?[16][17]

Alternatives to causal and evidential decision theory

Some scholars believe that a new decision theory needs to be built from the ground up. Philosopher Christopher Meacham proposes "Cohesive Expected Utility Maximization": An agent "should perform the act picked out by a comprehensive strategy which maximizes cohesive expected utility". Meacham also proposes this can be extended to "Global Cohesive Expected Utility Maximization" to enable superrationality-style cooperation between agents.[18][19] In the context of AI, bitcoin pioneer Wei Dai proposes "updateless decision theory", which adds to globally cohesive mechanisms the admittedly difficult concept of "logical counterfactuals" to avoid being blackmailed:[18]

Consider an agent that would pay up in response to a counterfactual blackmail. The blackmailer would predict this and blackmail the agent. Now, instead, consider an agent that would refuse to pay up in response to a counterfactual blackmail... The blackmailer would predict this too, and so would not blackmail the agent. Therefore, if we are constructing an agent that might encounter counterfactual blackmail, then it is a better overall policy to construct an agent that would refuse to pay up when blackmailed in this way.

It is an open question whether a satisfactory formalization of logical counterfactuals exists.[20][21]

Notes

Gibbard, A.; Harper, W.L. (1981), "Counterfactuals and two kinds of expected utility", Ifs: Conditionals, Beliefs, Decision, Chance, and Time: 153–190
Lewis, D. (1976), "Probabilities of conditionals and conditional probabilities", The Philosophical Review, 85 (3): 297–315, doi:10.2307/2184045, JSTOR 2184045
In fact, Lewis proved a stronger result: "if a class of probability functions is closed under conditionalizing, then there can be no probability conditional for that class unless the class consists entirely of trivial probability functions," where a trivial probability function is one that "never assigns positive probability to more than two incompatible alternatives, and hence is at most four-valued [...]."
Shaffer, Michael John (2009), "Decision Theory, Intelligent Planning and Counterfactuals", Minds and Machines, 19 (1): 61–92, doi:10.1007/s11023-008-9126-2
Weirich, Paul, "Causal Decision Theory", The Stanford Encyclopedia of Philosophy (Winter 2016 Edition), Edward N. Zalta (ed.), URL = plato.stanford.edu/archives/win2016/entries/decision-causal/
Lewis, D. (1981), "Causal decision theory" (PDF), Australasian Journal of Philosophy, 59 (1): 5–30, doi:10.1080/00048408112340011, retrieved 2009-05-29
Egan, A. (2007), "Some counterexamples to causal decision theory" (PDF), The Philosophical Review, 116 (1): 93–114, CiteSeerX 10.1.1.642.5936, doi:10.1215/00318108-2006-023, archived from the original (PDF) on 2017-03-11, retrieved 2017-07-27
Bellos, Alex (28 November 2016). "Newcomb's problem divides philosophers. Which side are you on?". The Guardian. Retrieved 27 July 2017.
Lewis, D. (1979), "Prisoners' dilemma is a Newcomb problem", Philosophy & Public Affairs, 8 (3): 235–240, JSTOR 2265034
Howard, J. V. (May 1988). "Cooperation in the Prisoner's Dilemma". Theory and Decision. 24 (3): 203–213. doi:10.1007/BF00148954.
Meacham, Christopher JG. "Binding and its consequences." Philosophical studies 149.1 (2010): 49-71.
Harper, William (January 1984). "Ratifiability and Causal Decision Theory: Comments on Eells and Seidenfeld". PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association. 1984 (2): 213–228. doi:10.1086/psaprocbienmeetp.1984.2.192506.
Ahmed, A. (1 September 2014). "Dicing with death". Analysis. 74 (4): 587–592. doi:10.1093/analys/anu084.
Greaves, Hilary. "Epistemic decision theory." Mind 122.488 (2013): 915-952.
Wedgwood, Ralph. "Gandalf’s solution to the Newcomb problem." Synthese (2013): 1-33.
Weirich, Paul, "Causal Decision Theory", The Stanford Encyclopedia of Philosophy (Winter 2016 Edition), Edward N. Zalta (ed.), URL = plato.stanford.edu/archives/win2016/entries/decision-causal/
Joyce, James M. "Regret and instability in causal decision theory." Synthese 187.1 (2012): 123-145.
Soares, Nate, and Benja Fallenstein. "Toward Idealized Decision Theory." Machine Intelligence Research Institute. 2014.
Meacham, Christopher JG. "Binding and its consequences." Philosophical studies 149.1 (2010): 49-71.
Nate Soares and Benja Fallenstein. Counterpossibles as necessary for decision theory. In Artificial General Intelligence. Springer, 2015.
Everitt, Tom, Jan Leike, and Marcus Hutter. "Sequential extensions of causal and evidential decision theory." International Conference on Algorithmic Decision Theory. Springer, Cham, 2015.

External links

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[GibbardHarper-1] Gibbard, A.; Harper, W.L. (1981), "Counterfactuals and two kinds of expected utility", Ifs: Conditionals, Beliefs, Decision, Chance, and Time: 153–190

[Lewis1976-2] Lewis, D. (1976), "Probabilities of conditionals and conditional probabilities", The Philosophical Review, 85 (3): 297–315, doi:10.2307/2184045, JSTOR 2184045

[3] In fact, Lewis proved a stronger result: "if a class of probability functions is closed under conditionalizing, then there can be no probability conditional for that class unless the class consists entirely of trivial probability functions," where a trivial probability function is one that "never assigns positive probability to more than two incompatible alternatives, and hence is at most four-valued [...]."

[Shaffer2009-4] Shaffer, Michael John (2009), "Decision Theory, Intelligent Planning and Counterfactuals", Minds and Machines, 19 (1): 61–92, doi:10.1007/s11023-008-9126-2

[5] Weirich, Paul, "Causal Decision Theory", The Stanford Encyclopedia of Philosophy (Winter 2016 Edition), Edward N. Zalta (ed.), URL = plato.stanford.edu/archives/win2016/entries/decision-causal/

[Lewis1981-6] Lewis, D. (1981), "Causal decision theory" (PDF), Australasian Journal of Philosophy, 59 (1): 5–30, doi:10.1080/00048408112340011, retrieved 2009-05-29

[Egan2007-7] Egan, A. (2007), "Some counterexamples to causal decision theory" (PDF), The Philosophical Review, 116 (1): 93–114, CiteSeerX 10.1.1.642.5936, doi:10.1215/00318108-2006-023, archived from the original (PDF) on 2017-03-11, retrieved 2017-07-27

[8] Bellos, Alex (28 November 2016). "Newcomb's problem divides philosophers. Which side are you on?". The Guardian. Retrieved 27 July 2017.

[Lewis1979-9] Lewis, D. (1979), "Prisoners' dilemma is a Newcomb problem", Philosophy & Public Affairs, 8 (3): 235–240, JSTOR 2265034

[10] Howard, J. V. (May 1988). "Cooperation in the Prisoner's Dilemma". Theory and Decision. 24 (3): 203–213. doi:10.1007/BF00148954.

[binding-11] Meacham, Christopher JG. "Binding and its consequences." Philosophical studies 149.1 (2010): 49-71.

[12] Harper, William (January 1984). "Ratifiability and Causal Decision Theory: Comments on Eells and Seidenfeld". PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association. 1984 (2): 213–228. doi:10.1086/psaprocbienmeetp.1984.2.192506.

[13] Ahmed, A. (1 September 2014). "Dicing with death". Analysis. 74 (4): 587–592. doi:10.1093/analys/anu084.

[14] Greaves, Hilary. "Epistemic decision theory." Mind 122.488 (2013): 915-952.

[15] Wedgwood, Ralph. "Gandalf’s solution to the Newcomb problem." Synthese (2013): 1-33.

[sep-16] Weirich, Paul, "Causal Decision Theory", The Stanford Encyclopedia of Philosophy (Winter 2016 Edition), Edward N. Zalta (ed.), URL = plato.stanford.edu/archives/win2016/entries/decision-causal/

[17] Joyce, James M. "Regret and instability in causal decision theory." Synthese 187.1 (2012): 123-145.

[soares-18] Soares, Nate, and Benja Fallenstein. "Toward Idealized Decision Theory." Machine Intelligence Research Institute. 2014.

[19] Meacham, Christopher JG. "Binding and its consequences." Philosophical studies 149.1 (2010): 49-71.

[20] Nate Soares and Benja Fallenstein. Counterpossibles as necessary for decision theory. In Artificial General Intelligence. Springer, 2015.

[21] Everitt, Tom, Jan Leike, and Marcus Hutter. "Sequential extensions of causal and evidential decision theory." International Conference on Algorithmic Decision Theory. Springer, Cham, 2015.