Experimental philosophy and the problem of evil

Ian M. Church; Blake McAllister; James S. Spiegel

doi:10.1017/S0034412525000125

Experimental philosophy and the problem of evil

Published online by Cambridge University Press: 04 March 2025

Ian M. Church

Blake McAllister and

James S. Spiegel

Show author details

Ian M. Church*: Affiliation:
Department of Philosophy and Religion, Hillsdale College, Hillsdale, MI, USA
Blake McAllister: Affiliation:
Department of Philosophy and Religion, Hillsdale College, Hillsdale, MI, USA
James S. Spiegel: Affiliation:
Department of Philosophy and Religion, Hillsdale College, Hillsdale, MI, USA
*: Corresponding author: Ian M. Church; Email: [email protected]

Article contents

Abstract
Recent experimental philosophical research on the problem of evil
Philosophical research on the problem of evil: the current state of play
Potential areas of experimental philosophical research on the problem of evil
Conclusion
Competing Interests
Footnotes
References

Rights & Permissions

Abstract

The problem of evil is an ideal topic for experimental philosophy. Suffering – which is at the heart of most prominent formulations of the problem of evil – is a universal human experience and has been the topic of careful reflection for millennia. However, interpretations of suffering and how it bears on the existence of God are tremendously diverse and nuanced. Why does suffering push some people toward atheism while pushing others toward deeper faith? What cultural, psychological, or sociological differences account for this diversity of responses? And, importantly, what light might this diversity of responses shed on the problem of evil and how it has been formulated by philosophers in recent years? The aim of this article is to highlight how the tools and resources of experimental philosophy might be fruitfully applied to the problem of evil. In the first section, we review some recent work in this area and describe the current state of this emergent body of literature. In the second section, we review the broader and more recent theoretical developments on the problem of evil. In the final section, we outline some potential areas of future empirical research that we see as especially promising given those developments.

Keywords

problem of evil experimental philosophy psychology of religion arguments against theism experimental philosophy of religion

Type: Original Article
Information: Religious Studies , First View , pp. 1 - 20

DOI: https://doi.org/10.1017/S0034412525000125 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press.

The problem of evil is an ideal topic for experimental philosophy. Suffering – which is at the heart of most prominent formulations of the problem of evil – is a universal human experience and has been the topic of careful reflection for millennia. However, interpretations of suffering and how it bears on the existence of God are tremendously diverse and nuanced. As empirically minded philosophers, we might immediately find ourselves wondering why (and how!) something so universal might be understood in so many different ways. Why does suffering push some people toward atheism while pushing others toward deeper faith? What cultural, psychological, or sociological differences account for this diversity of responses? And, importantly, what light might this diversity of responses shed on the problem of evil and how it has been formulated by philosophers in recent years?

The primary aim of this article is to highlight how the tools and resources of experimental philosophy might be fruitfully applied to the problem of evil.Footnote ¹ To start, in the first section, we review some recent work in this area and describe the current state of this emergent body of literature. So far, experimental philosophy of religion regarding the problem of evil has primarily focused on the two most influential formulations of the problem of evil: William Rowe’s ‘The Problem of Evil and Some Varieties of Atheism’ (Rowe Reference Rowe1979) and Paul Draper’s ‘Pleasure and Pain: An Evidential Problem for Theists’ (Draper Reference Draper1989). In the second section we review the broader and more recent theoretical developments on the problem of evil. Finally, in the third section, we outline some potential areas of future empirical research that we see as especially promising given those developments.

Recent experimental philosophical research on the problem of evil

The problem of evil is, of course, an ancient problem. Traditionally, the existence of evil was taken to be logically incompatible with the existence of an all-good, all-knowing, all-powerful God. In his seminal article, ‘Evil and Omnipotence’, J.L. Mackie put the problem like this: ‘In its simplest form the problem is this: God is omnipotent; God is wholly good; and yet evil exists. There seems to be some contradiction between these three propositions, so that if any two of them were true the third would be false’ (Mackie Reference Mackie1955, 200). This approach to the problem is called the logical problem of evil.

In his landmark book, God, Freedom, and Evil (Plantinga Reference Plantinga1974), Alvin Plantinga argued that, contrary to Mackie, there is no logical incompatibility between the existence of evil and the claims that God is wholly good and omnipotent. If a world with free moral agents is always more valuable than a world without free moral agents, and if the only way to guarantee a world without evil is to create a world without free moral agents, then a wholly good and omnipotent God might allow for evil to accomplish the goal of having a world with free moral agents.

Plantinga’s argument was broadly taken to be persuasive. In response, proponents of the problem of evil changed tack. Instead of arguing that the existence of evil is logically incompatible with a traditional conception of God, they argue for the much weaker thesis that evil poses an epistemic challenge for those forms of theism. Some argued that the existence of gratuitous evil was evidence against traditional brands of theism, a type of argument that came to be called the evidential problem of evil. Others framed the problem in terms of epistemic probability – that the amount and distribution of evil in the world made the truth of theism epistemically less probable than alternative hypotheses. This type of argument is called the probabilistic problem of evil.Footnote ² William Rowe’s seminal article, ‘The Problem of Evil and Some Varieties of Atheism’ (Rowe Reference Rowe1979), is the most cited formulation of the problem of evil in the extant philosophical literature and an archetypal example of an evidential problem. And Paul Draper’s landmark article, ‘Pleasure and Pain: An Evidential Problem for Theists’ (Draper Reference Draper1989), has come to be considered, perhaps somewhat confusingly given the title, an archetypal example of the probabilistic problem of evil.

As experimental philosophy proliferated at the start of the twenty-first century, its tools and resources were slow to be adopted by philosophers of religion, and attempts to apply those tools and resources to the problem of evil have only started in the past few years.Footnote ³ So far, work has exclusively focused on those two seminal formulations, Rowe’s evidentialist formulation and Draper’s probabilistic formulation. We will briefly elucidate each before discussing the empirical research that has already been done.

Rowe’s formulation

In William Rowe’s seminal version of the problem of evil,Footnote ⁴ he levels the following argument against theism:

1. There exist instances of [pointless suffering–defined as] intense suffering which an omnipotent, omniscient being could have prevented without thereby losing some greater good or permitting some evil equally bad or worse.
2. An omniscient, wholly good being would prevent the occurrence of any intense suffering it could, unless it could not do so without thereby losing some greater good or permitting some evil equally bad or worse.
3. [Therefore] there does not exist an omnipotent, omniscient, wholly good being. (Rowe Reference Rowe1979, 336)

Of course, such an argument is valid, but why should we think that the premises are true? Premise 2 seems fairly unobjectionable. Indeed, as Rowe notes, ‘This premise … is, I think, held in common by many atheists and nontheists’ (Rowe Reference Rowe1979, 336). For this article, we’re happy to agree; our focus will be on premise 1.

Why should we think that there is pointless suffering? Here Rowe has us think about an example of what seems like a good candidate for a pointless evil:

Fawn: Suppose in some distant forest lightning strikes a dead tree, resulting in a forest fire. In the fire a fawn is trapped, horribly burned, and lies in terrible agony for several days before death relieves its suffering (Rowe Reference Rowe1979, 337).

According to Rowe, ‘so far as we can see, the fawn’s intense suffering is pointless’ (Rowe Reference Rowe1979, 337). (Though, whether or not the suffering is actually pointless is the subject of the debate.) While an omnipotent, omniscient, all-good being certainly could have prevented such an event, it’s extremely difficult to imagine how permitting something like the suffering of Fawn could either prevent a greater evil from occurring or usher in some greater good. As such, premise 1 looks plausible.

But, as Rowe is quick to note, this doesn’t amount to a proof. For all we can tell, there is a greater evil or good that allowing Fawn prevents or affords, respectively. The problem, as Rowe sees it, is that given ‘our experience and knowledge of the variety and profusion of suffering in our world’ it sure seems like such evils are pointless; hence, the argument still provides ‘rational support for atheism’ and makes it ‘reasonable for us to believe that the theistic God does not exist’ (Rowe Reference Rowe1979, 338, emphasis ours).Footnote ⁵

Critically, it is our intuitions regarding Fawn (and related cases) that are the driving force for thinking that premise 1 is true and, thus, for Rowe’s evidential argument more generally. As Alvin Plantinga elucidates Rowe’s argument, if it seems as though the suffering in FAWN is pointless, then that gives us a reason for thinking that the suffering in Fawn is pointless (Reference Plantinga2000, 465–466).

Empirical work on Rowe’s formulation

In a new article, McAllister, Church, Rezkalla, and Nguyen propose to study whether the intuitions that underwrite Rowe’s seminal formulation of the problem of evil vary across demographics (McAllister et al. Reference McAllister, Church, Rezkalla, Nguyen, Knobe and Nichols2024, sec. 2.1). They hypothesize that intuitions regarding Rowe’s case will significantly diverge according to respondents’ religious beliefs, nationality, ethnicity, gender, and education level. McAllister et al. also inquire into the psychological mechanisms underwriting these intuitions. For instance, given that fawns are quintessentially cute animals (think of Bambi!), it raises the question: does cuteness influence the perception of pointlessness? Would the tragic demise of a less adorable animal be considered equally pointless?

Additionally, the concise nature of the fawn case, spanning just two sentences, may play a role in its perceived pointlessness. Some scholars have stressed the significance of context and narrative in addressing the issue of suffering.Footnote ⁶ Would the inclusion of more background details alter perceptions of pointlessness?

To investigate these questions, Ian Church, Justin Barrett, and the rest of their team conducted an empirical study (see Church et al. Reference Church, Carlson and Barrett2021, Reference Church, Worcol and Barrett2022), the results of which were more fully reported in McAllister et al.’s (Reference McAllister, Church, Rezkalla, Nguyen, Knobe and Nichols2024) article.Footnote ⁷ Participants read some variation of Rowe’s vignette of the fawn from the Rowe (Reference Rowe1979) paper. For half of the participants the vignette was accompanied by a description of the role of wildfires in a forest ecosystem, including their contributions to the health of the ecosystem by clearing away dead organic material and leaving behind a topsoil dense in organic materials. The other half of participants read the vignette without context, just as it appeared in Rowe’s Reference Rowe1979 paper. The subject of the vignette varied as either a fawn, a boar, or a vulture. Finally, in half of the cases a picture of the subject of the vignette accompanied the vignette. Thus, this experiment contained three variables: context (high or low), picture (picture or no picture), and animal (fawn, boar, or vulture).

After reading the vignette, participants rated the following statements designed to assess their degree of agreement or disagreement with Rowe’s intuition that the suffering described in the vignette is pointless: ‘The story you just read is an example of pointless suffering’, ‘Some equal or greater evil could have been prevented because of the situation in the story’, and ‘Some equal or greater good could be accomplished because of the situation in the story’. Church and the rest of the team initially intended to measure the degree to which participants shared Rowe’s intuitions through an index compiled of the score of these three statements, however, they found that whereas scores of the last two questions were highly correlated (r = .478, p < 0.01) the first question was not highly correlated in the expected direction with the last two questions (r = .071, p < 0.01 and r = − .171, p < 0.01). Therefore, they measured agreement with Rowe through an index of the reverse scored second and third questions.Footnote ⁸

What did they find? In summary, they found significant variations across each demographic variable when it comes to agreement with Rowe, though sometimes in surprising ways – for instance,women were more likely to agree with Rowe than men and agreement with Rowe consistently dropped with increases in education-level. They also found that the inclusion of context dramatically diminished the perceptions of pointlessness. The cuteness of the animal (with or without a picture) did not have a measurable impact on people’s responses. Perhaps most remarkable was the fact that so few people actually agreed with Rowe’s assessment of the target vignette. To quote McAllister et al., ‘One of the more notable results is how few people, only 12.61%, share any level of agreement with Rowe about the gratuitousness of the fawn’s suffering’ (McAllister et al. Reference McAllister, Church, Rezkalla, Nguyen, Knobe and Nichols2024, sec. 3.1).

McAllister et al. argued that, if these results are accurate, then they put those who agree with Rowe in an epistemically perilous position. Few, it seems, will have strong reasons for privileging the judgement of those who agree with Rowe over those who don’t. Accordingly, one’s justification for believing that the fawn’s suffering is gratuitous could be partially or even fully defeated, undermining Rowe’s argument. Given the exploratory nature of the research thus far, McAllister et al. acknowledge that such a conclusion is highly defeasible and conditional on whether the study is accurately measuring agreement with Rowe. That said, these findings certainly raise questions about the reach and strength of Rowe’s argument – questions that deserve further investigation.

Follow-up research on Rowe’s formulation

Serious questions remain about the accuracy of these findings and what to make of them. To start, there is the question of whether McAllister et al.’s two-item index is an accurate metric for agreement with Rowe. The challenge of making sure what is measured lines up with what philosophers are talking about is a perennial challenge for any work within experimental philosophy, and the situation is no different here.

For instance, participants might agree that some greater good could be accomplished by the fawn’s suffering (or some worse evil prevented by it) but think of this possibility as extremely remote. To disagree with Rowe, however, participants need to think of this possibility as being plausible, or at least not particularly unlikely.

To address this concern, McAllister, Church, and the rest of their team have developed an eight-item index to measure agreement with Rowe which they plan to use in follow-up research. First, the two from McAllister et al. (Reference McAllister, Church, Rezkalla, Nguyen, Knobe and Nichols2024):

‘Some equal or greater evil could have been prevented because of the situation in the story.’

‘Some equal or greater good could be accomplished because of the situation in the story.’

Plus the following six items:

‘It is plausible that the situation in the story brings about some greater good.’

‘It is plausible that the situation in the story prevents some worse evil.’

‘Probably, the situation in the story doesn’t bring about some greater good.’ (reverse scored)

‘Probably, the situation in the story doesn’t prevent some worse evil.’ (reverse scored)

‘It is reasonable to believe that the situation in the story doesn’t bring about some greater good.’ (reverse scored)

‘It is reasonable to believe that the situation in the story doesn’t prevent some worse evil.’ (reverse scored)

This expanded index should allow them to better gauge how credible participants find the possibility of a greater good or worse evil. And since it includes the original two-item index, it also allows them to see whether their original findings replicate.

In their 2022 article, ‘The Context of Suffering’, Church, Warchol, and Barrett draw on this empirical research to explore the idea that context plays an important role in diminishing perceptions of suffering as pointless. Referees and critical commentators raised a suite of important questions about why and whether context really diminish perceptions of pointlessness and what significance this ‘contextual effect’ has for evaluating Rowe’s argument.

One suggestion offered by commentators is that the context given to participants was too positive and that this positive valence is what diminishes perceptions of pointlessness. Call this the positive valence hypothesis. Or maybe, as some referees mused, the context that was given to participants contained a hidden theodicy. On this proposal, contexts drive down the perceptions of pointlessness only (or primarily) when they invoke a theodicy. Call this the sneaky theodicy hypothesis. Or we might think that the context is simply distracting participants from the pointlessness of the suffering. So on this hypothesis, it’s not as if participants are somehow seeing a ‘point’ in the suffering; instead, the sense of pointlessness is just being lost in the shuffle of new information. Call this the distraction hypothesis. And, finally, we might worry that the effect of context is merely a result of making participants aware of how little they know regarding the situation surrounding Rowe’s vignette and it is this awareness that is driving down perceptions of pointlessness. Call this the sceptical theism objection.

In follow-up research, McAllister, Church, and the rest of their team aim to explore each of these hypotheses by introducing new context conditions. Let’s call the original two context types ‘Regular Context’ and ‘No Context’. The following three context types will be added in future research: ‘Negative Context’, ‘Theodicy Context’, ‘Junk Context’ to address the first three of those objections.

Addressing the positive valence hypothesis, ‘Negative Context’ will give negative context to the target case of suffering (i.e. absolutely nothing good is reported to come about from the fawn’s suffering, only bad). If ‘Negative Context’ significantly diminishes the perception of pointlessness, then we’ll have reason to think that it’s not just the more positive valence of ‘Regular Context’ that was causing participants to report diminished perceptions of pointlessness.

Addressing the sneaky theodicy hypothesis, the ‘Theodicy Context’ will embed a full-throated theodicy into the original vignette. If the sneaky theodicy hypothesis were true, we would expect to see the ‘Theodicy Context’ significantly diminish the perception of pointlessness, even more than the ‘Regular Context’ condition. If the ‘Theodicy Context’ doesn’t diminish perceptions of pointlessness as significantly as ‘Regular Context’, then we may doubt that the sneaky theodicy hypothesis best explains the observed contextual effect.

Addressing the distraction hypothesis, ‘Junk Context’ will provide participants with contextual information that is only tangentially related to the target example of suffering (e.g. how forest fires produce light, that light travels nearly 300,000 km per second, etc.); if this context significantly reduces the perception of pointlessness, then that would lend further credence to the distraction hypothesis. If ‘Junk Context’ doesn’t significantly diminish participants’ perception of pointlessness, then it seems unlikely that the distraction hypothesis best explains the contextual effect.

Finally, in response to the sceptical theism hypothesis, instead of being assigned to one of the five context conditions, some participants will be randomly assigned to what we’re calling the Sceptical Theism condition. Participants who are assigned this condition will read a brief discussion on the significant limitations humans face in trying to understand the suffering we encounter in this world before being asked to reflect on the original fawn vignette. If participants who were assigned the Sceptical Theism condition report a significantly diminished perception of pointlessness in the target suffering, then that would lend credence to the sceptical theism hypothesis. If, however, subjects who were assigned to the Sceptical Theism condition don’t report a significantly diminished perception of pointlessness, then we’ll have reason to think that the sceptical theism hypothesis does not best explain the observed contextual effect. But maybe none of these hypotheses – positive valence, sneaky theodicy, distraction, or sceptical theism – best explain the role context played in diminishing perceptions of pointlessness. Further research could suggest that the role context is playing is more mysterious than we might have previously expected.

Draper’s formulation

In formulating his probabilistic problem of evil, Draper does not argue that God’s existence is logically incompatible with evil but only that ‘our knowledge about pain and pleasure creates a problem for theists’ (Draper Reference Draper1989, 331). Draper asks us to let ‘O’ represent ‘a statement reporting both the observations one has made of humans and animals experiencing pain or pleasure and the testimony one has encountered concerning [such things]’ (Draper Reference Draper1989, 332). He then contends that O is less epistemically probable given theism than the hypothesis of indifference (HI).Footnote ⁹ He provides the following definitions:

HI: ‘Neither the nature nor the condition of sentient beings on earth is the result of benevolent or malevolent actions performed by non-human persons’ (Draper Reference Draper1989, 332).

Theism: ‘There exists an omnipotent, omniscient, and morally perfect person who created the Universe’ (Draper Reference Draper1989, 331).

Let ‘P(X|Y)’ stand for the epistemic probability that X given Y. We can now express Draper’s conclusion a bit more formally as:

Draper’s Conclusion: ‘P(O|HI) is much greater than P(O|theism)’ (Draper Reference Draper1989, 333).

Note that according to Draper, the epistemic probability of O given HI isn’t just greater than the epistemic probability of O given theism; it’s much greater. Hence, the evidence that O provides for HI over theism is supposedly quite significant.

Importantly, the exact degree to which O confirms HI over theism will depend on each subject’s unique epistemic situation. For one, not all subjects start with the same ‘O’ since not everyone has the same observations or testimony about pleasure and pain. For another, background information may differ from one person to another such that the relevant conditional probabilities will also differ from subject to subject.Footnote ¹⁰ Nevertheless, Draper seems confident that his conclusion will hold for most if not all subjects given ‘the biological role played by both pain and pleasure in goal-directed organic systems’ (334). Such a role fits perfectly with the hypothesis of indifference but not nearly so well, he claims, with theism.

Empirical work on Draper’s formulation

In these early stages, experimental philosophers have primarily been interested in exploring whether people’s own assessments of the relevant conditional probabilities align with those of Draper. Ian Church, Blake McAllister, James Spiegel, and Justin Barrett (hereafter Church et al.) explore this in their manuscript, ‘Testing Folk Perceptions of Draper’s Probabilistic Problem of Evil’. Their initial hypothesis was that agreement with Draper’s Conclusion – that is, answers indicating that one’s ‘O’ is more epistemically probable given HI than given theism – would not be widely shared across demographic variables.Footnote ¹¹ If anything, in keeping with the empirical research on Rowe’s formulation discussed above, Church et al. hypothesized agreement with Draper’s Conclusion to be in the clear minority.

They also expected atheists and agnostics to be significantly more likely than various religious groups (e.g. Christians, Hindus, etc.) to give answers in line with Draper’s Conclusion. Likewise, given that men are statistically more likely than women to be atheists (Cragun Reference Cragun and Yamane2016, 307), they expected men to agree with Draper’s Conclusion significantly more on average than women. And given that education level negatively correlates with religiosity (Beit-Hallahmi Reference Beit-Hallahmi and Martin2006, 313), they expected agreement with Draper’s Conclusion to correlate with education level.

To gauge agreement with Draper’s Conclusion, participants were given the following two thought experiments in a random order:

Planet X: Imagine that you are transported to another universe, to a planet (call it Planet X) that is very similar to Earth and with creatures (including humans) that are very similar to Earth’s creatures and humans. Before exploring Planet X, imagine that you also know that this planet was created by an all-powerful, wholly good God who is involved in the lives of the humans there. Based solely on this knowledge, how confident are you that Planet X would contain a similar distribution of pain and pleasure that you have observed (and heard about from other people) in our own world?

Planet Y: Imagine that you are transported to another universe, to a planet (call it Planet Y) that is very similar to Earth and with creatures (including humans) that are very similar to Earth’s creatures and humans. Before exploring Planet Y, imagine that you also know the condition of the humans on Planet Y is NOT the result of actions performed by supernatural beings. Based solely on this knowledge, how confident are you that Planet Y would contain a similar distribution of pain and pleasure that you have observed (and heard about from other people) in our own world?Footnote ¹²

Using a 7-point Likert confidence scale ranging from ‘not at all confident’ to ‘entirely sure’, participants were asked to rate their confidence that the given planet would have a ‘similar distribution of pain and pleasure as [they] have observed (and heard about from other people) in our own world’. Plausibly, the more confident one is that the pleasure and pain on Planet X or Y will resemble our own, the higher one takes P(O|theism) and P(O|HI) to be, respectively.

Given this, Church et al. defined agreement with Draper’s Conclusion as registering a Planet Y confidence score (ranging from 1–7) that is greater than a Planet X confidence score (again, ranging from 1–7). Falling under this umbrella are two different types of agreement:

Partial Agreement: A participant reports partial agreement with Draper’s Conclusion if and only if their Planet X score is lower than their Planet Y score by one point.

Full Agreement: A participant reports full agreement with Draper’s Conclusion if and only if their Planet X score is at least two points lower than their Planet Y score (Reference Church, McAllister, Spiegel and BarrettChurch et al., in progress).

Contrary to what Church et al. hypothesized, their research found that Draper’s Conclusion enjoys comparatively broad intuitive support amongst the folk. Where only 12.6 per cent of participants ‘share any level of agreement’ with William Rowe’s Reference Rowe1979 formulation of the problem of evil (McAllister et al. Reference McAllister, Church, Rezkalla, Nguyen, Knobe and Nichols2024, sec. 3.1), around 41 per cent of participants surveyed by Church et al. reported at least Partial Agreement with Draper’s Conclusion. 28 per cent registered Full Agreement. Put another way, this line of research suggests that Draper faces far less of an uphill battle regarding folk intuitions than Rowe does. Draper took his conclusion to capture the intuitions ‘of a great many people who have regarded evil as an epistemic problem for theists’, and he certainly seems to be correct in that regard (Draper Reference Draper1989, 332).

Church et al. did find some variation across demographics, but variations across gender, ethnicity, nationality, education level, income, and so on were far more muted than the variation found in the McAllister et al. study of Rowe’s argument. Religion, however, seemed to have an enormous influence on whether or not (and to what degree) participants registered agreement with Draper. Atheists and agnostics were significantly more likely (p < .001) to agree with Draper than any of the other religious categories considered (such as Protestant, Catholic, Other Christian, Hindu).

Across all participants, the mean confidence score for Planet Y was higher (M = 4.83, SD = 1.44) than Planet X (M = 4.41, SD = 1.72). A 1-tailed t-test showed that the differences between the Planet Y scores and Planet X scores was significant (p < .001; d_s = .26). Other things being equal, Church et al. propose that there may be collective wisdom in the aggregate scores – that other things being equal, we should expect the average confidence scores of all participants to be a more accurate measure of the conditional probabilities relevant to Draper’s Conclusion than the individual confidence scores of any (arbitrarily chosen) individual participant. If that’s right, then such findings might provide evidential support for Draper’s Conclusion.Footnote ¹³

Follow-up research on Draper’s formulation

Church et al. caution that further research is needed before any firm conclusions can be drawn. To start, Church et al. acknowledge that atheists and agnostics are significantly over-represented in their sample. Given that atheists and agnostics were significantly more likely to agree with Draper than other religious groups, this over-representation may have skewed the averages in Draper’s direction. Church, McAllister, and the rest of their team are conducting follow-up studies with a more representative sample.

Another worry arises from the way Church et al.’s study directly asks participants how confident they are that ‘an all-powerful, wholly good God’ would create a world with a similar distribution of pain and pleasure as our own. Many theists, we might worry, will be quite confident that God would create such a world, because they already believe that God created this world. This is, however, to stack the deck in favour of theism and so might be called the improper weighting effect. Their evaluation of P(O|theism) should be conducted given only the fact that an all-powerful, wholly good God exists (and general background evidence acceptable to all parties).

A related concern is that the explicit religious framing might tempt participants to simply parrot whatever they take to be theological (or atheological) orthodoxy. Call this the parroting effect. Given the significant variation along religious lines, we might worry that reported answers aren’t really reflecting the intuitions of participants but merely their pre-established theological or atheological commitments.

To address both of these concerns, Church’s research group is designing follow-up studies that discourage improper weighting or parroting by reorienting the questions around third-parties (e.g. ‘What should this other person with limited information think?’) or removing explicitly religious framing (e.g. ‘What would this world look like if governed by a wholly good AI?’).

A final worry is that Draper’s exclusive focus on pleasure and pain may limit the reach of these results. People may agree that the distribution of pleasure and pain fits better with the hypothesis of indifference than with theism, but would they say the same about good and bad states of affairs more generally? What about distributions of virtue and vice? Beauty and ugliness? Once again, Church’s team plans to explore these issues further.

Philosophical research on the problem of evil: the current state of play

The literature on the problem of evil is immense, and extends far beyond the work of Rowe and Draper. There is no hope of us surveying it in its entirety. Our overview will be selective, giving shape to the broad contours of the debate but with a preference for certain issues that we find especially suitable for empirical probing, which will be discussed below.

Responses to evidential formulations

Non-empirical responses to the evidential and probabilistic problems of evil have generally taken three forms. The first is to give a theodicy (or a defence). The second is to adopt sceptical theism. The third, only now gaining prominence on the contemporary scene, is to claim for God some form of moral immunity.

Both theodicies and defences articulate reasons that would justify God in allowing evil (or certain types of evil) to exist. A defence claims only that it is epistemically possible, or true for all we know, that God has such reasons. A theodicy claims it is plausible God has such reasons.Footnote ¹⁴ A theodicy is more suitable in this context, since the mere possibility of justifying reasons does not seem sufficient to defuse either Rowe’s or Draper’s arguments.Footnote ¹⁵ Sceptical theism contests our ability to tell whether there are any justifying reasons.Footnote ¹⁶ In particular, we are in no position to conclude that God does not have justifying reasons. A final strategy says there is no need for justifying reasons since God enjoys moral immunity. That is, whatever moral principles underlie the contention that a good God would not allow evil (or evil of a certain sort) do not apply to God. We’ll look at each type of response more closely, from last to first.

Moral immunity

The terminology of ‘moral immunity’ is our own and seeks to cover a variety of positions all united by the contention that none of the moral principles on which the problem of evil is grounded, nor any other moral principles in their general vicinity, apply to God. Some go so far as to claim that God is not morally good (Davies Reference Davies2006). He does not exist in our moral economy in such a way that requires justifying reasons for acting in certain ways. Indeed, God does not act for reasons at all. Others concede God’s moral goodness but deny that his goodness gives him any obligation to promote our well-being (Adams Reference Adams1999; Murphy Reference Murphy2017). For Mark Murphy, God has reasons to promote our well-being but not requiring reasons. It may still be the case on such views that God’s treatment of us must meet certain requirements if he is to count as good or loving toward us (Adams Reference Adams2013, 16–17), but whether he is good or loving toward us is up to his discretion. Still others might concede that God has moral obligations to us but contest John Stuart Mill’s insistence that these must be substantially on par with our own (Harrison Reference Harrison2017). Treatment that may be forbidden for us can be permissible for God. While growing in popularity, such criticisms of the problem of evil remain in the minority, perhaps in part due to the predominance of Christianity within the debate.

Sceptical theism

Much more common is sceptical theism. Sceptical theists attack key inferences or rational processes relied on to support the problem of evil. For instance, how does one support the notion of gratuitous evils? Presumably, by surveying all of the justifying reasons we can think of and, finding none of them satisfactory, inferring that there are no justifying reasons. Sceptical theists argue that, in this context, such noseeum inferences should not be trusted.

Sceptical theists substantiate their scepticism in a variety of ways, all of them highlighting reasons why we should take ourselves not to be in a position to discern potential justifying reasons. In one of the earliest defences of sceptical theism, Stephen Wykstra (Reference Wykstra1984) appeals to a parent-child analogy. Just as we should not expect a small child to understand why her parents allow her to suffer, even when there are such reasons (e.g. she is receiving cancer treatments), so we should not expect ourselves to understand why God allows us to suffer. Appeal to the parent-child analogy is controversial and raises larger questions about what responsibilities God has to (try to) communicate his plans to us and/or to comfort us with his presence in the midst of indecipherable suffering. Here discussions of the problem of evil and the problem of divine hiddenness converge.

Alston (Reference Alston1991) defends sceptical theism more systematically by inventorying limitations of the human cognitive condition vis-à-vis potential justifying reasons:

1. Lack of relevant data.
2. Complexity greater than we can handle.
3. Difficulty of determining what is metaphysically possible or necessary.
4. Ignorance of the full range of possibilities.
5. Ignorance of the full range of values.
6. Limits to our capacity to make well-considered value judgements. (Dougherty Reference Dougherty, Zalta and Nodelman2014)

If we are limited in these ways, then it is difficult to see how we could ever reasonably believe there to be no justifying reasons for allowing evil.Footnote ¹⁷ The most pressing challenge for sceptical theism is usually to prevent their scepticism from spilling over into unwarranted scepticism in other domains (moral, religious, perceptual, etc.).

Traditional theodicies

Theodicies try to articulate plausible justifying reasons for God to allow evil. They tend to fall into a few basic categories. Free-will theodicies, which trace back at least to Augustine,Footnote ¹⁸ start with incompatibilism – the view that a choice cannot be both free and determined. Thus, not even God can allow creatures to make free choices and ensure that they never use that freedom to produce moral evil (evils caused by free will). Nevertheless, God is justified in allowing creatures to make free choices, and whatever moral evil may come of them, because doing so is necessary to make possible great goods such as moral righteousness or genuine loving relationships. Contemporary defenders often stress the importance that our free choices have more than token significance. Plantinga (Reference Plantinga1974) emphasizes the need for ‘morally significant freedom’ while Swinburne (Reference Swinburne1998) touts the value of bearing responsibility for our own welfare and that of others. Greater good theodicies, in contrast, argue that evil is integral to achieving a greater good. John Hick’s (1976) soul-making theodicy is the most prominent of this sort. Hick argues that the only environment suitable for growth in virtue is one in which suffering is present. Without it, people would lack the motivation and opportunity to acquire and exercise moral characteristics such courage, compassion, generosity, selflessness, and the like.

Free will and greater good theodicies are often presented as rivals, and there are certainly important differences between them. In free-will theodicies, evil is not necessary. If humans always used their free will rightly, then we could achieve the desired goods without any evil. Evil is an avoidable by-product. Whereas in greater good theodicies, evil is necessary for (or at least integral to) producing the great goods in question. That being said, almost all free-will theodicies also make appeal to greater good considerations, and many greater good theodicies also rely on incompatibilist conceptions of free will.Footnote ¹⁹ Regarding the former, those advocating for free-will theodicies almost always add that God will not permit the creation of free creatures to be an unmitigated tragedy.Footnote ²⁰ Thus, God ensures that the moral evil resulting from free choices is ultimately used for good. Regarding the latter, many of those advocating for greater good theodicies rely crucially on the idea that free choices cannot be determined. For example, Hick’s account of why suffering is essential to soul-making is only plausible on the supposition that God cannot develop these virtues within us (or cannot do so in an equally valuable way) without our free and undetermined cooperation.Footnote ²¹

So it may be that there is less tension between these two approaches than commonly supposed. Be that as it may, not all greater good theodicies assume incompatibilism. Compatibilist-friendly theodicies include the divine glory theodicy and the ‘O Felix Culpa’ theodicy. The former says that the existence of moral evil allows God to display his mercy and love in saving sinners and his justice and wrath in punishing them, thereby increasing his own glory.Footnote ²² The latter says that the very best worlds are those in which God is allowed to execute his rescue plan of becoming incarnate, atoning for sin, and raising fallen creatures to new life. However, such a plan can be executed only in a world filled with moral evil.Footnote ²³

So far, most of these theodicies have focused exclusively on moral evils–those stemming from the misuse of free will. But what of natural evils? These are evils that do not result from free will such as earthquakes, diseases, or non-culpable accidents (e.g. skidding on a patch of ice while driving resulting in the death of the driver). Some theodicies, such as Hick’s soul-making theodicy, are equipped to explain natural evils as well as moral evils, but often multiple theodicies must be patched together to account for the full variety of evils in the world. Regarding natural evils, a prominent theodicy is the natural law theodicy. This maintains that any world with sufficiently stable natural laws will inevitably contain natural evils, and that the goods made possible by stable natural laws (such as providing a suitable environment for the exercise of human freedom and responsibility, the development of human moral character, and the pursuit of scientific discovery) are enough to justify the allowance of such evils.Footnote ²⁴

Traditional challenges for theodicies

Theodicies face several perennial challenges. First, we have already seen the challenge in accounting for the full variety of evils observed in our world. Even if successful, a theodicy may justify one type of evil (e.g. human suffering) while leaving another untouched (e.g. animal suffering). Second, many of the most plausible theodicies rely crucially on incompatibilism about free will, or at least, on the idea that libertarian freedom is significantly more valuable than whatever sorts of freedom are compatible with determinism. This assumption is highly controversial, however.Footnote ²⁵ Third, the plausibility of a theodicy often comes down to the question, ‘Is it worth it?’ Are the goods to which the theodicy appeals good enough to justify the evils in question? This is a basic value judgement.Footnote ²⁶ What is the theodicist to do with those (seemingly always a significant number) who reasonably disagree? Fourth and finally, many intuit that allowing evils of an especially horrendous sort is inherently wrong. In other words, allowing those evils is itself an evil and, as the Pauline Principle tells us, one must never do evil that good may come of it.Footnote ²⁷

The rise of ‘good-hearted’ theodicies

Work on theodicy reached something of a crescendo near the turn of the millennium, with a flurry of seminal books and articles being published by William Hasker, Alvin Plantinga, Richard Swinburne, Peter van Inwagen, and others. Among the most innovative was Marilyn McCord Adams’s Horrendous Evils and the Goodness of God (Adams Reference Adams1999). This book, along with earlier work by Eleonore Stump (Reference Stump1985), seems largely responsible for the subsequent rise in what Evan Fales calls ‘good-hearted theodicies’ (Fales Reference Fales, McBrayer and Howard-Snyder2013, 352), the most notable of which is found in Stump’s Wandering in Darkness (Stump Reference Stump2010).Footnote ²⁸

Following the lead of Adams and Stump, good-hearted theodicies emphasize God’s love for all individuals and insist that this love be expressed in ways we can appreciate when viewed in their proper context. That is, good-hearted theodicies try to ensure that the charitable reader will not revile God’s decisions nor see him as indifferent toward human suffering. This concern for maintaining a sense of God’s compassion and care for individual persons is perhaps why Fales also calls such theodicies ‘soft-hearted’ (Fales Reference Fales, McBrayer and Howard-Snyder2013, 351–352). This sensitivity is manifest in several commonly proposed requirements on the content of good-hearted theodicies, each of which is contentious among theists more generally:

1. The Horrendous Evil Requirement. A theodicy must account not only for evil in general but for the most horrifying instances of it.
2. The Good Life Requirement. A life that is worth living (on-the-whole good and meaningful) must be guaranteed, or at least made available, to all persons, including those who endure horrendous evils.Footnote ²⁹
3. The Sufferer-centred Requirement. Horrendous suffering must result in some sufficiently great good for the one who suffered, lest he or she be treated as a mere means to an end.
4. The Defeat Requirement. The sufficiently great good that accrues to the sufferer must arise organically from the suffering itself, thereby ‘defeating’ the suffering (Chisholm Reference Chisholm1968) and avoiding any impression of an arbitrary pay-off.
5. The Redemption Requirement. The defeating good must be subjectively appropriated by the sufferer such that he or she no longer wishes away that suffering.

Along with these substantive constraints come certain stylistic ones as well. Proponents try to write with empathy and are careful not to downplay the horror of suffering.

In seeking to meet all of these requirements, several important trends have emerged, each representing a key development in the domain of theodicy.

1. Eschatology. It is plain that defeat and redemption are not always made available in this life, so good-hearted theodicies have followed Hick (Reference Hick1976, 338–341) in relying crucially on eschatological goods.
2. Special Revelation. In fleshing out an eschatological vision, good-hearted theodicies have found it helpful, if not indispensable, to make use of the resources of particular religious traditions, most commonly Christianity.
3. Worldviews and Perspectives. As the appeal to specific religious traditions makes clear, in giving a theodicy one can no longer assume that the larger worldview supposed by that theodicy is shared by (or even comprehensible to) most of one’s readers. In this case, the theodicy is liable to be misunderstood and unfairly dismissed (Stump Reference Stump2010). It therefore becomes incumbent on the theodicist to articulate this larger worldview and make it as intuitive as possible. Hence, how things seem to the individual – that is, one’s ‘perspective’ – becomes an increasing point of focus (McAllister Reference McAllister2020).
4. Narrative. Some argue that embedding a theodicy within a narrative gives the reader special understanding of that theodicy and the worldview presumed by it–one that registers on an intuitive and emotional level (Stump Reference Stump2010, Chs 2–4 and 373–374).
5. Pastoral Awareness. While not completely disowning Plantinga’s distinction between the philosophical and the pastoral (Plantinga Reference Plantinga1974, 63–64), good-hearted theodicies seem aware that pastoral considerations cannot be altogether ignored in one’s philosophical response to the problem of evil (McAllister Reference McAllister2020, 445).

Based on current trends, we think the future of theodicy most likely lies in this good-hearted direction.

Potential areas of experimental philosophical research on the problem of evil

Having surveyed the most prominent formulations of the problem of evil as well as the range of responses to the problem, let’s now consider how experimental philosophy might continue to contribute to that emerging scholarly discussion. Below is just a sampling of some of the lines of research that come to mind. There are certainly many more.

As regards the moral immunity approach to the problem of evil, it would be worth empirically exploring how people think about the goodness of God. Do people think of God as being morally good? If so, then do they think that God’s moral goodness and obligations are similar to those of humans? And what correlations might there be between various theists’ answers to these questions and the doctrinal commitments distinctive to their respective theological traditions? How is someone’s understanding of divine goodness impacted by the images of God they’re most attracted to (see, for example, Johnson et al. Reference Johnson, Okun, Cohen, Sharp and Hook2019)? Could it be that those who rely more heavily on religious coping strategies think of God’s goodness as more similar to our own, perhaps as having moral attitudes and obligations toward us similar to that of human parents?Footnote ³⁰ Could those who think more analytically, or who score lower in cognitive empathy, be more inclined to think of God’s goodness as non-moral or entailing little by way of beneficence? Answers to such questions might be helpful not only in ascertaining why particular theists are or are not drawn to the moral immunity approach, but may even bear on how rational we find those approaches and/or rejections of them.

The sceptical theist approach likewise raises many questions warranting experimental philosophical inquiry. For starters, why do people believe that certain forms of evil are gratuitous? Are such convictions bare intuitions or are they conclusions reached on the basis of inferences? In either case, what sorts of cognitive processes are involved? Sceptical theists tend to appeal to the ‘noseeum’ inference, but is this the true basis of most people’s beliefs? If it’s not, then sceptical theists may be attacking the wrong thing.

Regarding the plausibility of sceptical theism itself, how credible do the folk find the notion that we really are in the dark as regards the potential justifying reasons for God’s allowance of evil? What grounds might they have for this sceptical stance?Footnote ³¹ Is this attitude prompted by rational reflection, personal experiences, the influence of their religious tradition, or something else? Do they display the same level of scepticism in other, non-religious contexts? If not, is this because they are being more careful here or because they are engaged in motivated reasoning to preserve prior religious commitments? We might also wonder if many people have inconsistent beliefs regarding their ability to discern gratuitous evils. Perhaps, for example, many people believe that their inability to see a ‘point’ to an instance of suffering counts as strong evidence for thinking there is no point, while also confidently affirming that they wouldn’t be privy to a ‘point’ if there was one (see Church et al. Reference Church, Carlson and Barrett2020). The answers to such questions will help us better understand the draw to sceptical theism and might even increase or diminish how much credence we assign it.

Regarding theodicies, a wide range of issues call for experimental probing. First off, where do the folk generally land as regard preferences for either the free will or greater good theodicy? And how might these preferences be correlated with people’s doctrinal commitments about such things as divine providence and the authority of Scripture? How might people’s theodicy preferences be correlated with their intuitions about God’s priorities for humanity? What are people’s basic value judgements regarding what evils God could reasonably allow and for what reasons (e.g. the goods of respecting our moral autonomy, increasing our moral maturity, making genuine relationships possible, the achievement of great beauty, etc.)? In the view of the folk, is the display of particular divine attributes reasonable grounds for God’s permission of evil? If so, what attributes do people find most compelling in this regard (e.g. divine justice, divine mercy, divine loving relationality)?Footnote ³² How do people’s assessments differ, if at all, regarding their assessments of God’s permission of natural and moral evils? And what sorts of factors do people regard as potential grounds for divine permission of horrendous evils? Is certainty of full and final redemption necessary, sufficient, or both? What are the folk intuitions regarding whether a loving human parent would bring a child into the world knowing that he or she would eventually experience horrendous evil? And do these intuitions line up with their intuitions on whether God would do the same? What sorts of considerations might inform those for whom these intuitions do not correlate?

With regard to such questions, the point is not simply to understand what the folk think but why they think it. What are the underlying cognitive, psychological, social factors influencing their judgements and what insight might this give us into the nature of the debate and (possibly) into the plausibility of various positions within that debate?

To give an example, moral foundations theory has suggested that framing the moral landscape exclusively in terms of harm/care and fairness/reciprocity is a hallmark of secular liberalism, whereas religious conservatives tend to draw from a more diverse moral palate (Haidt Reference Haidt, Schloss and Murray2009). Since many theodicies appeal to considerations beyond harm and fairness, could this division help explain why people assess the plausibility and prospects of theodicies so differently? This would certainly make sense of why the problem of evil ascended to new heights alongside the rise of secular liberalism in the modern West,Footnote ³³ as well as why those problems are nowadays almost exclusively formulated in terms of suffering and well-being (or, as with Draper, pain and pleasure). It might also explain why good-hearted theodicies, which give special focus to considerations of harm/care and fairness/reciprocity, are gaining in prominence.

Speaking of this, the recent movement toward good-hearted theodicies also suggests a gamut of questions for experimental inquiry. What are we to make of the increasing focus on horrendous suffering? Is this a constructive development (honing in on those cases that pose the greatest rational challenge to God’s existence) or a distracting rhetorical tactic (rife for triggering emotional reactions and one-sided judgements)?Footnote ³⁴ Could it perhaps be both? To what extent do people find sufferer-centred requirements plausible, and why or why not? As floated above, does their appeal stem from a devoted concern to individual well-being and fairness most common to the modern West? In any case, do theodicies that meet such requirements stand a better chance at persuading audiences today than those that don’t? When it comes to the final defeat or redemption of evil, what sorts of factors do people find most compelling (e.g. moral maturity and Godlikeness, the experience of divine gratitude or love, etc.)? Is appeal to an afterlife necessary for final redemption to be plausible? If so, must one inevitably approach these issues from the standpoint of a particular theological perspective beyond that of bare theism? How do one religion’s assets for dealing with the problem of evil compare to those of alternative religions? And how does the strength or weakness of various faith traditions in this regard speak to their ultimate sustainability in light of the challenges posed by evil?

Still other important questions related to the problem of evil more generally await probing, particularly pertaining to the various presuppositions people bring to their reflections on God and evil. As should already be plain from our discussion thus far, an especially relevant area to probe is moral psychology. While it can easily be overlooked, assessments of the problem of evil are ultimately moral in nature, involving basic judgements about what an all-powerful, perfectly good God ought or ought not to do, usually based on an evaluation of whether certain goods are good enough to outweigh or defeat the evils that make them possible. Obviously, then, we might expect one’s moral-theoretical commitments and tendencies – even if only inchoate or intuitively expressed – to affect one’s take on the problem of evil. But how exactly does this play out? For example, those on both sides of the problem of evil have accused the other of overly-utilitarian thinking.Footnote ³⁵ Do we see those with more consequentialist intuitions tending toward one side of the argument or the other? The same might be asked of those whose intuitions are more deontological or virtue-oriented.

One may also easily overlook the fact that such moral assessments are necessarily influenced by one’s own moral-psychological condition. So, for starters, we might wonder how one’s level of moral seriousness impacts one’s take on the problem of evil? Could there be a correlation between how committed a person is to living rightly and how inclined one is to make particular kinds of judgements about whether God would allow evil, or the sorts of evils, we find in this world? To what degree and in what way do people’s hedonistic tendencies impact their assessment? In terms of personal psychology, we might ask how people’s level of emotional empathy affects their evaluation of the argument. Or, looking at personal psychological characteristics more broadly, how might whole personality types differ in their approaches to the problem of evil? Empirical studies could be conducted using any of the major personality typologies, such as the OCEAN model or ‘Big Five’ personality inventory (Goldberg Reference Goldberg1993) and the HEXACO model of personality structure (Lee and Ashton Reference Lee and Ashton2004).

In justifying God’s allowance of evil, theodicists most often appeal to moral or spiritual goods, but some have leaned heavily on aesthetic goods as well.Footnote ³⁶ Since people vary not only in taste but in sensitivity to beauty and interest in aesthetic considerations more generally, a study probing the relationship between such factors and judgements about the problem of evil could be telling. Could the plausibility of certain theodicies vary alongside aesthetic sensibilities or tastes? Perhaps those drawn to epic storytelling are more positively inclined toward ‘O Felix Culpa’ theodicies, which appeal to the beauty and glory of forgiveness and redemption. Perhaps only those without a strong concern for aesthetics find pure free-will theodicies (which cast moral evils as unmitigated tragedies) persuasive. We would not be surprised if many theodicies are rejected due to a failure to adequately portray the ugliness of evil as being organically integrated into an aesthetically pleasing whole. This would explain why the defeat of evil has become an increasing prominent element to theodicies, since the notion of defeat is drawn originally from aesthetics.Footnote ³⁷ In any case, we suspect that the beauty or ugliness of a theodicy and the story it tells plays a larger factor in people’s acceptance or rejection of that theodicy than is commonly appreciated.

The problem of evil hinges not only on moral and aesthetic assessments but on epistemological ones as well. Thus, we might find it fruitful to explore what personal psychological factors underlie these assessments. One basic question concerns how much epistemic self-confidence people have, both in themselves and in the effective reach of the human intellect more generally. Do they tend to trust their intuitive judgements or are they more prone to reflective scrutiny and second-guessing? How comfortable are people with admitting ignorance and acknowledging mystery on these issues? Do they display a strong need for closure or struggle with uncertainty? How ready are they to defer to ‘the experts’ when it comes to the problem of evil? Who even do they perceive as experts on these issues (philosophers or religious leaders)? And how does all of this connect with their willingness or unwillingness to rely on faith?

With respect to both epistemic and axiological evaluations, we might also wonder how broader cultural factors are biasing, steering, or otherwise influencing our thinking about the issue, whether directly or indirectly. For example, the most cited figures on the problem of evil are almost all from WEIRD societies (Western, Educated, Industrialized, Rich, Democratic).Footnote ³⁸ Could this be shaping the debate in globally and historically peculiar ways largely invisible to most of its participants?

And finally, experimental philosophers of religion should also pay attention to how easily folk beliefs in the above areas are formed, given up, or revised. For example, imagine that Maria reports that the distribution of pain and pleasure in this world is epistemically more probable on Draper’s hypothesis of indifference than on theism; however, let’s also imagine that it takes almost no effort at all to get Maria to think differently (either changing her mind entirely or becoming agnostic on this issue). In a different vein, imagine that Tomás, a theist, reports that there is plausibly some greater good that comes about because of the suffering of Rowe’s fawn; however, when placed under cognitive-load (i.e. forced to think fast or has his reflective-reasoning weakened by other tasks) he denies that this is plausible. How would these results change our analysis of their initial reports? Between this sort of evidence (i.e. how easy it is to form or revise a belief), cross-cultural evidence, and the other sorts of evidence mentioned above, experimental philosophers of religion might get a very different picture of what the folk think about these issues.Footnote ³⁹

These represent just a handful of the directions empirical inquiry into the problem of evil might take.

Conclusion

In this article, we have undertaken an examination of the problem of evil through the lens of experimental philosophy, an approach that has demonstrated considerable promise in addressing some of philosophy’s most enduring and complex issues. At the core of our investigation lies the phenomenon of suffering – a universal human experience that nevertheless elicits a remarkably diverse range of responses, being a central motivation for atheism in some and inspiring deeper faith in others. Further research applying the tools and resources of experimental philosophy to the problem of evil shows potential for uncovering the underlying cultural, psychological, and sociological factors that contribute to this diversity, thereby enriching our understanding of the problem of evil and its various philosophical formulations.

To conclude, we are calling for a rigorous, empirically informed approach to the problem of evil within the domain of experimental philosophy. By integrating empirical research with philosophical inquiry, we can achieve a more nuanced and comprehensive understanding of the complexities surrounding evil and suffering. This approach not only contributes to the philosophical discourse on the problem of evil but also exemplifies the broader potential of experimental philosophy to illuminate and challenge traditional philosophical problems.

Acknowledgements

We’re enormously grateful to Justin Barrett for providing feedback on this article for us. This research was made possible thanks to the generous support of the John Templeton Foundation (grant ID: 61886).

Competing Interests

The authors delcare that they have no competing interests.

Footnotes

1. Roughly speaking, experimental philosophy is the project of taking the tools and resources of the human sciences (especially psychology and cognitive science) and bringing them to bear on issues within philosophy toward genuine philosophical ends.

2. It is worth noting that this distinction between evidential and probabilistic versions of the problem of evil is common but not at all universal. Some scholars simply categorize all versions of the problem of evil that aren’t the logical problem of evil as evidential. We find the distinction between evidential and probabilistic helpful, but there are many ways to carve up the problem of evil pie.

3. Though Helen De Cruz has been a pioneer in the field of experimental philosophy of religion for many years. SeeDe Cruz (De Cruz Reference De Cruz, Green and Stump2015a, Reference De Cruz2015b, Reference De Cruz2017, Reference De Cruz2018). Other early contributions include Barrett and Church (Reference Barrett and Church2013); Leeuwen (Reference Van Leeuwen2014); Lim (Reference Lim2017); De Cruz and Nichols (Reference De Cruz and Nichols2016); and Draper and Nichols (Reference Draper and Nichols2013). For more on why philosophy of religion was particularly slow to adopt experimental philosophy, see Church (Reference Church2024).

4. Modified versions of Rowe’s argument can, of course, be found in later work – see for example, Rowe (Reference Rowe and Howard-Snyder1996) – but empirical work has primarily focused on Rowe’s Reference Rowe1979 formulation because it is the most influential and the most cited variation of the problem in the academic literature.

5. That said, it is somewhat unclear precisely how much evidential weight Rowe ascribes to his argument. For a fuller consideration of the array of possible interpretations, see Wykstra (Reference Wykstra and Howard-Snyder1996).

6. See, for example, Stump (Reference Stump2010).

7. The experiment was pre-registered on the Open Science Framework here: https://osf.io/ebgpd.

8. Given that (i) Rowe uses ‘pointlessness’ as a loose shorthand for not bringing about a greater good or preventing a greater evil and (ii) the second and third items correlate with each other but not with the first item, it makes sense to prioritize the second and third items on this index. However, that said, it’s worth noting that using the three-item index or the just first item would not radically change McAllister et al.’s results.

9. Draper provides the following analysis of epistemic probability: ‘Relative to K, p is epistemically more probable than q, where K is an epistemic situation and p and q are propositions, just in case any fully rational person in K would have a higher degree of belief in p than in q’ (Draper Reference Draper1989, 349 n).

10. Background information, if not mentioned explicitly, is always tacitly featured in conditional probabilities. That is, P(O|theism) should really be written P(O|theism & B) where B represents the subject’s background evidence. The present point is that two subjects can have two different sets of background evidence such that P(O|theism & B₁) may differ from P(O|theism & B₂).

11. Work in experimental philosophy has repeatedly shown that many (though certainly not all) philosophical intuitions can vary based on education, gender, ethnicity, etc., and we expected to find a similar result here. See, for example, Weinberg, Nichols, and Stich (Reference Weinberg, Nichols and Stich2001); Machery et al. Reference Machery, Stich, Rose, Alai, Angelucci, Berniūnas, Buchtel, Chatterjee, Cheon, Cho, Cohnitz, Cova, Dranseika, Lagos, Ghadakpour, Grinberg, Hannikainen, Hashimoto, Horowitz and Zhu2017; Stich and Machery Reference Stich and Machery2023. For work on the stability of philosophical intuitions across cultures, see Knobe (Reference Knobe2019).

12. This research was pre-registered on the Open Science Framework. See https://osf.io/xqtn5.

13. And that would be evidential support in addition to the support Draper provides from his ‘armchair’ argument.

14. Plantinga (Reference Plantinga1974) defines theodicies as maintaining that God does have such reasons. We are instead following the usage of Van Inwagen (Reference Van Inwagen2006) and others.

15. One can acknowledge the possibility of justifying reasons and still reasonably believe that there are none. Hence, a defence may not block reasonable belief in premise 2 of Rowe’s argument. As for Draper’s, the mere possibility of justifying reasons may raise the probability of O given T slightly but not enough to prevent significant disconfirmation of theism.

16. Sceptical theism might better be titled ‘agnosticism’, as Howard-Synder (Reference Howard-Snyder and Kvanvig2009) suggests, since it can be endorsed by theists and non-theists alike.

17. Most other defences of sceptical theism build a case for one or more of the limitations identified by Alston. Notable examples include Van Inwagen (Reference Van Inwagen2006) and Bergmann (Reference Bergmann2001).

18. See Augustine’s On Free Choice of the Will.

19. For more on this point, see Spiegel (Reference Spiegel2011).

20. Davis (Reference Davis2001) provides a good example of this.

21. See Hick 1976, 255-256. See alsoEckstrom (Ekstrom Reference Ekstrom2021, 33-34).

22. This theodicy is most prominent among Calvinists. See, e.g., some of the essays in Alexander and Johnson (Reference Alexander and Johnson2016) (especially those of Johnson, Green, and Hart).

23. Plantinga (Reference Plantinga and van Inwagen2004). While this is a compatibilist friendly theodicy, it doesn’t require compatibilism either. Plantinga himself is an incompatibilist.

24. Reichenbach (Reference Reichenbach1976), Swinburne (Reference Swinburne1998), Van Inwagen (Reference Van Inwagen2006), and Hasker (Hasker Reference Hasker2008, ch. 5).

25. See, e.g., Ekstrom (Reference Ekstrom2021) and Speaks (Speak Reference Speak2015, 97-98), on this point. According to the PhilPapers 2020 survey, 57.68% of respondents accepted or leaned toward compatibilism compared to only 18.20% who accepted or leaned toward libertarianism.

26. See, e.g., Hick (Reference Hick1976), 255–256.

27. Sterba (Reference Sterba2019) appeals to this Pauline Principle (taken from Romans 3:8) in supporting his version of the problem of evil.

28. Adams herself eschews the word ‘theodicy’ and does not attempt to give a justification of God in the traditional sense. She does, however, attempt to reconcile the goodness of God with evils in our world (Adams Reference Adams1999), and many of the unique aspects to her approach can and have been incorporated into theodicies by Stump and others.

29. The debate over whether the goods under consideration must be guaranteed (as in Adams Reference Adams1999) or only made available (as in Stump Reference Stump2010) applies to 3–5 as well, though I will ignore this nuance for the sake of readability. Some only require that this good be made available to the sufferer (and likewise for the goods in 3–4).

30. Freud is wrong that we believe in God’s existence because we want a daddy in the sky, but maybe some of us believe God is morally good for this reason. Work applying attachment styles to perceived relationships to the divine might be especially relevant here. See, for example, Bradshaw et al. (Reference Bradshaw, Ellison and Marcum2010).

31. And we might wonder if an overly confident assessment of an evil’s gratuitousness might be a product of the Dunning-Kruger effect.

32. And we might plausibly wonder if the kinds of theodicies people find most intellectually attractive are also the ones they find pastorally or even existentially satisfying.

33. Ancients and medievals tended to think of the problem of evil differently–often as more of a metaphysical or ontological problem than a moral one–and generally found it much less convincing.

34. Returning to moral foundations theory, Haidt (Reference Haidt2012), 154–155, proposes that our harm/care moral foundation originated as a way of encouraging concern for vulnerable children. Is it any wonder, then, that the examples of horrendous evils that add the most credence to the problem of evil (such as those presented by Ivan in Dostoyevsky’s The Brothers Karamazov) are ones featuring children?

35. Critics of the problem of evil, particularly greater good theodicists, are regularly accused of employing ‘ends justify the means’ reasoning. Proponents of the problem of evil are sometimes accused of thinking that God is interested only in creating a hedonistic paradise.

36. For example, Augustine in the ancient period and Marilyn Adams more recently. See Augustine (Reference Augustine, Benjamin and Hackstaff1964) and Adams (Reference Adams and Sharma1993).

37. The defeat of evil is compared to the way in which ugly patches of colour might enhance the beauty of the overall painting. It is not that the ugly patch is a flaw that is merely compensated for by the other parts of the painting; rather, the beauty of the whole is unachievable apart from the ugliness of that particular patch.

38. See Henrich (Reference Henrich2020) for an overview of how WEIRDness can skew our understanding of an issue.

39. We’re grateful to Justin Barrett for pointing this out to us.

References

Adams, MM (1993) Aesthetic goodness as a solution to the problem of evil. In Sharma, A (ed.), God, Truth and Reality: Essays in Honor of John Hick. New York: St. Martin’s, 46–61.CrossRef Google Scholar

Adams, MM (1999) Horrendous Evils and the Goodness of God. Ithaca: Cornell University Press.Google Scholar

Adams, MM (2013) Ignorance, instrumentality, compensation, and the problem of evil. Sophia 52, 7–26.CrossRef Google Scholar

Alexander, D and Johnson, D (eds) (2016) Calvinism and the Problem of Evil. Portland: Pickwick.Google Scholar

Alston, WP (1991) The inductive argument from evil and the human cognitive condition. Philosophical Perspectives 5, 29–67.CrossRef Google Scholar

Augustine, ST (1964) On Free Choice of the Will. Benjamin, AS and Hackstaff, LH (trans.), Indianapolis: The Bobbs-Merrill.Google Scholar

Barrett, JL and Church, IM (2013) Should CSR give atheists epistemic assurance? On beer-goggles, bffs, and skepticism regarding religious beliefs. The Monist 96, 311–324.CrossRef Google Scholar

Beit-Hallahmi, B (2006) Atheists: A Psychological Profile. In Martin, M (ed.), The Cambridge Companion to Atheism. Cambridge: Cambridge University Press, 300–318.CrossRef Google Scholar

Bergmann, M (2001) Skeptical theism and Rowe’s new evidential argument from evil. Noûs 35, 278–296.CrossRef Google Scholar

Bradshaw, M, Ellison, CG and Marcum, JP (2010) Attachment to god, images of god, and psychological distress in a nationwide sample of presbyterians. The International Journal for the Psychology of Religion 20, 130–147.CrossRef Google Scholar

Chisholm, R (1968) The defeat of good and evil. Proceedings and Addresses of the American Philosophical Association, 42, 21–38.CrossRef Google Scholar

Church, IM (2023) Experimental philosophy of religion: The problem of evil. In Bauer, AM, and Kornmesser, S (eds), The Compact Compendium of Experimental Philosophy. Berlin: De Gruyter, 371–392.CrossRef Google Scholar

Church, IM (2024) Data over dogma: A brief introduction to experimental philosophy of religion. Philosophy Compass 19(6), 1–13.CrossRef Google Scholar

Church, IM, Carlson, R and Barrett, JL (2021) Evil intuitions? The problem of evil, experimental philosophy, and the need for psychological research. Journal of Psychology and Theology 49, 126–141.CrossRef Google Scholar

Church, IM, McAllister, B, Spiegel, JS and Barrett, JL (in progress) Testing folk perceptions of Draper’s probabilistic problem of evil.Google Scholar

Church, IM, Worcol, I and Barrett, JL (2022) The context of suffering: Empirical insights into the problem of evil. TheoLogica: An International Journal for Philosophy of Religion and Philosophical Theology 6, 4–19.Google Scholar

Cragun, RT (2016) Nonreligion and atheism. In Yamane, D (ed.), Handbook of Religion and Society. New York: Springer, pp. 301–320.CrossRef Google Scholar

Davies, B (2006) The Reality of God and the Problem of Evil. London: Bloomsbury.Google Scholar

Davis, ST (ed.) (2001) Encountering Evil: Live Options in Theodicy, 2nd edn. Louisville: Westminster John Knox.Google Scholar

De Cruz, H (2015a) Divine hiddenness and the cognitive science of religion. In Green, A and Stump, E (eds), Hidden Divinity and Religious Belief: New Perspectives. Cambridge: Cambridge University Press, 53–68.Google Scholar

De Cruz, H (2015b) Where philosophical intuitions come from. Australasian Journal of Philosophy 93, 233–249.CrossRef Google Scholar

De Cruz, H (2017) Religious disagreement: An empirical study among academic philosophers. Episteme 14, 71–87.CrossRef Google Scholar

De Cruz, H (2018) Religious beliefs and philosophical views: A qualitative study. Res Philosophica 95, 477–504.CrossRef Google Scholar

De Cruz, H and Nichols, R (2016) Advances in Religion, Cognitive Science, and Experimental Philosophy. London: Bloomsbury.Google Scholar

Dougherty, T (2014) Skeptical theism. In Zalta, EN and Nodelman, U (eds), The Stanford Encyclopedia of Philosophy. Stanford: Stanford University Press.Google Scholar

Draper, P (1989) Pain and pleasure: An evidential problem for theists. Noûs 23, 331–350.CrossRef Google Scholar

Draper, P and Nichols, R (2013) Diagnosing bias in philosophy of religion. The Monist 96, 420–446.CrossRef Google Scholar

Ekstrom, LW (2021) God, Suffering, and the Value of Free Will. Oxford: Oxford University Press.CrossRef Google Scholar

Fales, E (2013) Theodicy in a vale of tears. In McBrayer, JP and Howard-Snyder, D (eds), The Blackwell Companion to the Problem of Evil. Hoboken NJ: Wiley-Blackwell, pp. 349–362.CrossRef Google Scholar

Goldberg, LR (1993) The structure of phenotypic personality traits. American Psychologist 48, 26–34.CrossRef Google Scholar PubMed

Haidt, J (2009) Moral psychology and the misunderstanding of religion. In Schloss, J and Murray, M (eds), The Believing Primate: Scientific, Philosophical, and Theological Reflections on the Origin of Religion. Oxford, UK: Oxford University Press, pp. 278–291.CrossRef Google Scholar

Haidt, J (2012) The Righteous Mind: Why Good People Are Divided by Politics and Religion. New York, NY: Vintage.Google Scholar

Harrison, GK (2017) A Radical Solution to the Problem of Evil. Sophia, 56(2), 279–287. https://doi.org/10.1007/s11841-016-0526-0CrossRef Google Scholar

Hasker, W (2008) The Triumph of God over Evil: Theodicy for a World of Suffering. Downers Grove: InterVarsity Press.Google Scholar

Henrich, J (2020) The WEIRDest People in the World: How the West Became Psychologically Peculiar and Particularly Prosperous. New York: Farrar, Straus and Giroux.Google Scholar

Hick, J (1976) Death and Eternal Life. London: Collins.Google Scholar

Howard-Snyder, D (2009) Epistemic humility, arguments from evil, and moral skepticism. In Kvanvig, JL (ed.), Oxford Studies in Philosophy of Religion, Vol. 2. Oxford: Oxford University Press, pp. 17–57.CrossRef Google Scholar

Johnson, KA, Okun, MA, Cohen, AB, Sharp, CA and Hook, JN (2019) Development and validation of the five-factor LAMBI measure of god representations. Psychology of Religion and Spirituality 11, 339–349.CrossRef Google Scholar

Knobe, J (2019) Philosophical Intuitions Are Surprisingly Robust Across Demographic Differences. Epistemology and Philosophy of Science, 56(2), 29–36. https://doi.org/10.5840/eps201956225CrossRef Google Scholar

Lee, K and Ashton, MC (2004) Psychometric properties of the HEXACO personality inventory. Multivariate Behavioral Research 39, 329–358.CrossRef Google Scholar PubMed

Lim, DF (2017) Experimental philosophy and philosophy of religion. European Journal for Philosophy of Religion 9, 139–158.CrossRef Google Scholar

Machery, E, Stich, S, Rose, D, Alai, M, Angelucci, A, Berniūnas, R, Buchtel, EE, Chatterjee, A, Cheon, H, Cho, I-R, Cohnitz, D, Cova, F, Dranseika, V, Lagos, ÁE, Ghadakpour, L, Grinberg, M, Hannikainen, I, Hashimoto, T, Horowitz, A, … Zhu, J (2017) The Gettier Intuition from South America to Asia. Journal of Indian Council of Philosophical Research, 34(3), 517–541. https://doi.org/10.1007/s40961-017-0113-yCrossRef Google Scholar

Mackie, JL (1955) Evil and omnipotence. Mind 64, 200–212.CrossRef Google Scholar

McAllister, B (2020) The perspectival problem of evil. Faith and Philosophy 37, 421–450.CrossRef Google Scholar

McAllister, B, Church, IM, Rezkalla, P, and Nguyen, L (2024) Empirical challenges to the evidential problem of evil. In Knobe, J, and Nichols, S (eds), Oxford Studies in Experimental Philosophy, Vol. 5. Oxford: Oxford University Press, 185–220.CrossRef Google Scholar

Murphy, MC (2017) God’s Own Ethics: Norms of Divine Agency and the Argument from Evil. New York: Oxford University Press.CrossRef Google Scholar

Plantinga, A (1974) God, Freedom, and Evil. Grand Rapids, MI: Eerdmans.Google Scholar

Plantinga, A (2000) Warranted Christian Belief. New York, NY: Oxford University Press.CrossRef Google Scholar

Plantinga, A (2004) Supralapsarianism, or ‘O Felix Culpa. In van Inwagen, P (ed.), Christian Faith and the Problem of Evil. Grand Rapids, MI: Eerdmans, 1–25.Google Scholar

Reichenbach, B (1976) Natural evils and natural laws. International Philosophical Quarterly 16, 179–196.CrossRef Google Scholar

Rowe, WL (1979) The problem of evil and some varieties of atheism. American Philosophical Quarterly 16, 335–341.Google Scholar

Rowe, WL (1996) The evidential argument from evil: A second look. In Howard-Snyder, D (ed.), The Evidential Argument From Evil. Bloomington, IN: Indiana University Press, pp. 262–285.Google Scholar

Speak, D (2015) The Problem of Evil. Malden, MA: Polity.Google Scholar

Spiegel, JS (2011) On free will and soul-making: Complementary approaches to the problem of evil. Philosophia Christi, Series II 13, 405–413.CrossRef Google Scholar

Sterba, JP (2019) Is a Good God Logically Possible? New York, NY: Palgrave Macmillan.CrossRef Google Scholar

Stich, SP and Machery, E (2023) Demographic Differences in Philosophical Intuition: A Reply to Joshua Knobe. Review of Philosophy and Psychology, 14(2), 1–34. https://doi.org/10.1007/s13164-021-00609-7CrossRef Google Scholar

Stump, E (1985) The problem of evil. Faith and Philosophy 2, 392–423.CrossRef Google Scholar

Stump, E (2010) Wandering in Darkness: Narrative and the Problem of Suffering. Oxford: Oxford University Press.CrossRef Google Scholar

Swinburne, R (1998) Providence and the Problem of Evil. Oxford: Oxford University Press.CrossRef Google Scholar

Van Inwagen, P (2006) The Problem of Evil: The Gifford Lectures Delivered in the University of St. Andrews in 2003. New York: Oxford University Press.CrossRef Google Scholar

Van Leeuwen, N (2014) Religious credence is not factual belief. Cognition 133, 698–715.CrossRef Google Scholar

Weinberg, JM, Nichols, S and Stich, S (2001) Normativity and Epistemic Intuitions. Philosophical Topics, 29(1–2), 429–460.CrossRef Google Scholar

Wykstra, SJ (1984) The Humean obstacle to evidential arguments from suffering: On avoiding the evils of ‘appearance’. International Journal for Philosophy of Religion 16, 73–93.CrossRef Google Scholar

Wykstra, SJ (1996) Rowe’s Noseeum arguments from evil. In Howard-Snyder, D (ed.), The Evidential Argument From Evil. Bloomington, IN: Indiana University Press, pp. 126–150.Google Scholar

Article contents

Experimental philosophy and the problem of evil

Abstract

Keywords

Recent experimental philosophical research on the problem of evil

Rowe’s formulation

Empirical work on Rowe’s formulation

Follow-up research on Rowe’s formulation

Draper’s formulation

Empirical work on Draper’s formulation

Follow-up research on Draper’s formulation

Philosophical research on the problem of evil: the current state of play

Responses to evidential formulations

Moral immunity

Sceptical theism

Traditional theodicies

Traditional challenges for theodicies

The rise of ‘good-hearted’ theodicies

Potential areas of experimental philosophical research on the problem of evil

Conclusion

Acknowledgements

Competing Interests

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests