The (Dis)Advantage of Certainty: The Importance of Certainty in Language

Pamela C. Corley; Justin Wedeking

doi:10.1111/lasr.12058

The (Dis)Advantage of Certainty: The Importance of Certainty in Language

Published online by Cambridge University Press: 01 January 2024

Pamela C. Corley and

Justin Wedeking

Article contents

Abstract
Language and Persuasion
Data and Method
Results
Conclusion
Footnotes
References

Rights & Permissions

Abstract

How can legal decision makers increase the likelihood of a favorable response from other legal and social actors? To answer this, we propose a novel theory based on the certainty expressed in language that is applicable to many different legal contexts. The theory is grounded in psychology and legal advocacy and suggests that expressing certainty enhances the persuasiveness of a message. We apply this theory to the principal–agent framework to examine the treatment of Supreme Court precedent by the Federal Courts of Appeal. We find that as the level of certainty in the Supreme Court's opinion increases, the lower courts are more likely to positively treat the Court's decision. We then discuss the implications of our findings for using certainty in a broader context.

Type: Articles
Information: Law & Society Review , Volume 48 , Issue 1 , March 2014 , pp. 35 - 62

DOI: https://doi.org/10.1111/lasr.12058 [Opens in a new window]
Copyright: © 2014 Law and Society Association.

How can political and legal decision makers increase the likelihood of a favorable response from other actors? Scholars have identified several mechanisms. Presidential administrations can control resources to exert influence on independent regulatory commissions (Reference MoeMoe 1982). Presidents can staff agencies with appointees who will remain ideologically compatible (Reference Wood and WatermanWood & Waterman 1991). Congress can monitor bureaucratic agencies by responding to “fire alarms” (Reference McCubbins and SchwartzMcCubbins & Schwartz 1984) and can constrain federal judges by passing statutes with more detailed language (Reference Randazzo, Waterman and FineRandazzo, Waterman, & Fine 2006). Courts can achieve higher compliance from executive agencies by writing clear and explicit opinions (Reference SpriggsSpriggs 1996) and can encourage positive treatment from lower courts when they speak with a unified voice in the form of a unanimous opinion (Reference Benesh and ReddickBenesh & Reddick 2002). Within the courtroom, litigants and witnesses can increase their credibility, competency, and trustworthiness if they testify in a “style characteristic of high-status people” (Reference BlackBlack 1989: 18). Given that policymaking typically requires some level of cooperation from another political actor, a vital question facing legal actors is how to ensure a favorable response to their decisions.

We argue that a basic but unrecognized tool is available to policy makers; it involves the certainty or “authoritativeness” of language. Language is important because it is the primary way that political and legal actors communicate with each other. Moreover, institutional and legal actors primarily communicate with each other using written language, which means the receiver of the communicated message must interpret the language before responding to it. Thus, we argue that by varying the degree of certainty or authority expressed in the language, the sender of the message has an opportunity to enhance the favorability of the response.

We apply this theory to a principal–agent framework. Specifically, we examine U.S. Supreme Court opinions, analyzing how lower courts respond to Supreme Court precedents based on the variation in the degree of certainty expressed in Court opinions, though we should note that our findings hold larger implications for use of certainty in other legal settings, which we discuss more at the end. A key part of this hierarchical relationship—or principal–agent relationship—concerns the relevance of “stare decisis.” It requires lower courts to defer to support the outcome and reasoning established by a higher court, irrespective of their preferences. As the Supreme Court has stated, “unless we wish anarchy to prevail within the federal judicial system, a precedent of this Court must be followed by the lower federal courts no matter how misguided the judges of those courts may think it to be” (Hutto v. Davis 1982: 375). Yet, several studies show that lower courts' treatment of Supreme Court precedent is far from perfect, with lower courts citing some precedents positively while citing others negatively (e.g., Reference Hansford and SpriggsHansford & Spriggs 2006; Reference JohnsonJohnson 1979). This suggests there may be something different across opinions—related to the language of the opinion—that enhances or mitigates the response. Understanding this difference becomes vital because the legal reasoning underlying a particular decision is crucial in determining its precedential value and in understanding the evolution of the law. In fact, Reference SpriggsSpriggs (2003) argues that justices care more about substantive policy outcomes and observes that the bargaining over opinions is largely concerned with the language of the opinion. We suggest that the degree of certainty expressed in the opinion is a strong indicator of how lower courts treat Supreme Court precedents.

In this article, our goal is to understand the extent to which the language of the majority opinion influences how that opinion is treated by the lower courts. Do lower courts respond to the content of court opinions? Specifically, if the language of the opinion is more authoritative, are lower courts more likely to positively treat that opinion? In order to answer these questions, we utilize linguistic software designed to assess the certainty—or the authoritativeness—of the words used by the justices. Specifically, we use the certainty of the language expressed in the opinion to capture the authoritativeness of the opinion.Footnote ¹ Our approach expands on the growing trend in empirical legal scholarship to employ computer content analysis to understand judicial decisionmaking (e.g., Reference BlackBlack et al. 2011; Reference CorleyCorley 2008; Reference Owens and WedekingOwens & Wedeking 2011; Wahlbeck, Spriggs, & Reference Wahlbeck, Spriggs and SigelmanSigelman 2002). It also has the benefit of taking opinion content seriously, and directly links the content of Court opinions to lower court policy outcomes. Additionally, this research further informs our understanding of how the courts of appeals treat Supreme Court opinions, a significant topic given that the courts of appeals are the de facto courts of last resort in the federal system (Reference Hettinger, Lindquist and MartinekHettinger, Lindquist, & Martinek 2006). We find that the language used in the majority opinion does influence the extent to which the lower courts positively treat Supreme Court precedent. Specifically, we find that as the authoritativeness of the words used in the majority opinion increases, the lower courts are more likely to positively treat the Supreme Court decision. Importantly, we believe our findings about the influence of the language are portable to other broader contexts.

Language and Persuasion

We propose studying the importance of certainty through the principal–agent framework. Principal–agent models were initially developed to examine questions of incomplete information and risk sharing, eventually were applied to organizational theory in economics, and subsequently introduced into other fields (Reference MoeMoe 1984). This approach, which treats the Court as the principal and the Courts of Appeals as the agent, is common within the courts literature (e.g., Reference BeneshBenesh 2002; Reference George and YoonGeorge & Yoon 2003; Songer, Segal, & Reference Songer, Segal and CameronCameron 1994).Footnote ² Furthermore, courts that are subordinate to the Supreme Court are “subject to an absolute duty to follow its precedents” (Reference KimKim 2007). However, lower federal courts still retain discretion when deciding cases and we argue that a persuasively written opinion is more likely to lead to positive treatment.

An appellate opinion must be written persuasively. Writing a persuasive opinion can hold together a majority coalition or enhance the reputation of the opinion (and its author) in the legal community (Reference HumeHume 2006). Additionally, Supreme Court justices wish to write legally strong, persuasive opinions because they desire to produce good law and good policy and in order to be cast in as favorable a light as possible (Reference Corley, Collins and CalvinCorley, Collins, & Calvin 2011).

Finally, a strong, persuasive opinion can enhance the extent to which opinions are implemented by lower courts. “To properly communicate the disposition of a case the judge must enable the reader to understand and accept the judge's decision. Thus the document communicating that decision must be clear and persuasive” (Reference GeorgeGeorge 2007: 4). The success of this persuasive attempt is largely based on the ability to distinguish opinion from fact and portray assertions in persuasive ways. The Court “can firmly endorse rules or they can equivocate …” (Reference HumeHume 2006: 817). In other words, the Court can choose to use words that reflect a high degree of certainty or a low degree of certainty.Footnote ³ Thus, we argue a key feature of an opinion's level of certainty, or “authoritativeness,” stems from the language used in the content of the opinion. Judges wield enormous power in shaping the law and influencing society and, accordingly, “there is pressure on them to speak decisively” (Reference SolanSolan 1993: 2). Specifically, judges are taught that opinions “should … carry conviction …” (Federal Judicial Center 1991: 19).Footnote ⁴

This argument is supported by empirical research that finds more assertive messages to be more persuasive (Reference Hazelton, Cupach and LiskaHazelton, Cupach, & Liska 1986; Reference Sniezek and BuckleySniezek & Buckley 1995). This then begs the question, why is certainty important for persuasiveness? Reference Yates, Price, Lee and RamirezYates et al. (1996) attribute part of the answer to extremeness as an indicator of competence when they found that consumers prefer sources that make extreme confidence judgments. In addition, Reference Sniezek and Van SwolSniezek and Van Swol (2001) find that principals who express more confidence are trusted more, even when the agent has less expertise, and this leads to their advice being followed more often. In sum, source certainty is important because it helps resolve that decision maker's cognitive burden.Footnote ⁵

When viewed from the perspective of persuasion then, certainty is relevant and important in judicial opinions because “[w]riting opinions is a lot like writing briefs. Both are, at bottom, efforts to persuade. Lawyers want to satisfy clients and win cases. Judges want to persuade lawyers, litigants, and the community at large that the decision they have made … is the absolutely correct one” (Reference HigdonHigdon 2010: 1242). Furthermore, the persuasive power of courts is integral for judicial effectiveness because it is the only leverage courts have (see Reference Baird and JavelineBaird & Javeline 2007). Given that courts must rely on other actors to implement their decisions, possessing neither the power of the purse nor the sword (Reference HamiltonHamilton 1788, Federalist Paper 78), courts instead must rely on their persuasive power, which encourages implementation of their decisions.

What are the elements of persuasive legal argument? Reference GardnerGardner (1993) argues that one of the key strategies is “to create the impression that the judge has no choice” (54, emphasis in original). While Gardner admits that judges exercise discretion, he is quick to point out that judges do not like to exercise discretion because it increases the difficulty of the choice (i.e., increases the cognitive burden on the judge). As part of his formula, one of Gardner's key principles is “Establishing Certainty of Authoritativeness” (Reference Gardner1993: 55).

Gardner's advice is joined by many others. Reference Rieke and StutmanRieke and Stutman (1990), in their book on communication in legal advocacy, spend an entire chapter on source credibility where some of the key components are the assertiveness and confidence of the message. For example, Rieke and Stutman write, “when receivers perceive a source to be confident, they confer the source higher credibility” (Reference Rieke and Stutman1990: 120). But this advice is not limited to a handful of texts. Rather, advice like it can be found in almost any guide to legal advocacy or writing. For example, Reference Waicukauski, Sandler and EppsWaicukauski, Sandler, and Epps (2001) write on the importance of a speaker's “ethos,” which is the Greek word for character or credibility, and argue for the importance of “Convey[ing] your Conviction” (Reference Waicukauski, Sandler and Epps2001: 41). Finally, Reference Lebovits and HidalgoLebovits and Hidalgo (2009), advising law clerks how to draft their first judicial opinion, counsel them to “[b]e definite … not tentative” (35) and in The Judicial Opinion Writing Handbook, judges are advised, when writing opinions, to “be definite. …” (Reference GeorgeGeorge 2007: 27). Importantly, while we focus on judges, this is not limited to judges. In his sociological look at the justice system, Reference BlackBlack (1989) finds that how litigants and witnesses speak in court matters for their success and credibility in front of judges and juries. Thus, we argue that the more certainty expressed in Supreme Court opinions, the more persuasive those opinions will be, leading to an increase in compliance.

Although previous scholars have used different measures to capture the authoritativeness of a Supreme Court decision, including whether it was unanimous, whether it was a minimum winning coalition decision, the size of the voting majority, the number of dissenting justices, the number of dissenting opinions, and the number of special concurring opinions (Reference Benesh and ReddickBenesh & Reddick 2002; Reference Hansford and SpriggsHansford & Spriggs 2006; Reference JohnsonJohnson 1979),Footnote ⁶ those measures have neglected an important feature of our legal system—the content of the opinion. In sum, by using a higher degree of authoritative language in the opinion, the opinion is more likely to be persuasive. This leads to the following hypothesis: As the degree of certainty in an opinion increases, lower courts will be more likely to interpret the precedent positively than negatively or neutrally.

Data and Method

To determine whether the language of the opinion affects the treatment of Supreme Court precedent by the lower courts, we identify and examine lower court treatments of a random sample of 110 Supreme Court cases from the 1976 to 1986 terms.Footnote ⁷ The unit of analysis is a Court of Appeals decision that has interpreted one of our 110 Supreme Court opinions, which includes circuit court cases from 1976 to 2005. For our analysis, we identified 2,772 Courts of Appeals decisions through Shepard's Citations via Lexis. Shepard's is a legal resource that provides for each Supreme Court decision a list of all the subsequent cases (Supreme Court, Courts of Appeals, District Courts, and state courts) that cite the decision. Although Shepard's does not capture whether lower courts are ignoring Supreme Court precedent, Reference Benesh and ReddickBenesh and Reddick (2002) find that the Courts of Appeals do not disregard precedent they disagree with. In fact, they did not find a single opinion that overtly ignored the overruling decision.Footnote ⁸

Dependent Variable

Important for purposes here, Shepard's offers an editorial analysis indicating how the subsequent decision (the “citing” case) legally interpreted the previous decision (the “cited” case). The goal of Shepard's is to ascertain whether the precedent is still good law, or whether it has been diminished based on how it is being treated (Reference Hansford and SpriggsHansford & Spriggs 2006). To be judged by Shepard's, the subsequent case must contain specific language that legally interprets the cited case (see Reference Spriggs and HansfordSpriggs & Hansford 2000). In other words, a cited case is not considered to be “legally interpreted” just because it is cited.Footnote ⁹ Shepard's offers for each citing case the following types of legal interpretations that are relevant to this study: “Question,” “Limit,” “Criticize,” “Distinguish,” “Follow,” “Explain,” or “Harmonize.” Shepard's labels “Followed” as positive treatment, “Explained” and “Harmonized” as neutral treatment, and “Question,” “Limit,” “Criticize,” and “Distinguish” as negative treatment. Although Shepard's codes treatments of precedent in concurring and dissenting opinions, we focus only on treatments that occur in majority opinions.

Shepard's uses “Followed” to indicate that a citing case's majority opinion “expressly” relied on a cited case as precedent (Reference Spriggs and HansfordSpriggs & Hansford 2000: 330). Examples of language that lead to an opinion being coded by Shepard's as “Followed” are “controlling,” or “determinative” or “such a conclusion is required by” (Reference Spriggs and HansfordSpriggs & Hansford 2000: 330). Thus, we code a circuit case that Shepard's indicates “Followed” a Supreme Court decision as positive.Footnote ¹⁰ Consistent with Shepard's typology of legal treatment, we code a case that “Questioned,” “Limited,” “Criticized,” or “Distinguished” a Supreme Court decision as negative.Footnote ¹¹ Finally, we code a case that “Explained” or “Harmonized” a Supreme Court decision as neutral. “Explained” indicates that the citing opinion “clarifies, interprets, construes or otherwise annotates the decision in the cited case” and “Harmonized” means “that the cases differ in some way; however, the court has found a way to reconcile and bring into harmony the apparent inconsistency” (Reference Hansford and SpriggsHansford & Spriggs 2006: 44).

Reference Spriggs and HansfordSpriggs and Hansford (2000) empirically tested the reliability of Shepard's analysis of Supreme Court opinions and assessed the validity of Shepard's treatment codes, finding them to be reliable and valid (see also Reference Hansford and SpriggsHansford & Spriggs 2006). Specifically, when collapsing the treatment codes into three broad categories—positive treatment, neutral treatment, and negative treatment—they found that the negative treatment code is the most reliable, and the neutral treatment code is the least reliable, although it is still considered reliable. Thus, we categorize the treatments by the circuit courts into three types: positive treatment, neutral treatment, and negative treatment. In our data, 62 percent of the cases received positive treatment, 13 percent received neutral treatment, and 25 percent received negative treatment. Because our dependent variable is nonordered and categorical, we estimate a multinomial logit (importantly, we get similar results if we estimate an ordered logit).Footnote ¹²

Primary Independent Variable

Our main independent variable, degree of certainty, measures the degree of certainty expressed in the majority opinion. We generate this measure using the computer content analysis program Linguistic Inquiry and Word Count (LIWC). LIWC is a dictionary-based program, meaning that it contains lists of words that correspond to separate dictionaries that represent a larger concept. Specifically, we use LIWC's dictionary for “certainty,” which we explain more fully below. LIWC was developed by psychologists to measure a variety of things, such as expression of emotions, cognitive thought processes, use of pronouns, as well as several others (Reference Tausczik and PennebakerTausczik & Pennebaker 2010). Using dictionaries, thesauruses, and questionnaires, an initial selection of words for each category was made by research assistants. Groups of three judges then independently rated whether each word was appropriate for that category. Those category word lists were updated and a word remained in the category list if two out of the three judges agreed it should be included, a word was deleted if at least two judges agreed it should be excluded, and a word was added to the category if at least two of the judges agreed it should be added. That process was then repeated by a separate group of three judges.

Dozens of studies have used indicators from LIWC to explain various phenomena, with these results demonstrating predictive validity. Moreover, LIWC's validity and reliability on a variety of its indicators have been established by several studies (e.g., Reference AlpersAlpers et al. 2005; Reference Bandum and OwenBandum & Owen 2009; Reference CohenCohen 2012; Reference KahnKahn et al. 2007). For example, Reference CohenCohen (2012) demonstrates the concurrent validity of LIWC's “certainty” indicator by showing that it correlated with a corpus-based dictionary of cognitive rigidity. In short, LIWC appears to be widely accepted as a text analysis tool. We should note, however, as with any linguistic software program, LIWC has its limitations.Footnote ¹³

The 2007 LIWC dictionary for the concept “certainty” contains 83 words. Some examples include: absolutely, always, certain*, clearly, commit, completely, every, exact*, extremely, forever, indeed, inevitab*, must, never, perfect*, positiv*, precis*, totally, truly, undeniab*, undoubt*, unquestion*, where the asterisk allows the program to count any variation of the word with that stem. Appendix A contains the full list of words.Footnote ¹⁴ The LIWC program works simply by searching the text for these words and counts their occurrence as a proportion of the total number of words, yielding a percentage for each category. In our sample of 110 Supreme Court cases, the certainty category ranges from .61 to 2.34, with a mean of 1.25 and standard deviation of 0.323. Higher values are theorized to measure higher levels of certainty expressed by the writer. While these percentages may seem small at first glance, they are, in fact, larger upon closer inspection. For example, a document with 5,864 words (the mean opinion length in our dataset) with 1 percent “certain” words will have about 59 “certain” words, which roughly equals 3.5 “certain” words per page in the U.S. Reports (this assumes a 17-page opinion with an average of 350 words per page). In short, while 3.5 words per page do not seem overwhelming, it is the repetitive effect over the course of an opinion compared to an opinion with far fewer “certain” words.Footnote ¹⁵

To better illustrate how our measure relates to a Supreme Court opinion, we provide an example. Nixon v. Administrator of General Services, 433 U.S. 425 (1977), the legal dispute over former President Nixon's White House tapes, has a certainty score of 1.44, which is slightly above the sample mean. In the majority opinion's discussion of separation of powers and the “abundant” statutory precedent for mandatory disclosure of documents that the Executive branch possesses, consider Brennan's use of the word “never.” “Such regulation of material generated in the Executive Branch has never been considered invalid as an invasion of its autonomy” (446), where “never” is one certainty word that the LIWC program captures. Further, consider this sentence, “As the careful research by the District Court clearly demonstrates, there has never been an expectation that the confidences of the Executive Office are absolute and unyielding” (450, emphasis added). While these two sentences represent only a small sample of what the LIWC program captures, its significance becomes more noteworthy when one considers alternative ways to construct the sentence. For example, Brennan could have simply wrote that “… there is no expectation that the confidences of the Executive Office are absolute and unyielding” and it would have carried the same substantive meaning, but it would have lacked the force that the added certainty brings.Footnote ¹⁶

Control Variables

Previous research shows that a number of other factors influence treatment of Supreme Court precedent by the lower courts. The first factor is age of precedent, measured in years. There are two views of how the age of a precedent might influence treatment by lower courts, suggesting competing hypotheses. The first view suggests that older decisions have become fundamental to the Court and lower courts would be more likely to positively treat those cases. The second view argues recent precedents deserve more respect from the lower courts because the Supreme Court is not likely to overturn recently established precedents (see Reference Brenner and SpaethBrenner & Spaeth 1995).

Research also suggests competing directional hypotheses for the complexity of a case. Reference WasbyWasby (1970) views complex decisions as confusing to the lower courts and thus expects them to limit positive treatment. Alternatively, Reference JohnsonJohnson (1987) found that complex decisions were followed more often and Reference Benesh and ReddickBenesh and Reddick (2002) viewed complex decisions as fostering higher levels of positive treatment since they engender a closer reading. For complexity, we count the number of legal provisions relied upon and the number of issues raised in the precedent (Reference SpaethSpaeth 2006).

We also include the ideological consistency with the Supreme Court majority opinion. Ideology influences lower court judges (see, e.g., Reference Hall and BraceHall & Brace 1992; Reference Songer and HaireSonger & Haire 1992), and as the distance between the ideology of the Supreme Court decision and the members of the deciding appeals court panel increases, the likelihood of the panel treating the precedent positively should decrease. We use the Judicial Common Spaces score (Reference EpsteinEpstein et al. 2007a) for each federal court of appeals judge, district court judge,Footnote ¹⁷ and each Supreme Court justice, a measure of personal ideology that places them in the same policy space. We take the absolute value of the difference between the median of the appeals court panel and the median of the precedent's majority coalition. This distance should capture whether the appeals court panel is ideologically consistent with the Supreme Court decision.

We also control for case importance. Although some scholars argue that important Supreme Court cases are more likely to be followed by lower courts since they are more visible (see Reference Benesh and ReddickBenesh & Reddick 2002), important cases are also more likely to be controversial, and a number of scholars (Reference BaumBaum 1978; Reference GruhlGruhl 1980; Reference WasbyWasby 1970) have suggested that controversial Supreme Court decisions are more likely to receive negative treatment by lower courts. Thus, we use two measures to tap into the importance of a Supreme Court case.Footnote ¹⁸ The first is a measure of political importance, a dichotomous variable coded 1 if the case is a major case using the New York Times measure, and 0 otherwise (Reference Epstein and SegalEpstein & Segal 2000; Epstein et al. Reference Epstein2007b).Footnote ¹⁹ The second is a measure of legal importance, also a dichotomous variable, coded 1 if the case struck down a law as unconstitutional or overturned an existing precedent, and 0 otherwise (Reference SpaethSpaeth 2006).

Next, we control for the possibility that lower courts sometimes engage in anticipatory behavior. How far away has the Supreme Court moved from the precedent? The deciding appeals court panel may engage in anticipatory behavior, taking into consideration the ideology of the Supreme Court that is sitting at the time the lower court interprets the precedent relative to that of the Supreme Court that handed down the precedent. This may be because the judges fear reversal by the Supreme Court or because they believe that is their proper role. Reference KleinKlein (2002) found evidence that two federal appellate judges indicated they sometimes engage in anticipatory decisionmaking. In addition, Reference GruhlGruhl (1981) found that federal court decisions were more likely to act in anticipatory compliance. Thus, we control for a change in Supreme Court ideology and we use the same ideology scores as above and calculate the change in Supreme Court ideology from the time of the precedent by taking the absolute value of the difference between the median of the Court sitting at the time the lower court treats the decision from the median justices that issued the precedent. We expect that, as the distance grows, the lower courts will be less likely to treat the Supreme Court case positively.

We also account for the treatment of precedent by Supreme Court. Reference Hansford and SpriggsHansford and Spriggs (2006) found that lower courts respond to how the Supreme Court has interpreted its own precedent. If the Court treats a case positively, by following it and declaring it to be good law, then the authority of the case is enhanced. Conversely, if the Court negatively interprets a case, then the authority of the case is diminished. To account for this, for each Supreme Court case decided during the 1976–1986 terms, we use Shepard's to identify all subsequent Supreme Court cases that positively or negatively treated it. We then count the number of times the Court's majority opinions interpreted the precedent in a positive or negative manner at the time the lower court treats the decision.Footnote ²⁰ We take the difference between the number of prior positive and negative interpretations. Thus, positive values of this variable indicate that the Court has interpreted the precedent positively more often than negatively at the time the lower court treats the decision and negative scores indicate that the precedent has had more negative treatments than positive. We expect that the more often the Court has treated the precedent positively than negatively, the more likely the lower courts will treat the precedent positively.

We also control for how much support the opinion has garnered. Thus, we include a variable, which is equal to 1, if the vote in the case is unanimous, or 0 otherwise.Footnote ²¹ To control for Supreme Court cases that have been overruled, we include a dummy variable, which is equal to 1 if the case has been overruled by the Supreme Court, or 0 otherwise. We use Shepard's to identify cases that have been overruled and expect that these cases are less likely to receive positive treatment by the lower court.

Another potential factor is opinion clarity. Principal–agent theory suggests that agents have a more difficult time evading a principal's commands when those commands are clear (Reference BrentBrent 2003). In other words, a clearly written opinion leaves less discretion for the lower court. Reference SpriggsSpriggs (1996) found that opinion clarity mattered for agency compliance. In addition, Reference Staton and VanbergStaton and Vanberg (2008) identify that when judges value policy outcomes (rather than managing institutional prestige), judges will value clarity over vagueness (as part of the tradeoff). Additionally, if an opinion is written very clearly, it may be more persuasive. While there are many possible ways to measure an opinion's clarity, for one proxy we use the average number of words per sentence. This is a variant of the commonly used readability measures, which capture surface characteristics of a text (such as the average sentence length) to use as a proxy for the difficulty of reading the text. We expect that as opinion clarity increases, lower courts should treat precedents more positively.

While the readability measure captures one dimension of legal clarity, we also control for a second type of opinion clarity—the attention to detail.Footnote ²² In this Reference Randazzo, Waterman and Finevein, Randazzo, Waterman, and Fine (2006) use a measure of statutory constraint, borrowed from Reference Huber, Shipan and PfahlerHuber, Shipan, and Pfahler (2001) and Reference Huber and ShipanHuber and Shipan (2002), that suggests the more detail provided in a statute will constrain other actors who are responsible for implementing the policy. It is simply the log of the total number of words in the opinion.Footnote ²³ We expect clear precedents to be treated more positively. Finally, we also include in the model all of the dummy variables for each circuit, excepting the First Circuit which was used as the baseline, so that each dummy can be interpreted as the impact of a given circuit relative to the First Circuit.Footnote ²⁴

Results

Does more authoritative language affect the treatment of Supreme Court precedent by the Circuit Courts of Appeal? Table 1 suggests that opinions that have a higher level of certainty are more likely to be treated positively by the lower courts.

Table 1. Multinomial Logit Model of the Impact of “Certainty” on Lower Court Compliance

N = 2,772; ^* P < 0.05, ^**P < 0.01, ^***P < 0.001 (one-tailed tests where directionality hypothesized).

Note: Fixed effects for each circuit are not reported.

Specifically, even after controlling for alternative explanations, the coefficient for degree of certainty is statistically significant. Thus, the fact that the decision is more authoritative appears to affect treatment by the lower courts.Footnote ²⁵ As the level of certainty increases, circuit courts are more likely to treat the majority opinion positively and less likely to treat that opinion negatively. However, the coefficient for neutral treatment is not statistically significant, suggesting that the degree of certainty does not influence whether the lower court is less likely to treat the opinion neutrally. Specifically, as the degree of certainty increases by one standard deviation above the mean, the probability of positive treatment goes from 0.634 (the baseline) to 0.662, an increase of 0.028. The probability of negative treatment drops to 0.218 from 0.251, a decrease of 0.033. When the certainty score is at its highest compared to its lowest, the probability of positive treatment increases from 0.571 to 0.716, an increase of 0.145. This finding suggests that the Supreme Court can increase compliance by using more certain authoritative language in its opinions.

To better illustrate the magnitude of our findings, Figure 1 displays the predicted probabilities of the three types of treatment based on the level of certainty, holding the other variables at their mean or modal values. Each shaded region corresponds to a different treatment: positive, neutral, and negative. Figure 1 supports our hypothesis, as it illustrates that as certainty increases, the probability of a positive treatment increases while the probability for a negative treatment decreases, with the probability of a neutral treatment staying the same. To further highlight the substantive effect a change in certainty might have, consider how a modest change in certainty will increase the number of positive treatments of an opinion. For example, with a one standard deviation increase in certainty (which is approximately 1 more certainty word per page of a 17-page opinion), our model would predict 40 more positive treatments of Supreme Court precedent. This becomes even more substantial when we consider the possibility that one Supreme Court opinion might get multiple positive treatments from several Courts of Appeals decisions.

Figure 1. Predicted Probabilities of Each Type of Treatment.

Our model also includes a series of control variables and there are several variables related both to increased negative treatment and neutral treatment.Footnote ²⁶ Although opinion clarity is not statistically significant (P = 0.072, one tailed), attention to detail is statistically significant and signed in the expected direction. Specifically, more detailed precedents are treated more positively than either negatively or neutrally. As attention to detail increases, positive treatment goes from 0.634 to 0.693, an increase of 0.059. This suggests that opinion clarity, along with certainty, matters when it comes to treatment of Supreme Court precedent by lower courts, which is consistent with earlier research by Reference Huber and ShipanHuber and Shipan (2002), Reference Randazzo, Waterman and FineRandazzo, Waterman, and Fine (2006), Reference SpriggsSpriggs (1996), and Reference Staton and VanbergStaton and Vanberg (2008).

As we noted, scholars have disagreed about whether older or more recent Supreme Court cases are more likely to be treated positively by the lower courts. The results of this study show that the age of the Supreme Court precedent has a positive impact on lower court treatment. As precedents age, lower courts are less likely to treat a Supreme Court decision negatively and neutrally. If the Supreme Court precedent is 15 years old (one standard deviation above the mean) compared with 8 years old (the mean), the probability of positive treatment goes from 0.634 to 0.708, an increase of 0.074. Thus, the results show that older decisions have become fundamental and lower courts are more likely to follow those decisions and less likely to negatively interpret those decisions.

In addition, if the decision has more positive treatments than negative treatments by the Supreme Court, the lower courts are less likely to treat the case negatively. If the Supreme Court has treated its own precedent more positively than negatively, the probability of positive treatment goes up by 0.049 (0.634–0.683).

More complex cases are more likely to be treated neutrally than positive, while legally important cases are more likely to be treated negatively than positively. Additionally, as the difference in ideology between the Supreme Court case and the deciding appeals panel increases, the lower court is more likely to treat the precedent negatively versus positively. However, the political importance of the case does not appear to manifest any systematic influence on the lower courts' treatment of the precedent.

With respect to changes in Supreme Court ideology, the odds of the lower court neutrally treating the case rather than positively treating the case increase. Specifically, the probability of positive treatment decreases by 0.029 when the distance between the ideology of the Supreme Court sitting at the time the lower court treats the precedent and the ideology of the Supreme Court that handed down the precedent increases (0.634–0.605). Thus, the lower court is treating the case in a less positive manner when the Supreme Court has moved away from the precedent. However, the case still stands as precedent. This suggests that the appeals court panel is not more likely to negatively treat the case. Thus, it appears that the circuit courts are somewhat engaging in anticipatory behavior. Finally, if the precedent was a unanimous opinion, lower courts are more likely to positively treat the precedent than negatively treat it, with the probability of positive treatment increasing by 0.047 (0.634–0.681).

Conclusion

Past research was mixed regarding whether the authoritativeness of a Supreme Court opinion influenced lower court compliance. However, those studies defined authoritativeness based on the amount of support the majority opinion had. In contrast, we examine the authoritativeness of the majority opinion based on the language the justices use in the opinion. “Opinion writing is public writing of the highest order; people are affected not only by judicial opinions but also by how they are written” (Reference Lebovits, Curtin and SolomonLebovits, Curtin, & Solomon 2008: 237). Given that the judiciary's power comes arguably from its words alone, it is important to understand how the language the Court uses in its opinions can influence how lower courts treat those decisions.Footnote ²⁷

Lower courts are more likely to positively treat Supreme Court precedents when the precedent contains more certain language, suggesting that there is a connection between the content of court opinions and implementation by other actors. By using more certain language, lower court judges may be more persuaded by the opinion. Although we argue that lexical choice is an important feature of the Supreme Court's persuasiveness, we are by no means assuming that lexical choice is the only tool that judges have to persuade. It is also possible that other nonlexical linguistic features, such as presuppositions used to provide background information on sentences that convey an author's purported assumption that the proposition in question is already assumed true by the addressee, may also persuade lower court judges to treat a Supreme Court precedent more favorably. However, an examination of presuppositions would entail a different type of analysis that requires a close reading of a much smaller number of opinions and is beyond the scope of the current article. Importantly, we believe that both lexical choice and other nonlexical tools, such as presuppositions, sentential syntax, and discourse coherence, all make important contributions to an opinion's ability to persuade.

Additionally, the degree of certainty used may not be completely a conscious judicial strategy. It is also entirely likely that some justices are (just as some laypeople are) more inherently gifted communicators, allowing them to write and speak “automatically” in a manner that is more convincing and certain. Some judges and lawyers undoubtedly have a “gift” or “knack” for phrasing arguments in just such a way that it makes it very difficult to disagree with them. This argument is echoed in Reference BlackBlack's (1989) book where he finds that speaking style matters greatly for the credibility of witnesses who testify. For those where expressing certainty is partly an unconscious act, we think, the legal domain is no different than other domains. The presence (or absence) of this personality characteristic, however, does not diminish the fact that certainty can also be a conscious strategy that is used to try and enhance the persuasiveness of a message.

This raises a couple of important broader questions. Namely, what does certainty stand for? As well as, is it legally relevant and might it apply to other areas of legal decisionmaking? With respect to the first question, a rudimentary examination of the definition of certainty might suggest it stands for a firm conviction or belief that something is reliably true. That someone is willing to phrase an opinion with more certainty not only means that a particular response is desired, it also suggests that the person's reputation and keen judgment are being called to speak for the legal actor. It is another way of saying “trust me” without having to explicitly reference those terms, and its strength lies in the fact that one can use this phrasing of language to communicate with either friend or foe. It is a linguistic mechanism designed to signal that only a certain logic could have led the debate to a particular point and that it leads to only one proper conclusion. In sum, certainty stands for something that can help tip the scales in a case.

As for certainty's legal relevance, we posit that it is especially relevant to appellate court judges and hierarchical relations in the judiciary. Importantly, that is the only relationship we tested for in this article, yet we believe it also applies to other judges (e.g., lower court judges applying a higher court's ruling on the admissibility of evidence) as well as lawyers who might, when advocating for a client, phrase an important case precedent or fact in such a way to make it appear the judge has little other choice but to decide in their favor. We also think certainty is relevant for juries and their decisionmaking, as well as litigants. Consider this quote from Donald Black:

It should finally be mentioned that the success of litigants also depends on how they speak. Recent experiments show that the credibility of people in court increases if they testify in a style characteristic of high-status people. We can distinguish between “powerful” and “powerless” speech by witnesses in courtrooms … those testifying in the powerful mode have more credibility. To a judge or jury, they seem more competent and trustworthy … In various ways, then, how people speak allows the social structure of a case to insinuate itself into the courtroom when it might otherwise be unknown. (Reference BlackBlack 1989: 18–19)

We believe it even applies to broader coverage of social movements, with how the news media cover and frame a reaction, a political protest or social response to a Court ruling. For example, Reference Gamson, Modigliani and BraungartGamson and Modigliani (1987) document the trends in which certain affirmative action frames are used over time by columnists. They find that columnists' usage of the “delicate balance” frame, which argues the government should maintain a proper balance between remedying past discrimination and avoiding future discrimination, peaks at the time of the Bakke decision, something that is notable because of the “balance” that was eventually struck by the Court, striking down racial quotas but allowing the use of race as a criterion in school admissions. This suggests that if the media frames the case in such a way that it emphasizes certainty of an outcome or some ramification, then the case might have a much broader impact, possibly greatly influencing the momentum of a social movement or even change the social structure of future cases.

Furthermore, these findings raise important empirical and normative questions. From an empirical standpoint, given our findings, one might suggest that all justices need to do to increase compliance is to add language to increase the certainty of an opinion. However, as we noted above, if justices want to be regarded as credible and respectable jurists, they need to exercise their own discretion, realizing that sending the signal of high certainty “all the time” will lose its value. In other words, justices need to demonstrate some modesty and temper any inclination that demands perfect compliance.

From a normative perspective, one might wonder whether having an ability to increase compliance (by changing the certainty of the language in an opinion) is a “good” or “bad” thing for the law as well as legal change in society. Although we are not entering the normative debate, we recognize that increasing the certainty of opinion language to ensure compliance may (or may not) have negative consequences that may be intended or unintended. For example, Reference Brewer and BurkeBrewer and Burke (2002) found that a more confident witness was perceived by jurors to be more credible, as indicated by the jurors' higher likelihood of believing a crime was committed, regardless of whether the witness was consistent or inconsistent in testimony (see also Reference Whitley and GreenbergWhitley & Greenberg 1986). In contrast, increased certainty can also have positive effects. For example, it has long been widely accepted that one of the Supreme Court's main purposes is to clarify the law. In other words, when multiple circuit court decisions are in conflict with each other, creating uncertainty in the law, many view it as an important function of the Court to reduce this conflict. In situations where there is lower court conflict, it can be beneficial if the Supreme Court increases its level of certainty in an opinion to better ensure compliance, thus helping to alleviate conflict in the lower courts. In sum, our larger point is to emphasize the importance of documenting and highlighting the presence of the empirical finding of certainty and how it influences judges on the Courts of Appeals.Footnote ²⁸

Beyond this article's primary contribution to a greater understanding of the connection between the language of court opinions and treatment by lower courts, this research corroborates the value of using computerized text analysis to understand judicial opinions. Much can be learned by employing computer-based text analysis programs (Reference Owens and WedekingOwens & Wedeking 2012), such as the LWIC software used here (e.g., Reference Owens and WedekingOwens & Wedeking 2011) as well as other automated methods (e.g., Reference CorleyCorley 2008; Corley, Collins, & Reference Corley, Collins and CalvinCalvin 2011; Laver, Benoit, & Reference Laver, Benoit and GarryGarry 2003). For example, future research might use the LWIC software to evaluate whether more certain language used in parties' briefs leads to more favorable outcomes or whether it influences the extent to which the Supreme Court borrows from the parties' briefs. We believe that the addition of systematic research into this area will provide more insight into understanding how the law is crafted.

Appendix A

List of “Certain” Words in the 2007 LIWC Dictionary

absolute, absolutely, accura*, all, altogether, always, apparent, assur*, blatant*, certain*, clear, clearly, commit, commitment*, commits, committ*, complete, completed, completely, completes, confidence, confident, confidently, correct*, defined, definite, definitely, definitive*, directly, distinct*, entire*, essential, ever, every, everybod*, everything*, evident*, exact*, explicit*, extremely, fact, facts, factual*, forever, frankly, fundamental, fundamentalis*, fundamentally, fundamentals, guarant*, implicit*, indeed, inevitab*, infallib*, invariab*, irrefu*, must, mustn't, must'nt, mustn't, mustve, must've, necessar*, never, obvious*, perfect*, positiv*, precis*, proof, prove*, pure*, sure*, total, totally, true, truest, truly, truth*, unambigu*, undeniab*, undoubt*, unquestion*, wholly.

Multinomial Logit Model of the Impact of “Certainty” on Lower Court Compliance

Footnotes

We thank the editors and anonymous reviewers for helpful comments. In addition, we thank Brad Canon, Paul Collins, Chris Olds, James Pennebaker, and Rick Waterman for helpful advice or comments on earlier versions of this article. An earlier version was presented at the 2013 Midwest Political Science Association and the 2013 Kentucky Political Science Association. All mistakes are our own.

¹ Common synonyms for authoritativeness are commanding, confident, decisive, assertive, and self-assured. Thus, we use the terms authoritative and certainty interchangeably.

² We recognize that viewing the Supreme Court and the federal Courts of Appeals through a principal–agent lens is not the only way to analyze their relationship. Accordingly, while acknowledging the principal–agent framework has some merit, though certainly not without flaws, Reference KimKim (2011) also notes that judges share a common goal—the production of law itself. Specifically, “[l]aw is the joint product of judicial efforts at all levels of the hierarchy, but it is also inevitably the ground for contestation over policy choices” (Reference KimKim 2011: 572–73). Thus, there are elements of cooperation and conflict, which most legal scholars do not explore. Moreover, as Reference KimKim (2007) points out, this hierarchy also presents the possibility of lower court judges legitimately exercising their discretion, which should not necessarily be viewed as “shirking.”

³ Importantly, the literature suggests that certainty and clarity are different aspects. For example, Reference Hazelton, Cupach and LiskaHazelton, Cupach, and Liska (1986) differentiate between assertiveness and ambiguity. Reference Miller and PetersonMiller and Peterson (2004) suggest certainty is just one indicator of attitude strength. Furthermore, Reference Petrocelli, Tormala and RuckerPetrocelli, Tormala, and Rucker (2007) show that attitude certainty has two theoretically and empirically separated aspects. The first is clarity, the sense that one knows what one's attitude is, and the second is correctness, the sense that one's attitude is correct or valid.

⁴ However, Reference PosnerPosner (2009: 256) has argued the following: “One judicial opinion might be better than another not because the argument was more persuasive but because by candidly disclosing the facts and authorities tugging against its result, by being tentative and concessive in tone, even by confessing doubt about the soundness of its result, it was a more credible, a more impressive judicial document.”

⁵ This does not suggest, however, that justices can artificially inject their opinions with rhetoric that oozes certainty. There are limits. Specifically, consumers (i.e., lower court judges) will still desire a strong explanation to accompany the certainty (e.g., Reference Yates, Price, Lee and RamirezYates et al. 1996). In addition, there must be some variation in the certainty of recommendations, otherwise certainty will lose its value (i.e., certainty all the time will become meaningless). Furthermore, legal convention suggests practical constraints, such as one of Supreme Court justice Antonin Scalia's proposed principles of argumentation for persuading judges. Scalia recommends lawyers to “[n]ever overstate your case. Be scrupulously accurate” (Reference Scalia and GarnerScalia & Garner 2008: 13). Finally, recent research suggests there is value in having a vague opinion because it enables judges to manage their uncertainty over policy outcomes, where vague opinions enable judges to mask noncompliance (Reference Staton and VanbergStaton & Vanberg 2008). Thus, there may be some conditions where judges value less certainty in their opinions.

⁶ Reference JohnsonJohnson (1979) tried many different measures and found that the degree of Supreme Court support or nonsupport for a particular case has virtually no influence on how the lower courts treat that case. Along the same lines, Reference Hansford and SpriggsHansford and Spriggs (2006) find that the size of the majority coalition and the number of special concurring opinions accompanying the precedent do not influence the lower courts' treatment of the precedent. In contrast, Reference Benesh and ReddickBenesh and Reddick (2002) find that lower courts are more likely to positively treat unanimous decisions.

⁷ We exclude plurality opinions from this random sample given that plurality opinions create precedential uncertainty and lower courts are less likely to treat a plurality opinion positively and more likely to treat that opinion negatively or neutrally (see Reference CorleyCorley 2009).

⁸ Specifically, Reference Benesh and ReddickBenesh and Reddick (2002) analyzed Courts of Appeals treatment of Supreme Court alterations of precedent. They identified common West Key numbers between the overruled and overruling decisions, ascertaining the issues that were the basis of the overruling. Then they obtained every lower court decision under those keys from the year of the overruling decision to 1999. Out of the thousands of cases generated, they examined a sample of those to determine if the lower courts were ignoring the change in precedent. They did not find a single opinion that overtly ignored the overruling decision.

⁹ If a citing case refers to a cited case but no treatment code is provided, this means the citing case referenced but did not legally treat the cited case. This is coded as a nonsubstantive treatment of a cited case, which we do not include.

¹⁰ Prior to 1993, Shepard's used the “strongest letter rule” to determine which code to apply if two codes could be applied to the same point of law in the cited case (Reference Spriggs and HansfordSpriggs & Hansford 2000). This rule arranged treatment codes in terms of strength. The order of strength was: “Overruled,” “Questioned,” “Limited,” “Criticized,” “Followed,” “Distinguished,” “Explained,” and “Harmonized.” Beginning in 1993, Shepard's began giving multiple legal treatments to a cited case. In coding the cases used in this study, rather than have multiple legal treatments, we continue using Shepard's “strongest letter rule” to determine which code to apply.

¹¹ The Shepard's coding scheme categorizes distinguished treatments as weaker negative treatments than treatments coded as criticized or limited. Nevertheless, when the lower court distinguishes a Supreme Court precedent, it explicitly chooses not to apply the precedent. In so doing, the lower court limits the impact of the Supreme Court decision to a narrower set of facts, and thus limits the potential impact on future cases, regardless of the motivation of the lower court or whether others would consider the treatment reasonable.

¹² This raises two issues. First, one might alternatively consider the three categories as a set of ordered choices: positively treat > neutrally treat > negatively treat. However, it is not clear that the dependent variable is ordinal (e.g., is a negative treatment always more deleterious than a neutral treatment?). When there is uncertainty whether the dependent variable should be considered ordinal, multinomial logit is appropriate (Reference LongLong 1997). To be confident, we also estimated an ordered logit model and the results are substantively the same as those produced by the multinomial logit model (i.e., certainty increases positive treatment). Second, use of a multinomial logit requires us to make the assumption of the independence of irrelevant alternatives (IIA), which requires that the categories of our dependent variable cannot be plausible alternatives for one another. Given the clear substantive differences across the categories, we think this is a plausible assumption. While this assumption can be tested with the Hausman–McFadden or Small–Hsiao tests, we note that Reference Long and FreeseLong and Freese (2006: 243–44) specifically counsel against using them. Nonetheless, we ran the tests where they met the assumptions and all supported the conclusion that the alternatives are independent.

¹³ For example, the website for LIWC—http://www.liwc.net—frankly admits that assessing the reliability and validity of text analysis output is “tricky” and is not the same as with questionnaires. Furthermore, because it is a dictionary-based program, it does not capture all nuances of communication—it ignores context, irony, and sarcasm. It will also miss homonyms and double entendres. Additionally, judicial opinions are a unique form of written language, written with a specific format and structure by people with specialized training, and thus some of the tools of linguistic analysis are only in the beginning stages of being applied to the evaluation of legal opinions. To be sure, this does not mean the program cannot be used on legal texts, as one of its primary creators—James Pennebaker—has a recent working paper that applies LIWC to Supreme Court opinions (Reference Cross and PennebakerCross & Pennebaker 2012). Despite these limitations, we think it worthwhile to mention that there is no current program available that is able to capture all of the relevant elements of judicial opinions. And combined with the fact that it is important to understand whether different linguistic styles make opinions more or less powerful, we think LIWC has something beneficial to offer.

¹⁴ In addition to the 2007 dictionary, LIWC also contains a 2001 dictionary that contains only 25 words in the certainty category. Although we present the results from the 2007 dictionary, we also include the results from using the 2001 dictionary in Appendix A. Importantly, the results from either dictionary support our hypothesis (increases in certainty result in more positive treatments by lower courts). The only relevant difference in the findings involves a small change involving the coefficients for the neutral treatment category. One important consideration in choosing whether to use the 2001 or 2007 dictionary is where one prefers to reduce measurement error. Using a larger number of words (i.e., 2007 LIWC dictionary) will undoubtedly capture more raw instances of certainty in text, but it will also increase the number of “false positives” (type 1 errors). Type 1 errors are instances where a word is counted in the certainty index when it does not actually indicate certainty. In contrast, using a smaller number of words (i.e., 2001 LIWC dictionary) will undoubtedly capture fewer raw instances of certainty in text, but it will also decrease the number of “false positives” though it also increases the number of “false negatives” (type 2 errors), instances where a word is not counted in the certainty index when it actually indicates certainty. In the context of this article, a type 1 error artificially inflates the measure of certainty, increasing the likelihood of finding an association between certainty and positive treatment while a type 2 error increases the chance of finding no relationship. Normally, social science convention places a higher value on avoiding type 1 errors in this context, but we defer to the reviewers' requests to use the 2007 dictionary. Our primary hypothesis is supported by both the 2001 and 2007 dictionaries.

¹⁵ To alleviate concern that our search terms are capturing quotations made from federal statutes, we checked the 12 most “certain” opinions, which is 11 percent of our sample and a place where our measure might expect to register these “false positives.” We found only a trivial number of quotations of federal statutes that contained the search terms, where Court opinions tend to quote from are prior opinions, especially their own prior opinions. Importantly, we assume they are selecting these quotes intentionally to have a desired effect (i.e., that they are not random). Hence, when an opinion writer selects an authoritative quotation from prior case law (one that contains a search term), we believe it a reasonably safe assumption that they are intentionally picking that to buttress the authority of their own opinion.

¹⁶ To further explore the underlying construct of certainty, perhaps another alternative would be to use LIWC's dictionary for “tentativeness” as a proxy for uncertainty. However, we believe that certainty and tentativeness are separate constructs, where being low on certainty scale does not imply being tentative. This is supported by the fact that certainty and tentativeness are only weakly correlated (r = –0.082, P < 0.01). When we insert our measure for “tentativeness” into the model below, it is not significant (though close for some conditions) and does not change any of our findings with regard to certainty.

¹⁷ Many appeals court panels include a district court judge (e.g., Reference Collins and MartinekCollins & Martinek 2011). Thus, we calculated their scores using the same method.

¹⁸ Case importance operates on both a political and legal dimension (see Reference Maltzman and WahlbeckMaltzman & Wahlbeck 2004). Thus, we need two measures of case importance.

¹⁹ These are cases that (1) led to a story on the front page of the New York Times on the day after the Court handed down the decision; (2) were the lead cases in the story; and (3) were orally argued and decided with an opinion.

²⁰ We exclude “Overruled” from this count because of the following variable. We also exclude from this count any memorandum opinion that interpreted a precedent (see Reference Hansford and SpriggsHansford & Spriggs 2006). If Shepard's codes a particular treatment as negatively interpreting a precedent in more than one way, we only count this as one negative interpretation of the Supreme Court precedent. If Shepard's codes a treatment both positively and negatively, we include both of these treatments in the counts of positive and negative treatments (see Reference Hansford and SpriggsHansford & Spriggs 2006).

²¹ As a robustness check, we also controlled for the size of the majority coalition by adding a variable for the size of the majority coalition and the results were substantively similar. Alternatively, we inserted a dummy variable for each size of the majority coalition (e.g., five members, six members, seven members, and eight members, where a unanimous coalition was the baseline) and the results were again substantively similar.

²² We do not specifically control for clarity of legal rules. Thus, we recognize that there are other types of clarity that we are not capturing. However, we do believe that the clarity of legal rules would be correlated with at least one of our two measures of clarity. As we mentioned above, we also believe that certainty and clarity are separate constructs, and that clarity has different dimensions. To verify this, the two measures of opinion clarity appear to be tapping a different construct than certainty. Specifically, they are weakly correlated with certainty, and the alpha scale reliability coefficient for the two clarity measures and the certainty measure is low (less than .08). Furthermore, the correlation between the two measures of clarity is –0.01 (P = .56). Additionally, we examined the correlation between certainty and our two measures of clarity and they are weakly correlated (not stronger than .24). Thus, because clarity of legal rules is a separate theoretical construct from certainty, and because of the weak empirical correlations between certainty and our other two measures of clarity, we believe clarity of legal rules would be only weakly correlated with certainty. Given this, we do not feel that leaving the clarity of legal rules unmeasured would change the effect of certainty on how the court of appeals treats the Supreme Court precedent.

²³ While we use two measures of opinion clarity, we recognize that our two measures may not capture other forms of clarity (besides clarity of legal rule). Other possible forms of clarity are: pronoun reference resolution, topic maintenance, explicit marking of organization, including topic shifts, and explicit connector words (e.g., because, next, etc.).

²⁴ We also ran two additional robustness checks: a multinomial logit regression (MNL) with fixed effects for the majority opinion writer and another MNL with fixed effects for circuit and opinion writer. Those results are almost identical to the results presented here.

²⁵ To further explore the findings on the relationship between certainty and lower court treatment, we examined the bivariate relationship between them and the results are the same for both the 2007 and 2001 dictionaries. Certainty continues to exert a strong, significant effect on lower court treatment even when it is the only variable in the model. This test was to help ensure that the observed effect in the results was not being driven by some other confounding or collinear variable that was also in the model. This also speaks partly to the question of whether sophisticated judges (who know other judges are doing this) are influenced by opinion language that expresses uncertainty. As we address this issue partly in the concluding section, we believe this strategy can be both conscious and unconscious. We address the unconscious part in the conclusion. In short, with respect to using certainty as a conscious strategy, we think that varying the certainty of language is partly responsible for success in persuasiveness. It is odd, we think, to be skeptical of Supreme Court justices crafting opinions by adding more certainty to enhance the likelihood of being more persuasive considering that all lawyers are extensively trained to write and speak in an authoritative manner. To think that the entire legal profession (lawyers, judges, etc.) devotes this much time and resources to the way legal briefs read, and how they speak, and expect them not to be persuasive is somewhat baffling. We are not arguing that expressing more certainty is guaranteed to lead to a more positive treatment. Recall that we are estimating a probabilistic model, so there will be instances where expressing high amounts of certainty will not work. But that judges are, in many ways, just as likely to fall prey to many common cognitive illusions as laypeople. For just one study on this, Reference Guthrie, Rachlinski and WistrichGuthrie, Rachlinski, and Wistrich (2001) show that judges' decisionmaking was affected by five different types of cognitive illusion that we normally would expect them to be immune from. In short, judges are human. Moreover, we believe that overstating ones' case, as we argue earlier, can backfire and will also serve as a deterrent so that when the strategy is invoked, it is harder to detect.

²⁶ Although Table 1 does not report the circuit court dummy variables, the Third and the Ninth Circuits are more likely to treat Supreme Court cases negatively than the First Circuit, and the Fifth and Seventh Circuits are more likely to treat Supreme Court cases neutrally than the First Circuit. In addition, we also estimated a model with opinion author fixed effects to account for the fact that lower court judges may be responding to the signal of the opinion author. We find nothing changes (in fact, the certainty effect increases slightly).

²⁷ As a caveat, we should note it is possible that some third variable, such as a legal regime or political environment or process, may be at work that is influencing the amount of certainty used in opinions. If this is the case, it is possible that certainty is a proxy that is capturing the influence of this third variable. Importantly, though, if it is working through certainty, it is no longer having a direct impact on how the lower courts treat precedent.

²⁸ It is also important to recognize that we only examine the effects of certainty on Courts of Appeals judges, or what Reference HallHall (2011) refers to as vertical issues (those issues interpreted and enforced by lower courts). While Reference HallHall (2011) shows that compliance with Court decisions is much better on vertical issues compared to lateral issues (issues interpreted by noncourt actors), compliance is still not perfect on vertical issues, in particular with unpopular vertical issues, and is even lower on what Reference HallHall (2011) identifies as “unpopular lateral issues.”

References

Alpers, Georg W., et al. (2005) “Evaluation of Computerized Text Analysis in an Internet Breast Cancer Support Group,” 21 Computers in Human Behavior 361–76.CrossRef Google Scholar

Baird, Vanessa A., & Javeline, Debra (2007) “The Persuasive Power of Russian Courts,” 60 Political Research Q. 429–42.CrossRef Google Scholar

Bandum, Erin O'Carroll, & Owen, Jason E. (2009) “Evaluating the Validity of Computerized Content Analysis Programs for Identification of Emotional Expression in Cancer Narratives,” 21 Psychological Assessment 79–88.CrossRef Google Scholar

Baum, Lawrence (1978) “Lower Court Response to Supreme Court Decisions: Reconsidering a Negative Picture,” 3 Justice System J. 208–19.Google Scholar

Benesh, Sara C. (2002) The U.S. Court of Appeals and the Law of Confessions: Perspectives on the Hierarchy of Justice. New York: LFB Scholarly Publishing LLC.Google Scholar

Benesh, Sara C., & Reddick, Malia (2002) “Overruled: An Event History Analysis of Lower Court Reaction to Supreme Court Alteration of Precedent,” 64 J. of Politics 534–50.CrossRef Google Scholar

Black, Ryan, et al. (2011) “Emotions, Oral Arguments, and Supreme Court Decision Making,” 73 J. of Politics 572–81.CrossRef Google Scholar

Black, Donald (1989) Sociological Justice. Oxford: Oxford Univ. Press.Google Scholar

Brenner, Saul, & Spaeth, Harold J. (1995) Stare Indecisis: The Alteration of Precedent on the Supreme Court, 1946–1992. Cambridge: Cambridge Univ. Press.CrossRef Google Scholar

Brent, James C. (2003) “A Principal-Agent Analysis of U.S. Courts of Appeals Responses to Boerne V. Flores,” 31 American Politics Research 557–70.CrossRef Google Scholar

Brewer, Neil, & Burke, Anne (2002) “Effects of Testimonial Inconsistencies and Eyewitness Confidence on Mock Juror Judgments,” 26 Law and Human Behavior 353–64.CrossRef Google Scholar PubMed

Cohen, Shuki J. (2012) “Construction and Preliminary Validation of a Dictionary for Cognitive Rigidity: Linguistic Markers of Overconfidence and Overgeneralization and Their Concomitant Psychological Distress,” 41 J. of Psycholinguistic Research 347–79.CrossRef Google Scholar PubMed

Collins, Paul M. Jr, & Martinek, Wendy L. (2011) “The Small Group Context: Designated District Court Judges in the U.S. Courts of Appeal,” 8 J. of Empirical Legal Studies 177–205.CrossRef Google Scholar

Corley, Pamela C. (2008) “The Supreme Court and Opinion Content: The Influence of Parties' Briefs,” 61 Political Research Q. 468–78.CrossRef Google Scholar

Corley, Pamela C. (2009) “Uncertain Precedent: Circuit Court Responses to Supreme Court Plurality Opinions,” 37 American Politics Research 30–49.CrossRef Google Scholar

Corley, Pamela C., Collins, Paul M Jr, & Calvin, Bryan (2011) “Lower Court Influence on U.S. Supreme Court Opinion Content,” 73 J. of Politics 31–44.CrossRef Google Scholar

Cross, Frank B., & Pennebaker, James W. (2012) The Language of the Roberts Court.” ExpressO. Working paper available online: http://works.bepress.com/frank_cross/4.Google Scholar

Epstein, Lee, & Segal, Jeffrey A. (2000) “Measuring Issue Salience,” 44 American J. of Political Science 66–83.CrossRef Google Scholar

Epstein, Lee, et al. (2007a) “The Judicial Common Space,” 23 J. of Law, Economics, & Organization 303–25.CrossRef Google Scholar

Epstein, Lee, et al. (2007b) The Supreme Court Compendium. Washington, DC: CQ Press.Google Scholar

Federal Judicial Center (1991) Judicial Writing Manual.Google Scholar

Gamson, William A., & Modigliani, Andre (1987) “The Changing Culture of Affirmative Action,” in Braungart, Richard D, ed., Research in Political Sociology, 3. Greenwich, CT: JAI. 137–77.Google Scholar

Gardner, James A. (1993) Legal Argument: The Structure and Language of Effective Advocacy. Charlottesville, VA: The Michie Company.Google Scholar

George, Joyce J. (2007) Judicial Opinion Writing Handbook. Buffalo, NY: William S. Hein & Co., Inc.Google Scholar

George, Tracey E., & Yoon, Albert H. (2003) “The Federal Court System: A Principal-Agent Perspective,” 47 St. Louis Univ. Law J. 819–34.Google Scholar

Gruhl, John (1980) “The Supreme Court's Impact on the Law of Libel: Compliance by Lower Federal Courts,” 33 Western Political Q. 503–19.CrossRef Google Scholar

Gruhl, John (1981) “Anticipatory Compliance with Supreme Court Rulings,” 14 Polity 294–313.CrossRef Google Scholar

Guthrie, Chris, Rachlinski, Jeffrey J., & Wistrich, Andrew J. (2001) “Inside the Judicial Mind,” 86 Cornell Law Rev. 777–830.Google Scholar

Hall, Matthew E. K. (2011) The Nature of Supreme Court Power. Cambridge: Cambridge Univ. Press.Google Scholar

Hall, Melinda Gann, & Brace, Paul (1992) “Toward An Integrated Model of Judicial Voting Behavior,” 20 American Politics Q. 147–68.CrossRef Google Scholar

Hamilton, Alexander (1788) Federalist Paper #78.Google Scholar

Hansford, Thomas G., & Spriggs, James F. II (2006) The Politics of Precedent on the U.S. Supreme Court. Princeton, NJ: Princeton Univ. Press.CrossRef Google Scholar

Hazelton, Vincent, Cupach, William R., & Liska, Jo (1986) “Message Style: An Investigation of the Perceived Characteristics of Persuasive Messages,” 1 J. of Social Behavior and Personality 565–74.Google Scholar

Hettinger, Virginia A., Lindquist, Stefanie A., & Martinek, Wendy L. (2006) Judging on A Collegial Court: Influences on Federal Appellate Decision Making. Charlottesville, VA: Univ. of Virginia Press.Google Scholar

Higdon, Michael J. (2010) “Something Judicial This Way Comes … The Use of Foreshadowing as a Persuasive Device in Judicial Narrative,” 44 Univ. of Richmond Law Rev. 1213–60.Google Scholar

Huber, John D., & Shipan, Charles R. (2002) Deliberate Discretion? The Institutional Foundations of Bureaucratic Autonomy. New York: Cambridge Univ. Press.CrossRef Google Scholar

Huber, John D., Shipan, Charles R., & Pfahler, Madelaine (2001) “Legislatures and Statutory Control of Bureaucracy,” 45 American J. of Political Science 330–45.CrossRef Google Scholar

Hume, Robert J. (2006) “The Use of Rhetorical Sources by the U.S. Supreme Court,” 40 Law & Society Rev. 817–44.CrossRef Google Scholar

Johnson, Charles A. (1979) “Lower Court Reactions to Supreme Court Decisions: A Quantitative Examination,” 23 American J. of Political Science 792–804.CrossRef Google Scholar

Johnson, Charles A. (1987) “Law, Politics, and Judicial Decision Making: Lower Federal Court Uses of Supreme Court Decisions,” 21 Law & Society Rev. 325–39.CrossRef Google Scholar

Kahn, Jeffrey H., et al. (2007) “Measuring Emotional Expression with the Linguistic Inquiry and Word Count,” 120 The American J. of Psychology 263–86.CrossRef Google Scholar PubMed

Kim, Pauline T. (2007) “Lower Court Discretion,” 82 New York Univ. Law Rev. 383–442.Google Scholar

Kim, Pauline T. (2011) “Beyond Principal-Agent Theories: Law and the Judicial Hierarchy,” 105 Northwestern Univ. Law Rev. 535–75.Google Scholar

Klein, David E. (2002) Making Law in the United States Courts of Appeals. New York: Cambridge Univ. Press.CrossRef Google Scholar

Laver, Michael, Benoit, Kenneth, & Garry, John (2003) “Extracting Policy Positions from Political Texts,” 97 American Political Science Rev. 311–32.CrossRef Google Scholar

Lebovits, Gerald, Curtin, Alifya V., & Solomon, Lisa (2008) “Ethical Judicial Opinion Writing,” 21 Georgetown J. of Legal Ethics 237–309.Google Scholar

Lebovits, Gerald, & Hidalgo, Lucero R. (2009) “Advice to Law Clerks: How to Draft Your First Judicial Opinion,” 36 Westchester Bar J. 29–37.Google Scholar

Long, J. Scott, & Freese, Jeremy (2006) Regression Models for Categorical Dependent Variables Using Stata. College Station, TX: Stata Press.Google Scholar

Long, J. Scott (1997) Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks, CA: Sage.Google Scholar

Maltzman, Forrest, & Wahlbeck, Paul (2004) “A Conditional Model of Opinion Assignment on the Supreme Court,” 57 Political Research Q. 551–63.CrossRef Google Scholar

McCubbins, Mathew, & Schwartz, Thomas (1984) “Congressional Oversight Overlooked: Police Patrols versus Fire Alarms,” 28 American J. of Political Science 165–79.CrossRef Google Scholar

Miller, Joanne M., & Peterson, David A. M. (2004) “Theoretical and Empirical Implications of Attitude Strength,” 66 J. of Politics 847–67.CrossRef Google Scholar

Moe, Terry M. (1982) “Regulatory Performance and Presidential Administration,” 26 American J. of Political Science 197–224.CrossRef Google Scholar

Moe, Terry M. (1984) “The New Economics of Organization,” 28 American J. of Political Science 739–77.CrossRef Google Scholar

Owens, Ryan J., & Wedeking, Justin (2011) “Justices and Legal Clarity: Analyzing the Complexity of U.S. Supreme Court Opinions,” 45 Law & Society Rev. 1027–61.CrossRef Google Scholar

Owens, Ryan J., & Wedeking, Justin (2012) “Some (Potential) Applications of Computer Content Analysis to the Study of Law and Courts,” 22 Law & Courts 26–32.Google Scholar

Petrocelli, John V., Tormala, Zakary, & Rucker, Derek D. (2007) “Unpacking Attitude Certainty: Attitude Clarity and Attitude Correctness,” 92 J. of Personality and Social Psychology 30–41.CrossRef Google Scholar PubMed

Posner, Richard A. (2009) Law and Literature. Cambridge, MA: Harvard Univ. Press.Google Scholar

Randazzo, Kirk, Waterman, Richard W., & Fine, Jeffrey A. (2006) “Checking the Federal Courts: The Impact of Congressional Statutes on Judicial Behavior,” 68 J. of Politics 1006–17.CrossRef Google Scholar

Rieke, Richard D., & Stutman, Randall K. (1990) Communication in Legal Advocacy. Columbia, SC: Univ. of South Carolina Press.Google Scholar

Scalia, Antonin, & Garner, Bryan A. (2008) Making Your Case: The Art of Persuading Judges. St. Paul, MN: Thomson/West.Google Scholar

Sniezek, Janet, & Buckley, Timothy (1995) “Cueing and Cognitive Conflict in Judge-Advisor Decision Making,” 62 Organizational Behavior and Human Decision Processes 159–74.CrossRef Google Scholar

Sniezek, Janet, & Van Swol, Lyn (2001) “Trust, Confidence, and Expertise in a Judge-Advisor System,” 84 Organizational Behavior and Human Decision Processes 288–307.CrossRef Google Scholar

Solan, Lawrence M. (1993) The Language of Judges. Chicago, IL: Univ. of Chicago Press.CrossRef Google Scholar

Songer, Donald R., & Haire, Susan (1992) “Integrating Alterative Approaches to the Study of Judicial Voting: Obscenity Cases in the U.S. Courts of Appeals,” 36 American J. of Political Science 963–82.CrossRef Google Scholar

Songer, Donald R., Segal, Jeffrey A., & Cameron, Charles M (1994) “The Hierarchy of Justice: Testing a Principal-Agent Model of Supreme Court-Circuit Court Interactions,” 38 American J. of Political Science 673–96.CrossRef Google Scholar

Spaeth, Harold J. (2006) The Original United States Supreme Court Database, 1953–2005 Terms. East Lansing, MI: Michigan State Univ., Dept. Political Science.Google Scholar

Spriggs, James F. II (1996) “The Supreme Court and Federal Administrative Agencies: A Resource-Based Theory and Analysis of Judicial Impact,” 40 American J. of Political Science 1122–51.CrossRef Google Scholar

Spriggs, James F. II (2003) “The Attitudinal Model: An Explanation of Case Dispositions, Not Substantive Policy Outcomes,” 13 Law and Courts 23–6.Google Scholar

Spriggs, James F. II, & Hansford, Thomas G (2000) “Measuring Legal Change: The Reliability and Validity of Shepard's Citations,” 53 Political Research Q. 327–41.Google Scholar

Spriggs, James F. II, & Hansford, Thomas G (2002) “The U.S. Supreme Court's Incorporation and Interpretation of Precedent,” 36 Law & Society Rev. 139–57.CrossRef Google Scholar

Staton, Jeffrey K., & Vanberg, Georg (2008) “The Value of Vagueness: Delegation, Defiance, and Judicial Opinions,” 52 American J. of Political Science 504–19.CrossRef Google Scholar

Tausczik, Yla R., & Pennebaker, James W. (2010) “The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods,” 29 J. of Language and Social Psychology 24–54.CrossRef Google Scholar

Wahlbeck, Paul, Spriggs, James F. II, & Sigelman, Lee (2002) “Ghostwriters on the Court? A Stylistic Analysis of U.S. Supreme Court Opinion Drafts,” 30 American Politics Research 166–92.CrossRef Google Scholar

Waicukauski, Ronald, Sandler, Paul M., & Epps, JoAnne (2001) The Wining Argument. Chicago, IL: American Bar Association Publishing.Google Scholar

Wasby, Stephen L. (1970) The Impact of the United States Supreme Court: Some Perspectives. Homewood, IL: The Dorsey Press.Google Scholar

Whitley, Bernard E., & Greenberg, Martin S. (1986) “The Role of Eyewitness Confidence in Juror Perceptions of Credibility,” 16 J. of Applied Social Psychology 387–409.CrossRef Google Scholar

Wood, B. Dan, & Waterman, Richard W. (1991) “The Dynamics of Political Control of the Bureaucracy,” 85 American Political Science Rev. 801–28.CrossRef Google Scholar

Yates, J. Frank, Price, Paul C., Lee, Ju-Whei, & Ramirez, James (1996) “Good Probabilistic Forecasters: The ‘Consumer's’ Perspective,” 12 International J. of Forecasting 41–56.CrossRef Google Scholar

Cases Cited

Hutto v. Davis 454 U.S. 370 (1982).CrossRef Google Scholar

Nixon v. Administrator of General Services 433 U.S. 425 (1977).Google Scholar

Table 1. Multinomial Logit Model of the Impact of “Certainty” on Lower Court Compliance

Figure 1. Predicted Probabilities of Each Type of Treatment.

Article contents

The (Dis)Advantage of Certainty: The Importance of Certainty in Language

Abstract

Language and Persuasion

Data and Method

Dependent Variable

Primary Independent Variable

Control Variables

Results

Conclusion

Appendix A

List of “Certain” Words in the 2007 LIWC Dictionary

Multinomial Logit Model of the Impact of “Certainty” on Lower Court Compliance

Footnotes

References

References

Cases Cited

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests