Positive public sector stereotypes and their impact on public service delivery: an audit experiment

Gabriela Szydlowski; Noortje de Boer; Lars Tummers

doi:10.1017/bpp.2024.27

Positive public sector stereotypes and their impact on public service delivery: an audit experiment

Published online by Cambridge University Press: 07 November 2024

Gabriela Szydlowski

Noortje de Boer and

Lars Tummers

Show author details

Gabriela Szydlowski*: Affiliation:
1Utrecht School of Governance, Utrecht University, Utrecht, the Netherlands
Noortje de Boer: Affiliation:
1Utrecht School of Governance, Utrecht University, Utrecht, the Netherlands
Lars Tummers: Affiliation:
1Utrecht School of Governance, Utrecht University, Utrecht, the Netherlands 2City University of Hong Kong, Hong Kong
*: Corresponding author: Gabriela Szydlowski; Email: [email protected]

Article contents

Abstract
Introduction
Theoretical framework
Methods
Results
Discussion and conclusion
Conclusion
Data availability
Funding statement
Competing interest
Ethics approval statement
Footnotes
References

Rights & Permissions

Abstract

There are both negative and positive stereotypes about public sector workers. Most studies focus on negative stereotypes, like the idea that public servants are lazy. We, however, do the opposite. We focus on a positive stereotype: public sector workers are seen as caring and helpful. We test the effects of positive stereotypes on the quality of public service delivery. Using a pre-registered audit experiment in elderly care in the Netherlands and Belgium, we find that activating a pro-social stereotype does not affect the outcome of public service quality in terms of response rate and information provision. However, it does improve the bureaucratic process: public sector workers are friendlier toward citizens. They say around 12% more ‘thank you’ in their replies. Moreover, the citizens’ gender affects the response rate: female citizens receive around 10% more replies. Concluding, we show that positive stereotyping can improve parts of the quality of public service delivery but not all.

Keywords

pro-social stereotypes public sector workers public service delivery gender differences bureaucratic encounter

Type: New Voices
Information: Behavioural Public Policy , First View , pp. 1 - 38

DOI: https://doi.org/10.1017/bpp.2024.27 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2024. Published by Cambridge University Press

Introduction

Public sector workers are the face of bureaucracies. Citizens have stereotypical beliefs about them. There is a wealth of competing stereotypes about public sector workers, both positive and negative. Public sector workers are often assumed to be lazy 9-to-5 workers (London Chamber of Commerce and Industry and Hays, 2011). Scholars have started to investigate what stereotypes exist of public sector workers empirically. Contrary to traditional beliefs, this literature shows that citizens do not only hold negative stereotypes, such as lazy, corrupt and inefficient, yet also positively stereotype public sector workers. Public sector workers are seen as warm, competent, caring, helpful and dedicated (De Boer, Reference de Boer2020; Willems, Reference Willems2020; Neo et al., Reference Neo, Bertram, Szydlowski, Bouwman, de Boer, Grimmelikhuijsen, Charbonneau, Moon and Tummers2023). Most studies focus on negative stereotypes, like the idea that public servants are lazy or not creative (Chen and Bozeman, Reference Chen and Bozeman2014). We, however, do the opposite. We focus on a positive stereotype: public sector workers are seen as caring and helpful.

There is some variation in the degree of positivity of stereotypes based on characteristics of the public sector workers, such as occupation, or characteristics of the citizens themselves such as subjective level of income (de Boer, Reference de Boer2020; Bertram et al., Reference Bertram, Bouwman and Tummers2022). While this literature is valuable for mapping what stereotypes exist, it does not help us understand the possible effects of these positive stereotypes. We assess the effect of positive stereotypes of public sector workers on workers’ behavior.

Within the stereotype activation literature, empirical evidence shows that activating positive stereotypes, also known as stereotype boost, can improve performance (Levy, Reference Levy1996; Shih et al., Reference Shih, Pittinsky and Ambady1999; Shih et al., Reference Shih, Pittinsky, Ho, Inzlicht and Schmader2012; Clark et al., Reference Clark, Thiem and Kang2017). In other words, people perform better in situations in which positive stereotypes of their group are activated (Shih et al., Reference Shih, Pittinsky, Ho, Inzlicht and Schmader2012). For instance, reminding men that they are stereotypically better at sports than women led them to perform better (Chalabaev et al., Reference Chalabaev, Stone, Sarrazin and Croizet2008). This study sets out to investigate the positive effects of stereotypes by focusing on positive stereotypes of public sector workers: What is the effect of positive stereotypes of public sector workers on their public service delivery? A pre-registered audit experiment is used in The Netherlands and Belgium to answer this research question.

Performance matters to citizens and is one of the main criteria on which citizens evaluate the government (Bouckaert and van de Walle, Reference Bouckaert, Van de Walle and Salminen2003). All citizens encounter government administrators, and evaluations of service quality are based on facts and not pre-conceived ideas (Bouckaert and van de Walle, Reference Bouckaert, Van de Walle and Salminen2003). People's experience of the administrative process is very important in the shaping of their attitude towards the government (Reisig and Parks, Reference Reisig and Parks2003; Tyler, Reference Tyler2006; Wang et al., Reference Wang, Jun and Wang2021). There are several criteria by which we can measure government quality and performance. One such approach, as outlined by Boyne et al. (Reference Boyne, Farrell, Law, Powell and Walker2003) includes three main criteria: efficiency, responsiveness and equity. In this paper, we focus on responsiveness.

We make three theoretical contributions. First, we contribute to the literature on public sector stereotypes. Most of the literature focuses on public sector workers and describes what stereotypes exist (de Boer, Reference de Boer2020; Willems, Reference Willems2020; Bertram et al., Reference Bertram, Bouwman and Tummers2022; with a notable exception of Szydlowski et al., Reference Szydlowski, de Boer and Tummers2022; Neo et al., Reference Neo, Bertram, Szydlowski, Bouwman, de Boer, Grimmelikhuijsen, Charbonneau, Moon and Tummers2023). We move this debate forward by studying the consequences of public sector stereotypes. Second, we expand the limited literature on positive stereotyping, and specifically of job stereotyping by testing how it affects public sector workers (Shih et al., Reference Shih, Pittinsky, Ho, Inzlicht and Schmader2012). Finally, the stereotype literature is focused almost exclusively on negative effects. We answer a call toward a positive public administration (Douglas et al., Reference Douglas, t'Hart, Ansell, Anderson, Flinders, Head and Moynihan2019) by focusing on the positive effects of positive stereotyping theory.

We make two practical contributions, for managers and for policy-makers. Firstly, from a practical standpoint, we know little how concrete managerial actions can influence desired employee outcomes (Vogel and Willems, Reference Vogel and Willems2020). A practical implication of our study is that applying pro-social stereotypes to workers would be used by managers to influence public service delivery. Hence it is a potential micro-intervention (Vogel and Willems, Reference Vogel and Willems2020). Thus, if activating a positive stereotype associated with work can affect public service delivery quality in a positive manner, it would be a concrete, low-cost managerial action to implement which can foster positive interaction between citizens and the state.

Secondly, we contribute to the development of behavioral public policy by showing the potential use of positive public sector worker stereotypes on the quality of public service delivery. By doing so, we expand the range of potential behavioral interventions to enhance performance. Consequently, this can lead to an improvement in citizen satisfaction with the government and interactions between citizens and the state.

Methodologically, we are answering a call for more field experiments within public administration (Hansen and Tummers, Reference Hansen and Tummers2020) and adhere to the practice of field experiments to test behavioral public policies (Sanders et al., Reference Sanders, Snijders and Hallsworth2018; Fels, Reference Fels2022). We conducted an audit experiment. Covert field experiments that record subjects’ behavior without their knowledge allow researchers to make strong causal claims that cannot be made with observational data and provide much less social desirability bias (Gaddis and Crabtree, Reference Gaddis and Crabtree2021). Field experiments have high value for practitioners and policy-makers as they allow for causal inference in real-world settings (Hansen and Tummers, Reference Hansen and Tummers2020).

Field experiments that test the application of behavioral interventions to policy design have become popular in informing policy decisions and behavioral public policy studies (Sanders et al., Reference Sanders, Snijders and Hallsworth2018; Fels, Reference Fels2022). Yet, most studies on behavioral interventions and field experiments center around nudges and boosting (Fels, Reference Fels2022; examples of studies include Arroyos-Calvera et al., Reference Arroyos-Calvera, Drouvelis, Lohse and McDonald2021; Gravert and Kurz, Reference Gravert and Kurz2021; Keppeler et al., Reference Keppeler, Sievert and Jilke2022; Van Roekel et al., Reference Van Roekel, Reinhard and Grimmelikhuijsen2022). We test a different type of low-cost intervention. Furthermore, in various field studies in public policy, high-quality scientific standards are not met, such as transparency by respecting pre-analysis plans (Fels, Reference Fels2022). We have conducted a pre-registered study and adhere to the open and rigorous research approach (Perry, Reference Perry2016; Vogel and Willems, Reference Vogel and Willems2020).

Theoretical framework

Public sector worker stereotypes

Stereotypes are beliefs about the characteristics, attributes and behaviors of members of specific groups (Stallybrass, Reference Stallybrass, Bullock and Stallybrass1977). For instance, the idea that public sector workers as a group are lazy is a stereotype. There is a long tradition of studying stereotypes, often regarding race (Vomfell and Stewart, Reference Vomfell and Stewart2021), gender (Régner et al., Reference Régner, Thinus-Blanc, Netter, Schmader and Huguet2019), nationality (Rad and Ginges, Reference Rad and Ginges2018) and age (Levy et al., Reference Levy, Pilver, Chung and Slade2014). When it comes to stereotypes in the workplace, studies have been focusing mostly on ethnic and minority characteristics, age and gender-specific characteristics (Ashton and Esses, Reference Ashton and Esses1999; Leach et al., Reference Leach, Carraro, Garcia and Kang2017; Willems, Reference Willems2020). Yet, studies rarely explicitly examine how job stereotypes affect workers.

There are some public sector worker stereotypes – positive and negative (Wilson, Reference Wilson1989; Goodsell, Reference Goodsell2004; Chen and Bozeman, Reference Chen and Bozeman2014; Neo et al., Reference Neo, Bertram, Szydlowski, Bouwman, de Boer, Grimmelikhuijsen, Charbonneau, Moon and Tummers2023). On the positive side, these studies demonstrate that public sector workers are stereotyped with pro-social traits including warm, caring and helpful (de Boer, Reference de Boer2020; Willems, Reference Willems2020; Bertram et al., Reference Bertram, Bouwman and Tummers2022; Neo et al., Reference Neo, Bertram, Szydlowski, Bouwman, de Boer, Grimmelikhuijsen, Charbonneau, Moon and Tummers2023). Related to this, there has been a long tradition in our field that studies job stereotypes implicitly. To illustrate, within the field of organizational behavior, scholars agree that public sector workers have distinct pro-social traits (see Vogel and Willems, Reference Vogel and Willems2020). Public sector work is based on the opportunity to make a positive difference in other people's lives (Bolino and Grant, Reference Bolino and Grant2016; Vogel and Willems, Reference Vogel and Willems2020). Thus, it is believed that individuals who enter the public sector do so due to a motivation for pro-social impact such as helping others (Lewis and Frank, Reference Lewis and Frank2002; Gregg et al., Reference Gregg, Grout, Ratcliffe, Smith and Windmeijer2011; Cowley and Smith, Reference Cowley and Smith2014). Pro-social behavior is characterized by actions intended to benefit others than oneself (Resh et al., Reference Resh, Marvel and Wen2018). Pro-social traits include being helpful, empathic and positive attitudes such as friendliness (Zhao et al., Reference Zhao, Ferguson and Smillie2016).

In addition, the public service motivation (PSM) literature provides substantial evidence for public sector workers having high pro-social traits. PSM refers to the intrinsic motivation and individual pro-social pre-dispositions associated with working in the public sector such as compassion, dedication to serve society and communities, self-sacrifice (Perry and Wise, Reference Perry and Wise1990; Grant, Reference Grant2008). A large body of empirical research demonstrates that public sector workers – compared to the private sector – are seen to possess higher levels of pro-social traits (Houston, Reference Houston2000; Lewis and Frank, Reference Lewis and Frank2002; John and Johnson, Reference John, Johnson, Park, Curtice, Thomson, Phillips, Johnson and Clery2008; Cowley and Smith, Reference Cowley and Smith2014).

Stereotyping and performance

Stereotype activation

Studying the effects – and activation of stereotypes – has a long tradition in the field of psychology (Shih et al., Reference Shih, Pittinsky, Ho, Inzlicht and Schmader2012). Stereotype activation theory posits that making relevant stereotypes cognitively accessible in a particular situation (activating the stereotype) influences the attitudes and behaviors of the stereotyped individual(s) (Marx et al., Reference Marx, Brown and Steele1999; Wheeler and Petty, Reference Wheeler and Petty2001; Gupta et al., Reference Gupta, Turban and Bhawe2008). Stereotype activation increases the cognitive accessibility of characteristics ascribed to members of the stereotyped group (Wheeler and Petty, Reference Wheeler and Petty2001), which influences people's attitudes toward and behaviors on the stereotyped task (Gupta et al., Reference Gupta, Turban and Bhawe2008). Notably, stereotype activation is believed to influence attitudes and behaviors even when people may not regard the stereotype as true for themselves or their group (Gupta et al., Reference Gupta, Turban and Bhawe2008). Thus, we expect that utilizing a pro-social stereotype about public sector workers serves as a trigger for pro-social behavior, by activating the cognitive accessibility of the characteristic of the worker, which in turn increases the confidence and motivation to follow the given characteristic.

Positive stereotyping effects

Positive stereotype activation and performance studies show mixed results. On the one hand, positive stereotypes are shown to decrease performance. Positive stereotypes are argued to lead to unrealistically high expectations (Ho et al., Reference Ho, Driscoll and Loosbrock1998) and worsen performance on tests (Cheryan and Bodenhausen, Reference Cheryan and Bodenhausen2016). Yet, scholars have suggested that negative effects of positive stereotyping stem from imposing higher expectations that create stress (Cheryan and Bodenhausen, Reference Cheryan and Bodenhausen2016). The extent to which the stereotype heightens stress levels could influence whether the effects of positive stereotyping are positive or negative (Shih et al., Reference Shih, Pittinsky, Ho, Inzlicht and Schmader2012).

On the other hand, positive stereotyping is also shown to increase performance. It is associated with self-fulfilling prophecies and confirmation bias (Madon et al., Reference Madon, Guyll, Aboufadel, Montiel, Smith, Palumbo and Jussim2001). In other words, activating a positive stereotype can lead individuals to act accordingly to the stereotype. Clark et al. (Reference Clark, Thiem and Kang2017) found that activating positive stereotypes can act as a bolster to a person's belief regarding their abilities and task performance. They found that Asian-Americans performed better in a math test after their ethnic identity was activated with positive traits associated with their group. Levy (Reference Levy1996) has shown that activation of negative terms associated with the elderly (e.g. senile, dementia) produced deficits in the memory abilities of elderly participants. Meanwhile, the activation of positive terms associated with the elderly (e.g. wise, experienced) produced an enhancement of the elderly participants’ memory abilities. Shih et al. (Reference Shih, Pittinsky and Ambady1999) found that Asian American women performed better on a mathematics test when their Asian identity was cued, but worse when their gender identity was cued.

The mixed empirical evidence on activating positive stereotypes raises the question, do positive stereotype effects hold when it comes to job stereotyping for public sector workers? One key difference to consider is that studies about gender, age and race stereotypes are addressing characteristics that an individual does not necessarily choose. An individual will have much more ease in deciding which group to join in terms of professional identity, compared to which group one belongs to on the aforementioned characteristics.

Positive stereotyping effects and public service delivery

Substantial evidence from organizational and social psychology literature demonstrates that the opportunity to make a meaningful difference in the lives of others has a large motivational potential (Grant, Reference Grant2008) and it increases performance (Hackman and Oldham, Reference Hackman, Hackman and Oldham1980; Vogel and Willems, Reference Vogel and Willems2020). A meta-analysis shows that the opportunity to help others through one's job positively affects performance (Humphrey et al., Reference Humphrey, Nahrgang and Morgeson2007). Thus, employees’ opportunity to affect and help the lives of others (i.e. task significance) enhances employees’ perception of job meaningfulness and leads to better performance (Hackman and Oldham, Reference Hackman, Hackman and Oldham1980; Vogel and Willems, Reference Vogel and Willems2020).

Grant's theory (Reference Grant2008) posits that connecting people to their ‘pro-social’ motivation and impact enhances employee performance. A core purpose of public service work is to make a positive difference in the health, safety and well-being of individuals, groups and communities (Perry, Reference Perry1996; Reference Perry1997; Grant, Reference Grant2008). The individuals, groups and communities that benefit from these jobs depend on pro-socially motivated employees to perform them effectively (Grant, Reference Grant2008). Indeed, public sector workers are demonstrated to have pro-social traits and motivation (Lewis and Frank, Reference Lewis and Frank2002; Gregg et al., Reference Gregg, Grout, Ratcliffe, Smith and Windmeijer2011; Cowley and Smith, Reference Cowley and Smith2014). As presented earlier, empirical evidence shows they are also stereotyped accordingly.

The rationale for our hypotheses is as follows: Since public sector workers often have pro-social traits (Lewis and Frank, Reference Lewis and Frank2002; Gregg et al., Reference Gregg, Grout, Ratcliffe, Smith and Windmeijer2011; Cowley and Smith, Reference Cowley and Smith2014), and that making the pro-social aspect of work salient (i.e. reconnecting them to pro-social aspects of their work) is associated with better performance (Humphrey et al., Reference Humphrey, Nahrgang and Morgeson2007), we expect that activating pro-social stereotypes will remind the workers of their pro-social impact and lead to better performance quality during public service delivery compared to a control group. Thus, based on Grant's theory (Reference Grant2008), we hypothesize that:

H1: Activating pro-social public sector workers stereotypes will lead to better quality of public service delivery in terms of response rate.

H2: Activating pro-social public sector stereotypes will lead to better quality of public service delivery in terms of information provision.

Public service delivery

We hypothesize that activating positive stereotypes of public sector workers will lead to better quality during public service delivery. Now, we must explain what we mean by public service delivery.

In recent decades, governmental reforms have undergone profound changes in terms of public service delivery, often under the banner of New Public Management (NPM) (Andrews and Van de Walle, Reference Andrews and Van de Walle2013; Haddad et al., Reference Haddad, Mssassi and Makkaoui2020). The quality of public service delivery has become a prominent criterion within public administration and become a standard means to evaluating public service delivery (Andrews and Van de Walle, Reference Andrews and Van de Walle2013). Additionally, public service delivery quality is a criterion by which citizens judge the government (Bouckaert and Van de Walle, Reference Bouckaert, Van de Walle and Salminen2003).

Public service quality has been defined as meeting the expectations of citizens (Wisniewski, Reference Wisniewski1996; Haddad et al., Reference Haddad, Mssassi and Makkaoui2020). Service quality can also be defined as the difference between citizen's expectations of service and the perceptions of the service after it is received (Paul et al., Reference Paul, Mittal and Srivastav2016). For instance, did a civil servant reply to me in one week, as I expected s/he would have? Service quality is recognized as a major factor responsible for citizen satisfaction with public administration (Paul et al., Reference Paul, Mittal and Srivastav2016). Furthermore, public service quality is strongly linked to the personnel delivering that service (Haddad et al., Reference Haddad, Mssassi and Makkaoui2020).

Similarly, public service performance is defined as the extent to which the organization's performance meets or exceeds citizens’ expectations (Wang et al., Reference Wang, Jun and Wang2021). Although quality and performance are distinct concepts, their interdependence is evident, as the perception of service quality is, in part, reliant on performance. Service quality evaluation encompasses five main categories: tangible elements, reliability, responsiveness, assurance and empathy (Parasuraman et al., Reference Parasuraman, Zeithaml and Berry1985). Similarly, government performance is assessed based on efficiency, responsiveness, equity and effectiveness (Andrews and van de Walle, Reference Andrews and Van de Walle2013). Citizen satisfaction has become increasingly crucial for evaluating government performance, aligning with the same definition as for service quality – the extent to which the organization's performance meets or surpasses citizens’ expectations (Wang et al., Reference Wang, Jun and Wang2021). Responsiveness emerges as a common criterion in the evaluation of both service quality (Parasuraman et al., Reference Parasuraman, Zeithaml and Berry1985) and government performance (Andrews and van de Walle, Reference Andrews and Van de Walle2013), underscoring the shared focus on client satisfaction.

The satisfaction of users or consumers hinges on the actual performance of the service, aligning with or exceeding anticipated standards (Parasuraman et al., Reference Parasuraman, Zeithaml and Berry1985; Mbassi et al., Reference Mbassi, Mbarga and Ndeme2019). Parasuraman et al. (Reference Parasuraman, Zeithaml and Berry1988) further elaborate that perceived service quality results from a comparative process, where clients evaluate the services offered by an organization in relation to their perception of the organization's performance (Mbassi et al., Reference Mbassi, Mbarga and Ndeme2019). Therefore, public service quality can be comprehended as a dimension or derivative of performance, reflecting the intricate relationship between the two concepts in the context of governmental reforms and citizen-centric governance paradigms.

A key theme in bureaucratic encounters in service delivery pertains to public sector workers’ responsiveness (Thunman et al., Reference Thunman, Ekström and Bruhn2020). Taxpaying citizens expect value for money, which is why responsiveness and efficiency are important aspects of public service delivery quality (Bourgon, Reference Bourgon2007). Public service delivery quality is indeed characterized by efficiency, responsiveness and equity (Andrews and Van de Walle, Reference Andrews and Van de Walle2013). Parasuraman et al. (Reference Parasuraman, Zeithaml and Berry1988) conceptualized a five-dimensional model for service quality: reliability, responsiveness, empathy, assurance and tangibility. Today, their quality measuring instrument is a standard for service quality (Paul et al., Reference Paul, Mittal and Srivastav2016). Responsiveness in public service delivery is the desire of the organization to efficiently deliver service, to help customers and to offer a prompt service (Parasuraman et al., Reference Parasuraman, Zeithaml and Berry1988). Similarly, outcome – did you get what you needed – is one of the main drivers of service satisfaction among citizens (Daniels, Reference Daniels2016). Receiving an answer to an inquiry, for instance about a public-school program or healthcare is an example of responsiveness, and in turn, public service delivery quality. We therefore define public service delivery quality in terms of responsiveness: (a) whether we received a response, and (b) whether information was provided for all questions asked (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022).

Responsiveness in citizen encounters encompasses the timely and supportive assistance provided in specific situations (Hofstetter and Stokoe, Reference Hofstetter and Stokoe2015; Thunman et al, Reference Thunman, Ekström and Bruhn2020). Defined literally as being ‘quick to respond or react appropriately or sympathetically; answering’ (Liao, Reference Liao2018; Webster services dictionary), responsiveness is further elaborated as the organization's commitment to efficiently delivering services, aiding citizens and ensuring prompt assistance (Parasuraman et al., Reference Parasuraman, Zeithaml and Berry1988). Framed within the social contract of governance, where governments are elected to serve the people, the importance of responsiveness is underscored by its role in citizens’ evaluations of the government and public administration (Vigoda, Reference Vigoda2000).

In performance control, responsiveness is pivotal, representing the speed and accuracy with which service providers address requests for action and information (Thomas and Palfrey, Reference Thomas and Palfrey1996; Vigoda, Reference Vigoda2000). The quality of interactions with public sector workers, as demonstrated in studies by Wang et al. (Reference Wang, Jun and Wang2021) and Brown (Reference Brown2007), influences how citizens assess a service, with poor-quality interactions leading to lower service ratings. Beyond individual encounters, responsiveness during public service delivery is ingrained in the core values that citizens use to evaluate public organizations (Meier and Bohte, Reference Meier and Bohte2006; Brown, Reference Brown2007; Wang et al., Reference Wang, Jun and Wang2021), and its manifestation in the services directly shapes citizens’ satisfaction with the organization (Rölle, Reference Rölle2017). In this sense, responsiveness is an important performance metric by which citizens evaluate the government.

Furthermore, responsiveness is an important performance metric beyond performance per se. Indeed, it relates to procedural justice and administrative burden, which in both cases impact citizen–state relations. Procedural justice is defined as citizens being treated with trust, fairness, respect and neutrality (Murphy and Tyler, Reference Murphy and Tyler2008). In essence, it refers to the fairness of the process by which authorities make decisions and treat citizens (Tyler, Reference Tyler2003). It can be related to the fairness of decision-making and the quality of the treatment of citizens (Blader and Tyler, Reference Blader and Tyler2003). Evidence shows that citizens’ evaluation of public sector workers and the services they receive is not only determined by the ultimate outcome (i.e. being able to place your father in a given nursing home), but also by how they are treated during contact with the public sector workers (i.e. did I even receive an answer? Did the public sector worker address all of my questions or concerns?) (Blader and Tyler, Reference Blader and Tyler2003; Wells, Reference Wells2007; Wang et al., Reference Wang, Jun and Wang2021).

Moreover, if people are treated fairly, with respect and neutrality, they are more likely to comply with decisions and directives (Hinds and Murphy, Reference Hinds and Murphy2007; Murphy and Tyler, Reference Murphy and Tyler2008; Wang et al., Reference Wang, Jun and Wang2021). Responsiveness – such as answering inquiries and providing information – is part of fair, respectful and neutral interactions with citizens. Ultimately, cross-national comparisons of citizen trust in government show that a fair and equitable process is more important than the assessments of government performance itself (Van Ryzin, Reference Van Ryzin2011; Moynihan et al., Reference Moynihan, Herd and Harvey2015). In sum, citizens care as much or more about the process of their interactions with the state as they do about the outcome (Moynihan et al., Reference Moynihan, Herd and Harvey2015).

Additionally, responsiveness is relevant to the administrative burden. Put simply, a lack of responsiveness or a bad quality of responsiveness can increase the administrative burden for citizens, especially regarding learning costs. Administrative burden refers to one's experience of policy implementation as onerous (Burden et al., Reference Burden, Canon, Mayer and Moynihan2012). In other words, it refers to the costs individuals experience in their interactions with the state (Moynihan et al., Reference Moynihan, Herd and Harvey2015). Learning costs appear when individuals engage in search processes to collect information about public services and assess how they are relevant to themselves, as, for instance inquiring about the steps to enroll one's parent in a public nursing home (Moynihan et al., Reference Moynihan, Herd and Harvey2015). Inconsistent information, lack of response and incomplete information to one's inquiries can, therefore, increase learning costs associated with navigating state policies and programs – the process of public service delivery. In turn, this increases the administrative burden for citizens, negatively impacting citizen–state relations.

Therefore, proper responsiveness to citizens during public service delivery has policy implications. We can improve citizen–state relations by investigating behavioral public policy strategies that positively enhance responsiveness during public service delivery. The learning costs associated with administrative burden can be altered. Public sector workers play a vital role. Their work discretion allows a considerable margin for increasing or decreasing administrative burden (Brodkin and Lipsky, Reference Brodkin and Lipsky1983; Lipsky, Reference Lipsky1984; Keiser and Soss, Reference Keiser and Soss1998; Soss et al., Reference Soss, Fording and Schram2011). Their work, therefore, affects the difficulties clients may have in accessing services to which they are entitled. Therefore, behavioral public policies that encourage public sector workers to decrease administrative burden, such as in learning costs by replying to requests and providing complete information, are valuable.

NPM ushered in a customer-oriented restructuring, redefining citizens as customers and seeking to enhance public service quality through a client-oriented approach (Haddad et al., Reference Haddad, Mssassi and Makkaoui2020; Thunman et al., Reference Thunman, Ekström and Bruhn2020). NPM aimed to improve public services by emphasizing efficiency, consistency and responsiveness to citizens’ needs, borrowing inspiration from the private sector's customer-service orientation (Hood, Reference Hood1991; Andrews and van de Walle, Reference Andrews and Van de Walle2013).

However, there are fundamental differences between serving and responding in the public sector and private firms. First, public organizations, governed by concerns for social welfare, equity and fair distribution of public goods, face a more intricate role than private businesses (Vigoda, Reference Vigoda2000; Daniels, Reference Daniels2016). Unlike private organizations driven by constant competition and financial incentives, public entities may not gain financially from providing better service to citizens, often grappling with budget constraints determined in advance (Milakovich, Reference Milakovich2003). Consequently, the public sector is more concerned with equity, fairness and efficient resource utilization, underlining the multifaceted nature of their relationship with citizens (Milakovich, Reference Milakovich2003).

Why is public service delivery important?

Speaking to citizen-state interactions more broadly, poor public service delivery performance, and as such bad quality of it can decrease the trust that citizens have toward the government. Public service delivery is a representation of the government and its bureaucracy, as it deals directly with a core function of governments: providing services (Bouckaert, Reference Bouckaert2002; Besley and Ghatak, Reference Besley and Ghatak2007; Hadian, Reference Hadian2017). As seen, responsiveness is directly related to the quality and performance of these services. Good quality of public service delivery is crucial for a well-functioning public administration, affecting citizen trust and relations with the public sector (Bouckaert, Reference Bouckaert2002; Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003; Hadian, Reference Hadian2017; Jilke et al., Reference Jilke, van Dooren and Rys2018). Public distrust toward the government is often associated with the functioning of public services (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). Bad performance and quality of public service delivery can fuel negative stereotypes of governments in general and low trust (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). Similarly, good quality services foster trust and positive stereotypes of government (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). High public service delivery quality can lead to higher satisfaction in citizens (Hung et al., Reference Hung, Huang and Chen2003; Mbassi et al., Reference Mbassi, Mbarga and Ndeme2019). Thus, a major consequence of public service delivery of poor quality is the impact it has on citizens’ trust toward the government which ultimately affects citizen-state relations (Bouckaert and Van de Walle, Reference Bouckaert, Van de Walle and Salminen2003).

The impact of public service delivery quality can be explained by micro-performance theory. It refers to how the functioning of public administrators influences citizen perceptions of the government (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). Citizens’ evaluations of the government are influenced by the quality of the service delivery from a given administrator. Put simply, the good quality of public service delivery by administrators during service delivery leads to satisfied customers (citizens) which, in turn, positively influences their attitude and trust toward the government. This is done not only by the macro functioning of the government but also through the micro – such as individual experiences. Improving the quality of public service delivery is a key goal for governments as public services are a key determinant of quality of life (Besley and Ghatak, Reference Besley and Ghatak2007).

Performance matters to citizens and is one of the main criteria on which citizens evaluate the government (Bouckaert and van de Walle, Reference Bouckaert, Van de Walle and Salminen2003). All citizens encounter government administrators, and evaluations of service quality are based on facts and not pre-conceived ideas (Bouckaert and van de Walle, Reference Bouckaert, Van de Walle and Salminen2003). People's experience of the administrative process is very important in the shaping of their attitude toward the government (Reisig and Parks, Reference Reisig and Parks2003; Tyler, Reference Tyler2006; Wang et al., Reference Wang, Jun and Wang2021). There are several criteria by which we can measure government quality and performance. One such approach, as outlined by Boyne (Reference Boyne1996) includes three main criteria: efficiency, responsiveness and equity. In this paper, we focus on responsiveness.

The use of positive public sector worker stereotypes to improve public service delivery quality is relevant from a policy perspective, too. Such low-cost behavioral interventions, as bringing emphasis on positive stereotypes, may help enhance citizens’ experience of the administrative process and may improve citizen-state interactions.

Methods

To test our hypotheses, we developed a scalable audit experiment. First, we tested our manipulation checks – via a survey – to assess whether our e-mails successfully activated the positive public sector pro-social stereotype of a ‘helpful worker’ (Willems, Reference Willems2020; Bertram et al., Reference Bertram, Bouwman and Tummers2022). Then, we tested the effect of the positive stereotype on public service delivery. Our design can be replicated across sectors, stereotypes and countries. Ethical concerns when it comes to audit experiments are often raised. We therefore obtained ethical approval for the study and its procedures through the ethical committee of the faculty of Law, Economics and Governance of Utrecht University. For more detailed discussions about the ethics of audit studies, please refer to Crabtree (Reference Crabtree and Gaddis2018), Lahey and Beasley (Reference Lahey, Beasley and Gaddis2018) and Gaddis and Crabtree (Reference Gaddis and Crabtree2021).

Our design follows state-of-the-art practices of other audit experiments (Crabtree, Reference Crabtree and Gaddis2018; Lahey and Beasley, Reference Lahey, Beasley and Gaddis2018; Gaddis and Crabtree, Reference Gaddis and Crabtree2021). The experimental design of the audit study methodology relies on sending identical information requests that differ by one attribute (in this case, stereotype activation) of the sender. The behavior (in this case, public service delivery) of the audited agents will be assessed by comparing response rates and the information provision across randomly assigned e-mails (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022). The e-mail itself was kept short to decrease the burden for employees. Each organization received one e-mail only to keep the administrative burden low (Jilke et al., Reference Jilke, van Dooren and Rys2018).

To test our manipulation, we tested seven e-mails (Appendix A). We based our text on e-mails used in other audit studies (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022). For details about the design and procedure, measures, sample and results, please refer to Appendix B. The manipulation check for stereotype activation was successful. Based on the results, we selected two e-mails: the control e-mail (e-mail 1, M = 1.92) and the highest-scoring e-mail (e-mail 5, M = 4.22) (see Table 1).

Table 1. Selected E-mails

Design and procedure

The purpose of this study was to test the effect of activating a pro-social stereotype on public service delivery. Our study was pre-registered at https://osf.io/wm8j3/?view_only=71135b7ecdbe44f5a61fe49edca06cd9 and supplementary materials, syntax and data are available at https://osf.io/txejk/?view_only=6751981ff1ef4489920396f12d23faf8. We chose nursing homes as the context for our audit study. In the Netherlands, every citizen in need of long-term care (i.e. nursing home) can rely on public funding, as the government finances and safeguards the functioning of the long-term care market (Bos et al., Reference Bos, Margareth Kruse and Theodoor Jeurissen2020). Similarly, in Belgium, the nursing home sector is a regulated public service market by the central government (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022). Based on residents’ care needs, the government allocates daily amounts to pay for facilities, where the compulsory national health insurance scheme bears the medical and nursing expenses (Van Dooren and Jilke, Reference Van Dooren and Jilke2022). Thus, in both cases, nonprofit and public nursing homes are funded by the government, count as a public market and have been entrusted by the government to carry-out public services (Van Dooren and Jilke, Reference Van Dooren and Jilke2022).

We found through public records online access to e-mails of nursing homes in the Netherlands and Belgium. We compiled all e-mails and randomized them into one of the two conditions: no stereotype activation (control, e-mail 1) and strong stereotype activation (e-mail 5, three sentences). Each nursing home received one e-mail inquiring about their services and was given two weeks to reply (Jilke et al., Reference Jilke, van Dooren and Rys2018).

Measures

Pro-social stereotype activation

We activated the pro-social stereotype of a ‘helpful worker’ (Table 1, e-mail 5). Our e-mails were randomized between male and female senders to minimize any gender effects of the senderFootnote ¹ (Grohs et al., Reference Grohs, Adam and Knill2016). We picked the most common female and male names culturally common to both the Netherlands (https://forebears.io/netherlands/forenames) and Belgium (https://forebears.io/belgium/forenames) to minimize any SES connotations for discrimination (Jilke et al., Reference Jilke, van Dooren and Rys2018). This resulted in Monique (the most popular name in the Netherlands, fourth in Belgium) for females, and Jan (the most popular male name in the Netherlands, sixth in Belgium). We also picked the most common surname in both countries (de Jong for the Netherlands and Peeters for Belgium). We coded the control condition as 0 and the stereotype activation condition as 1.

Public service delivery

We chose two outcome variables that represent core aspects of responsiveness in public service delivery: response rate and information provision (Jilke et al., Reference Jilke, van Dooren and Rys2018). Both outcomes were binary (coded as 0 or 1).

Response rate

We evaluated whether the response rate differs between groups (0 – no response, 1 – response). Automatic replies were excluded (such as thank you for your message, we will get back to you in X working days), and we only included actual replies from employees.

Information provision

We asked in the e-mail three questions about the organization's services. Adequate public service delivery will be defined as having answered all three questions (coded as 1). If not all questions were answered, the public service delivery performance will be coded as 0.

Exploratory measures

We have pre-registered exploratory variables to deepen the understanding of our results. We investigated friendliness as a dependent variable, and gender of the sender as predictors on all three dependent variables. We have also explored the effect of country as an exploratory independent variable on our dependent variables. Country exploratory effects are in Appendix C. We have included these exploratory variables for different facets of public service delivery. Friendliness relates to administrative burdens as psychological costs of the procedure. Gender and country of the sender as characteristics of the workerFootnote ².

Friendliness

We investigated friendliness as a third dependent variable. We operationalized friendliness as saying ‘thank you’ in the response back. Examples would include ‘thank you for your e-mail/contacting us’ and ‘thank you for your interest in our facility’. We did not include using ‘thank you’ in the signature of the e-mail. We coded e-mails as 0 if there was no in-text ‘thank you’, and 1 if there was. Automatic replies were excluded (such as ‘thank you for your e-mail we will get back to you in X working days’).

Expressing gratitude, such as saying ‘thank you,’ is a form of friendliness or being friendly (Percival and Pulford, Reference Percival and Pulford2020). Research suggests that verbal expressions of politeness and gratitude influence interpersonal perceptions, reducing formality and increasing friendliness ratings (Percival and Pulford, Reference Percival and Pulford2020). That is, when individuals express ‘thank you’ in social interactions following a compliment, they are perceived as friendlier (Algoe et al., Reference Algoe, Dwyer, Younge and Oveis2020; Percival and Pulford, Reference Percival and Pulford2020). The act of thanking someone is linked to the theory of reciprocal altruism, which posits that gratitude helps regulate our response to altruistic acts, fostering positive social interactions (Algoe et al., Reference Algoe, Dwyer, Younge and Oveis2020; Carter et al., Reference Carter, Chen, Razik and Shakelford2020).

Furthermore, friendliness during bureaucratic encounters is also relevant to public administration. Studies show that citizens’ satisfaction with the government is not limited to service outcomes, but is also influenced by the process through which citizens receive services (Reisig and Parks, Reference Reisig and Parks2003; Tyler, Reference Tyler2006; Brown, Reference Brown2007; Wells, Reference Wells2007; Wang et al., Reference Wang, Jun and Wang2021). Friendly and polite interactions from public employees can add to a better experience of the bureaucratic encounter, which can result in higher levels of satisfaction with service provision (Wang et al., Reference Wang, Jun and Wang2021).

Gender

We have coded for the gender of the sender, male (0) and female (1) to investigate gender effects.

Sample

We used the G*Power program for our power calculation, based on a Cohen's d of 0.2 (f ² = 0.02 in G*Power). The calculation estimated 636 participants required for a power of 0.9 and an alpha of 0.05. We chose a small effect size as the literature does not provide enough evidence for a medium or large effect size.

We e-mailed 849 homes and received 573 replies, with a reply rate of 67.5%. A sample of 573 allows for a power of 0.85 instead of 0.9. We aimed to contact all nursing homes in the Netherlands and Flemish Belgium. However, certain nursing homes are part of larger chains, and while offering many locations, provide only one general e-mail for inquiries about placement. Thus, we excluded all nursing homes that provided the same contact e-mail address while keeping the one general address. By doing so, we limit spillover between our conditions. Larger chains are more common in the Netherlands, leaving us with a sample with a majority nursing homes located in Flemish Belgium. We also removed all homes that had a private for-profit component in both countries, leaving us with a sample of public and nonprofit nursing homes.

Table 2 shows our sample demographics in terms of gender and country and the randomization check. We assessed the sample conditions for homogeneity with a chi-square test on the gender of the sender and country of the experiment. The differences are all insignificant showing that randomization was successful – our treatment and control groups do not significantly differ on both demographic variables.

Table 2. Demographic comparison across groups and randomization test for gender and country

We had two exclusion criteria. Firstly, e-mails not successfully delivered due to invalid addresses have been excluded (Jilke et al., Reference Jilke, van Dooren and Rys2018). Second, responses were considered invalid if they are received two weeks after the e-mail had been sent out (Jilke et al., Reference Jilke, van Dooren and Rys2018). We have excluded 10 replies in total for answering after two weeks (6 in Belgium and 4 in the Netherlands). We had 15 invalid e-mail addresses in total.

Statistical analysis

We have conducted an ordinary least square (OLS) regression on each main outcome variable: (a) response rate, and (b) information provision. We opted for an OLS over a logistic regression based on experimentalist recommendations (Angrist and Pischke, Reference Angrist and Pischke2008). We have conducted the same regression on our exploratory dependent variable of friendliness. Therefore, we have two pre-registered main outcome variables and one exploratory outcome variable. We have also performed OLS regressions with our exploratory independent variable of gender. All of our (exploratory) analyses were pre-registered. We applied the Bonferroni correction to all analyses to account for multiple testing. Summary statistics for the results are in table 3.

Results

Hypothesis 1

Hypothesis 1 stated that a pro-social stereotype activation would lead to a better quality of public service delivery in terms of response rate. Activating a pro-social stereotype did not affect reply rate (B = 0.014, SE = 0.032, R ² = 0.000, p = 0.670). In the stereotype activation condition, the reply rate was 68.2%. In the control condition, the reply rate was 66.8%. Hypothesis 1 is rejected.

Hypothesis 2

Hypothesis 2 stated that a pro-social stereotype activation would lead to a better quality of public service delivery in terms of information provision. Activating a pro-social stereotype did not affect information provision (B = −0.042, SE = 0.041, R ² = 0.002, p = 0.314). In the stereotype activation condition, 41.3% of responses provided answers to all three questions, similarly as in the control condition with 45.5% of full answers. Hypothesis 2 is rejected (see Table 4 and Figure 1).

Table 3. Summary statistics for results

Table 4. Ordinary least square regression of stereotype activation

Figure 1. Stereotype activation effects on response rate, information provision and friendliness. Note: The Y-axis, ranging from 0 to 70, shows the sample percentage. Each condition shows 95% error bars.

Exploratory analyses – stereotype activation on friendliness

We investigated whether a pro-social stereotype activation affected the friendliness of the reply. We find that activating a pro-social stereotype leads to more friendliness in the replies from the workers toward the clients (B = 0.122, SE = 0.041, R ² = 0.015, p = 0.003). In the stereotype activation condition, 47.0% of answers were friendly compared to 34.8% in the control condition.

Exploratory analyses – gender effects

For all three dependent variables, we included the gender of the sender and stereotype activation as predictors in an OLS (see Table 5)Footnote ³. Results are represented in Figure 2.

Figure 2. Gender effects on response rate, information provision and friendliness.

Table 5. Exploratory OLS regression results – effects of gender

Response Rate

We explored whether the gender of the sender affected the reply rate. We find that gender affects the reply rate (B = 0.226, SE = 0.031, R ² = 0.058, p < 0.001). Senders who are women receive more replies than men, roughly 10% more consistently in both conditions. In the total sample, 56.5% of replies (324 e-mails) were for female senders, while 43.5% (248 e-mails) were for male senders on a total of 573 replies.

Information provision

We investigated whether the gender of the sender affected the information provision. We find that gender does not affect information provision, although significant at the 0.10-level (B = 0.079, SE = 0.042, R ² = 0.008, p = 0.059). That is, women received 22% more of complete replies than men. In total, 249 e-mails provided complete information provision, in which 97 were for males (39%) and 152 were for females (61%).

Friendliness

We examined whether the gender of the sender affected friendliness of the reply. We find that gender has no effect on friendliness (B = −0.022, SE = 0.041, R ² = 0.016, p = 0.597).

Discussion and conclusion

We investigated whether activating a pro-social stereotype improves the quality of public service delivery. We have conducted a field experiment on two aspects of responsiveness of public service delivery: response rate and information provision. Our results drive us to distinguish public service delivery quality into two parts: bureaucratic outcome and bureaucratic process. We find that a pro-social stereotype activation does not affect bureaucratic outcome of response rate and information provision. However, in our exploratory analyses, we find that a pro-social stereotype activation does affect the bureaucratic process. Activating a pro-social stereotype led public sector workers to be friendlier toward citizens in the form of gratitude (saying thank you) in their replies, by around 12%. The fact that our results drive us to distinguish between outcome and process is in line with previous research. Citizens’ evaluation of public services is not limited to the outcomes but is also influenced by the process (Reisig and Parks, Reference Reisig and Parks2003; Tyler, Reference Tyler2006; Brown, Reference Brown2007; Wells, Reference Wells2007; Wang et al., Reference Wang, Jun and Wang2021). Friendly and polite interactions from public employees can enhance the experience of bureaucratic encounters (Wang et al., Reference Wang, Jun and Wang2021). Additionally, in our exploratory analyses, we find that the personal characteristic of gender does affect bureaucratic outcomes: women receive roughly 10% more replies than men.

Why did stereotype activation affect the process and not the outcome? A first explanation may be found in the task concordance between the stereotype activation and the effect of the activation on our outcome variable. Our stereotype activation was not about employee performance per se, but about the process with the client (being helpful). Looking at stereotype activation literature, the evaluated outcome task is often straightforwardly connected to the stereotype being induced. To illustrate, stereotyping to be good/bad at math would be tested by math tests (Shih et al., Reference Shih, Pittinsky and Ambady1999; Clark, Thiem and Kang, Reference Clark, Thiem and Kang2017), stereotyping being good/bad in a sport would be tested with sport performance (Chalabaev et al., Reference Chalabaev, Stone, Sarrazin and Croizet2008), stereotyping memory would be tested with memory tests (Levy, Reference Levy1996). It is possible that the stereotype of being helpful in our study was not as directly related to our outcome (reply rate and information provision), but more related with the process with the client. Future research is needed to dissect the relation between task concordance and the stereotype being activated.

A second explanation may be found in identity mechanisms. Stereotype activation literature emphasizes that for a stereotype to have an effect, the stereotyped person must identify with the stereotype-domain (Smith and Johnson, Reference Smith and Johnson2006). For instance, if one stereotypes women as bad drivers, then for the stereotype to have an effect, one must identify as a woman. It is possible that our stereotype did not activate the professional identity of nursing home workers that would affect our professional outcome measure (i.e. reply rate and information provision) but solely activated a pro-social identity and in turn affected the process outcome measure of being helpful. Pro-social behavior is characterized by actions intended to benefit others than oneself (Resh et al., Reference Resh, Marvel and Wen2018). Friendliness falls under the umbrella of pro-social behaviors (Malti and Dys, Reference Malti and Dys2018). Future research is needed to identify underlying mechanisms when activating stereotypes such as identity based on professional group belonging. A fruitful start could be to measure the distinctive roles of PSM (Perry, Reference Perry1996) and pro-social motivation (Francois and Vlassopoulos, Reference Francois and Vlassopoulos2008) when developing and testing stereotype activation interventions.

Our findings make important contributions to our field. First, most of the current literature investigating public sector worker stereotypes focuses on describing what stereotypes exist (de Boer, Reference de Boer2020; Willems, Reference Willems2020; Bertram et al., Reference Bertram, Bouwman and Tummers2022). We show that public sector stereotypes can have effects on citizen-state interactions. Our results demonstrate that positive stereotypes do not alter the outcome of public services (i.e. information provision and response rate remain the same) but do affect the process of public services (i.e. public sector workers are friendly when stereotyped). This is in line with the recent work of Szydlowski et al. (Reference Szydlowski, de Boer and Tummers2022) who demonstrated that showing vulnerability by public sector workers makes citizens behave more compassionately. More research is needed that investigates the consequences of different types of stereotypes for the process of citizen-state interactions.

Second, most of the work on stereotypes focuses on personal characteristics such as gender, age and race (Levy et al., Reference Levy, Pilver, Chung and Slade2014; Régner et al., Reference Régner, Thinus-Blanc, Netter, Schmader and Huguet2019; Vomfell and Stewart, Reference Vomfell and Stewart2021). Our field is no exception and is almost exclusively focused on studying stereotypes of citizens’ and workers’ personal characteristics (e.g. Keiser, Reference Keiser2010; Jilke et al., Reference Jilke, van Dooren and Rys2018; Raaphorst et al., Reference Raaphorst, Groeneveld and Van de Walle2018; Harrits, Reference Harrits2019). We show that stereotypes of professional identity also matter. Thus, our results demonstrate that job stereotype effects exist in the public work setting. It is worthwhile to continue to study the effects of stereotypes related to social groups people chose to enter with more ease, while being aware of the stereotypes of these groups. Observational and experimental methods may be a useful combination.

Finally, we showed that positive public sector stereotypes do not affect bureaucratic outcomes in terms of response rate and information provision which may be reassuring. However, we showed that the citizens’ gender does affect the outcome of bureaucratic procedures. We showed that women received more replies than men, around 10%. It could be interpreted that men are discriminated in terms of outcome when receiving elderly care services based on their name. However, this interpretation seems too simple when we delve into the discrimination literature in our field. In general, this literature is almost exclusively studies name-based discrimination when it comes to racial or ethnic minorities (e.g. Jilke et al., Reference Jilke, van Dooren and Rys2018; Guul et al., Reference Guul, Villadsen and Wulff2019). A notable exception is Grohs et al. (Reference Grohs, Adam and Knill2016) who also studied gender effects on service provision. Contrary to our results, men received more replies than women in their study across two domains: childcare and mobile home requests. Yet, they did not find a clear pattern of gender-based discrimination. They did, however, find indication that the policy context of the service being provided sometimes favors men and sometimes favors women. More specifically, men received more complete information and higher service when requesting for childcare, whereas women received more complete information and higher service orientation when requesting for mobile homes. Our study was conducted only in the context of nursing home requests.

These two studies together show the relevance of the call for a heterogeneity revolution in behavioral sciences and theory (Bryan et al., Reference Bryan, Tipton and Yeager2021). Both field experiments investigating responsiveness in public service provision find different results in terms of outcome and process based on clients’ personal characteristics of gender. Both field experiments test different domains of public service access: nursing home requests, childcare requests and mobile home requests. Depending on the domain, women or men received more answers and more complete answers. Therefore, the personal characteristic of gender does affect bureaucratic outcomes in public service provision, we just cannot explain yet how exactly and why. Theoretically, it is worthwhile to dissect if policy context explains why sometimes men and sometimes women receive better quality services when interacting with the state. It may also be fruitful to test if the gender of the public sector workers themselves offers insights into these mixed results on name-based discrimination. The representative bureaucracy literature may be helpful here because there is evidence that shared values (e.g. based on gender or race) improve outcomes for citizens, which could explain differences in service delivery (Guul, Reference Guul2018; Wright and Headley, Reference Wright and Headley2020).

Our findings have implications for practice. First of all, the bureaucratic process is associated with several costs for the client, including psychological costs (Moynihan et al., Reference Moynihan, Herd and Harvey2015). Psychological costs refer to frustrations and stresses that arise from interacting with the state (Moynihan et al., Reference Moynihan, Herd and Harvey2015). When individuals depend upon the state for vital resources – such as the provision of health services – uncertainty about the receipt of those benefits, as well as frustrations in the process of seeking those may increase stress. There is evidence that individuals that care for an old relative have higher stress (Pinquart and Sörensen, Reference Pinquart and Sörensen2003), yet little is known about how interactions with public sector workers to obtain benefits of caregiving (such as healthcare for nursing homes) affect that stress (Moynihan et al., Reference Moynihan, Herd and Harvey2015). Psychological costs have been addressed in terms of friendliness from the worker in research (Olsen et al., Reference Olsen, Kyhse-Andersen and Moynihan2022). Our findings show that we are able to activate a pro-social stereotype in workers that may reduce psychological costs for the clients, in terms of friendliness. Public sector workers in the stereotype activation condition were friendlier to the clients, which can make them feel more welcomed (Olsen et al., Reference Olsen, Kyhse-Andersen and Moynihan2022).

Secondly, our findings contribute to practice in terms of the importance of the micro-interactions between the state and citizens (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). The functioning of public administrators influences citizen perceptions of the government (Van de Walle and Bouckaert, Reference Van de Walle and Bouckaert2003). Thus, citizens’ stereotypes, attitudes and trust in the government are influenced by the interactions and quality of the service delivery from a given administrator. Positive interactions between public sector workers and citizens are therefore a key goal. Our findings suggest a low-cost way to do so: activating a pro-social stereotype of public sector workers. Future research should test this effect more closely to grasp a better understanding of how we can potentially implement this. One way is to investigate which mechanisms are at play. Practically, there is quite a research gap in how concrete managerial actions can influence desired employee outcomes (Vogel and Willems, Reference Vogel and Willems2020). Thus, applying pro-social stereotypes to workers would be used by managers to influence the public service delivery process – as a sort of micro-intervention (Vogel and Willems, Reference Vogel and Willems2020).

Future research should explore the evaluation and perceived value of friendlier replies from public sector workers, particularly those incorporating expressions of gratitude, during the delivery of public services. It would be valuable to investigate whether responses containing phrases such as ‘thank you for your interest in our home’ or ‘thank you for your message’ are regarded as friendlier, more competent or preferable for interaction with a public sector worker, compared to replies that do not include such elements. Future research should assess the direct impact of such gratitude-infused replies on citizens’ perceptions of the quality of public service delivery and the bureaucratic process. Understanding the dynamics of how these responses are evaluated can provide insights into enhancing overall citizen-state interactions.

Thirdly, our findings have practical implications for policy-makers. Future research could compare whether the same bureaucratic outcome and process effects are found across policy domains like finance, healthcare and education. This may be especially relevant in areas where private counterparts are often deemed to provide a better quality service, such as in healthcare (Pongsupap and Van Lerberghe, Reference Pongsupap and Van Lerberghe2006; Daneshkohan et al., Reference Daneshkohan, Zarei and Ahmadi-Kashkoli2020).

As such, policy-makers and scholars should consider developing and testing communication strategies emphasizing positive stereotypes specific to workers in different policy domains. We also encourage future research to replicate our results (Sanders et al., Reference Sanders, Snijders and Hallsworth2018). Additionally, our gender effects have further implications for policy-making. Our results show that your gender matters when you reach out to the government for services.

Finally, our findings must be considered in the light of some limitations. Our limitations pertain to generalizability and context, measures and design. In terms of generalizability, we are limited in generalizing to the public sector as a whole, as our sample was composed of public and of nonprofit organizations. That is, our population may not have fully identified as public sector workers. We are also limited to our context of testing. We cannot claim that our effect would generalize in other areas of the public sector (i.e. teaching, police and tax officials). Future studies should investigate stereotype activation effects on public sector-specific stereotypes and organizations.

Additionally, we are limited in our measures. We cannot know whether our stereotype activation worked because it activated the stereotype of a helpful worker, or the personal value of helping. Future research should examine which stereotype-relevant domain was at play. We are also limited in our measure of friendliness, and cannot claim how the effect would transfer to face-to-face interactions or tones of interactions. Finally, we are also limited in our measures for public service delivery, and thus cannot completely rule out the potential effects of stereotype activation on bureaucratic outcomes. Future research should examine other aspects of the outcome, such as efficiency, response time and time invested in a client with stereotypes more in line with the outcome.

Lastly, we are also limited in terms of our design. To gain a comprehensive understanding of the causal effects of activating public sector worker stereotypes, further investigation into information equivalence of conditions is necessary. Currently, we face limitations in drawing definitive conclusions about the effects of our e-mails, as we lack insight into the underlying cognitive processes associated with reading them. In order to address this limitation, future studies should focus on assessing participants’ perceptions of the various stereotype-activating materials, for instance, in terms of warmth, friendliness and openness. This evaluation of information equivalence, its constructs and its effects would provide valuable insights on the process of stereotype activation. Moreover, it is important to acknowledge that our study was conducted as a field experiment, which inherently lacks the high level of control typically found in laboratory settings. Consequently, we are limited in making exclusive claims that our results are solely attributed to stereotype activation. Alternative interpretations, such as the possibility that the observed effects stem from general politeness rather than stereotyping, should be considered. Future studies should investigate this nuance in depth.

Furthermore, our experimental design has limitations. The absence of a placebo group with is noteworthy. Instead, our study includes only a control group and a positive stereotype group, omitting a third group exposed solely to a positive message without a stereotype. This decision stems from constraints related to the limited statistical power of our sample size, compounded by the unavailability of additional nursing homes for inclusion, as we have already incorporated all such facilities in the Netherlands and Flemish Belgium.

Consequently, our design limitation hinders us from definitively discerning whether the observed increase in friendliness in responses within the stereotyped condition results from activating a positive stereotype or reciprocating a positive message. However, evidence supports the notion that friendliness, kindness and pro-social behavior can elicit reciprocal responses in interactions (Lubell and Scholz, Reference Lubell and Scholz2001; Pelaprat and Brown, Reference Pelaprat and Brown2012), studies in social dilemmas and behavioral game theory present mixed and inconsistent findings regarding the reciprocity of behavior (Komorita et al., Reference Komorita, Parks and Hulbert1992; Sheldon, Reference Sheldon1999; Parks and Rumble, Reference Parks and Rumble2001). Future research could delve into whether similar effects emerge in a positive message condition compared to a positive stereotype activation condition.

Moreover, another design limitation pertains to our manipulation check. In a survey experiment, we performed the manipulation check on a separate sample of public sector workers. Results may have been different if we had tested our manipulation check in the field. Due to a lack of more possible participants in our study, we could not test our conditions in the field. Future research should take this limitation into account.

One last design limitation pertains to our limited measures. That is, our primary measures of responsiveness in public service delivery are binary measures. Although binary measures do have their advantages, such as simplicity and ease of interpretation, they provide better control over study variables, they reduce potential biases and increase the accuracy of results, and they are cost-effective in studying biases and behaviors (Deeks, Reference Deeks2002; Bischof et al., Reference Bischof, Cohen, Cohen, Foos, Kuhn, Nanou, Visalvanich and Vivyan2022). In our design, we built on the work of other audit studies that also use binary measures (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022). However, they also come with their disadvantages, such as a lack of nuance by oversimplifying phenomena, they reduce available information and may be limited to understand more complex situations such as social interactions (Dolnicar, Reference Dolnicar2003; Bischof et al., Reference Bischof, Cohen, Cohen, Foos, Kuhn, Nanou, Visalvanich and Vivyan2022). In other words, they limit the range of information that is available. Future research should go beyond binary measures and also incorporate other measures such as response time, e-mail tone, degree of precision of the answers to the questions and amount and type of information in the e-mail.

Conclusion

We demonstrate that activating the positive stereotype of a helpful worker affects the bureaucratic process by increasing the friendliness of the employee. Our results suggest that a positive attitude of citizens toward the public sector worker (i.e. activating a positive stereotype) will generate a positive attitude from the public sector workers toward the client (i.e. being friendly). Positive stereotypes, however, do not affect bureaucratic outcomes regarding responsiveness in public service delivery. Our findings demonstrate that not only positive stereotypes but also citizens’ gender affects the result of the bureaucratic process. Women receive more answers to requests for nursing home placement than men.

Data availability

Data and supplementary materials are available at https://osf.io/txejk/?view_only = 6751981ff1ef4489920396f12d23faf8

Funding statement

Funding by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO) for the project ‘Lazy Bureaucrats? Studying Stereotypes of civil servants and its effects across countries' (VIDI.185.017). Grant number ‘NWO VIDI VIDI.185.017’.

Competing interest

The authors declare no potential conflict of interest.

Ethics approval statement

Ethical approval of the faculty ethical review committee from the faculty of Law, Economics, and Governance of Utrecht University was obtained.

Appendix A

Table A1. All tested e-mails

Appendix B

Manipulation check

Design and procedure

In order to test our manipulation, we developed seven e-mails: one neutral (control), three with a light stereotype activation (one stereotype activation sentence) and three with a strong stereotype activation (three stereotype activation sentences). These are shown in Appendix A. All e-mails ask the same questions and vary solely whether none, one, or three stereotype sentences of a helpful worker were integrated. The control condition is suitable because – as opposed to developing positive and negative e-mails – it provides a true baseline. In this way, we can assess the stereotype activation (Lonati et al., Reference Lonati, Quiroga, Zehnder and Antonakis2018). We based our e-mails on e-mails used in other audit studies (Jilke et al., Reference Jilke, van Dooren and Rys2018; Van Dooren and Jilke, Reference Van Dooren and Jilke2022). Our e-mails had the sole purpose of activating stereotypes. It was not possible to establish if this mimicked an everyday e-mail the workers receive. We are, however, unable to verify this since it would require access to the e-mail accounts of elderly care home workers. To not fatigue, bore, or reveal our manipulation to our respondents, the respondents were randomized to rate three e-mails. In sum, we tested whether our pro-social stereotype of a helpful worker was indeed activated, and if there was a difference in strength of activation between the conditions. Our study was pre-registered at https://aspredicted.org/BLO_RPI and supplementary materials, syntax and data are available at https://osf.io/txejk/?view_only=6751981ff1ef4489920396f12d23faf8.

Measures

This data collection was integrated as a part of a larger survey experiment. Additionally to our main measures, we also have demographics: age, gender and years of experience in the public sector.

Pro-social stereotype activation

We assessed whether the pro-social stereotype of a helpful worker was activated, and the extent to which the activation varied across conditions. We asked participants: ‘To what extent is the worker being stereotyped as “very helpful”?’. Participants rated this question on a 5-point Likert scale (from 1 – not at all to 5 – very much). We chose a popular, unisex name (Alexis) to avoid possible gender and discrimination effects (Bilan et al., Reference Bilan, Mishchuk, Samoliuk and Mishchuk2020).

Sample

Participants were recruited in the United Kingdom through an online survey panel (Prolific). We did a power calculation for seven groups and a MANOVA (for two independent variables). We pre-registered two independent variables. As only one variable is of interest for this paper, we report only one. The second variable attempted to foreshadow our experimental results to aid in the study design. We asked participants their willingness to respond to the client in the e-mail and to answer all questions. The analyses on our second variable are available in the Supplementary Materials on OSF https://osf.io/txejk/?view_only=6751981ff1ef4489920396f12d23faf8. Our sample consisted of 57% females with a median age of 38 years. Table B1 provides the details.

Table B1. Manipulation check sample demographics (n = 718)

We used the G*Power program with a small effect size (f = 0.02). This led us to an estimation of 658 participants with a power of 0.95 and an alpha of 0.05. All participants are workers in the public sector. Participants who did not pass two out of three attention checks were excluded from the analysis. No participants were excluded for failing attention checks. In the end, our sample consists of 718 participants.

Results

To analyze the results, we performed a MANOVA on the mean scores of each item.

Stereotype activation

The manipulation check for stereotype activation and strength of stereotype activation was successful. That is, there was a significant statistical difference between the neutral condition, the light activation conditions and the strong activation conditions: (F (24, 7480.73) = 29.91, p < 0.001; Wilk's Lambda = 0.726, partial Eta² = 0.08). Please refer to Table B2 for the e-mails’ descriptives and Tables B3 and B4 for the MANOVA results. The post-hoc Tukey HSD showed statistical difference in stereotype activation between the control e-mail and the light activation condition [Mean difference = −1.79, 95% CI = (−2.07, −1.51)], the light activation condition and the strong activation condition [Mean difference = −0.50, 95% CI = (−0.79, −0.22)], and between the control condition and the strong activation condition [Mean difference = −2.29, 95% CI = (−2.57, −2.01)]. Table B5 shows the post-hoc tests results of the e-mails’ comparison.

Table B2. E-mail descriptives on stereotype activation

Table B3. MANOVA results – multivariate tests

Table B4. MANOVA – tests of between-subjects effects

Table B5. Post-hoc tests (Tukey HSD) for E-mail means comparison

For the main study, we selected e-mails number one – control – (M = 1.92, SD = 1.19) and five – strong activation – (M = 4.22, SD = 1.01). Based on the results, we deemed that the differences between the strength of activation of the light and strong conditions were too small, even though significant. Thus, we have decided to only select two e-mails instead of three for the main study: the control e-mail (e-mail 1, M = 1.92) and the highest-scoring e-mail (e-mail 5, M = 4.22). This also helped to increase the power of the main study.

Appendix C

Exploratory analyses – country effects

For all three dependent variables, we included country and stereotype activation as predictors in an OLS. Results are presented in Table C1.

Response rate

We explored whether there was a difference in response rates between countries. We find that the country affects the reply rate (B = 0.102, SE = 0.047, R ² = 0.013, p = 0.03). We find that there is a higher reply rate in the Netherlands (74.4%) than in Flemish Belgium (63.5%).

Information provision

We investigated whether the country of the sender affected the information provision. We find that the country affects information provision (B = −0.398, SE = 0.056, R ² = 0.141, p < 0.001). Our results show that fewer replies in the Netherlands provided an answer to all three questions (20.1%) compared to Belgium (58.6%).

Friendliness

We examined whether the country of the sender had an effect on the friendliness of the reply back. We did not find any effects to suggest differences in effects based on country (B = −0.020, SE = 0.059, R ² = 0.018, p = 0.739).

Table C1. Exploratory OLS regression results – country effects

Appendix D

Exploratory analyses suggested by the reviewer – Interaction effect of gender and treatment (stereotype activation) on gender analysis models

Table D1. Exploratory OLS regression results – effects of gender including interaction effect of gender and stereotype activation

Footnotes

¹ With gender, we mean that the name of the sender is a marker that could serve as an indicator of someone's gender identity. While we are aware that someone's gender identity is more complex than male–female, for this study, we only studied name markers that either signal male or female gender.

² We have conducted two more exploratory analyses that yielded null results, namely on friendliness of the greeting and on the number of questions asked back to the client. The results and syntax are found in the online Supplementary Materials. No other exploratory analyses were conducted.

³ As per reviewer's suggestion, we have also conducted analyses adding the interaction effect of treatment*gender on the provided dependent variables. We find null results for the interaction effect on response rate and on friendliness. However, we find an effect on information provision. That is, it appears that the treatment works better for female senders, where female senders that are in the stereotype activation condition receive more emails with complete answers to their questions than the other groups. Results for these analysis are in Appendix D.

Note: df, degrees of freedom; Sig., significance; Noncent. parameter, noncentrality parameter.

Note: Based on observed means. The error term is mean square (error) = 0.357.

References

Algoe, S. B., Dwyer, P. C., Younge, A. and Oveis, C. (2020), ‘A new perspective on the social functions of emotions: gratitude and the witnessing effect’, Journal of Personality and Social Psychology, 119(1): 40.CrossRef Google Scholar PubMed

Andrews, R. and Van de Walle, S. (2013), ‘New public management and citizens’ perceptions of local service efficiency, responsiveness, equity and effectiveness’, Public Management Review, 15(5): 762–783.CrossRef Google Scholar

Angrist, J. D. and Pischke, J.-S. (2008), ‘Parallel worlds: fixed effects, differences-in-differences, and panel data’, in Mostly Harmless Econometrics, Princeton: Princeton University Press, 221–248.CrossRef Google Scholar

Arroyos-Calvera, D., Drouvelis, M., Lohse, J. and McDonald, R. (2021), ‘Improving compliance with COVID-19 guidance: a workplace field experiment’, Behavioural Public Policy, 1–23.Google Scholar

Ashton, M. C. and Esses, V. M. (1999), ‘Stereotype accuracy: estimating the academic performance of ethnic groups’, Personality and Social Psychology Bulletin, 25(2): 225–236.CrossRef Google Scholar

Bertram, I., Bouwman, R. and Tummers, L. (2022), ‘Socioeconomic status and public sector worker stereotypes: results from a representative survey’, Public Administration Review, 82(2): 237–255.CrossRef Google Scholar

Besley, T. and Ghatak, M. (2007), ‘Reforming public service delivery’, Journal of African Economies, 16: 127–156.CrossRef Google Scholar

Bilan, Y., Mishchuk, H., Samoliuk, N. and Mishchuk, V. (2020), ‘Gender discrimination and its links with compensations and benefits practices in enterprises’, Entrepreneurial Business and Economics Review, 8(3): 189–204.CrossRef Google Scholar

Bischof, D., Cohen, G., Cohen, S., Foos, F., Kuhn, P. M., Nanou, K., Visalvanich, N. and Vivyan, N. (2022), ‘Advantages, challenges and limitations of audit experiments with constituents’, Political Studies Review, 20(2): 192–200.CrossRef Google Scholar

Blader, S. L. and Tyler, T. R. (2003), ‘A four-component model of procedural justice: defining the meaning of a “fair” process’, Personality and Social Psychology Bulletin, 29(6): 747–758.CrossRef Google Scholar PubMed

Bolino, M. C. and Grant, A. M. (2016), ‘The bright side of being prosocial at work, and the dark side, too: a review and agenda for research on other-oriented motives, behavior, and impact in organizations’, Academy of Management Annals, 10(1): 599–670.CrossRef Google Scholar

Bos, A., Margareth Kruse, F. and Theodoor Jeurissen, P. P. (2020), ‘For-profit nursing homes in the Netherlands: what factors explain their rise?’, International Journal of Health Services, 50(4): 431–443.CrossRef Google Scholar PubMed

Bouckaert, G. (2002), ‘Pride and performance in public service: some patterns of analysis’, International Review of Administrative Sciences, 67(1): 15–27.CrossRef Google Scholar

Bouckaert, G. and Van de Walle, S. (2003), ‘Quality of Public Service Delivery and Trust in Government’, in Salminen, A. (eds), Governing Networks: EGPA Yearbook, vol. 22, Amsterdam: IOS Press, 299–318.Google Scholar

Bourgon, J. (2007), ‘Responsive, responsible and respected government: towards a new public administration theory’, International Review of Administrative Sciences, 73(1): 7–26.CrossRef Google Scholar

Boyne, G. A. (1996), ‘Scale, performance and the new public management: an empirical analysis of local authority services’, Journal of Management Studies, 33(6): 809–826.CrossRef Google Scholar

Boyne, G. A., Farrell, C., Law, J., Powell, M. and Walker, R. M. (2003), Evaluating Public Management Reforms. Buckingham: Open University Press.Google Scholar

Brodkin, E. and Lipsky, M. (1983), ‘Quality control in AFDC as an administrative strategy’, Social Service Review, 57(1): 1–34.CrossRef Google Scholar

Brown, T. (2007), ‘Coercion versus choice: citizen evaluations of public service quality across methods of consumption’, Public Administration Review, 67(3): 559–572.CrossRef Google Scholar

Bryan, C. J., Tipton, E. and Yeager, D. S. (2021), ‘Behavioural science is unlikely to change the world without a heterogeneity revolution’, Nature human behaviour, 5(8): 980–989.CrossRef Google Scholar PubMed

Burden, B. C., Canon, D. T., Mayer, K. R. and Moynihan, D. P. (2012), ‘The effect of administrative burden on bureaucratic perception of policies: evidence from election administration’, Public Administration Review, 72(5): 741–751.CrossRef Google Scholar

Carter, G., Chen, M.-H. and Razik, I. (2020), ‘The theory of reciprocal altruism’, in Shakelford, T. K. (ed.), The SAGE Handbook of Evolutionary Psychology: Foundations of Evolutionary Psychology, Sage Publications Ltd., 170–187.Google Scholar

Chalabaev, A., Stone, J., Sarrazin, P. and Croizet, J.-C. (2008), ‘Investigating physiological and self-reported mediators of stereotype lift effects on a motor task’, Basic and Applied Social Psychology, 30(1): 18–26.CrossRef Google Scholar

Chen, C. A. and Bozeman, B. (2014), ‘Am I a public servant or am I a pathogen? Public managers’ sector comparison of worker abilities’, Public Administration, 92(3): 549–564.CrossRef Google Scholar

Cheryan, S. and Bodenhausen, G. V. (2016), ‘When positive stereotypes threaten intellectual performance: the psychological hazards of “Model Minority” status’, Psychological Science, 11(5).Google Scholar

Clark, J. K., Thiem, K. C. and Kang, S. (2017), ‘Positive stereotype validation: the bolstering effects of activating positive stereotypes after intellectual performance’, Personality and Social Psychology Bulletin, 43(12): 1630–1642.CrossRef Google Scholar PubMed

Cowley, E. and Smith, S. (2014), ‘Motivation and mission in the public sector: evidence from the world values survey’, Theory and Decision, 76(2): 241–263.CrossRef Google Scholar

Crabtree, C. (2018), ‘An Introduction to Conducting Email Audit Studies’, in Gaddis, S. (ed.), Audit Studies: Behind the Scenes with Theory, Method, and Nuance, Cham: Springer, 103–117.CrossRef Google Scholar

Daneshkohan, A., Zarei, E. and Ahmadi-Kashkoli, S. (2020), ‘Health system responsiveness: a comparison between public and private hospitals in Iran’, International Journal of Healthcare Management, 13(sup1): 296–301.CrossRef Google Scholar

Daniels, A. (2016), ‘Quality in public service delivery’, International Journal of Civil Service Reform and Practice, 1(2): 55–64.Google Scholar

de Boer, N. (2020), ‘How do citizens assess street-level bureaucrats’ warmth and competence? A typology and test’, Public Administration Review, 80(4): 532–542.CrossRef Google Scholar

Deeks, J. J. (2002), ‘Issues in the selection of a summary statistic for meta-analysis of clinical trials with binary outcomes’, Statistics in medicine, 21(11): 1575–1600.CrossRef Google Scholar PubMed

Dolnicar, S. (2003), Simplifying three-way questionnaires-do the advantages of binary answer categories compensate for the loss of information?Google Scholar

Douglas, S., t'Hart, P., Ansell, C., Anderson, L., Flinders, M., Head, B. and Moynihan, D. (2019), Towards Positive Public Administration: A Manifesto. Submitted to Public Administration Review.Google Scholar

Fels, K. M. (2022), ‘Who nudges whom? Expert opinions on behavioural field experiments with public partners’, Behavioural Public Policy, 1–37.CrossRef Google Scholar

Francois, P. and Vlassopoulos, M. (2008), ‘Pro-social motivation and the delivery of social services’, CESifo Economic Studies, 54(1): 22–54.CrossRef Google Scholar

Gaddis, S. M. and Crabtree, C. (2021), Correspondence Audit Studies are Necessary to Understand Discrimination. Available at SSRN 3813269.CrossRef Google Scholar

Goodsell, C. T. (1983; 2004; 2014), The Case for Bureaucracy: A Public Administration Polemic. Chatham, UK: Chatham House Publishers.Google Scholar

Grant, A. M. (2008), ‘Employees without a cause: the motivational effects of prosocial impact in public service’, International Public Management Journal, 11(1): 48–66.CrossRef Google Scholar

Gravert, C. and Kurz, V. (2021), ‘Nudging à la carte: a field experiment on climate-friendly food choice’, Behavioural Public Policy, 5(3): 378–395.CrossRef Google Scholar

Gregg, P., Grout, P. A., Ratcliffe, A., Smith, S. and Windmeijer, F. (2011), ‘How important is pro-social behaviour in the delivery of public services?’, Journal of Public Economics, 95(7–8): 758–766.CrossRef Google Scholar

Grohs, S., Adam, C. and Knill, C. (2016), ‘Are some citizens more equal than others? Evidence from a field experiment’, Public Administration Review, 76(1): 155–164.CrossRef Google Scholar

Gupta, V. K., Turban, D. B. and Bhawe, N. M. (2008), ‘The effect of gender stereotype activation on entrepreneurial intentions’, Journal of Applied Psychology, 93(5): 1053.CrossRef Google Scholar PubMed

Guul, T. S. (2018), ‘The individual-level effect of gender matching in representative bureaucracy’, Public Administration Review, 78(3): 398–408.CrossRef Google Scholar

Guul, T. S., Villadsen, A. R. and Wulff, J. N. (2019), ‘Does good performance reduce bad behavior? Antecedents of ethnic employment discrimination in public organizations’, Public Administration Review, 79(5): 666–674.CrossRef Google Scholar

Hackman, J. R., Hackman, R. J. and Oldham, G. R. (1980), Work Redesign (Vol. 2779). Reading, MA: Addison-Wesley.Google Scholar

Haddad, A., Mssassi, S. and Makkaoui, M. (2020), ‘The public service qualitative dimensions from the citizen-customers perspective: a literature review and conceptual model’, International Journal of Innovation and Scientific Research, 49(2): 230–246.Google Scholar

Hadian, D. (2017), ‘The relationship organizational culture and organizational commitment on public service quality; perspective local government in Bandung, Indonesia’, International Review of Management and Marketing, 7: 1.Google Scholar

Hansen, J. A. and Tummers, L. (2020), ‘A systematic review of field experiments in public administration’, Public Administration Review, 80(6): 921–931.CrossRef Google Scholar

Harrits, G. S. (2019), ‘Stereotypes in context: How and when do street-level bureaucrats use class stereotypes?’, Public Administration Review, 79(1): 93–103.CrossRef Google Scholar

Hinds, L. and Murphy, K. (2007), ‘Public satisfaction with police: using procedural justice to improve police legitimacy’, Australian & New Zealand Journal of Criminology, 40(1): 27–42.CrossRef Google Scholar

Ho, C. P., Driscoll, D. M. and Loosbrock, D. L. (1998), ‘Great expectations: the negative consequences of falling short’, Journal of Applied Social Psychology, 28(19): 1743–1759.CrossRef Google Scholar

Hofstetter, E. and Stokoe, E. (2015), ‘Offers of assistance in politician–constituent interaction’, Discourse Studies, 17(6): 724–751.CrossRef Google Scholar

Hood, C. (1991), ‘A public management for all seasons?’, Public Administration, 69(1): 3–19.CrossRef Google Scholar

Houston, D. J. (2000), ‘Public-service motivation: a multivariate test’, Journal of Public Administration Research and Theory, 10(4): 713–728.CrossRef Google Scholar

Humphrey, S. E., Nahrgang, J. D. and Morgeson, F. P. (2007), ‘Integrating motivational, social, and contextual work design features: a meta-analytic summary and theoretical extension of the work design literature’, Journal of Applied Psychology, 92(5): 1332.CrossRef Google Scholar PubMed

Hung, Y.-H., Huang, M. L. and Chen, K.-S. (2003), ‘Service quality evaluation by service quality performance matrix’, Total Quality Management & Business Excellence, 14(1): 79–89.CrossRef Google Scholar

Jilke, S., van Dooren, W. and Rys, S. (2018), ‘Discrimination and administrative burden in public service markets: does a public–private difference exist?’, Journal of Public Administration Research and Theory, 28(3): 423–439.CrossRef Google Scholar

John, P. and Johnson, M. (2008), ‘Is there still a public service ethos’, In Park, A., Curtice, J., Thomson, K., Phillips, M., Johnson, M. and Clery, E. (eds.) British Social Attitudes: The 24^th Report, 105–125, London: Sage Publications.CrossRef Google Scholar

Keiser, L. R. (2010), ‘Understanding street-level bureaucrats’ decision making: determining eligibility in the social security disability program’, Public Administration Review, 70(2): 247–257.CrossRef Google Scholar

Keiser, L. R. and Soss, J. (1998), ‘With good cause: bureaucratic discretion and the politics of child support enforcement’, American Journal of Political Science, 42(4): 1133–1156.CrossRef Google Scholar

Keppeler, F., Sievert, M. and Jilke, S. (2022), ‘Increasing COVID-19 vaccination intentions: a field experiment on psychological ownership’, Behavioural Public Policy, 1–20.CrossRef Google Scholar

Komorita, S. S., Parks, C. D. and Hulbert, L. G. (1992), ‘Reciprocity and the induction of cooperation in social dilemmas’, Journal of Personality and Social Psychology, 62(4): 607.CrossRef Google Scholar

Lahey, J. and Beasley, R. (2018), ‘Technical aspects of correspondence studies’, in Gaddis, S. (ed.), Audit Studies: Behind the Scenes with Theory, Method, and Nuance, Cham: Springer, 81–101.CrossRef Google Scholar

Leach, C. W., Carraro, L., Garcia, R. L. and Kang, J. J. (2017), ‘Morality stereotyping as a basis of women's in-group favoritism: An implicit approach’, Group Processes & Intergroup Relations, 20(2): 153–172.CrossRef Google Scholar

Levy, B. (1996), ‘Improving memory in old age through implicit self-stereotyping’, Journal of Personality and Social Psychology, 71(6): 1092.CrossRef Google Scholar PubMed

Levy, B. R., Pilver, C., Chung, P. H. and Slade, M. D. (2014), ‘Subliminal strengthening: improving elders’ physical function over time through an implicit-age-stereotype intervention’, Psychological science, 25: 2127–2135.CrossRef Google Scholar

Lewis, G. B. and Frank, S. A. (2002), ‘Who wants to work for the government?’, Public Administration Review, 62(4): 395–404.CrossRef Google Scholar

Liao, Y. (2018), ‘Toward a pragmatic model of public responsiveness: Implications for enhancing public administrators’ responsiveness to citizen demands’, International Journal of Public Administration, 41(2): 159–169.CrossRef Google Scholar

Lipsky, M. (1984), ‘Bureaucratic disentitlement in social welfare programs’, Social Service Review, 58(1): 3–27.CrossRef Google Scholar

Lonati, S., Quiroga, B. F., Zehnder, C. and Antonakis, J. (2018), ‘On doing relevant and rigorous experiments: review and recommendations’, Journal of Operations Management, 64: 19–40.CrossRef Google Scholar

Lubell, M. and Scholz, J. T. (2001), ‘Cooperation, reciprocity, and the collective-action heuristic’, American Journal of Political Science, 45(1): 160–178.CrossRef Google Scholar

Madon, S., Guyll, M., Aboufadel, K., Montiel, E., Smith, A., Palumbo, P. and Jussim, L. (2001), ‘Ethnic and national stereotypes: the Princeton trilogy revisited and revised’, Personality and Social Psychology Bulletin, 27(8): 996–1010.CrossRef Google Scholar

Malti, T. and Dys, S. P. (2018), ‘From being nice to being kind: development of prosocial behaviors’, Current Opinion in Psychology, 20: 45–49.CrossRef Google Scholar PubMed

Marx, D. M., Brown, J. L. and Steele, C. M. (1999), ‘Allport's legacy and the situational press of stereotypes’, Journal of Social Issues, 55(3): 491–502.CrossRef Google Scholar

Mbassi, J. C., Mbarga, A. D. and Ndeme, R. N. (2019), ‘Public service quality and citizen-client's satisfaction in local municipalities’, Journal of Marketing Development and Competitiveness, 13(3): 110–123.Google Scholar

Meier, K. J. and Bohte, J. (2006), Politics and the Bureaucracy: Policymaking in the Fourth Branch Of Government. Belmont, CA: Thomson Wadsworth.Google Scholar

Milakovich, M. M. E. (2003), ‘Balancing customer service, empowerment, and performance with citizenship, responsiveness and political accountability’, International Public Management Review, 4(1): 61–83.Google Scholar

Moynihan, D., Herd, P. and Harvey, H. (2015), ‘Administrative burden: learning, psychological, and compliance costs in citizen-state interactions’, Journal of Public Administration Research and Theory, 25(1): 43–69.CrossRef Google Scholar

Murphy, K. and Tyler, T. (2008), ‘Procedural justice and compliance behaviour: the mediating role of emotions’, European Journal of Social Psychology, 38(4): 652–668.CrossRef Google Scholar

Neo, S., Bertram, I., Szydlowski, G., Bouwman, R., de Boer, N., Grimmelikhuijsen, S., Charbonneau, É., Moon, M. J. and Tummers, L. (2023), ‘Working 9 to 5? A cross-national analysis of public sector worker stereotypes’, Public Management Review, 1–30. https://doi.org/10.1080/14719037.2023.2254306Google Scholar

Olsen, A. L., Kyhse-Andersen, J. H. and Moynihan, D. (2022), ‘The unequal distribution of opportunity: a national audit study of bureaucratic discrimination in primary school access’, American Journal of Political Science, 66(3): 587–603.CrossRef Google Scholar

Parasuraman, A., Zeithaml, V. A. and Berry, L. L. (1985), ‘A conceptual model of service quality and its implications for future research’, Journal of Marketing, 49(4): 41–50.CrossRef Google Scholar

Parasuraman, A., Zeithaml, V. A. and Berry, L. (1988), ‘Servqual: a multiple-item scale for measuring consumer perceptions of service quality’, Journal of Retailing, 64: 12–40.Google Scholar

Parks, C. D. and Rumble, A. C. (2001), ‘Elements of reciprocity and social value orientation’, Personality and Social Psychology Bulletin, 27(10): 1301–1309.CrossRef Google Scholar

Paul, J., Mittal, A. and Srivastav, G. (2016), ‘Impact of service quality on customer satisfaction in private and public sector banks’, International Journal of Bank Marketing, 34(5): 606–622.CrossRef Google Scholar

Pelaprat, E. and Brown, B. (2012), ‘Reciprocity: understanding online social relations’, First Monday, 17(10). https://doi.org/10.5210/fm.v17i10.3324Google Scholar

Percival, N. M. and Pulford, B. D. (2020), ‘Do say “thank you”: verbal expressions of politeness and gratitude influence interpersonal perceptions’, The Journal of General Psychology, 147(3): 228–243.CrossRef Google Scholar PubMed

Perry, J. L. (1996), ‘Measuring public service motivation: an assessment of construct reliability and validity’, Journal of Public Administration Research and Theory, 6(1): 5–22.CrossRef Google Scholar

Perry, J. L. (1997), ‘Antecedents of public service motivation’, Journal of Public Administration Research and Theory, 7(2): 181–197.CrossRef Google Scholar

Perry, J. L. (2016), ‘Practicing what we preach! Public administration review promotes transparency and openness’, Public Administration Review, 1(77): 5–6.Google Scholar

Perry, J. L. and Wise, L. R. (1990), ‘The motivational bases of public service’, Public Administration Review, 50(3): 367–373.CrossRef Google Scholar

Pinquart, M. and Sörensen, S. (2003), ‘Differences between caregivers and noncaregivers in psychological health and physical health: a meta-analysis’, Psychology and Aging, 18(2): 250.CrossRef Google Scholar PubMed

Pongsupap, Y. and Van Lerberghe, W. (2006), ‘Choosing between public and private or between hospital and primary care: responsiveness, patient-centredness and prescribing patterns in outpatient consultations in Bangkok’, Tropical Medicine & International Health, 11(1): 81–89.CrossRef Google Scholar PubMed

Raaphorst, N., Groeneveld, S. and Van de Walle, S. (2018), ‘Do tax officials use double standards in evaluating citizen-clients? A policy-capturing study among Dutch frontline tax officials’, Public Administration, 96(1): 134–153.CrossRef Google Scholar

Rad, M. S. and Ginges, J. (2018), ‘Folk theories of nationality and anti-immigrant attitudes’, Nature Human Behaviour, 2(5): 343–347.CrossRef Google Scholar PubMed

Régner, I., Thinus-Blanc, C., Netter, A., Schmader, T. and Huguet, P. (2019), ‘Committees with implicit biases promote fewer women when they do not believe gender bias exists’, Nature Human Behaviour, 3(11): 1171–1179.CrossRef Google Scholar

Reisig, M. D. and Parks, R. B. (2003), ‘Neighborhood context, police behavior and satisfaction with police’, Justice Research and Policy, 5(1): 37–65.CrossRef Google Scholar

Resh, W. G., Marvel, J. D. and Wen, B. (2018), ‘The persistence of prosocial work effort as a function of mission match’, Public Administration Review, 78(1): 116–125.CrossRef Google Scholar

Responsive. Merriam-Webster.com Dictionary. Merriam-Webster. https://www.merriam-webster.com/dictionary/responsive. Accessed 2 Feb. 2024.Google Scholar

Rölle, D. (2017), ‘What makes citizens satisfied? The influence of perceived responsiveness of local administration on satisfaction with public administration’, Journal of Social and Administrative Sciences, 4(1): 1–13.Google Scholar

Sanders, M., Snijders, V. and Hallsworth, M. (2018), ‘Behavioural science and policy: where are we now and where are we going?’, Behavioural Public Policy, 2(2): 144–167.CrossRef Google Scholar

Sheldon, K. M. (1999), ‘Learning the lessons of tit-for-tat: even competitors can get the message’, Journal of Personality and Social Psychology, 77(6): 1245.CrossRef Google Scholar

Shih, M., Pittinsky, T. L. and Ambady, N. (1999), ‘Stereotype susceptibility: identity salience and shifts in quantitative performance’, Psychological science, 10(1): 80–83.CrossRef Google Scholar

Shih, M., Pittinsky, T. L. and Ho, G. (2012), ‘Stereotype Boost: Positive Outcomes from the Activation of Positive Stereotypes’, in Inzlicht, M. and Schmader, T. (eds), Stereotype Threat: Theory, Process, and Application, New York: Oxford University Press, 141–156.Google Scholar

Smith, J. L. and Johnson, C. S. (2006), ‘A stereotype boost or choking under pressure? Positive gender stereotypes and men who are low in domain identification’, Basic and Applied Social Psychology, 28(1): 51–63.CrossRef Google Scholar

Soss, J., Fording, R. C. and Schram, S. F. (2011), ‘The organization of discipline: from performance management to perversity and punishment’, Journal of Public Administration Research and Theory, 21(suppl_2): i203–i232.CrossRef Google Scholar

Stallybrass, O. (1977), ‘Stereotype’, in Bullock, A. and Stallybrass, O. (eds), The Fontana Dictionary of Modern Thought, London: Fontana/Collins, 601.Google Scholar

Szydlowski, G., de Boer, N. and Tummers, L. (2022), ‘Compassion, bureaucrat bashing, and public administration’, Public Administration Review, 82(4): 619–633.CrossRef Google Scholar

The London Chamber of Commerce and Industry and Hays (2011), Public to Private: Making the Move. London, UK: Hays.Google Scholar

Thomas, P. and Palfrey, C. (1996), ‘Evaluation: stakeholder-focused criteria’, Social Policy and Administration, 30(2): 125–142.CrossRef Google Scholar

Thunman, E., Ekström, M. and Bruhn, A. (2020), ‘Dealing with questions of responsiveness in a low-discretion context: offers of assistance in standardized public service encounters’, Administration & Society, 52(9): 1333–1361.CrossRef Google Scholar

Tyler, T. R. (2003), ‘Procedural justice, legitimacy, and the effective rule of law’, Crime and Justice, 30: 283–357.CrossRef Google Scholar

Tyler, T. R. (2006), Why People Obey the Law. Princeton: Princeton University Press.CrossRef Google Scholar

Van de Walle, S. and Bouckaert, G. (2003), ‘Public service performance and trust in government: the problem of causality’, International Journal of Public Administration, 26(8–9): 891–913.CrossRef Google Scholar

Van Dooren, W. and Jilke, S. (2022), ‘No evidence for ethnic discrimination in the nonprofit sector: an audit study of access to nursing homes’, International Public Management Journal, 27(1): 1–16.Google Scholar

Van Roekel, H., Reinhard, J. and Grimmelikhuijsen, S. (2022), ‘Improving hand hygiene in hospitals: comparing the effect of a nudge and a boost on protocol compliance’, Behavioural Public Policy, 6(1): 52–74.CrossRef Google Scholar

Van Ryzin, G. G. (2011), ‘Outcomes, process, and trust of civil servants’, Journal of Public Administration Research and Theory, 21(4): 745–760.CrossRef Google Scholar

Vigoda, E. (2000), ‘Are you being served? The responsiveness of public administration to citizens’ demands: an empirical examination in Israel’, Public Administration, 78(1): 165–191.CrossRef Google Scholar

Vogel, D. and Willems, J. (2020), ‘The effects of making public service employees aware of their prosocial and societal impact: a microintervention’, Journal of Public Administration Research and Theory, 30(3): 485–503.CrossRef Google Scholar

Vomfell, L. and Stewart, N. (2021), ‘Officer bias, over-patrolling and ethnic disparities in stop and search’, Nature Human Behaviour, 36(1): 1–10.Google Scholar

Wang, F., Jun, K.-N. and Wang, L. (2021), ‘Bureaucratic contacts and their impact on citizen satisfaction with local government agencies: the influence of expectation’, Public Policy and Administration, 36(1): 41–68.CrossRef Google Scholar

Wells, W. (2007), ‘Type of contact and evaluations of police officers: the effects of procedural justice across three types of police–citizen contacts’, Journal of Criminal Justice, 35(6): 612–621.CrossRef Google Scholar

Wheeler, S. C. and Petty, R. E. (2001), ‘The effects of stereotype activation on behavior: a review of possible mechanisms’, Psychological Bulletin, 127(6): 797–826.CrossRef Google Scholar PubMed

Willems, J. (2020), ‘Public servant stereotypes: it is not (at) all about being lazy, greedy and corrupt’, Public Administration, 98(4): 807–823.CrossRef Google Scholar

Wilson, J. Q. (1989; 2019), Bureaucracy: What Government Agencies Do and Why They Do It. New York, NY: Basic Books.Google Scholar

Wisniewski, M. (1996), ‘Measuring service quality in the public sector: the potential for SERVQUAL’, Total Quality Management, 7(4): 357–365.CrossRef Google Scholar

Wright, J. E. and Headley, A. M. (2020), ‘Police use of force interactions: is race relevant or gender germane?’, The American Review of Public Administration, 50(8): 851–864.CrossRef Google Scholar

Zhao, K., Ferguson, E. and Smillie, L. D. (2016), ‘Prosocial personality traits differentially predict egalitarianism, generosity, and reciprocity in economic games’, Frontiers in Psychology, 7: 1137.CrossRef Google Scholar PubMed

Table 1. Selected E-mails

Table 2. Demographic comparison across groups and randomization test for gender and country

Table 3. Summary statistics for results

Table 4. Ordinary least square regression of stereotype activation

Figure 1. Stereotype activation effects on response rate, information provision and friendliness. Note: The Y-axis, ranging from 0 to 70, shows the sample percentage. Each condition shows 95% error bars.

Figure 2. Gender effects on response rate, information provision and friendliness.

Table 5. Exploratory OLS regression results – effects of gender

Table A1. All tested e-mails

Table B1. Manipulation check sample demographics (n = 718)

Table B2. E-mail descriptives on stereotype activation

Table B3. MANOVA results – multivariate tests

Table B4. MANOVA – tests of between-subjects effects

Table B5. Post-hoc tests (Tukey HSD) for E-mail means comparison

Table C1. Exploratory OLS regression results – country effects

Table D1. Exploratory OLS regression results – effects of gender including interaction effect of gender and stereotype activation

Article contents

Positive public sector stereotypes and their impact on public service delivery: an audit experiment

Abstract

Keywords

Introduction

Theoretical framework

Public sector worker stereotypes

Stereotyping and performance

Stereotype activation

Positive stereotyping effects

Positive stereotyping effects and public service delivery

Public service delivery

Why is public service delivery important?

Methods

Design and procedure

Measures

Pro-social stereotype activation

Public service delivery

Response rate

Information provision

Exploratory measures

Friendliness

Gender

Sample

Statistical analysis

Results

Hypothesis 1

Hypothesis 2

Exploratory analyses – stereotype activation on friendliness

Exploratory analyses – gender effects

Response Rate

Information provision

Friendliness

Discussion and conclusion

Conclusion

Data availability

Funding statement

Competing interest

Ethics approval statement

Appendix A

Appendix B

Manipulation check

Design and procedure

Measures

Pro-social stereotype activation

Sample

Results

Stereotype activation

Appendix C

Exploratory analyses – country effects

Response rate

Information provision

Friendliness

Appendix D

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests