Introduction
According to the Small Business Administration (SBA), small businesses, defined as businesses with fewer than 500 employees, employ more than half of the entire workforce and account for more than sixty percent of new jobs created in the United States economy, in addition to being responsible for about fifty percent of private domestic gross product (as of 2016).Footnote 1 In this context, it is noteworthy, however, that small business owners in credit markets, in particular minority owners, have difficulties in securing sources of capital for business operations in spite of their economic importance (Ang Reference Ang1991; Ennew and Binks, Reference Ennew and Binks1995; Pettit and Singer, Reference Pettit and Singer1985).
The fact that minority-owned small businesses have difficulty in obtaining loans in credit markets may be attributed to 1) economic and financial differences between minority-owned and non-minority-owned small businesses, 2) lender discrimination against minority owners (based on statistical or preference-based discrimination),Footnote 2 or 3) cultural differences between lenders and borrowers, which may cause lenders to make less effort to collect information on the creditworthiness of minorities than that of White applicants (Calomiris, et al., Reference Calomiris, Kahn and Longhofer1994; Cavalluzzo et al., Reference Cavalluzzo, Cavalluzzo and Wolken2002; Longhofer and Peters, Reference Longhofer and Peters2005).Footnote 3
Discriminating against minority owners who apply for loans in credit markets is prohibited under current fair-lending laws, in particular the Equal Credit Opportunity Act (ECOA) of 1974. According to the United States Department of Justice, ECOA “prohibits creditors from discriminating against credit applications on the basis of race, color, religion, national origin, sex, marital status, [or] age.”Footnote 4 Several studies however, provide evidence that minority-owned businesses face discrimination in loan approval (Blanchard et al., Reference Blanchard, Zhao and Yinger2008; Blanchflower et al., Reference Blanchflower, Levine and Zimmerman2003; Fairlie et al., Reference Fairlie, Robb and Robinson2022).
It follows that studies of lending discrimination for small businesses must be implemented based on statistical approaches (e.g., multivariate regression equations) to detect whether there exists lender discrimination in credit markets. As pointed out by Stephen Ross and John Yinger (Reference Ross and Yinger2002) (also see Blanchard et al., Reference Blanchard, Zhao and Yinger2008, p. 468), studies of lending discrimination based mainly on statistical approaches should address the potential sources of biases: omission of relevant explanatory variables; sample selection issues; endogeneity; and functional misspecification.
However, little research on lending market discrimination has been conducted that deals properly with selection bias problems that may arise in credit markets (Blanchard et al., Reference Blanchard, Zhao and Yinger2008; Cavalluzzo et al., Reference Cavalluzzo, Cavalluzzo and Wolken2002). Phillips Robert and Yezer Anthony (Reference Robert and Anthony1996), in this context, criticize the lending market literature that uses simple single-equation models of credit application rejection and loan default and argue for corrections for sample selection bias. James J. Heckman (Reference Heckman1976, Reference Heckman1979) shows in his seminal work how a nonrandomly selected sample can cause bias in estimating coefficients of interest and how to remedy selection bias problems to get consistent estimates.
This paper adds to the small business lending market literature by 1) combining cross sectional data from the Survey of Small Business Finances (SSBF) for 1993, 1998, and 2003 to get more precise estimates and test statistics with more power; 2) conducting regression analyses with different model specifications to show the robustness of the empirical results; and 3) dealing directly with problems of sample selection based on the Heckman’s approach with particular attention to the assumptions required to justify the identification of the effect (i.e., exclusion restrictions).
Literature Review
The literature on lending market discrimination, which bases its theoretical framework on Gary S. Becker’s model of discriminatory employer preference, has been focused on small business owners’ access to credit markets (Ando Reference Ando1988; Cohn and Coleman, Reference Cohn and Coleman2001; Cole Reference Cole2013; Grown and Bates, Reference Grown and Bates1992). Most studies have analyzed data from either the Characteristics of Business Owners (CBO) or the Survey of Small Business Finances (SSBF).
Early work on lending market discrimination against small business owners used the CBO data to analyze the relationship between small business owners, in particular minority-owned small businesses, and credit accessibility (e.g., loan approval and loan amount). Faith H. Ando (Reference Ando1988), for example, shows that Black-owned small businesses are less likely than White-owned small businesses to obtain commercial bank loans based on an analysis of CBO data. The study estimates a logit model that controls for the characteristics of firms, applicants’ demographic information, and credit risk. Using the same data set as in Ando (Reference Ando1988), and controlling for a similar set of variables, Timothy Bates (Reference Bates1991) finds that Black-owned small businesses receive smaller loan amounts than those owned by Whites.
Ken S. Cavalluzzo and colleagues (Reference Cavalluzzo, Cavalluzzo and Wolken2002) use the 1993 SSBF, which is one of “the most extensive public data sets available on small businesses” (p. 647), to examine differences in loan denial rates and interest rates charged between minority-owned and non-minority-owned small businesses. They find, on the one hand, that there is no evidence that Black-owned small businesses pay more for loans compared to others. Using a logit model, they also find, on the other hand, that Black-owned small businesses are more likely than others to be denied loans even after controlling for a broad set of characteristics.
Other research on small-business lending market discrimination also finds that Black-owned small businesses are more likely than others to be denied loans after controlling for a large number of firm and owner characteristics. For example, David G. Blanchflower and colleagues (Reference Blanchflower, Levine and Zimmerman2003) control for the owner’s education, creditworthiness, type of loan, organizational status, age of firm, firm size, industry, and region to analyze loan denial. An extensive set of variables similar to the ones in Blanchflower et al. (Reference Blanchflower, Levine and Zimmerman2003) are also controlled in existing studies (Blanchard et al., Reference Blanchard, Zhao and Yinger2008; Bostic and Lampani, Reference Bostic, Lampani, Blanton, Williams and Rhine1999; Cavalluzzo and Cavalluzzo, Reference Cavalluzzo and Cavalluzzo1998). On the other hand, other studies find empirical results suggesting that Hispanic- and Asian-owned small businesses may also be discriminated against in credit markets (Cavalluzzo and Wolken, Reference Cavalluzzo and Wolken2005; Cole Reference Cole2008; Coleman Reference Coleman2002).
As pointed out by Robert and Anthony (Reference Robert and Anthony1996) as well as Gangadharrao Maddala and Robert Trost (Reference Maddala and Trost1982), however, using only cases where firms submitted loan applications in lending markets to detect discrimination may produce biased estimates due to selection bias problems (i.e., nonrandomly selected subsample driven by self-selection problems). To deal with self-selection problems that arise in lending markets, Cavalluzzo and colleagues (Reference Cavalluzzo, Cavalluzzo and Wolken2002) use a bivariate probit model to take self-selection into account and find that the correlation between the application decision equation and the outcome equation (i.e., loan denial) is positive and statistically significant. However, they conclude that adding the selection equation does not seriously influence denial estimates. Replicating the work of Cavalluzzo et al. (Reference Cavalluzzo, Cavalluzzo and Wolken2002) using the 1998 SSBF, Lloyd Blanchard and colleagues (Reference Blanchard, Zhao and Yinger2008) also confirm that the selection correction does not alter estimates of the determinants of loan denial.
One of the major limitations in both studies, however, is that regressors used to deal with self-selection are identical in the two equations (i.e., selection and outcome) as mentioned in Cavalluzzo et al. (Reference Cavalluzzo, Cavalluzzo and Wolken2002, p. 673). Although the bivariate sample selection model can be theoretically identified without any restriction on the regressors, it is well known that the results are usually less than convincing due to very high standard errors for coefficients caused by multicollinearity and the functional assumptions that are required (Cameron and Trivedi, Reference Cameron and Trivedi2005; Wooldridge Reference Wooldridge2010).
This paper, in this regard, provides several contributions to the small-business lending market literature. First, unlike the previous literature, this paper uses pooled cross sections—SSBF for 1993, 1998, and 2003—to get more precise estimates and provide statistics with more power. Second, as noted above, although the previous small-business lending market literature addresses self-selection problems, results are generally unconvincing as the same regressors are controlled in the selection and the outcome equation. Here, the paper uses several alternative identifying variables (i.e., exclusion restrictions) and different model specifications for a bivariate probit sample selection model to improve adjustments for sample selection bias.
Theoretical Framework
This paper bases the interpretation of the empirical analysis on theoretical predictions from Becker’s (Reference Becker1971) seminal work on the effects of prejudice in the labor market. In the subsection below, therefore, we briefly review the key implications from his model. In what follows, we apply Becker’s model to a lender’s loan decision process, showing how lender prejudice may influence the likelihood of loan approval for small business owners, in particular, minority-owned small businesses. Last but not least, we will introduce Heckman’s approach, one of the sample selection models used in observational studies, to deal with self-selection problems that arise in lending markets.
Becker’s Discrimination Model
Throughout his analysis, Becker assumes 1) employers may be racially prejudiced, 2) White and Black workers are perfect substitutes in production, 3) a production function is constant returns to scale, and (4) the market is perfectly competitive.Footnote 5 Since employers in Becker’s model have prejudice against hiring Black workers, following Kerwin K. Charles and Jonathan Guryan (Reference Charles and Guryan2008), employer i utility can be written as the function of profit and the disutility ( $ {d}_i $ ) for each Black worker hired:
where $ {\pi}_i\hskip0.35em =\hskip0.35em f\left({L}_W+{L}_B\right)-{w}_W{L}_W-{w}_B{L}_B $ is the employer’s profit; $ {w}_W $ and $ {w}_B $ are White and Black wages, respectively; $ {L}_W $ and $ {L}_B $ are the number of White and Black workers hired by the employer; and $ f\;\left(\bullet \right) $ is the production function, assumed constant returns to scale. Since it is assumed that the firm chooses its inputs (here, the number of White and Black workers) to maximize the employer’s utility, the first-order conditions for the hiring of White and Black workers, respectively, can be written:
Since White and Black workers are assumed to be perfect substitutes in production, for any employer who hires both Black and White workers, it follows that:
Equation (2) means that the employer hires either White or Black labor up to the point at which its marginal product is equal to its marginal impact on the employer’s utility. Since $ {d}_i $ represents the employer’s disutility for hiring a Black worker, Black workers are paid less by $ {d}_i $ . The implications from Becker’s model are that 1) an employer with prejudice behaves as if Black workers’ wages are higher than they actually are, 2) an employer hires White labor if his/her prejudice is such that $ {w}_W $ < $ {w}_B+{d}_i $ and vice versa, and 3) in the labor market, the allocation of either White or Black labor to firms is not random. In the next subsection, we will carry the implications from Becker’s model over to small-business lending markets. In particular, we will consider the lender’s decision process.
Loan Decision ProcessFootnote 6
As described above, Becker’s model assumes an employer with prejudice against hiring, for example, Black workers, and employer utility maximization affects relative earnings of racial groups. Carrying this idea over to small-business lending markets, we can assume a lender has prejudice against approving loan applications from minority-owned small businesses. The lender’s objective is to approve loan applications that can “provide a higher return than other potential uses of the capital” (Blanchard et al., Reference Blanchard, Zhao and Yinger2008, p. 469), taking account of the lender’s discriminatory preferences.
Defining $ {\pi}^{\ast } $ as the lender’s required profitability threshold, the lender’s decision rule for a loan application is as follows:
where $ \pi $ indicates the profit that can be expected from a loan application, which is determined with full information on the loan application. As pointed out by Ross and Yinger (Reference Ross and Yinger2002), however, it is almost inevitable that lenders have incomplete information on loan applicants and are unable to predict loan performance with certainty. In this context, they must estimate loan profitability based on rules of thumb, their past experience, and so on. Therefore, in practice, the loan decision process can be written as follows:
where $ {\pi}^E $ is an estimated loan profitability derived from a lender’s incomplete information based on a loan performance. If we assume that a lender uses limited information on the characteristics of the applicant (A), the firm (F), and the loan (L) in the loan decision process and has prejudice against approving loan applications from minority-owned small businesses, Equation (5) can be changed to the following loan decision rule if a borrower is a member of a certain ethnic group (e.g., Black):
where M is a dummy with value of 1 for members of this ethnic group and d is the disutility the lender experiences if a member of that ethnic group is given a loan. If we assume that the actual loan estimate of profitability has a linear functional form and a normally distributed error term, this estimate can be written as:
Since the actual decision rule creates two exclusive outcomes, loan denial (D = 1) or loan approval (D = 0), Equations (6) and (7) imply a probit model, which can be used to analyze the functional relationship between the likelihood that a loan application is denied and the characteristics of interest (i.e., A, F, L, and M).
where $ \Phi $ ( $ \bullet $ ) is the standard normal cumulative distribution function such that 0 $ <\Phi $ ( $ \bullet $ ) $ < $ 1.
Equation (8) shows that if the coefficient of the race dummy variable ( $ M $ ) is positive even after controlling for the explanatory variables in a probit model, this suggests the existence of discrimination in small-business lending markets. One of the limitations, however, of using a probit model to detect discrimination in lending markets is that self-selection on the part of applicants in their decision to apply for a loan is ignored, which in turn can cause estimated coefficients to be biased. In the next subsection, therefore, we will set up a new identification strategy that takes into account applicants’ self-selection and corrects for self-selection problems.
Sample Selection Model - Heckman’s Approach
As pointed out in Robert and Anthony (Reference Robert and Anthony1996), much of the lending market literature has used “simple single-equation models of rejection and default” (p. 87) to detect discrimination in lending markets, which ignores problems caused by the sample selection. Although the econometrics literature shows that ignoring the sample selection process can be justified under the assumption that the selection process is solely determined by regressors controlled in a regression equation (i.e., exogeneous sample selection), the assumption may not hold in general, which in turn causes the estimates of coefficients to be biased.
In this context, some of the small-business lending market literature deals with the sample selection process by directly taking into account the loan application process in estimating the loan denial decision (Blanchard et al., Reference Blanchard, Zhao and Yinger2008; Cavalluzzo et al., Reference Cavalluzzo, Cavalluzzo and Wolken2002), but they do not address the issue, for example, that using the same regressors in both the outcome equation and the selection equation depends on the details of specification for identification. In what follows, therefore, we will set up a bivariate probit sample selection model based on the logic of Heckman’s estimator that takes into account the loan denial and the loan application decision jointly, and we will then show how the model deals with sample selection problems.Footnote 7
In the small-business lending market literature, we have no way of getting pure random samples of applications for loans. For example, we may expect that a small business owner is more likely to apply for a loan if the loan approval is more likely. Since small business owners self-select in applying for loans, we can only observe a subset of the population—small business owners who applied for loans—which may not be representative of the underlying population of small businesses. Hence using a selected sample without correcting for the selection can cause parameter estimates to be biased. When we estimate the outcome equation (i.e., loan denial), the loan application decision will be directly taken into account in our estimates of, for example, discrimination in small-business lending markets. The bivariate probit sample selection model, therefore, consists of a selection equation (i.e., whether to apply) and an outcome equation (i.e., loan denial) as follows:
where $ {y}_1^{\ast } $ and $ {y}_2^{\ast } $ are latent variables for loan application and loan denial, respectively. y 1 = 1 means that a small business owner, a potential loan applicant, becomes an actual loan applicant (otherwise $ {y}_1 $ = 0) and $ {y}_2 $ = 1 means that the loan application is denied (otherwise $ {y}_2 $ = 0). Subscripts for individuals are suppressed here. In our estimation strategy, we assume that the two latent variables have specific functional forms as follows:
Here, Equation (11) shows that $ {y}_1^{\ast } $ —whether a small business owner applies for a loan— depends on a set of observed variables $ ({X}_S $ ) and the error term $ ({\varepsilon}_S $ ), and Equation (12) shows that $ {y}_2^{\ast } $ —whether a lender denies the loan application—depends on a set of observed variables $ ({X}_O $ ) and the error term $ ({\varepsilon}_O $ ). Following the standard approach to the bivariate probit sample selection model, we assume that $ {X}_S $ and $ {X}_O $ are exogenous to the error terms and $ {\varepsilon}_j $ follows $ N\left(0,{\sigma}_j^2\right),j=S,O $ .
Reintroducing subscript i to identify business owners, if we do not consider the loan application decision, the conditional probability of being denied a loan (i.e., $ {y}_{2i}=1 $ ) is written as follows:
where $ \Phi $ is the cumulative standard normal distribution function. Likewise, the conditional probability that small business owner i, given $ {X}_i $ , is approved for a loan is written as follows:
Hence likelihood function (L) that is used to estimate parameters of interest in a standard probit model without considering the loan application decision can be written as follows:
The first $ {N}_1 $ observations identify small business owners who are denied loan applications while the latter ( $ N-{N}_1 $ ) small business owners are not denied loan applications (i.e., their loans are approved). For clarity, we introduce subscript i to identify cases. However, this specification does not consider the loan application process jointly.
Since we assumed above that a latent variable $ ({y}_2^{\ast } $ ) has a specific functional form, however, the population regression function for $ {y}_2^{\ast } $ can be written as follows:Footnote 8
Based on the loan application decision in the population applying for the loan, the regression function can be written:
Assuming that $ {\varepsilon}_{Oi} $ and $ {\varepsilon}_{Si} $ are bivariate normally distributed with $ \rho $ correlation coefficient between $ {\varepsilon}_{Oi} $ and $ {\varepsilon}_{Si} $ , then we have:
where $ \phi $ and $ \Phi $ are the standard normal population density function and cumulative density function, respectively. Hence the regression equation with the loan application decision considered jointly is:
where $ {v}^2\hskip0.35em =\hskip0.35em 1+{\rho}^2\lambda \left({X}_S{\beta}_S-\lambda \right) $ .Footnote 9
Equation (19) shows, as proved in Heckman (Reference Heckman1979), that using the selected sample of the underlying population can create a functional misspecification if it does not control for the second term ( $ \rho \lambda $ ) of Equation (19), and will in turn cause parameter estimates to be biased in the regression equation if $ \rho \ne 0 $ . Since we can obtain consistent estimates $ \hat{\lambda} $ and $ \hat{v^2} $ from Equation (12) by using a probit model explaining whether or not a small business owner applies for a loan, we can set up the following regression equation:
where $ \xi \hskip0.35em =\hskip0.35em \frac{\eta }{v} $ and E $ (\xi \left|{X}_s{\beta}_s>0\right)\hskip0.35em =\hskip0.35em 0 $ and $ E\Big({\xi}^2\left|{X}_S{\beta}_S>0\right)\hskip0.35em =\hskip0.35em 1 $ .
Therefore, the likelihood function for estimation of Equation (20) can be written as follows:
where the first $ {N}_1 $ observations include small business owners who applied for loans and whose loan applications were approved. The $ N-{N}_1 $ observations include small business owners who applied for loans, but whose loan applications were denied. The $ M-N $ observations include small business owners who did not apply for loans.
Data
Data used for this study are based on the Federal Reserve Board’s 1993, 1998, and 2003 Survey of Small Business Finances (SSBF), which were conducted by the National Opinion Research Center (NORC) for the Board of Governors. In this survey, small businesses are defined as U.S. domestic for-profit, nonsubsidiary, nonfinancial, nonagricultural, nongovernmental businesses that employ fewer than 500 employees. The firms surveyed in each year’s cross-sectional data form a nationally representative sample of small businesses operating in the United States as of the survey year (Bitler et al., Reference Bitler, Robb and Wolken2001; Cole and Wolken, Reference Cole and Wolken1995; Mach and Wolken, Reference Mach and Wolken2006).
The samples were drawn from the Dun & Bradstreet Market Identifier file that is considered as broadly representative of all businesses in the United States (Mach and Wolken, Reference Mach and Wolken2006). Small businesses in this survey are selected according to a stratified random sample design. The samples were stratified by urban/rural status, census division (i.e., East North Central, East South Central, Middle Atlantic, Mountain, New England, Pacific, South Atlantic, West North Central, and West South Central), and total employment size.
The SSBF samples provide comprehensive information on individual small businesses, including detailed demographic and financial data. For example, the survey includes each firm’s recent borrowing experiences with financial institutions (e.g., loan approval/denial), the firm’s location and primary industry (e.g., service, manufacture), and organizational form (e.g., corporation, partnership). The survey also includes the primary owner’s characteristics, which include personal demographic variables (e.g., race, education), credit history, business experience, and the like. The survey further provides information on the characteristics of the lenders that approved or denied the firm’s loan applications, including type of lender (e.g., commercial bank, savings bank), the lender’s location, the length of the relationship between the lender and the firm, and so on.Footnote 10
More interestingly, the SSBF provides information on different reasons for not applying for loans. Small business owners in the survey can be classified into one of four categories of borrower type: non-borrower, discouraged borrower, approved borrower, and denied borrower.Footnote 11 Non-borrowers are those who didn’t apply for loans because they didn’t need credit while discouraged borrowers are those who didn’t apply for loans because they feared rejection although they needed credit. Likewise, approved borrowers are defined as those whose loan applications are approved while denied borrowers are defined as those whose loan applications are denied.Footnote 12
To increase the sample size and obtain more precise estimates, this research pools three waves of the SSBF data (1993, 1998, and 2003). Looking at Table A2 in the Appendix, the descriptive statistics from the pooled SSBF data are presented by borrower types (e.g., approved, denied borrower) and across the survey years. Including the descriptive statistics, all the regression results presented in the paper use sampling weights, which are designed to take account of the stratified sampling design.
When different cross sections are pooled as in this paper, however, there is a caveat that should be pointed out. Pooling can be justified only insofar as the relationship between the outcome variable and at least some of the explanatory variables remains constant over time (Wooldridge Reference Wooldridge2013). To justify pooling different cross sections, we can check whether similar patterns between variables appear across the survey years. For example, the patterns for the proportion of each borrower type is very similar over time. In particular, regardless of the survey year, non-borrowers are the largest proportion, followed by approved borrowers, discouraged borrowers, and denied borrowers. Other patterns are similar as well. Also, each of the three samples was run separately based on Model 8 below (i.e., fully controlled) in Table 1 to check whether the same results can be observed. These show that the coefficients of the African American dummy variable are within sampling error as is that for White females (in each case relative to White males). We do find some statistically significant differences for the coefficients of Hispanic and other races.Footnote 13
Empirical Results
Descriptive Statistics from the pooled SSBF dataFootnote 14
As presented by Rebel A. Cole (Reference Cole2008), the weighted descriptive statistics in Appendix Table A2 are classified by borrower type: non-borrowers, discouraged borrowers, approved borrowers, and denied borrowers. The pooled SSBF samples used for Table A2 include 12,412 observations in total—4637 from the 1993 SSBF, 3551 from the 1998 SSBF, and 4224 from the 2003 SSBF.Footnote 15 By racial group, the pooled SSBF samples are broken into 2302 minority-owned small businesses (825 African American, 671 Hispanic, and 806 other), 8234 small businesses owned by White males, and 1876 small businesses owned by White females.
Since our approach is to identify the existence of discrimination in small business lending markets, the paper uses extensive information on credit history, firm and owner characteristics, loan and lender characteristics, and geographic characteristics in the pooled SSBF samples. These variables are critical in the sense that lenders’ expected profits on the approved loans are based mainly on the probability of loan repayment. Lenders therefore are expected to assess a firm’s profitability and likelihood of loan repayment based on these variables (Bostic and Lampani, Reference Bostic, Lampani, Blanton, Williams and Rhine1999).
We use the natural log of total sales, profits, and firm net worth to measure firm size, which is shown to be closely related to demand for credit (Jovanovic Reference Jovanovic1982). A firm’s organization status (cooperation, partnership, sole proprietorship) and its industry classification (seven categories) also are used since they are likely to identify differences in borrowing constraints. For example, corporations are anticipated to be more willing to take on debt because of their limited liability protection (Ang Reference Ang1991). Owners’ age, education, and managerial experience are also controlled. Prior research on the relationship between entrepreneurship and small business viability shows that an owner’s education and managerial experience are a measure of the owner’s human capital, which is positively related to a firm survival (Bates Reference Bates1990).
There are other variables that may affect a lender’s loan decision. We use lender type (commercial bank, savings bank, finance companies, other), and the length of the relationship between a small business and a lender. Caren Grown and Timothy Bates (Reference Grown and Bates1992), for example, hypothesize that commercial banks compared to other financial institutions tend to approve larger loans to borrowers, and Cole (Reference Cole2008) argues that specialized lenders such as finance companies and savings associations offer only specialized loans (e.g., equipment loans). Allen N. Berger and Gregory F. Udell (Reference Berger and Udell1995) also show that strong relationships (in part measured by the length of time) between small businesses and lenders increase the likelihood of loans. The definitions of all the variables used in the analyses are presented in Appendix Table A1.
One of the main reasons why the weighted descriptive statistics are presented by borrower type in this paper is that the different characteristics among the borrower types provide us with a sense of how sample selection could occur in small-business lending markets. In this context, descriptive statistics in Table A2 suggest that there exist significant differences in characteristics among borrowers. For example, looking at the proportion of business obligations that are delinquent (i.e., Business Delinquency in Table A2), discouraged borrowers are quite similar to denied borrowers across the survey years—their proportions of delinquency are higher than the other two borrower types (i.e., non-borrowers and approved borrowers). Likewise, looking at the proportions of those who had faced bankruptcy (see Table A2), non-borrowers and approved borrowers are similar to one another across the survey years in that their bankruptcy proportions are lower compared with discouraged and denied borrowers.
Although these findings are not observed for all variables, it is confirmed in general that non-borrowers and approved borrowers are similar to one another while discouraged borrowers and denied borrowers are also similar to one another. We can see that discouraged and denied borrowers have poor credit quality (e.g., bankruptcy), small-size businesses (e.g., sales), less education (e.g., college degree), less business experience, and they are younger.
If we calculate descriptive statistics by racial group, similar patterns also arise (results not presented). For example, looking at the proportion of business obligations that are delinquent, Black-owned small businesses have the highest proportions across the survey years. Compared with White-owned small businesses, minority-owned small businesses are generally disadvantaged in terms of credit quality, business size, education, and so on. These differences among racial groups are also observed in the loan denial rate. The rate of loan denial is about 0.51 for African Americans, 0.26 for Hispanics, 0.26 for others, 0.11 for White males, and 0.16 for White females.
Regression Results
Since the purpose of the paper is to identify the existence of discrimination that may occur in small-business lending markets based on lenders’ loan denial decisions, we will first look at regression results obtained when the choice of whether to apply for a loan is not considered. Next, we will see regression results obtained when this sample selection process is considered jointly with the loan denial decision process.
No Correction for Sample Selection
Regression results in Table 1 are based on the pooled SSBFs.Footnote 17 Model 1 in Table 1 controls race dummy variables and the survey fixed effects (i.e., dummy variables for survey years) only and it shows average marginal effects of minority status in small business lending markets. For example, Model 1 shows that Blacks, Hispanics, and other races face, on average, about 31%, 10%, and 13% greater chance of being denied loans compared to White males (the omitted category), respectively. With more controls, the coefficients associated with minority owned businesses dramatically decline, but they are still statistically significant. Model 2 in Table 1 shows that owners’ credit characteristics seem to explain much of the relationship between race and the chance of being denied a loan. However, Model 6 shows that there is little change in the coefficients when firm industry (e.g., manufacture or transportation) is added to Model 5. In Model 8, we see that the relationship declines relatively little when loan and lender characteristics are controlled.
a This table reports average marginal effects of race dummy variables (e.g., Hispanic) and their robust standard errors. Regarding the full list of variables controlled for here, please see Table A3
Following Blanchard et al. (Reference Blanchard, Zhao and Yinger2008), Model 8 controls for all the variables: owners’ credit histories, firms’ characteristics, owners’ characteristics, geographic characteristics, loan characteristics, and lender characteristics. The results in Model 1 – Model 8 confirm the view that minority-owned small businesses face higher chances of being denied loans compared with White male-owned small businesses, even with a large number of factors controlled.Footnote 18 Our findings are consistent with others in the credit market literature (Blanchard et al., Reference Blanchard, Zhao and Yinger2008). We find the coefficients of Hispanic and other races in our model to be statistically significant whereas others do not, which may be mainly due to our greater sample size, which decreases the standard errors of the coefficients of the race dummy variables.
Previous research adds to regression equations a variable that indicates whether the lender was in the same metropolitan area or county as the firm (Blanchard et al., Reference Blanchard, Zhao and Yinger2008; Petersen and Rajan, Reference Petersen and Rajan1994), but we do not control for this variable in any model specification presented in the paper since there might exist a possibility that a small business owner may have choice regarding the location of the lender, which causes endogeneity.
In the same context, loan characteristics (e.g., loan type) and lender characteristics (e.g., lender type) can also be viewed as endogenous variables that borrowers can choose (Smith and Cloud, Reference Smith, Cloud, Goering and Wienk2018). For estimates not to suffer such bias, it must be the case that, controlling for credit history, owner characteristics, and firm characteristics, unobserved factors in the error term of the loan denial equation must be uncorrelated with loan and lender characteristics that explain the lender’s loan denial decision.Footnote 19
Model specifications presented in Table 1, however, do not consider the sample selection process that occurs in small-business lending markets. More specifically, non-borrowers and discouraged borrowers, who didn’t submit loan applications, are not considered in the lender’s loan decision process. As shown by Jeffrey M. Wooldridge (Reference Wooldridge2010), if the selection process can be solely determined by exogenous variables that are controlled for in the loan denial equations, a standard regression approach such as a probit model can produce consistent estimates regardless of whether the sample selection process is considered jointly. However, since it is a very strong assumption that the factors that determine whether a borrower applies for a loan are the same as those that are controlled in our analyses, we consider corrections for the selection bias problems.
Correction for Sample Selection
As briefly mentioned above, the sample selection process—whether a small business owner applies for a loan—could be related to the lender’s loan denial decision. In other words, since the error term in the outcome equation (i.e., the lender’s denial decision process) is potentially correlated to the error term in the selection equation (i.e., a borrower’s loan application decision), a standard probit approach may not produce consistent estimates. Therefore, we must estimate the outcome equation and the selection equation jointly to obtain consistent estimates of the coefficients of interest.
As pointed out in the empirical literature that uses Heckman’s methods, however, the selection equation should normally have exclusion restrictions in the bivariate probit sample selection model (Cameron and Trivedi, Reference Cameron and Trivedi2005; Wooldridge Reference Wooldridge2010). More specifically, there should be at least one regressor included in the selection equation but not in the outcome equation. Otherwise, the inverse Mills ratio in Equation (18) may be strongly correlated with the other measures in the equation, which may jeopardize the validity of the outcome and the selection equation estimation.
We have implemented an alternative model specification for the bivariate probit sample selection model to see how exclusion restrictions can influence the coefficients of interest. The exclusion restrictions used in this paper assume that the interaction terms of industry and region dummy variables affect the selection equation only but do not affect the outcome equation directly as long as owners’ credit histories, firms’ characteristics, owners’ characteristics, geographic characteristics, industry, loan characteristics, and lender characteristics are controlled.
Our justification for this assumption is that a firm’s decision to apply for loans may be based on considerations that matter for a firm’s productivity and profit, which means that the location and the characteristics of industry to which a firm belongs jointly affect a firm’s behavior. Considering cluster effects promote both competition and cooperation among small businesses in an area, a firm’s decision to apply for a loan may be affected by the interaction of region and firm industry (i.e., the interaction terms to which the exclusion restrictions apply), but a bank’s decision to evaluate loan applications (e.g., loan denial) may be simpler—a bank takes into account region and industry in a simple way, (i.e., in accord with the additive terms in the outcome equation as long as an extensive set of variables are controlled).
Looking at Table 2, for example, the model with Heckman Correction (i.e., Panel A) shows that Black-owned small businesses have, on average, a 15% higher chance of being denied loans compared to White-owned businesses (the omitted category), where the outcome equation is the same as Model 8 in Table 1 (i.e., loan and lender characteristics included in the outcome equation), but the selection equation is considered jointly. Looking at the selection equation—whether to apply for a loan—that is the same as the outcome equation except it does not include loan and lender characteristics, we see that Black-owned small businesses are not more likely to apply for a loan compared to White-owned small businesses.
a The model specification can be interpreted as follows: The outcome equation includes all the independent variables as in Model (8) in Table 1. However, the selection equation excludes loan and lender characteristics, but includes identifying variables - the interaction terms of industry and region dummy variables.
b The coefficients in all selection equations are regression estimates, not marginal effects.
What is interesting in Table 2 is that the effects of selection bias seem to underestimate the marginal effects of Model 8 in Table 1, provided that the model specification with Heckman Correction is correct. Regardless of whether exclusion restrictions are implemented (i.e., Panel B), the model specifications in Table 2 show the same consistent pattern, (i.e., that minorities face higher chances of being turned down for loans). These findings also suggest that selection bias can be ignored, and the use of the simple specification can be justified in estimating the effects of race on loan denial if an extensive set of variables controlled as shown in Model 8 in Table 1.
For comparison, regression results in Table 1 based on a standard probit model and those based on the bivariate probit sample selection models in Table 2 show that the coefficients associated with the different minority groups that are statistically significant in a standard probit model remain statistically significant in the bivariate probit sample selection model as well. Also, the sizes of the coefficients for Black and Hispanic ownership in both models are similar to one another, which is consistent with what Blanchard et al. (Reference Blanchard, Zhao and Yinger2008) found. Therefore, it can be argued from the regression results that the effects of selection bias problems are not large in small business lending markets if an extensive set of independent variables are controlled.
Discussion
The first regression results presented in this paper are based on a standard probit regression approach with a variety of control variables (e.g., credit history, firm and owner characteristics) and they show a consistent pattern of differential denial for minorities regardless of model specification. However, in spite of the consistent regression results observed when various measures are controlled in the standard probit regression approach, since sample selection is not considered, one should be careful not to interpret these results as evidence that minorities are discriminated against in small business lending markets.
The paper implements a bivariate probit sample selection model to consider the outcome and the selection equation jointly. To see how different model specifications—for example, a bivariate probit sample selection model with and without exclusion restrictions—can affect the estimates of interest, this paper implements different bivariate probit sample selection models. As presented in Table 2, the regression results are similar to one another regardless of model specification. More specifically, in the four different bivariate probit sample selection models, the sizes of the coefficients are similar to one another, although the results are not presented here.
Regardless of the methods used to correct for sample selection, the results in Table 2 suggest that selection bias has little effect on estimates of coefficients of interest, although the correction factor is statistically significant in all specifications. We can therefore argue that the simple specification that ignores selection is valid in estimating the effects of race on loan denial.
Summary and Conclusions
The literature on credit market discrimination shows consistent results that can be interpreted as evidence that minority-owned small businesses may be discriminated against compared to their counterparts (i.e., White-owned small businesses) in obtaining loans. This paper adds to the literature on small-business credit markets by using pooled cross sections (i.e., the SSBFs) to get more precise estimates and provide statistics with more power. Compared to prior studies, for example, regression estimates in Table 1 have smaller standard errors. In other words, with the large sample used, this paper tries to provide more precise estimates of coefficients using alternative identification strategies. By using a bivariate probit sample selection model, this paper also shows that regression estimates are much the same regardless of whether exclusion restrictions are used.
Regardless of model specification based on either a standard probit or a bivariate probit sample selection model, the paper shows that there exists a positive race differential in loan denial, which indicates that minority-owned small businesses are more likely to be turned down for loans. However, it should be noted that identification strategies used in this paper have some limitations. First, parametric assumptions are used in model specifications. More specifically, it is assumed that we know a specific functional form of a lender’s loan denial and of a borrower’s loan application decision process. Second, there is a possibility that exclusion restrictions used in our analyses may not be valid. Third, there might be other variables that may not be included in the datasets but affect the likelihood of loan denial and loan application decision, and their omission may bias estimates of the effect of race. For example, if we assume that the owner’s bargaining ability is an important factor that can affect a lender’s loan decision process, is correlated with race, but cannot be observed by a researcher, then the estimated coefficients for minority-owned small businesses will be biased.
Our findings have policy implications. First, as pointed out in Blanchard et al. (Reference Blanchard, Zhao and Yinger2008), higher denial rates for minority-owned small businesses can be interpreted as evidence that lenders discriminate against minority-owned small businesses and, therefore, regulators can assume that racial discrimination exists in small business lending markets unless lenders prove that any remaining racial differences in loan approval can be justified by legitimate business considerations.
Second, since the constraints in access to financial resources greatly impact small business operations, monetary tightening is expected to have a large effect on small business sales and cause a contraction of lending to small businesses. In this case, minority-owned small businesses may have a harder time to obtain a loan during a period of tight money (Gertler and Gilchrist, Reference Gertler and Gilchrist1994). Therefore, federal financial regulatory institutions should be required to help minority-owned small businesses to secure sources of capital for their business operations during economic downturns.
Last but not least, considering difficulty in obtaining sources of capital observed in this research, more financial programs need to be implemented that are designed for minority-owned small businesses to secure financial resources. For example, the U.S. Small Business Association’s Minority-Owned Businesses Development Program provides one-on-one counseling sessions, training workshops, and management assistance to help minority-owned firms finance their businesses.Footnote 20 Further, financial regulatory institutions such as the Consumer Financial Protection Bureau need to monitor financial institutions (e.g., banks) to keep minority-owned small businesses safe from unfair practices observed in financial industries.
Supplementary Materials
To view supplementary material for this article, please visit http://doi.org/10.1017/S1742058X23000012.