1. Introduction
At the end of the movie Trading Places, the two protagonists trade in frozen concentrated orange juice futures based on a stolen crop report. In their case, it was easy to know how to trade based on the report. However, that is not always the case. The U.S. Department of Agriculture (USDA) releases monthly World Agricultural Supply and Demand Estimates (WASDE), which contain fundamental market information such as the National Agricultural Statistics Service (NASS) crop production reports and ending stocks estimates. Private firms attempt to predict the WASDE reports using satellite imagery, publicly known supply-and-demand estimates, experimental plots, crop tours, and calls to grain firms, as well as many other ways to gather fundamental information (Milonas, Reference Milonas1987).
We develop trading models based on knowing the WASDE report in advance. The results allow estimating the potential trading returns using WASDE report predictions in the days before the report. Thus, the research provides a benchmark model for firms engaged in forecasting prices and estimates of potential per contract returns for firms considering investment in endeavors to predict the crop reports. Because any private forecasts would have errors, the results give an upper bound on what could be earned using private forecasts. Many agricultural economists have studied the effects of USDA crop production reports on commodity prices. Findings by Adjemian (Reference Adjemian2012), Fortenbery and Sumner (Reference Fortenbery and Sumner1993), Isengildina, Irwin, and Good (2006b), Isengildina-Massa et al. (Reference Isengildina-Massa, Irwin, Good and Gomez2008a, Reference Isengildina-Massa, Irwin, Good and Gomez2008b), McKenzie (Reference McKenzie2008), Milonas (Reference Milonas1987), Dorfman and Karali (Reference Dorfman and Karali2015), and Sumner and Mueller (Reference Sumner and Mueller1989) confirm that WASDE reports contain significant fundamental market information that causes market prices to change after the reports are released. The present research adds to this literature by showing how and when portions of the crop supply/demand information in the reports are reflected in market prices in advance of the reports.
WASDE reports are released on a monthly basis and contain information compiled by several USDA agencies such as NASS; the reports offer information on supply and demand and contain two main components, acres to be harvested and expected yield per acre (Vogel and Bange, Reference Vogel and Bange1999). The NASS crop production report and the WASDE report are developed secretly and are usually released between the 9th and 12th of every month (Vogel and Bange, Reference Vogel and Bange1999). For the 2000 crop, for example, the corn and soybean ending stocks in the WASDE reports are for the end of the crop year on September 1, 2001. The monthly reports that we consider for the 2000 crop year begin with projections in May 2000 and end in March 2001. According to the first report (USDA, 1973), these projections are meant to serve as guideline approximations based on the information currently known. Projections vary with every new report as crop, weather, and economic conditions change. Prices move after these reports are released because the information remains secret until the official report is released. The USDA fiercely guards this information to ensure that nothing is leaked before the report date (Vogel and Bange, Reference Vogel and Bange1999).
Price movements are larger following USDA reports (Isengildina, Irwin, and Good, 2006b). Similarly, Isengildina-Massa et al. (Reference Isengildina-Massa, Irwin, Good and Gomez2008a, Reference Isengildina-Massa, Irwin, Good and Gomez2008b) found that after WASDE reports containing NASS crop production reports were released, implied volatility in corn and soybean markets was reduced. Sumner and Mueller (Reference Sumner and Mueller1989) found that the harvest forecast reports released in the months of August, September, and October cause a greater change in corn and soybean market prices than other reports. Known periods of greater market realignments offer traders a chance to capitalize on market movements. Adjemian (Reference Adjemian2012) also found that information from WASDE reports moved prices, especially those that contained NASS crop production reports. Private agencies already release prediction data in the days prior to a USDA report (McKenzie, Reference McKenzie2008). If these forecasts encompassed the information in the WASDE reports and in turn negated the need for WASDE reports, the release would have no effect on price volatility. This is clearly not the case.
A few notable price forecasting models have been developed to predict grain prices based on information in the WASDE reports. Anderson and Tweeten (Reference Anderson and Tweeten1975), Dunn (Reference Dunn1975), Westcott and Hull (Reference Westcott and Hull1985), Westcott and Hoffmann (1999), and Do (Reference Do2010) all used variations of a regression of price against ending stocks or a stocks-to-use ratio. The model estimated by Do used more recent data, which yielded a lower R-squared for the regression than did Anderson and Tweeten (Reference Anderson and Tweeten1975). Do's findings suggest that there has been structural change. More than a third of the U.S. corn crop is being used in ethanol production (Mallory, Irwin, and Hayes, Reference Mallory, Irwin and Hayes2012), which could create a structural change in the relationship between prices and ending stocks. Westcott and Hull (Reference Westcott and Hull1985) and Westcott and Hoffmann (Reference Westcott and Hoffman1999) find market relationships varied with different periods of government legislation. To predict prices, we develop a forecasting model than can quickly adjust to structural changes by using only recent data to calibrate the model.
Previous literature has shown the effects of WASDE reports on prices immediately after the report release. However, past literature has not considered models of how to use predictions to trade. This research considers trading of U.S. corn and soybean commodities spanning the crop years 1975–2012. Trading is simulated on a daily basis. Information provided includes seasonality of monthly trading returns and evidence regarding structural change in the relationship between prices and ending stocks. This research successfully fills a void in the current literature and serves as a relevant guide to the potential profitability of trading based on WASDE report information.
2. WASDE Reports
The WASDE reports are released on a monthly basis and contain information for many different agricultural commodities. They include information at the U.S. and world levels to provide information on the current supply of commodities and expected acreage and yield data. The WASDE report releases part of its information as projections and part as estimates. We use only projections. The projections can change as more information about economic conditions, availability of inputs, and crop and weather conditions are reported. Each new WASDE contains a new projection that represents the most current information gathered during that report month. The projections generally begin between May and April; the exception is the early WASDE reports from 1973 through 1979. During this time period, there was not a standard in which the projection year began, which allowed it to vary among the early months in the calendar year. All trading is in the March futures contract. A single expiration month is used so that the regression model is always predicting the same price. The only month of the calendar year that would not report returns is April because of the March contract expiration and a new WASDE report for the next contract year not becoming available until May.
The WASDE periodically releases reports that contain corrections for previous WASDE information. These corrections are generally issued soon after an initial report and show a small difference in the previously reported numbers and do not provide a large amount of new information to the market. As such, these reports were not included. Of the 533 WASDE reports released from September 17, 1973, to March 8, 2013, that contained U.S. corn projections, 38 contained corrections to corn ending stocks, and the ending stocks changed on average by 4.24% of the original value. The largest changes from the corrections occurred in WASDE reports that were released in the 1970s.
Insengildina, Irwin, and Good (Reference Isengildina, Irwin and Good2006a) provide evidence that when a large adjustment in a previous prediction is needed, the adjustment is not fully made in the next report, and thus projections are smoothed. Such smoothing could reduce the accuracy of our forecasts if the market responded to the true expected values rather than the smoothed reported values. Prices are not deflated because of the short estimation period and the lack of clear theory about what index to use.
The number of observations between WASDE reports can vary depending on the days that the WASDE reports are released. Because of the varying calendar days each month and the untimely release of various WASDE reports, the number of days between reports varies. Observations more than 31 calendar days are not considered because there are too few of them to be representative. Months are crop reporting months, and years are WASDE projection years (for example, 1975 is the 1974/75 crop year).
3. Forecasting Models
Like past research, the key forecasting tool is a regression of price against ending stocks to use. However, past ending stocks regressions have used annual models. Here, we use monthly data and regress the futures price following the report against the projected ending stocks to use from the WASDE report. The models are reestimated for each report so that projections are out of sample from the estimation. Projections are made using the true ending stocks from the next WASDE report. Either a short or long position is always held. Trading returns are calculated daily and are based on settlement prices. The simulated trading holds a long (short) position when the settlement price is below (above) the price forecast. Two different regression approaches are used. The weighted average model uses a weighted average of the regression coefficients using previous reports on this crop year and the regression coefficients using last year's data. The weights are linearly adjusted based on the number of observations in the current crop year. Once 10 observations are available, only the current crop year is used. The intuition behind the approach is to place more weight on the most current data and to give equal weight to data within a crop year. The rolling regression uses data from a set number of recent months to estimate the regression used to forecast prices. The alternative amounts of data used in the rolling regression are 12, 24, 35, 48, and 60 months.
Similar to previous literature (Anderson and Tweeten, Reference Anderson and Tweeten1975; Do, Reference Do2010; Irwin and Good, Reference Irwin and Good2016; Westcott and Hoffmann, Reference Westcott and Hoffman1999; Westcott and Hull, Reference Westcott and Hull1985), price forecasts are obtained by regressing price against a function of ending stocks. The model is reestimated for each report so that the predictions are based on out-of-sample parameter estimates. To account for structural change, the regression model is estimated only using recent data. The ending stocks regressions are estimated using monthly data:
where Fi is the futures settlement price from the day that the ith WASDE report was released, and xi contains observations of the independent variable, which is the ratio of ending stocks (or yield) and use. We also consider yield rather than ending stocks to determine the potential of using only yield forecasts.
The work of Anderson and Tweeten (Reference Anderson and Tweeten1975), Westcott and Hull (Reference Westcott and Hull1985), Westcott and Hoffman (Reference Westcott and Hoffman1999), and Do (Reference Do2010) provided the inspiration for the use of this model. Their models mostly used a stocks-to-use ratio or a ratio of utilization and ending stocks. They use a ratio in an attempt to correct for structural change in the overall size of the market. Adjemian and Smith (Reference Adjemian and Smith2012) focused on parameter estimation rather than forecasting but also used a stocks-to-use ratio. A ratio is not required here because a relatively short time period is used,Footnote 1 but we go ahead and use a stocks-to-use ratio to be consistent with past literature.
The predicted price equation is
where $\ {\widehat {Price}_{m + 1}}$ is the forecasted price at the next WASDE report release, x m + 1 is the ending stocks from the next WASDE report, and m is the current report month. The slope coefficient for the rolling regression model is forced to be nonnegative:
where ${\hat{\beta }_{2m}}$ is the estimated slope coefficient. The intercept coefficient ${\bar{\beta }_{1{\rm{m}}}}$ is calibrated to predict the current report month price without error:
The calibration of the intercept was inspired by a similar approach used by a private market advisory service. The trading rule for day d is
where I() is an indicator function that is 1 when true and −1 when false, and Positionmd is the desired futures position with 1 representing a long position and −1 a short position.
For the moving average method, the slope coefficient for the weighted average model is calculated using a weighted average of the previous year's last regression slope and the slope from the current year. The decision to limit the use of the previous year's regression slope coefficient in this manner is because the model uses only the current year of data. The model's accuracy is low early in the year when there is limited information. Slowly throughout the year the slope is weighted more to the current year than the previous year and in the last month does not use any of the previous slope estimation.
The weighted average model does not provide a price forecast unless two observations are observed in the current WASDE projection year, and the rolling regression model loses some observations at the start of the sample period.
To determine the effectiveness of the models in capturing all possible trading returns, a perfect foresight method was used for comparison. The forecast price of the perfect foresight model is the actual closing price on the day of the next WASDE report.
Note that transaction costs are not included in the calculations. Trades were generated on approximately 10% of the possible days (10.02% for corn and 10.40% for soybeans). Commission costs should be small for this type of trader such as $4 per round turn. Trading is based on closing prices. Liquidity costs would be incurred. With electronic markets, the bid-ask spread is usually one tick ($12.50) (Wang, Garcia, and Irwin, Reference Wang, Garcia and Irwin2014). So the total costs would be $16.50 per trade or 0.33¢/bu. Incurring this cost 10% of the time would give costs of 0.033¢/bu. per day. There could also be additional costs from managing a margin account.
4. Evaluation Procedure
The regressions involve corn and soybeans closing prices directly regressed on the ratio of WASDE U.S. ending stocks to use. This section describes the heteroskedasticity adjustment used in calculating the mean of trading returns and our efforts in trying to identify a relationship between trading returns and days until the report.
The trading returns displayed heteroskedasticity because of increased prices and the subsequent increased volatility of futures prices starting around 2008. An estimated generalized least squares (EGLS) approach was used to estimate mean returns. The first-stage regression includes only an intercept. The log of the squared residuals is regressed on dummy variables for year. The inverse of the exponential of the predicted value from this auxiliary regression gives the predicted variance, which is used to estimate the mean via weighted least squares. Mathematically, the model estimated is
where rjt is returns from month j and year t, μ is the mean, and ε jt ~ N(0, σ2 t ). The logarithm transformation is used to reduce the influence of outliers.
We also studied how returns changed as the days from the next report changed. We considered both linear parametric regressions and nonparametric regressions of returns on days to maturity. The nonparametric procedure used was locally weighted regression (LOESS), which fits a separate weighted regression for each data point where observations close to that data point receive greater weight (Cleveland and Devlin, Reference Cleveland and Devlin1988). The weaknesses of the LOESS procedure are that like other nonparametric procedures, it requires a large number of observations to be precise, and it can be time consuming to calculate as local fitting occurs at every observation of the independent variable. No statistical significance was obtained. We concluded that the results were too noisy to make any conclusions regarding how returns changed with days until the report, and so none of these results are included here.
Note also that our results provide the returns from trading a single contract. The value of knowing the WASDE report in advance would depend on the number of contracts traded. The optimal number of contracts to trade would depend on market liquidity, supply/demand of futures contracts, and the trader's ability to tolerate risk. Further, an actual forecast would have errors, which would reduce its value. Thus, the results here provide only a partial picture of the value of knowing the WASDE report in advance.
5. Results
The trading returns are reported as averages by WASDE report month and WASDE projection year. The results provide information pertaining to which months would yield the highest returns if the WASDE reports could be anticipated.
5.1. Regression Coefficients
The intercepts and slopes of regressions using only data from one entire WASDE projection year are plotted to show the change in the beta coefficients across WASDE projection years. These are displayed in Figures 1 and 2 for corn and Figures 3 and 4 for soybeans. The intercept coefficients as seen in Figures 1 and 3 stay relatively constant throughout the sample period. The exception is that intercepts for corn have increased in recent years to capture the increase in prices. Note that these intercepts are not used in forecasting. The intercept used in forecasting is calibrated so that the regression line travels through the price following the most recent report.
The coefficients for slope tend to vary more over time. As the market structure has changed in recent years, the variance of the slope coefficients has become large compared with earlier observations. To combat this, the weighted average model uses a moving average approach that combines estimates with the data from the year immediately prior and the current estimates. Both corn and soybeans experience a major change in slope coefficients around the year 2008.
These figures show that the market experienced structural changes. The model is able to adjust to these structural changes quickly because of limiting the data used in the calculation of the slope coefficient.
5.2. Aggregate Results
From Table 1, the weighted average and rolling regression models show similar returns. The rolling regression results with differing window widths also provide some support for using limited historical data. With the weighted average model, the average daily return over the entire data set for corn was 0.16 cents per bushel, which is substantially higher than estimated transactions costs of 0.033 cents per bushel. As reported in Table 2, the perfect foresight trading signal made an average daily return of 1.11 cents per bushel. Therefore, the corn model estimation was successful at capturing 14% of available returns.
Note: There are slight differences in the perfect foresight profits because of using different amounts of data for trading.
Returns per bushel are similar for soybeans and corn, but soybeans have smaller t-values because of the higher price of soybeans. Overall, returns are consistently statistically significant for corn, whereas the soybean returns are not. Using yield instead of ending stocks works poorly. Thus, efforts to forecast yield alone are not going to be enough.
The monthly trading returns calculations from Table 3 show that July, September, and October have the highest returns for corn. The perfect foresight trading signal shows that the largest price movements occurred in July and August (Table 4). Pollination occurs in this time period, and that is a critical period of growth for corn. Mattos and Silveira (Reference Mattos and Silveria2016) also found the effects of crop reports were largest in the months preceding the beginning of harvest. The perfect foresight model shows that for corn the largest opportunity for returns is toward the end of the growing season and on into harvest in the fall.
Note: WASDE, World Agricultural Supply and Demand Estimates.
Note: WASDE, World Agricultural Supply and Demand Estimates.
Soybean results are similar to results for corn. With the weighted average model, the average daily return over the entire data set was 0.15 cents per bushel (Table 1). The perfect foresight model returned an average daily return of 2.94 cents per bushel (Table 2). Therefore, the model for soybeans captured only 5% of the total available returns. The model captured less available returns than the corn model, which could be attributable to the soybean price being more influenced by world events. Also, soybeans have more substitutes than does corn. Thus, ending stocks may not reflect as much information for soybeans as they do for corn.
For soybeans, Table 5 shows that the only months with statistically significant returns are September and November. Ignoring the small sample in May, for soybeans the perfect foresight shown in Table 6 showed the largest potential returns in March, July, and August. The perfect foresight models for soybeans do not show a strong seasonal pattern by month. Work by Isengildina-Massa et al. (Reference Isengildina-Massa, Irwin, Good and Gomez2008a, Reference Isengildina-Massa, Irwin, Good and Gomez2008b) and Adjemian (Reference Adjemian2012) showed the importance of the January crop reports because of the information on harvest data contained in them, but we do not find such a result with our approach. Unlike corn, the soybean market is not influenced as heavily by weather and crop conditions during the growing season. Soybeans are grown with varying maturities and do not have a well-defined critical growth period like the time of pollination for corn. Also, U.S. corn is less important in the world market for soybeans than it is in the world market for corn (USDA, Economics Research Service, 2016a, 2016b).
a The rolling regression results differ little by how many months are included as shown in Table 1. We elected to report 36 months for soybeans because it had slightly higher returns than 24 months.
Note: WASDE, World Agricultural Supply and Demand Estimates.
Note: WASDE, World Agricultural Supply and Demand Estimates.
5.3. Returns across Years
Yearly returns are calculated to study structural change and are displayed in Figures 5 and 6. Sudden changes in market structure because of policy changes, the economic climate, or weather patterns can dramatically affect the returns from a trading model. An increase in slope coefficients from the weighted average model in Figures 2 and 4 coincides with an increase in the volatility of average daily returns. The volatility of recent returns has been high, with the corn model doing well in recent years (Figure 5), whereas the soybean model is less consistent (Figure 6).
6. Summary and Conclusions
This research determines the potential trading returns from perfectly predicting WASDE reports on a daily basis for U.S. corn and soybeans. Any WASDE report forecast available to a trader would have errors, and so the trading returns here represent an upper bound. The WASDE reports used include projections that span from May to March of the next calendar year.
To calculate returns, two models are used to forecast price on the day of a report release. To limit the effect of structural change, one model uses a weighted average to calculate the slope coefficient on the model regression. New regressions are developed for every month, and the intercept coefficient is calibrated so that the most recent month's price is estimated without error. Then, a trading signal determines the buy-sell position depending on whether the previous day's closing price was above or below the price forecast. With the rolling regression model, the number of observations included in the regression is varied over 12, 24, 36, 48, and 60 observations. Similar to the weighted average model, the rolling regression model produces a price forecast that is used to determine the long/short position.
The returns from the model calculations are weighted using EGLS. This accounts for periods of increased volatility to make the returns comparable across the entire data set. Without this weighting, recent market activity would overshadow previous market conditions.
Corn and soybeans returns do not have the same pattern across WASDE report months. Corn experienced increased returns in the summer months. This is attributable to critical growth periods during the hot summer months, and the weather conditions during this time greatly affect the potential yield of the harvested crop. Returns were also higher in the fall, which could be attributable to information about actual yields becoming available. One observation that can be made is that returns are highest when the crop is at critical growth periods and when harvest information becomes available.
Evidence of structural change is observed for corn and soybeans when the average daily returns were sorted by the WASDE projection year. Increased returns and increased price volatility occur around the years 2007 and 2008. To limit the effect of structural change, the price forecasting models were estimated using only recent data.
As expected, it would be valuable to know the WASDE report in advance, but positive returns would by no means be guaranteed. Although even 31 calendar days away from the report release has positive returns, the returns are noisy, and traders would be subject to extreme risk because of the influence of other information sources causing market movement throughout the month. Knowing only ending stocks does not provide enough predictive power for a forecasted price to capture a majority of the market movement. Price forecasting models with more variables may be needed to account for market movement as ending stocks account for a small, yet very important portion of this movement.