Read Marc Nerlove, “Returns to Scale in Electricity Supply” (except paragraphs of equations (6)–(9), the part of section 2 from p.184 on, and Appendix A and C) before doing this exercise. For 145 electric utility companies in 1955, the file NERLOVE.ASC has data on the following:
Column 1: total costs (call it TC) in millions of dollars
Column 2: output (Q) in billions of kilowatt hours
Column 3: price of labor (PL)
Column 4: price of fuels (PF)
Column 5: price of capital .(PK).
They are from the data appendix of his article. There are 145 observations, and the observations are ordered in size, observation 1 being the smallest company and observation 145 the largest. Using the data transformation facilities of your computer software, generate for each of the 145 firms the variables required for estimation. To estimate (1.7.4), for example, you need to generate log(TC), a constant, log(Q), log(PL), log(PK), and log(PF), for each of the 145 firms.
(a) (Data question) Does Nerlove’s construction of the price of capital conform to the definition of the user cost of capital? Hint: Read Nerlove’s Appendix B.4.
(b) Estimate the unrestricted model (1.7.4) by OLS. Can you replicate the estimates in the text?
(c) (Restricted least squares) Estimate the restricted model (1.7.6) by OLS. To do this, you need to generate a new set of variables for each of the 145 firms. For example, the dependent variable is log(TC/PF), not log(TC). Can you replicate the estimates in the text? Can you replicate Nerlove’s results? Nerlove’s estimate ofβ2, for example, is 0.721 with a standard error of 0.0174 (the standard error in his paper is 0.175, but it is probably a typographical error). Where in Nerlove’s paper can you find this estimate? What about the other coefficients?(Warning: You will not be able to replicate Nerlove’s results precisely. One reason is that he used common rather than natural logarithms; however, this should affect only the estimated intercept term. The other reason: the data set used for his results is a corrected version of the data set published with his article.)
As mentioned in the text, the plot of residuals suggests a nonlinear relationship between log(TC) and log(Q). Nerlove hypothesized that estimated returns to scale varied with the level of output. Following Nerlove, divide the sample of 145 firms into five subsamples or groups, each having 29 firms. (Recall that since the data are ordered by level of output, the first 29 observations will have the smallest output levels, whereas the last 29 observations will have the largest output levels.) Consider the following three generalizations of the model (1.7.6):
Model 1: Both the coefficients (β’s) and the error variance in (1.7.6) differ across groups.
Model 2: The coefficients are different, but the error variance is the same across groups.
Model 3: While each group has common coefficients forβ3 andβ4 (price elasticities) and common error variance, it has a different intercept term and a different β2. Model 3 is what Nerlove called the hypothesis of neutral variations in returns to scale.
(d) Estimate Model 1 by OLS. How well can you replicate Nerlove’s reported results? On the basis of your estimates of β2, compute the point estimates of returns to scale in each of the five groups. What is the general pattern of estimated scale economies as the level of output increases? What is the general pattern of the estimated error variance as output increases?
In particular, X is now a block-diagonal matrix. The equivariance restriction can be expressed as . There are now 20 variables derived from the original four regressors. The 145 dimensional vector corresponding to the second variable, for example, has log(Q1),…,log.(Q29) as the first 29 elements and zeros elsewhere. The vector corresponding to the 6th variable, which represents log output for the second group of firms, has log.Q(30),…, log(Q58) for the 30th through 58th elements and zeros elsewhere, and so on.
The stacking operation needed to form the y and X in (*) can be done easily if your computer software is matrix-based. Otherwise, you trick your software into accomplishing the same thing by the use of dummy variables. Define the j –th dummy variable as
Gauss Tip: If x is the F-ratio, the Gauss command cdffc(x,df1,df2) gives the area to the right of F for the F distribution with d f1 and d f 2 degrees of freedom.
TSP Tip: The TSP command to do the same is cdf(f, df1=df1, df2=df2) x. An output of TSP’s OLS command, OLSQ, is @SSR, which is the SSR for the regression.
RATS Tip: The RATS command is cdf ftest x df1 df2. An output of RATS’s OLS command, LINREG, is %RSS, which is the SSR for the regression.
(g) Estimate Model 3. The model is a special case of Model 2, with the hypothesis that the two price elasticities are the same across the five groups. Test the hypothesis at a significance level of 5 percent, assuming normality. (Note: Nerlove’s F-ratio on p183 is wrong.)
As has become clear from the plot of residuals in Figure 1.7, the conditional second moment is likely to depend on log output, which is a violation of the conditional homoskedasticity assumption. This time we do not attempt to test conditional homoskedasticity, because to do so requires large sample theory and is postponed until the next chapter. Instead, we pretend to know the form of the function linking the conditional second moment to log output. The function, specified below, implies that the conditional second moment varies continuously with output, contrary to the three models we have considered above. Also contrary to those models, we assume that the degree of returns to scale varies continuously with output by including the square of log output. Model 4 is
(h) Estimate Model 4 by weighted least squares on the whole sample of 145 firms. (Be careful about the treatment of the intercept; in the equation after weighting, none of the regressors is a constant.) Plot the residuals. Is there still evidence for conditional homoskedasticity or further nonlinearities?