

Explain how the answer is determined, especially the number of observations. Provide calculations. Also provide what other information we can get from this data set, e.g. meaning of R2, adjusted R2 etc.
For this exercise, we use JTRAIN. dta to determine the effect of a job training grant on hours of job training per employee. The basic model for the hours of training per employee in firm i and year t is: hrsempit=β0+δ1d88t+δ2d89t+β1 grant i,t+β2 grant i,t−1+β3log( employ i,t)+αi+ui,t for i=1,…,n and t=1,…,T. All variables are described in the data file. (a) This is a panel dataset. The command xtdescribe shows how the data is organized. Estimate the above equation including firm fixed effects using the command xtreg. Use the variable grant_1 for grant i,t−1. How many firms are used in the FE estimation? How many total observations would be used if each firm had data on all variables (in particular hrsemp) for all three years? Solutions: . xtreg hrsemp grant grant_1 lemploy d88 d89, fe Fixed-effects (within) regression Group variable: fcode R-sq: within =0.4909 between =0.0514 Number of obs =390 Number of groups =135 Obs per group: min = avg= 2.9
Answer: 135 firms are used in the FE estimation. Because there are three years, we would have a total of 405 observations if each firm had data on all variables for all three years. Instead, due to missing data, we can use only 390 observations in the FE estimation. (As an aside, d88 and d89 capture year fixed effects. The year 1987 operates as the reference category in the temporal dimension and is left out of the estimation).