楼主: Lyzjq888
3380 0

[学习分享] 面板数据SAS实现(1) [推广有奖]

  • 0关注
  • 0粉丝

已卖:422份资源

硕士生

21%

还不是VIP/贵宾

-

威望
0
论坛币
439 个
通用积分
10.2916
学术水平
0 点
热心指数
2 点
信用等级
0 点
经验
1379 点
帖子
166
精华
0
在线时间
66 小时
注册时间
2009-9-12
最后登录
2025-7-23

楼主
Lyzjq888 发表于 2013-1-16 15:25:45 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
INTRODUCTION
In recent years, estimation techniques that use time series cross-sectional (panel) data approaches have become widely used. The PANEL procedure in SAS/ETS software fits classes of linear models that arise when time series and cross-sectional data are combined. It is capable of fitting the following models:
[html] view plaincopyprint?

  • • one-way and two-way models  
  • • random-effects and fixed-effects models  
  • • autoregressive and moving average models  
  •    o Parks method  
  •    o dynamic panel method (GMM)  
  •    o Da Silva method  

• one-way and two-way models• random-effects and fixed-effects models• autoregressive and moving average models   o Parks method   o dynamic panel method (GMM)   o Da Silva methodThis paper uses simulated data to compare these techniques and outline their advantages and disadvantages. The paper starts with a brief theoretical overview of panel data methods. Several examples are given to demonstrate these techniques and their implementation in the PANEL procedure. The PANEL procedure is then compared with other SAS procedures.

PANEL MODELS
In this paper, the term panel refers to pooled data on time series cross-sectional bases. Typical examples of panel data include observations on households, countries, firms, trade, etc. For example, in the case of survey data on household income, the panel is created by repeatedly surveying the same households in different time periods
(years). The model is typically written in the following form:

Even though the dynamic nature of the model reflects the real relationship between the independent and dependent variables more accurately, the introduction of the lagged dependent variable can pose a variety of problems. Given the model structure, the dependent variables y(it) and y(it-1) are functions of  μ(i) , and estimation using ordinary least squares (OLS) will result in biased, inefficient, and inconsistent estimates. As discussed by Baltagi (1995), other models specifically designed for panel data also suffer from efficiency issues. This paper uses the PANEL procedure to demonstrate issues that might arise with a model that is dynamic in nature. ODS Graphics plots are used to
demonstrate results from various models.
DATA GENERATING PROCESS
The following AR(1) panel data generating process (DGP) was adopted from Bond, Bowsher, and Windmeijer (2001). One hundred cross sections, each containing six time periods, were generated. The DGP can be described as follows:

[html] view plaincopyprint?

  • data two;  
  • delta = 0.4;  
  • array x[6];  
  • do i=1 to 100;/*i从1到100*/  
  •    e_i = 4*rannor(1234);/*N(O,4)*/  
  •    mu_i = 36*rannor(32444);/*N(0,36)*/  
  •    do t=1 to 6;/*t从1到6*/  
  •       do k = 1 to 6;  
  •          x[k] = 3+4*(rannor(58785));/*系数β=3*/  
  •       end;  
  •    if t = 1 then   
  •       y = mu_i/(1-delta) + x1 + 5*x2 +10*x3 + e_i;/*β=5*/  
  •    else do;  
  •       v_it = 4*rannor(34454);  
  •       y = delta * y_t1 + mu_i + x1 + 5*x2 +10*x3 + v_it;  
  •    end;  
  •    output;  
  •    y_t1 = y;  
  •    end;  
  • end;  
  • run;  

data two;delta = 0.4;array x[6];do i=1 to 100;/*i从1到100*/   e_i = 4*rannor(1234);/*N(O,4)*/   mu_i = 36*rannor(32444);/*N(0,36)*/   do t=1 to 6;/*t从1到6*/      do k = 1 to 6;         x[k] = 3+4*(rannor(58785));/*系数β=3*/      end;   if t = 1 then       y = mu_i/(1-delta) + x1 + 5*x2 +10*x3 + e_i;/*β=5*/   else do;      v_it = 4*rannor(34454);      y = delta * y_t1 + mu_i + x1 + 5*x2 +10*x3 + v_it;   end;   output;   y_t1 = y;   end;end;run;
MODEL ESTIMATION
When introduced to a new method, users are likely to compare it with other estimation frameworks and techniques that are available. In this section, the generated data are used and several different models are estimated. We start with a simple OLS model that ignores the time series cross-sectional nature of the data by using the REG procedure.

[html] view plaincopyprint?

  • ods graphics on;  
  • proc reg data = two plot(unpackpanel)=all;  
  • model y = x1 x2 x3 /noint;  
  • run;  
  • ods graphics off;  

ods graphics on;proc reg data = two plot(unpackpanel)=all;model y = x1 x2 x3 /noint;run;ods graphics off;The model is estimated for three different cross-sectional error specifications, and the new PLOT option is used to obtain fit diagnostics for residuals (Figure 1).


Using the following statements, one-way fixed- and random-effects models are estimated using the PANEL procedure for the same three cross-sectional error specifications:

[html] view plaincopyprint?

  • ods graphics on;  
  • proc panel data = two plot=all;  
  • id i t;  
  • model y =x1 x2 x3 /fixone ranone noint;  
  • run;  
  • ods graphics off;  

ods graphics on;proc panel data = two plot=all;id i t;model y =x1 x2 x3 /fixone ranone noint;run;ods graphics off;

The PLOT=ALL option is used to obtain two diagnostic panels to examine the fit of the model. The panels for the oneway random-effects model are presented in Figures 2 and 3. The first panel was created using all 100 cross sections; the second panel depicts only the first 10 cross sections.


Since the model specification in Equation (2) includes a lagged dependent variable,One-way to correct for the inefficiencies is by using GMM for panel models developed by Arellano and Bond (1991), as follows :

[html] view plaincopyprint?

  • ods graphics on;  
  • proc panel data=two plot(unpackpanel)=all;  
  • id i t;  
  • instrument depvar exogenous = (x4 x5 x6);  
  • model y =  x1 x2 x3 /gmm twostep maxband=5 nolevels noint;  
  • run;  
  • ods graphics off;  

ods graphics on;proc panel data=two plot(unpackpanel)=all;id i t;instrument depvar exogenous = (x4 x5 x6);model y =  x1 x2 x3 /gmm twostep maxband=5 nolevels noint;run;ods graphics off;

It can be clearly seen from Equation (2) that the dependent variable y is not exogenous, since its values depend on its previous realizations. Arellano and Bond (1991) show that the dependent variable can still be used as one of the instruments if properly lagged. This is accomplished by using the DEPVAR option in the INSTRUMENT statement.
The INSTRUMENT statement can include other variables that are not correlated with the error term. In this model, x4, x5, and x6 are considered to be purely exogenous and uncorrelated with the error. In other models, it is possible that future values of available instruments are correlated with the error term but their past and current realizations are not. The fact that the past and present realizations are not correlated with the error enables us to use them as instruments with the PREDETERMINED option. The following INSTRUMENT statement is used to describe a model with two exogenous and one predetermined variables:




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:面板数据 fixed-effect introduction observations relationship 面板 techniques following procedure

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-24 17:39