我的数据是2007-2012年间中国企业的758个境外子公司的对外直接投资区位选择,东道国是68个国家,因变量是y,各国被选中为1,否则为0.研究的重点是各国环境规制(ER)对FDI区位选择的影响。自变量为相对应的年份各个国家的ER,控制变量有MS(市场规模)、LC(劳动力成本)等,部分数据如下:
year | country_name | country_group | FDIs | ER | MS | LC |
2007 | DZA | Developing | 0 | 64.80649943 | 135803556324.9200 | 3869.3732 |
2008 | DZA | Developing | 0 | 64.87458215 | 170989269622.0370 | 4786.2132 |
2009 | DZA | Developing | 1 | 63.7969275 | 138119949894.6960 | 3796.2456 |
2010 | DZA | Developing | 0 | 64.94757805 | 161777790125.7830 | 4364.9617 |
2011 | DZA | Developing | 0 | 64.67656322 | 198538802309.6370 | 5257.5008 |
2012 | DZA | Developing | 0 | 64.72920449 | 207955103846.4300 | 5403.9992 |
2007 | ARG | Emerging | 0 | 50.70530535 | 260768678129.1800 | 6630.0453 |
2008 | ARG | Emerging | 0 | 50.54413531 | 326582805854.3570 | 8231.2260 |
2009 | ARG | Emerging | 1 | 50.03219891 | 307155125025.1390 | 7674.3424 |
2010 | ARG | Emerging | 1 | 59.24214109 | 368736093173.6750 | 9132.9580 |
2011 | ARG | Emerging | 1 | 60.0566771 | 446044110969.7610 | 10951.5819 |
2012 | ARG | Emerging | 1 | 60.66871967 | 470532788509.7580 | 11452.1290 |
2007 | EGY | Developing | 0 | 75.92609164 | 130477817194.4120 | 1757.7605 |
2008 | EGY | Developing | 0 | 77.85793873 | 162818181818.1820 | 2156.7630 |
2009 | EGY | Developing | 0 | 80.08773545 | 188984088127.2950 | 2461.5309 |
2010 | EGY | Developing | 1 | 80.88393475 | 218887812549.8510 | 2803.5330 |
2011 | EGY | Developing | 1 | 80.25381868 | 235983523193.6300 | 2972.3667 |
2012 | EGY | Developing | 1 | 89.64502232 | 257285845358.2450 | 3187.3126 |
2008 | ARE | Developed | 1 | 24.47889523 | 314844665222.1970 | 46309.9821 |
2009 | ARE | Developed | 0 | 23.36202282 | 270334929437.5120 | 35025.1045 |
2010 | ARE | Developed | 2 | 22.71611436 | 287421927883.3930 | 34048.5302 |
2011 | ARE | Developed | 0 | 23.43729984 | 348594972517.1050 | 39057.8401 |
2012 | ARE | Developed | 0 | 23.46467331 | 348594972517.1050 | |
2007 | ARE | Developed | 1 | 8.928574553 | 258150041410.7600 | 44528.9960 |
我的问题是:
1.因变量是区位选择的话,比如2010年有2家公司选择了ARE进行投资,那这时的因变量y怎么定义呢,只能是0或者1吗,怎么体现有2家公司呢
2.用stata做条件logit必须是截面数据吗,看很多文献都是只研究某一年的情况,像我这种有2007-2012年六年的怎么处理,是不是添加年份虚拟变量来控制时间的影响,怎么设置呢,需要设6个虚拟变量吗,是否为2007年,是否为2008年……????
3.条件Logit要分组,将68个国家按发达国家(Developed)、发展中国家(Developing)和欠发达国家(Undeveloped)分组了,怎么表示呢,要设置三个虚拟变量吗,是否为发达国家、是否为发展中国家和是否欠发达国家??怎么设置配对变量,就是group怎么设置?