- * This code is written in Stata 14MP
- * Run all code together
- *--- Sample data sets ---*
- * Firms' name and address
- clear all
- input id str60 firmname str60 firmadd
- 1 洛阳中品实业有限公司 河南省洛阳市偃师市首阳山镇羊二庄
- 2 全南晶环科技有限责任公司 江西省赣州市全南县车站东路工业园
- 3 福州奋达伟业厨房设备有限公司 福建省福州市闽侯洋里铁岭工业区
- 4 甘肃景丽工贸有限公司 甘肃省白银市景泰县细泉镇兴泉
- end
- save firm, replace
- * Firms' address
- clear all
- input str15 cityname
- 上海市
- 无锡市
- 苏州市
- 杭州市
- 南平市
- 莆田市
- 青岛市
- 广州市
- 三亚市
- 洛阳市
- 赣州市
- 福州市
- 白银市
- end
- save cityname, replace
- *------------------------------*
- use cityname, clear
- gen city = usubinstr(cityname, "市", "", .)
- drop cityname
- sort city
- levelsof city, local(city) // Restore city names in a local macro named city
- use firm, clear
- rename firm* (firm1 firm2)
- reshape long firm, i(id)
- * Since city names can appear either or both from name and address,
- * reshape them into one column is more covenient.
- gen city = ""
- foreach c of local city {
- replace city = "`c'" if strmatch(firm, "*`c'*")
- }
- drop if city == ""
- drop _j firm
- duplicates drop
-
- save firmid_city, replace


雷达卡






京公网安备 11010802022788号







