楼主: evamxn
2088 9

[数据管理求助] 求助大神:stata如何统计当年与往年对比新增加的观测值的数量 [推广有奖]

  • 0关注
  • 0粉丝

本科生

3%

还不是VIP/贵宾

-

威望
0
论坛币
2 个
通用积分
0.7500
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
85 点
帖子
7
精华
0
在线时间
136 小时
注册时间
2010-6-5
最后登录
2023-7-14

楼主
evamxn 发表于 2022-4-30 23:23:06 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
求助各位大神:我在使用上市公司专利分类号(ipc)统计每年与往年相比新增的ipc类型的数量。比如如下数据,公司00001,2010年申请专利ipc号的类型有G06,G07; 2013年专利号与2010年相比,新增了H04这一类型,则2013年新增为1;2016年与2010、2013年相比没有新增ipc类型,则新增为0。其他公司也是一样的。求教代码应该如何编写?万分感谢!
数据如下:clear
input str12 symbol float year str3 ipc
"000001" 2010 "G06"
"000001" 2010 "G06"
"000001" 2010 "G07"
"000001" 2013 "H04"
"000001" 2013 "G06"
"000001" 2013 "G07"
"000001" 2013 "H04"
"000001" 2013 "G06"
"000001" 2013 "G06"
"000001" 2013 "G06"
"000001" 2016 "G06"
"000001" 2016 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "H04"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000001" 2017 "G06"
"000002" 2002 "E04"
"000002" 2002 "E04"
"000002" 2004 "E06"
"000002" 2004 "A47"
"000002" 2004 "E04"
"000002" 2004 "E06"
"000002" 2004 "E06"
"000002" 2004 "A47"
end





二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Stata 求助大神 tata 观测值 symbol stata 逐年新增观测统计

沙发
雨天的幸福 发表于 2022-5-1 10:57:19
现在还需要吗?我或许可以帮你

藤椅
evamxn 发表于 2022-5-1 11:13:19
雨天的幸福 发表于 2022-5-1 10:57
现在还需要吗?我或许可以帮你
需要需要,求助~

板凳
黃河泉 在职认证  发表于 2022-5-1 21:25:30
根据
  1. clear
  2. input str12 symbol float year str3 ipc
  3. "000001" 2010 "G06"
  4. "000001" 2010 "G06"
  5. "000001" 2010 "G07"
  6. "000001" 2013 "H04"
  7. "000001" 2013 "G06"
  8. "000001" 2013 "G07"
  9. "000001" 2013 "H04"
  10. "000001" 2013 "G06"
  11. "000001" 2013 "G06"
  12. "000001" 2013 "G06"
  13. "000001" 2016 "G06"
  14. "000001" 2016 "G06"
  15. "000001" 2017 "G06"
  16. "000001" 2017 "G06"
  17. "000001" 2017 "G06"
  18. "000001" 2017 "G06"
  19. "000001" 2017 "H04"
  20. "000001" 2017 "G06"
  21. "000001" 2017 "G06"
  22. "000001" 2017 "G06"
  23. "000001" 2017 "G06"
  24. "000001" 2017 "G06"
  25. "000002" 2002 "E04"
  26. "000002" 2002 "E04"
  27. "000002" 2004 "E06"
  28. "000002" 2004 "A47"
  29. "000002" 2004 "E04"
  30. "000002" 2004 "E06"
  31. "000002" 2004 "E06"
  32. "000002" 2004 "A47"
  33. end

  34. // Øyvind Snilsberg
  35. egen tag = tag(symbol ipc)
  36. bysort symbol year: egen wanted = total(tag)
  37. bysort symbol (year): replace wanted = 0 if year == year[1]
复制代码
Øyvind Snilsberg 之建议:

报纸
evamxn 发表于 2022-5-4 10:30:30
黃河泉 发表于 2022-5-1 21:25
根据Øyvind Snilsberg 之建议:
非常感谢黄老师的帮助
黄老师,想再请教您一个问题,如果我想统计过去5年新增的ipc数量,比如,2006年统计2002-2006年的新增ipc数量,2007年统计2003-2007年新增ipc数量,以此类推。应该怎么进一步编辑代码呢?数据如下:
clear
input str6 stkcd str4 year str3 ipc_3
"000002" "2002" "E04"
"000002" "2002" "E04"
"000002" "2004" "A47"
"000002" "2004" "E04"
"000002" "2004" "E04"
"000002" "2004" "E04"
"000002" "2004" "E06"
"000002" "2004" "E04"
"000002" "2004" "E04"
"000002" "2004" "E06"
"000002" "2004" "E04"
"000002" "2004" "E04"
"000002" "2004" "E04"
"000002" "2005" "A01"
"000002" "2005" "E04"
"000002" "2005" "E04"
"000002" "2005" "E06"
"000002" "2005" "E05"
"000002" "2005" "E04"
"000002" "2005" "F23"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "B65"
"000002" "2006" "A47"
"000002" "2006" "A01"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "E03"
"000002" "2006" "A47"
"000002" "2006" "B65"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "E04"
"000002" "2006" "A01"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2006" "A01"
"000002" "2006" "E04"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "A47"
"000002" "2006" "D06"
"000002" "2006" "A47"
"000002" "2007" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "B28"
"000002" "2008" "E04"
"000002" "2008" "B28"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "B28"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2008" "E04"
"000002" "2009" "G10"
"000002" "2009" "H03"
"000002" "2009" "H05"
"000002" "2009" "A01"
"000002" "2009" "E04"
"000002" "2009" "E04"
"000002" "2009" "B26"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "E04"
"000002" "2010" "B65"
"000002" "2010" "E04"
"000002" "2010" "E04"
end

地板
黃河泉 在职认证  发表于 2022-5-5 07:43:04
evamxn 发表于 2022-5-4 10:30
非常感谢黄老师的帮助
黄老师,想再请教您一个问题,如果我想统计过去5年新增的ipc数量,比如,20 ...
请 ssc install rang
  1. // Part II
  2. clear
  3. input str6 stkcd str4 year str3 ipc_3
  4. "000002" "2002" "E04"
  5. "000002" "2002" "E04"
  6. "000002" "2004" "A47"
  7. "000002" "2004" "E04"
  8. "000002" "2004" "E04"
  9. "000002" "2004" "E04"
  10. "000002" "2004" "E06"
  11. "000002" "2004" "E04"
  12. "000002" "2004" "E04"
  13. "000002" "2004" "E06"
  14. "000002" "2004" "E04"
  15. "000002" "2004" "E04"
  16. "000002" "2004" "E04"
  17. "000002" "2005" "A01"
  18. "000002" "2005" "E04"
  19. "000002" "2005" "E04"
  20. "000002" "2005" "E06"
  21. "000002" "2005" "E05"
  22. "000002" "2005" "E04"
  23. "000002" "2005" "F23"
  24. "000002" "2006" "A47"
  25. "000002" "2006" "A47"
  26. "000002" "2006" "A47"
  27. "000002" "2006" "B65"
  28. "000002" "2006" "A47"
  29. "000002" "2006" "A01"
  30. "000002" "2006" "A47"
  31. "000002" "2006" "D06"
  32. "000002" "2006" "E03"
  33. "000002" "2006" "A47"
  34. "000002" "2006" "B65"
  35. "000002" "2006" "A47"
  36. "000002" "2006" "D06"
  37. "000002" "2006" "A47"
  38. "000002" "2006" "A47"
  39. "000002" "2006" "A47"
  40. "000002" "2006" "A47"
  41. "000002" "2006" "A47"
  42. "000002" "2006" "A47"
  43. "000002" "2006" "E04"
  44. "000002" "2006" "A01"
  45. "000002" "2006" "D06"
  46. "000002" "2006" "A47"
  47. "000002" "2006" "A01"
  48. "000002" "2006" "E04"
  49. "000002" "2006" "A47"
  50. "000002" "2006" "A47"
  51. "000002" "2006" "A47"
  52. "000002" "2006" "D06"
  53. "000002" "2006" "A47"
  54. "000002" "2006" "A47"
  55. "000002" "2006" "A47"
  56. "000002" "2006" "D06"
  57. "000002" "2006" "A47"
  58. "000002" "2006" "A47"
  59. "000002" "2006" "D06"
  60. "000002" "2006" "A47"
  61. "000002" "2006" "A47"
  62. "000002" "2006" "A47"
  63. "000002" "2006" "D06"
  64. "000002" "2006" "A47"
  65. "000002" "2007" "E04"
  66. "000002" "2008" "E04"
  67. "000002" "2008" "E04"
  68. "000002" "2008" "E04"
  69. "000002" "2008" "E04"
  70. "000002" "2008" "E04"
  71. "000002" "2008" "E04"
  72. "000002" "2008" "E04"
  73. "000002" "2008" "E04"
  74. "000002" "2008" "E04"
  75. "000002" "2008" "B28"
  76. "000002" "2008" "E04"
  77. "000002" "2008" "B28"
  78. "000002" "2008" "E04"
  79. "000002" "2008" "E04"
  80. "000002" "2008" "B28"
  81. "000002" "2008" "E04"
  82. "000002" "2008" "E04"
  83. "000002" "2008" "E04"
  84. "000002" "2008" "E04"
  85. "000002" "2008" "E04"
  86. "000002" "2009" "G10"
  87. "000002" "2009" "H03"
  88. "000002" "2009" "H05"
  89. "000002" "2009" "A01"
  90. "000002" "2009" "E04"
  91. "000002" "2009" "E04"
  92. "000002" "2009" "B26"
  93. "000002" "2010" "E04"
  94. "000002" "2010" "E04"
  95. "000002" "2010" "E04"
  96. "000002" "2010" "E04"
  97. "000002" "2010" "E04"
  98. "000002" "2010" "E04"
  99. "000002" "2010" "E04"
  100. "000002" "2010" "E04"
  101. "000002" "2010" "B65"
  102. "000002" "2010" "E04"
  103. "000002" "2010" "E04"
  104. end

  105. egen tag = tag(stkcd ipc_3)
  106. bys stkcd year: egen wanted = total(tag)
  107. bys stkcd (year): replace wanted = 0 if year == year[1]

  108. duplicates drop stkcd year wanted, force
  109. destring year, replace
  110. rangestat (sum) wanted (count) wanted, interval(year -4 0) by(stkcd)
复制代码
estat 并据以更改:

7
evamxn 发表于 2022-5-19 17:11:05
黃河泉 发表于 2022-5-5 07:43
请 ssc install rangestat 并据以更改:
感谢黄老师的耐心帮助,学习啦

8
evamxn 发表于 2022-7-3 19:18:27
黃河泉 发表于 2022-5-5 07:43
请 ssc install rangestat 并据以更改:
黄老师您好,我想再请教一个问题,还是以上数据,如果想统计过去五年ipc的非重复值的个数,比如,2006年的过去5年(2002-2006)非重复值的ipc有E04,A47,E06,A01,E05,F23,B65,D06,E03,那么非重复ipc数量就是9。代码应该是什么样的呢?谢谢黄老师!

9
心情不错吼 发表于 2022-10-8 20:38:14
黃河泉 发表于 2022-5-5 07:43
请 ssc install rangestat 并据以更改:
黄老师您好,我想请教问题的问题与楼上同学类似,数据格式和内容也基本相同。
问题描述如下:统计过去五年ipc的非重复值的个数,比如,2006年的过去5年(2002-2006)非重复值的ipc有E04,A47,E06,A01,E05,F23,B65,D06,E03,那么非重复ipc数量就是9。
请问代码应该如何编写呐,万望黄老师可以指点一二,感谢感谢万分感谢老师!

10
浅笑熠熠 发表于 2022-12-8 11:22:25
求问楼主,这个公司IPC分类号的数据从哪里查?

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2026-1-7 04:03