- data c_g;
- set c;
- survyear=(dieod-dateotype)/365.25; *suryear=生存年=死亡日期-录入日期;
- age=(dateotype-BOD)/365.25; *age=年龄=录入日期-出生日期;
-
- if survyear=. then die=0;
- if survyear^=. then die=1;
-
- if -10<=survyear<0 then groupsy=0;
- if 0<= survyear <1 then groupsy=1;
- if 1<= survyear <2 then groupsy=2;
- if 2<= survyear <3 then groupsy=3;
- if 3<= survyear <4 then groupsy=4;
- if 4<= survyear <5 then groupsy=5;
- if 5<= survyear <6 then groupsy=6;
- if 6<= survyear <7 then groupsy=7;
- if 7<= survyear <8 then groupsy=8;
- if 8<= survyear <9 then groupsy=9;
- if 9<= survyear <10 then groupsy=10;
- if 10<= survyear <11 then groupsy=11;
- if 11<= survyear <12 then groupsy=12;
- if 12<= survyear <13 then groupsy=13;
- if 13<= survyear <14 then groupsy=14;
- if 14<= survyear <15 then groupsy=15;
- if 15<= survyear <16 then groupsy=16;
- run;
if then;
else if then;
这样生成太繁琐.
能有个by自动分组就好了,
数据量相对大:几十万行×100多变量.



雷达卡





京公网安备 11010802022788号







