[原创博文] 数据比较问题 [推广有奖]

31楼

pinggutu 发表于 2010-4-28 01:53:08

crackman 发表于 2010-4-27 14:32
data a;
input x y z j@;
cards;
1 2 3 5
2 3 4 5
1 2 3 4
8 7 6 5
2 3 4 5
1 2 3 4
9 8 6 5
4 5 6 7
7 8 6 5
2 3 9 8
9 8 6 5
1 0 9 6
8 7 6 5
;
run;
proc sort data=a out=a;
by x y z j;
run;
data b;
set a;
x1=dif(x);
y1=dif(y);
z1=dif(z);
j1=dif(j);
if (x1=y1=z1=j1)=0 then output;
drop x1 y1 z1 j1;
retain n 0;
n=attrn(open('work.b','i'),'nobs');
run;

但是版主，我看运行出来的结果少了一行。你提供的数据有9行不同的数据，我只运行显示出8行不同的数据。
还有就是用select count(distinct *) 来做，编译出错!

32楼

jcjc0602 发表于 2010-4-28 02:50:29

路过~~~~~·

33楼

soporaeternus 发表于 2010-4-28 09:20:18

data aa;
if _N_=1 then do;
declare hash h();
h.definekey("x","y","z","j");
h.definedata("x","y","z","j");
h.definedone();
end;
set a;
rc=h.find();
if rc ^=0 then do;
h.add();
output;
end;
run;

复制代码

热闹热闹，看看对不对

Let them be hard, but never unjust

34楼

rosen123 发表于 2010-4-28 10:26:51

好东西啊，谢谢

35楼

liudeng2005 发表于 2010-4-28 10:45:43

不明白，感觉第一个问题，直接用proc sort 加option nodup就可以搞定
第二个问题，先用symput 计算出最大值
然后在data步中，引用宏变量就可以搞定。

已有 1 人评分	热心指数	收起理由
crackman	+ 1	好的意见建议

总评分: 热心指数 + 1 查看全部评分

36楼

crackman 发表于 2010-4-28 10:55:57

31# pinggutu
少了一个0
if (x1=y1=z1=j1=0)=0 then output;
n=attrn(open('work.b','i'),'nobs');
run;

博客http://crackman.net

37楼

crackman 发表于 2010-4-28 10:59:10

33# soporaeternus
127  data a;
128  input x y z j@;
129  cards;
NOTE: 数据集 WORK.A 有 13 个观测和 4 个变量。
NOTE: “DATA 语句”所用时间（总处理时间）:
   实际时间       0.00 秒
   CPU 时间       0.00 秒

143  ;
144  run;
145  proc sort data=a out=a;
146  by x y z j;
147  run;
NOTE: 有 13 个从数据集 WORK.A 读取的观测。
NOTE: 数据集 WORK.A 有 13 个观测和 4 个变量。
NOTE: “PROCEDURE SORT”所用时间（总处理时间）:
   实际时间       0.00 秒
   CPU 时间       0.00 秒

148  data b;
149  set a;
150  retain n 0;
151  x1=dif(x);
152  y1=dif(y);
153  z1=dif(z);
154  j1=dif(j);
155  if (x1=y1=z1=j1=0)=0 then output;
156  n=attrn(open('work.b','i'),'nobs');
157  run;
NOTE: 有 13 个从数据集 WORK.A 读取的观测。
NOTE: 数据集 WORK.B 有 9 个观测和 9 个变量。
NOTE: “DATA 语句”所用时间（总处理时间）:
  实际时间       0.01 秒
   CPU 时间       0.01 秒

158  data aa;
159
160  if _N_=1 then do;
161
162  declare hash h();
163
164  h.definekey("x","y","z","j");
165
166  h.definedata("x","y","z","j");
167
168  h.definedone();
169
170  end;
171
172  set a;
173
174  rc=h.find();
175
176  if rc ^=0 then do;
177
178       h.add();
179
180       output;
181
182  end;
183
184  run;
NOTE: 有 13 个从数据集 WORK.A 读取的观测。
NOTE: 数据集 WORK.AA 有 9 个观测和 5 个变量。
NOTE: “DATA 语句”所用时间（总处理时间）:
    实际时间       0.29 秒
   CPU 时间       0.03 秒

博客http://crackman.net