SAS矩阵 - 经管之家

0关注
3粉丝

已卖：34份资源

博士生

52%

还不是VIP/贵宾

-

0%

威望: 0 级
论坛币: 1695 个
通用积分: 4.7100
学术水平: 2 点
热心指数: 2 点
信用等级: 0 点
经验: 11519 点
帖子: 90
精华: 0
在线时间: 461 小时
注册时间: 2012-5-10
最后登录: 2023-9-17

楼主

楚颜错 发表于 2015-8-12 09:47:02 |AI写论文

50论坛币

·现在想把一个数据库中变量的值都作为行做一个对应矩阵(观测中有该元素，对应值为1，无则为0)，原数据如下：

ID	y1	y2	y3
1	A	V	E
2	G		X
3	A	C	R

变换后数据结果如下：

ID	A	V	E	G	X	C	R
1	1	1	1	0	0	0	0
2	0	0	0	1	1	0	0
3	1	0	0	0	0	1	1

请问如何操作?谢谢！

最佳答案

chiant 查看完整内容

here is my solution. data a; input ID y1 $2. y2 $2. y3 $2.; cards; 1 A V E 2 G X 3 A C R run; %macro trans_data; proc sql noprint; select y into: value1- from (select distinct y1 as y from a where y1 is not null union select distinct y2 as y from a where y2 is not null union select distinct y3 as y from a w ...

分享0 收藏0 回帖

关键词：Dave 如何操作数据结果 AVE acr 数据库如何元素

回帖推荐

sniperhgy 发表于10楼查看完整内容

说话请不要带节奏哦^_^ 而且不光是处理大数据上，还有就是3个变量就需要union三次，要是10个变量就来十次？我发下我的解决方案，欢迎斧正：

沙发

chiant 发表于 2015-8-12 09:47:03

here is my solution.

data a;
   input ID y1 $2. y2 $2. y3 $2.;
   cards;
1 A V E
2 G X
3 A C R
run;

%macro trans_data;
   proc sql noprint;
         select y into: value1-  from
         (select distinct y1 as y from a where y1 is not null
            union select distinct y2 as y from a where y2 is not null
            union select distinct y3 as y from a where y3 is not null)
         ;
   quit;
   data a_new;
         set a;
         array yvar y1-y3;
         %do i=1 %to &sqlobs.;
            &&value&i=0;
         %end;
         do over yvar;
            select(yvar);
                  %do i=1 %to &sqlobs.;
                        when("&&value&i")  &&value&i=1;
                  %end;
                  otherwise;
            end;
         end;
         drop y1-y3;
   run;
%mend;
%trans_data;

已有 1 人评分	论坛币	收起理由
admin_kefu	+ 20	热心帮助其他会员

总评分: 论坛币 + 20 查看全部评分

藤椅

teqel 发表于 2015-8-13 03:21:00

It works, but is not efficient for a big data.
data a;
input ID y1 $2. y2 $2. y3 $2. ;
cards;
1 A V E
2 G X
3 A C R
run;
data b (keep=ID A V E G X C R);
set a;
array aa{*} A V E G X C R;
array bb{*} y1-y3;
do i=1 to dim(aa);
aa=0;
do j=1 to dim(bb);
if bb[j]=vname(aa) then do;
aa=1;
leave ;
end;
end;
end;
run;

已有 1 人评分	论坛币	学术水平	热心指数	收起理由
楚颜错	+ 2	+ 1	+ 1	精彩帖子

总评分: 论坛币 + 2 学术水平 + 1 热心指数 + 1 查看全部评分

板凳

chiant 发表于 2015-8-13 05:16:11

teqel 发表于 2015-8-13 03:21
It works, but is not efficient for a big data.
data a;
input ID y1 $2. y2 $2. y3 $2. ;

Your code is not only inefficient, but also inflexible. It's not good idea to hard-code the data value (A, V, E, ..., R).

报纸

sniperhgy 发表于 2015-8-13 13:25:52

chiant 发表于 2015-8-13 05:17
here is my solution.

data a;

朋友你的代码确实灵活，但是用distinct这种需要排序的statement外加连用3个select from，要是遇到大一些的数据，也是挺“呵呵”的……

地板

chiant 发表于 2015-8-13 16:31:50

sniperhgy 发表于 2015-8-13 13:25
朋友你的代码确实灵活，但是用distinct这种需要排序的statement外加连用3个select from，要是遇到大一些的 ...

ok, 那您有本事给个大数据的solution呗

7楼

楚颜错 发表于 2015-8-14 10:25:08

teqel 发表于 2015-8-13 03:21
It works, but is not efficient for a big data.
data a;
input ID y1 $2. y2 $2. y3 $2. ;

谢谢您！

8楼

楚颜错 发表于 2015-8-14 10:25:43

chiant 发表于 2015-8-12 09:47
here is my solution.

data a;

好厉害，看不懂！谢谢

9楼

楚颜错 发表于 2015-8-14 10:32:36

sniperhgy 发表于 2015-8-13 13:25
朋友你的代码确实灵活，但是用distinct这种需要排序的statement外加连用3个select from，要是遇到大一些的 ...

确实是大数据~

10楼

sniperhgy 发表于 2015-8-14 11:31:37

chiant 发表于 2015-8-13 16:31
ok, 那您有本事给个大数据的solution呗

说话请不要带节奏哦^_^

而且不光是处理大数据上，还有就是3个变量就需要union三次，要是10个变量就来十次？

我发下我的解决方案，欢迎斧正：

data a;
input ID y1 $1. y2 $1. y3 $1.;
cards;
1 AVE
2 G X
3 ACR
;
run;
%macro CreateCountTable;
proc contents
noprint
data = a
out = var_of_a(keep = NAME where = (NAME ne "ID"));
run;
data _null_;
set var_of_a end = eof;
call symput(strip("VAR") || strip(_N_), strip(NAME));
if eof then
call symput(strip("COUNT_OF_VAR"), strip(_N_));
run;
data _NULL_;
if 0 then set A nobs = n;
call symput('obs_of_a',trim(left(put(n, 8.))));
stop;
run;
%do i = 1 %to &obs_of_a.;
data _NULL_;
set a(firstobs = &i. obs = &i.);
%do j = 1 %to &COUNT_OF_VAR.;
call symput(strip("VALUE") || strip(&j.), strip(&&VAR&j.));
%end;
run;
%put _user_;
data part_&i.;
%do j = 1 %to &COUNT_OF_VAR.;
%if &&VALUE&j. ne %str() %then
&&VALUE&j. = 1;;
%end;
run;
%end;
data wanted;
set part_1 - part_&obs_of_a.;
array myArr _numeric_;
do over myArr;
myArr = coalesce(myArr, 0);
end;
run;
proc datasets;
delete part_1 - part_&obs_of_a.;
run;
quit;
%mend;
%CreateCountTable;

复制代码

已有 1 人评分	论坛币	收起理由
admin_kefu	+ 20	热心帮助其他会员

总评分: 论坛币 + 20 查看全部评分

SAS矩阵 [推广有奖]

最佳答案

相关帖子

回帖推荐

浏览过的帖子

浏览过的版块

本版微信群