楼主: blueran21
1743 0

跪求牛人帮看下这个题怎么code(level2) [推广有奖]

  • 0关注
  • 0粉丝

大专生

51%

还不是VIP/贵宾

-

威望
0
论坛币
1568 个
通用积分
1.0000
学术水平
0 点
热心指数
0 点
信用等级
0 点
经验
5453 点
帖子
14
精华
0
在线时间
93 小时
注册时间
2012-9-11
最后登录
2022-5-27

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Level 1
Scenario:
There is a file called “Big_Data” and a file called “Needed_Data”.  The “Needed_Data file contains a list of fields that needs to be pulled from the “Big_Data” file.  When the code runs it should create the file called “Output_Data” and it should contain all the needed fields plus the primary key of the “Big_Data” file.
Input  files
File one:
•        File Name:        Big_Data
•        File Type:        SAS dataset
•        Records:        10 million records.
•        Variables:        5 thousand variables per record.
•        Primary key:        Account_number


File two:
•        File Name:        Needed_Data.
•        File Type        SAS dataset.
•        Records:        1 to X number.
•        Variables:        1
•        Varname:
o        Keep_list: Contains the name of a single variable that would be on the Big_Data file.  

Example data:
  Keep_list
  Apples
  Oranges         
  Grapes


Processing requirement:
Output file “Output_Data” should contain all the fields that was requested in the “Needed_Data” file plus the primary key.

Output and Usage requirement:
None.

Error handling requirement:
None.

Suggestion:
For now assume the “Needed_Data” file will always contain variables that are on the “Big_Data” file.

Level 2

All requirements identical to Level 1 except for the following changes.

Input  files
File two:
•        File Name:        Needed_Data.
•        File Type        SAS dataset.
•        Records:        1 to X number.
•        Variables:        3
•        Varname:       
o        Keep_list: Name of a single variable that is on the “Big_Data” file.
o        Where_list: The expected value of the variable in the keep list.
o        Rename_list: The name of the variable to be named in the “Output_Data file”.  

Example data:
  Keep_list        Where_list        Rename List       
  Apples        Red                Ambrosia
                Oranges        Orange         
                Grapes        Green                Seedless  

Processing requirement:
Output file “Output_Data” should contain all the fields that was requested in the “Needed_Data” file plus the primary key.  The output fields should be renamed where asked it was asked for.

Example Output_data:
  Account_number
  Ambrosia
          Oranges         
          Seedless  

Output and Usage requirement:
None.

Error handling requirement:
Do not expect all fields being request in the “Needed_Data” file is on the “Big_Data” file.  If a field is missing it should not show up on the “Output_Data” file and a note should be add to the log indicating the data field was not available. The code should then continue with the remainder of the fields.

Suggestion:
Do not assume the “Needed_Data” file will always contain variables that are on the “Big_Data” file

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Level Leve code ODE COD primary contain records needed called

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-4-20 07:15