楼主: 职业xx
30879 287

[数据] 数据挖掘练习数据—Amazon Access Samples Data Set   [推广有奖]

  • 2关注
  • 9粉丝

博士生

60%

还不是VIP/贵宾

-

威望
0
论坛币
1588 个
通用积分
13.0731
学术水平
14 点
热心指数
17 点
信用等级
12 点
经验
6744 点
帖子
337
精华
0
在线时间
260 小时
注册时间
2007-6-8
最后登录
2024-8-1

相似文件 换一批

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
Amazon Access Samples Data Set Abstract: Amazon's InfoSec is getting smarter about the way Access data is leveraged. This is an anonymized sample of access provisioned within the company.
Data Set Characteristics:  Time-Series, Domain-TheoryNumber of Instances:30000Area:Business
Attribute Characteristics:N/ANumber of Attributes:20000Date Donated2011-09-13
Associated Tasks:Regression, Clustering, Causal-DiscoveryMissing Values?N/ANumber of Web Hits:12454

Source:Dataset creator and donator: Ken Montanez email: kenmonta[at]cal.berkeley.edu institution: Information Security, Amazon Corp.
Data Set Information:This is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files:
[amzn-anon-access-samples-2.0.csv] this file contains the access for users
[amzn-anon-access-samples-history-2.0.csv] this file contains the access history for a given user

Attribute Information:__amzn-anon-access-samples-2.0.csv__
This is a sparse data set containing users and their assigned access. The file contains 4 categories of attributes.
1) [PERSON_{ATTRIBUTE}] This category describes the 'user' who was given access. The [PERSON_ID] column is the primary key column for the file. There is one row per user.
PERSON_ID: id of the user
PERSON_MGR_ID: id of the user's manager
PERSON_ROLLUP_1: user grouping id
PERSON_ROLLUP_2: user grouping id
PERSON_ROLLUP_3: user grouping id
PERSON_DEPTNAME: department desciption id
PERSON_LOCATION: region id
PERSON_BUSINESS_TITLE: title id
PERSON_BUSINESS_TITLE_DETAIL: description id
PERSON_JOB_CODE: job code id
PERSON_COMPANY: company id
PERSON_JOB_FAMILY: job family id

2) [RESOURCE_{ID}] This category of attributes are the resources that a users can possibly have access to. A user will have a 1 in this column if the have access to it otherwise it will be 0.

3) [GROUP_{ID}] - This category of attributes are the groups that a users can possibly have access to. A user will have a 1 in this column if the have access to it otherwise it will be 0.

4) [SYSTEM_SUPPORT_{ID}] - This category of attributes are the system that a user can possibly be supporting. A user will have a 1 in this column if the have can possibly be supporting it, otherwise it will be 0.

__amzn-anon-access-samples-history-2.0.csv__
Permissions Time series data. Here is a short description of the columns:
ACTION: either 'remove_access' or 'add_access'
TARGET_NAME: either the {RESOURCE_ID} or {GROUP_ID}
LOGIN: the id of the user that is obtaining or losing access
REQUEST_DATE: YYYY-MM-DD HH:MM:SS
AUTHORIZATION_DATE: YYYY-MM-DD HH:MM:SS
数据量很大,不建议所有人都下载,谁的机器牛谁下载跑跑吧。
象征性收取1金币!!!

本帖隐藏的内容

数据挖掘练习数据amzn-anon-access-samples.rar (7.03 MB, 需要: 1 个论坛币)





二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:samples Amazon access Sample Acces getting Values access within about

本帖被以下文库推荐

沙发
ideal_ice_kingz 发表于 2011-12-20 21:54:46 |只看作者 |坛友微信交流群
nothing is impossible to be a willing heart

使用道具

藤椅
chensu 发表于 2011-12-21 00:03:40 |只看作者 |坛友微信交流群
哇,看看去啦

使用道具

板凳
caihl05 发表于 2011-12-22 23:59:15 |只看作者 |坛友微信交流群
学习学习

使用道具

报纸
hy_huiyan 发表于 2011-12-24 13:55:10 |只看作者 |坛友微信交流群
红啊好

使用道具

地板
寂寞落叶 发表于 2011-12-25 11:52:46 |只看作者 |坛友微信交流群

使用道具

7
nxf2498 发表于 2011-12-25 19:28:45 |只看作者 |坛友微信交流群
很想看看,正好要去他家面试

使用道具

8
fjxing 发表于 2011-12-26 11:22:10 |只看作者 |坛友微信交流群
thanks for sharing

使用道具

9
sls 发表于 2011-12-26 11:35:18 |只看作者 |坛友微信交流群
thanks

使用道具

看看

使用道具

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群

京ICP备16021002-2号 京B2-20170662号 京公网安备 11010802022788号 论坛法律顾问:王进律师 知识产权保护声明   免责及隐私声明

GMT+8, 2024-11-25 19:19