楼主: jiandong4388
3289 4

[其他] 【独家发布】spambase data-用来做分类的数据 [推广有奖]

贵宾

一无所有的小博

已卖:1992份资源

学科带头人

79%

还不是VIP/贵宾

-

TA的文库  其他...

数据文库

IRT

meta-analysis

威望
1
论坛币
220561 个
通用积分
8539.5183
学术水平
361 点
热心指数
464 点
信用等级
300 点
经验
72058 点
帖子
2255
精华
2
在线时间
1693 小时
注册时间
2013-7-7
最后登录
2025-12-1

楼主
jiandong4388 学生认证  发表于 2017-8-13 21:56:36 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
附件是用来做分类的数据,特此上传上来供大家学习使用:
| SPAM E-MAIL DATABASE ATTRIBUTES (in .names format)
|
| 48 continuous real [0,100] attributes of type word_freq_WORD  
| = percentage of words in the e-mail that match WORD,
| i.e. 100 * (number of times the WORD appears in the e-mail) /  
| total number of words in e-mail.  A "word" in this case is any  
| string of alphanumeric characters bounded by non-alphanumeric  
| characters or end-of-string.
|
| 6 continuous real [0,100] attributes of type char_freq_CHAR
| = percentage of characters in the e-mail that match CHAR,
| i.e. 100 * (number of CHAR occurences) / total characters in e-mail
|
| 1 continuous real [1,...] attribute of type capital_run_length_average
| = average length of uninterrupted sequences of capital letters
|
| 1 continuous integer [1,...] attribute of type capital_run_length_longest
| = length of longest uninterrupted sequence of capital letters
|
| 1 continuous integer [1,...] attribute of type capital_run_length_total
| = sum of length of uninterrupted sequences of capital letters
| = total number of capital letters in the e-mail
|
| 1 nominal {0,1} class attribute of type spam
| = denotes whether the e-mail was considered spam (1) or not (0),  
| i.e. unsolicited commercial e-mail.   
|
| For more information, see file 'spambase.DOCUMENTATION' at the
| UCI Machine Learning Repository: http://www.ics.uci.edu/~mlearn/MLRepository.html


1, 0.    | spam, non-spam classes

word_freq_make:         continuous.
word_freq_address:      continuous.
word_freq_all:          continuous.
word_freq_3d:           continuous.
word_freq_our:          continuous.
word_freq_over:         continuous.
word_freq_remove:       continuous.
word_freq_internet:     continuous.
word_freq_order:        continuous.
word_freq_mail:         continuous.
word_freq_receive:      continuous.
word_freq_will:         continuous.
word_freq_people:       continuous.
word_freq_report:       continuous.
word_freq_addresses:    continuous.
word_freq_free:         continuous.
word_freq_business:     continuous.
word_freq_email:        continuous.
word_freq_you:          continuous.
word_freq_credit:       continuous.
word_freq_your:         continuous.
word_freq_font:         continuous.
word_freq_000:          continuous.
word_freq_money:        continuous.
word_freq_hp:           continuous.
word_freq_hpl:          continuous.
word_freq_george:       continuous.
word_freq_650:          continuous.
word_freq_lab:          continuous.
word_freq_labs:         continuous.
word_freq_telnet:       continuous.
word_freq_857:          continuous.
word_freq_data:         continuous.
word_freq_415:          continuous.
word_freq_85:           continuous.
word_freq_technology:   continuous.
word_freq_1999:         continuous.
word_freq_parts:        continuous.
word_freq_pm:           continuous.
word_freq_direct:       continuous.
word_freq_cs:           continuous.
word_freq_meeting:      continuous.
word_freq_original:     continuous.
word_freq_project:      continuous.
word_freq_re:           continuous.
word_freq_edu:          continuous.
word_freq_table:        continuous.
word_freq_conference:   continuous.
char_freq_;:            continuous.
char_freq_(:            continuous.
char_freq_[:            continuous.
char_freq_!:            continuous.
char_freq_$:            continuous.
char_freq_#:            continuous.
capital_run_length_average: continuous.
capital_run_length_longest: continuous.
capital_run_length_total:   continuous.
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:spam AMBA MBAs Data Base

spambase.zip
下载链接: https://bbs.pinggu.org/a-2304379.html

122.51 KB

需要: 1 个论坛币  [购买]

本附件包括:

  • spambase.DOCUMENTATION
  • spambase.data
  • spambase.names

本帖被以下文库推荐

喜欢经济学,行为学,投资学,行为学,心理学,and 编程

沙发
西门高(未真实交易用户) 发表于 2017-8-13 22:14:23
谢谢分享

藤椅
军旗飞扬(未真实交易用户) 发表于 2017-8-14 06:25:25
谢谢楼主分享!

板凳
145xyz(未真实交易用户) 发表于 2017-8-17 09:27:30

报纸
xiexie1111(真实交易用户) 发表于 2017-8-27 05:22:47
thanks for your sharing, xie xie

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注jltj
拉您入交流群
GMT+8, 2025-12-6 07:44