楼主: forex95
4631 7

[学习资料] 有谁可以帮我翻译一下spss对于k均值聚类的初始聚类中心的算法吗? [推广有奖]

  • 2关注
  • 32粉丝

已卖:1572份资源

副教授

19%

还不是VIP/贵宾

-

威望
0
论坛币
5953 个
通用积分
8.9440
学术水平
18 点
热心指数
23 点
信用等级
14 点
经验
14262 点
帖子
421
精华
0
在线时间
779 小时
注册时间
2010-3-6
最后登录
2026-1-21

楼主
forex95 发表于 2012-4-16 14:32:25 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
我找到了初始聚类中心的算法啦,可是我不懂他的意思,有高人可以给解释一下吗?唉,我英语水平差啊。以下是初始聚类中心算法的英文解释。
If minid(xk,Mi)>dmn and d(xk,Mm)>d(xk,Mn), then xk replaces Mn. If minid(xk,Mi)>dmn and d(xk,Mm)<d(xk,Mn), then xk replaces Mm; that is, if the distance between xk and its closest cluster mean is greater than the distance between the two closest means (Mm and Mn), then xk replaces either Mm or Mn, whichever is closer to xk.
If xk does not replace a cluster mean in (a), a second test is made:
Let Mq be the closest cluster mean to xk.
Let Mp be the second closest cluster mean to xk.
If d(xk,Mp)>minid(Mq,Mi), then Mq=xk;
That is, if xk is further from the second closest cluster’s center than the closest cluster’s center is from any other cluster’s center, replace the closest cluster’s center with xk.
At the end of one pass through the data, the initial means of all NC clusters are set. Note that if NOINITIAL is specified, the first NC cases with no missing values are the initial cluster means.
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:翻译一下 SPSS PSS specified clusters

回帖推荐

kuangsir6 发表于4楼  查看完整内容

Model Parameters The primary calculation in k-means is an iterative process of calculating cluster centers and assigning records to clusters. The primary steps in the procedure are: 1. Select initial cluster centers 2. Assign each record to the nearest cluster 3. Update the cluster centers based on the records assigned to each cluster 4. Repeat steps 2 and 3 until either:  In step 3 ...

本帖被以下文库推荐

沙发
yanziwoaini 发表于 2012-4-16 14:36:06
SPSS统计分析从入门到精通这本书说的很详细,电子版的很好找,论坛有的,加油啊。

藤椅
kuangsir6 发表于 2012-4-16 17:35:03
基本过程:
K-Means 的工作原理是根据数据定义一组起始聚类中心。
然后根据记录的输入字段值,将每个记录分配到与其最相似的聚类中。在分配完所有记录后,
更新聚类中心以反映分配到每个聚类的新记录集。然后再次检查记录,以确定是否应将这些
记录重新分配到不同的聚类中,这个记录分配/聚类迭代过程将一直持续,直到达到最大迭代
次数或一次迭代与下次迭代之间的改变不超过指定阈值为止。

板凳
kuangsir6 发表于 2012-4-16 18:23:30
Model Parameters
The primary calculation in k-means is an iterative process of calculating cluster centers and
assigning records to clusters. The primary steps in the procedure are:
1. Select initial cluster centers
2. Assign each record to the nearest cluster
3. Update the cluster centers based on the records assigned to each cluster
4. Repeat steps 2 and 3 until either:
 In step 3, there is no change in the cluster centers from the previous iteration, or
 The number of iterations exceeds the maximum iterations parameter
Clusters are defined by their centers. A cluster center is a vector of values for the (encoded) input
fields. The vector values are based on the mean values for records assigned to the cluster.

Selecting Initial Cluster Centers
The user specifes k, the number of clusters in the model. Initial cluster centers are chosen using amaximin algorithm:
1. Initialize the first cluster center as the values of the input fields for the first data record.
2. For each data record, compute the minimum (Euclidean) distance between the record and each
defined cluster center.
3. Select the record with the largest minimum distance from the defined cluster centers. Add a new
cluster center with values of the input fields for the selected record.
4. Repeat steps 2 and 3 until k cluster centers have been added to the model.
Once initial cluster centers have been chosen, the algorithm begins the iterative assign/update
process.

             --------来自 IBM SPSS Modeler 14.2 算法指南


已有 1 人评分经验 论坛币 收起 理由
bakoll + 3 + 3 精彩帖子

总评分: 经验 + 3  论坛币 + 3   查看全部评分

报纸
forex95 发表于 2012-4-18 08:58:00
yanziwoaini 发表于 2012-4-16 14:36
SPSS统计分析从入门到精通这本书说的很详细,电子版的很好找,论坛有的,加油啊。
看了,没有具体的算法,只是教人如何使用而已。
情绪只是时间的消耗品,所谓非理性行为就是对时间的量化。

地板
forex95 发表于 2012-4-19 13:00:21
我找到了初始聚类中心的算法啦,可是我不懂他的意思,有高人可以给解释一下吗?以下是初始聚类中心算法的英文解释。
If minid(xk,Mi)>dmn and d(xk,Mm)>d(xk,Mn), then xk replaces Mn. If minid(xk,Mi)>dmn and d(xk,Mm)<d(xk,Mn), then xk replaces Mm; that is, if the distance between xk and its closest cluster mean is greater than the distance between the two closest means (Mm and Mn), then xk replaces either Mm or Mn, whichever is closer to xk.

If xk does not replace a cluster mean in (a), a second test is made:
Let Mq be the closest cluster mean to xk.
Let Mp be the second closest cluster mean to xk.
If d(xk,Mp)>minid(Mq,Mi), then Mq=xk;
That is, if xk is further from the second closest cluster’s center than the closest cluster’s center is from any other cluster’s center, replace the closest cluster’s center with xk.

At the end of one pass through the data, the initial means of all NC clusters are set. Note that if NOINITIAL is specified, the first NC cases with no missing values are the initial cluster means.


情绪只是时间的消耗品,所谓非理性行为就是对时间的量化。

7
forex95 发表于 2012-4-25 13:48:58
情绪只是时间的消耗品,所谓非理性行为就是对时间的量化。

8
matlab-007 发表于 2015-12-10 10:17:19
如果minid(xk,Mi)在静息和d(xk,毫米)在d(xk,Mn),然后xk取代Mn。如果minid(xk,Mi)在静息和d(xk,Mm)& lt;d(xk,Mn),然后xk取代毫米;也就是说,如果xk和最亲密的集群意味着之间的距离大于两个最亲密的手段之间的距离(毫米和Mn),然后xk取代Mm或Mn,哪个更接近xk。
如果xk并不取代集群意味着在(a),第二个考验是:
让Mq集群最接近xk的意思。
让议员最近的第二集群意味着xk。
如果d(xk,Mp)在minid(Mq,Mi),然后Mq = xk;
如果xk进一步从第二个最亲密的星团中心比最近的聚类中心与其他集群中心,用xk替换最近的星团中心。
最后一个数据,通过最初的所有数控集群设置。注意,如果指定NOINITIAL,第一个数控例没有缺失值初始集群方式。

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2026-1-27 21:54