人大经济论坛 › 论坛 › 数据科学与人工智能 › 数据分析与数据科学 › SPSS论坛 › Systematic Sampling with Fixed Sample Size using SPS ...

发帖

楼主: ReneeBK

1699 1

[问答] Systematic Sampling with Fixed Sample Size using SPSS Syntax [推广有奖]

1关注
62粉丝

VIP

已卖：4901份资源

学术权威

14%

还不是VIP/贵宾

TA的文库 其他...

R资源总汇

Panel Data Analysis

Experimental Design

威望: 1 级
论坛币: 49675 个
通用积分: 56.3087
学术水平: 370 点
热心指数: 273 点
信用等级: 335 点
经验: 57805 点
帖子: 4005
精华: 21
在线时间: 582 小时
注册时间: 2005-5-8
最后登录: 2023-11-26

楼主

ReneeBK 发表于 2014-4-30 13:23:49 |AI写论文

是否 +2 论坛币

k人参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群

赵安豆老师微信：zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

立即领取

感谢您参与论坛问题回答

经管之家送您两个论坛币！

+2 论坛币

I want to sample cases from a file by systematic sampling with a fixed sample size. Suppose I have N=10,000 cases in the file and want a sample of n=500 cases, choosing 1 case from every 20 cases. The first case sampled is the Kth case, where K is a random number from 1 to 20. The next cases sampled are the (K+20)th case, the (K+40th) case, and so forth. How can I do this in SPSS?

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

分享0 收藏0 回帖

关键词：Systematic Sampling SYNTAX Sample System choosing number where file

本帖被以下文库推荐

· SPSS NewOccidental|主题: 197, 订阅: 35

沙发

ReneeBK 发表于 2014-4-30 13:24:55

The general approach is to assign each case a sequence number based on their serial position in the file, assign an interval
number such that the length of each interval equals L=N/n. K is generated as a single random number from 0 to L, and the Kth
case in each interval is chosen. Note that the number of cases in the file, i.e. the population, must be known.
When the sample size is fixed, the size of the intervals from which single cases are sampled will not be an integer if the
population size is not evenly divisible by the sample size. In such cases K is likely to be non integer, perhaps indicating
case 244.34 as the case to be sampled, for example. It is customary to round the case number to be sampled up to the
next integer (Case 245 in this example). The detailed steps are:
1. Determine the population size, i.e. the number of cases in the file to be sampled. This can be determined from running DESCRIPTIVES on a variable or the AGGREGATE and MATCH FILES commands can be combined to save the population size as a
variable, NSIZE. Calculate the length of each sampling interval as population size/sample size and save as L.
2. Create a variable named CASENO which indicates the serial position of each case in the file. In SPSS, this can be a
copy of the SPSS system variable $CASENUM.

3. Create a variable named START and generate a random number between 0 and L for the first case in the file. Copy this
number to all following cases. This constant will serve as K, the starting position.

4. Compute a variable named INTERV which indicates to which interval of length L each case belongs. INTERV starts at
0 rather than 1.

5. Compute a variable named SAMCASE which is (L*INTERV + START). If SAMCASE is not an integer, it is rounded upwards to the next integer.

6. Select the case if SAMCASE = CASENO.

IMPLEMENTING IN SPSS

The following commands will perform these steps in versions 4 and above of SPSS, including SPSS for windows and SPSS for the
Macintosh. Some modifications are required for SPSS/PC+. The first command sets the seed for the random number generator,
using the date and time. (The default value of SEED may vary or be fixed, depending on the version of SPSS). The EXECUTE
command forces a pass of the data, with assignment of CASENO to all cases before any selection takes place. (Otherwise, the first case to be deleted would pass its value for $CASENUM and CASENO to the next case, which would then necessarily be deleted by the same comparison to SAMCASE and pass the same value of $CASENUM to the next case, etc.)

SET SEED = 950203123 .
* Save population size as variable.
COMPUTE DUM = 1.
AGGREGATE OUTFILE = tsize
/BREAK = DUM /NSIZE = N.
MATCH FILES /FILE = * /TABLE = tsize /BY DUM.
* Calculate interval length L (for sample size of 500 in this
* example).
COMPUTE L = NSIZE/500.
COMPUTE CASENO = $CASENUM.
* Generate starting point as random number from 0 to L .
IF (CASENO = 1) START = UNIFORM(L).
IF (MISSING(START)) START = LAG(START).
* Calculate which interval a case falls into .
COMPUTE INTERV = TRUNC(CASENO/L).
IF (INTERV = CASENO/L) INTERV = INTERV - 1.
COMPUTE SAMCASE = INTERV * L + START.
IF (SAMCASE > TRUNC(SAMCASE)) SAMCASE = TRUNC(SAMCASE) + 1.
EXECUTE.
SELECT IF (SAMCASE = CASENO).
EXECUTE.

返回列表

发帖

本版微信群

加好友,备注cda
拉您进交流群

京ICP备16021002号-2 京B2-20170662号京公网安备 11010802022788号论坛法律顾问：王进律师知识产权保护声明免责及隐私声明

[问答] Systematic Sampling with Fixed Sample Size using SPSS Syntax [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

[问答] Systematic Sampling with Fixed Sample Size using SPSS Syntax [推广有奖]

经管之家送您一份

经管之家联合CDA

感谢您参与论坛问题回答

扫码加我 拉你入群

相关帖子

本帖被以下文库推荐

浏览过的帖子

浏览过的版块

本版微信群

扫码加我拉你入群