|
If the intervals between measurements are not equal (or not nearly equal), then employing an autoregressive residual structure is invalid. In fact, I suggest that you forget about the REPEATED statement. Technically, an unstructured residual matrix can never be wrong, but it's likely too complex given the way in which the measurements were collected (unequal time intervals between and within patients). It is worth noting that the MIXED procedure in SAS offers a variety of spatial covariance structures which can handle unequal intervals while accounting for decaying residual correlations as observations become more distant in time, but I'll stick within the confines of SPSS for this post. With that stated, using the MIXED procedure in SPSS, a random coefficient model seems like your best option.
This is not easy to explain over email. Moreover, I'm quite distracted by other pressing work. Having said that, I'm going to try to help get you started. In order to make any movement, I need to make some assumptions:
(1) You have the date associated for when the measurements were taken on each subject.
(2) The first measurement was taken shortly before diagnosis.
(3) Patients you are tracking are getting equivalent forms of treatment that started shortly after diagnosis.
If yes to all 3 assumptions, then create a Time variable that reflects number of days since baseline. The first measurement on each patient will be considered baseline and should be coded as 0, and subsequent measurements will reflect the number of days since the first measurement/baseline. Concretely, if patient 1 was measured three times (baseline, 5 days post-baseline and 25 days post-baseline, then the dataset should look like this:
Patient_ID Time
1 0
1 5
1 25
2
2
.
.
.
Needless to say, if patients are measured more frequently (e.g., multiple times in a single day), then you should make the measurement unit number of hours or minutes since baseline.
With that said, I'd parameterize the model as follows:
MIXED Y BY <categorical predictors> WITH Age Time
/FIXED = <categorical predictors> Age Time <two-way interactions between each predictor and Time> | SSTYPE(3)
/METHOD = REML
/PRINT = G SOLUTION TESTCOV
/RANDOM = INTERCEPT Time | SUBJECT(Patient_ID) COVTYPE(UN).
I am assuming that there is a linear relationship between time and the dependent variable. You can certainly consider exploring other types of relationships. Same goes for Age.
At any rate, with the model proposed above you should be able to answer all sorts of research questions using the TEST sub-command (e.g. is the estimated mean on day X since baseline for males significantly different than females; is the slope for males significantly different for females). Examining the estimates from the random effects covariance (G) matrix could prove useful as well, but no time to discuss this right now.
Write back if you have additional questions and I'll try to respond when time permits.
HTH,
Ryan
|