Peter Flom
2009-03-09 18:34:53 UTC
Dear all,
I'm running a logistic regression model with random effect. The dependent
vaiable is binary. I have around 4000 cases with late_hiv_testing and
9000 are not. Due to the spatial clustering pattern of the cases(moran's I
significant), I include zipcode as a random effect in the logistic model
and use proc glimmix procedure.
My questions is how can I tell if the random effect is significant? from
Covariance parameter estimates?
My question, back to you, is why you care whether the random effect is significant.I'm running a logistic regression model with random effect. The dependent
vaiable is binary. I have around 4000 cases with late_hiv_testing and
9000 are not. Due to the spatial clustering pattern of the cases(moran's I
significant), I include zipcode as a random effect in the logistic model
and use proc glimmix procedure.
My questions is how can I tell if the random effect is significant? from
Covariance parameter estimates?
Suppose it is significant .... what would you do?
Suppose it is not significant ... would you do something different?
One answer is that you might pool the data if the random effect were not sig. To me,
this is not good reasoning. At least in most cases, we should pool the data
(or not pool it) for other reasons:
1) Is it reasonable to pool?
2) Does pooling change estimates of fixed effects and their variances?
So, might the relationship between your DV and IV depend on geography? I don't know.
Perhaps you know, perhaps not.
But you can assess 2) by running both models.
The other problem is that the variance (and hence, the sig.) of the random effects
can be hard to estimate. Proper df is a contentious area. In longitudinal data, one book
(I think it's Hedeker and Gibbons, but can't swear to it) suggests leaving estimates of the sig.
of random effects out of any reporting.
Peter
Peter L. Flom, PhD
Statistical Consultant
www DOT peterflomconsulting DOT com