Chandler Armstrong
2008-08-07 21:07:49 UTC
Hello,
I'm working with a massive time-series cross-section dataset that has a lot
of missing data spread across the cases and time.
when I try to run a TSCSREG program, I first get this "there is only one
cross section of time series observation" error. I think I know how to fix
this using SAS commands to count cases within a time series, but so far I've
just manually sorted through the data and found the cases that only have one
observation in a time series.
Next, I get an error of "not enough observations with non-missing model
variables for model statement in cross section 'csid'='x'; where 'csid' is
the cross section identifier and 'x' is the value of that identifier. I
can't think of a way to possibly use SAS to automatically delete the
offending cases, though I have a feeling there is something really simple to
solve it that I just don't know about. Currently, I am having to run the
program, check the log, open the dataset, delete the offending cases, close
the dataset, run the program, check the log, open the dataset, delete the
offending cases... ad infinitum. Does anybody know of a way to work with
these kinds of errors in the data without having to do so much manual work?
I think programs such as STATA just automatically dump the useless cases.
Why doesn't SAS? I really want to use SAS over STATA, but if it is this
fussy with data, I don't know if I can take much more of it.
Chandler
I'm working with a massive time-series cross-section dataset that has a lot
of missing data spread across the cases and time.
when I try to run a TSCSREG program, I first get this "there is only one
cross section of time series observation" error. I think I know how to fix
this using SAS commands to count cases within a time series, but so far I've
just manually sorted through the data and found the cases that only have one
observation in a time series.
Next, I get an error of "not enough observations with non-missing model
variables for model statement in cross section 'csid'='x'; where 'csid' is
the cross section identifier and 'x' is the value of that identifier. I
can't think of a way to possibly use SAS to automatically delete the
offending cases, though I have a feeling there is something really simple to
solve it that I just don't know about. Currently, I am having to run the
program, check the log, open the dataset, delete the offending cases, close
the dataset, run the program, check the log, open the dataset, delete the
offending cases... ad infinitum. Does anybody know of a way to work with
these kinds of errors in the data without having to do so much manual work?
I think programs such as STATA just automatically dump the useless cases.
Why doesn't SAS? I really want to use SAS over STATA, but if it is this
fussy with data, I don't know if I can take much more of it.
Chandler