SAS中文论坛

标题: missing value [打印本页]

作者: shiyiming    时间: 2004-3-20 07:47
标题: missing value
怎样利用SAS来处理丢失的数据,当然是只不删除记录条的前提下,不做数据的0输入处理或平均值输入处理,有没有其他的更好更有效的处理方法,恳请高手给出详细的指点。谢谢。
作者: shiyiming    时间: 2004-3-20 10:27
可试一试使用预测的方法
作者: shiyiming    时间: 2004-3-22 18:19
到底是用何预测的方法呢,请明示
作者: shiyiming    时间: 2004-3-22 21:58
标题: 预测的方法
范围太大了,还是要根据你的数据和常识经验出发。
作者: shiyiming    时间: 2004-3-26 10:54
one of the predictive imputation method:

suppose in dataset A, variable V1 has missing values, and there are some other variables as V2, V3, V4 ... are not missing, then u can build a regression  model as

V1=a1+a2*V2+a3*V3+a4*V4+...

After that, use the estimated value for V1 instead of original V1, then there will be no missing values.

You should be careful when u use this method. Make sure it make sense to use V2, V3, V4 to estimate V1.
作者: shiyiming    时间: 2004-3-28 12:10
标题: missing value
hannaqiu:
                   hello!
     thank you for your sugestion,but I think your way must make sure the
model is  reliable firstly,because the regression model is not always suit for the variables;so I think your way only be used in the simple condition.not be used often .
     I want to give some ideal here,we can first use "cluster" to analysis the data with missing value, then we can input mean value of each group
for the missing value.
    what do you think about my ideal ,thank you for your any opions.
作者: shiyiming    时间: 2004-3-28 13:45
标题: Re
我觉得不应该有多复杂。

1. 根据散点图或者经验,看数据趋势,预测曲线类型;

2. 拟合曲线方程,用预测值代替缺失值即可;

3. 偷懒的话,考虑“data fit”软件。
作者: shiyiming    时间: 2004-3-28 22:09
当然可以用cluster来做imputation,不过既然你问的是predictive imputation,我就告诉你怎么做了,用cluster遇到的问题其实是一样的,你又如何保证你的cluster是稳定的呢?




欢迎光临 SAS中文论坛 (http://mysas.net/forum/) Powered by Discuz! X3.2