| 研究生: |
黃珮菁 Huang, Pei-Ching |
|---|---|
| 論文名稱: |
含遺失值之列聯表最大概似估計量及模式的探討 Maximum Likelihood Estimation in Contingency Tables with Missing Data |
| 指導教授: |
江振東
Chiang, Jeng-Tung |
| 學位類別: |
碩士
Master |
| 系所名稱: |
商學院 - 統計學系 Department of Statistics |
| 論文出版年: | 2000 |
| 畢業學年度: | 88 |
| 語文別: | 中文 |
| 論文頁數: | 76 |
| 中文關鍵詞: | 遺失值 、完全及部分列聯表分析 、單樣本方法 、多樣本方法 、概似方程式因式分解法 、EM演算法 |
| 外文關鍵詞: | Missing data, Completely and partially cross-classified data, Single-sample method, Multiple-sample method, Factorization of the likelihood method, EM algorithm |
| 相關次數: | 點閱:190 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在處理具遺失值之類別資料時,傳統的方法是將資料捨棄,但是這通常不是明智之舉,這些遺失某些分類訊息的資料通常還是可以提供其它重要的訊息,尤其當這類型資料的個數佔大多數時,將其捨棄可能使得估計的變異數增加,甚至影響最後的決策。如何將這些遺失某些訊息的資料納入考慮,作出完整的分析是最近幾十年間頗為重要的課題。本文主要整理了五種分析這類型資料的方法,分別為單樣本方法、多樣本方法、概似方程式因式分解法、EM演算法,以上四種方法可使用在資料遺失呈隨機分佈的條件成立下來進行分析。第五種則為樣本遺失不呈隨機分佈之分析方法。
Traditionally, the simple way to deal with observations for which some of the variables are missing so that they cannot cross-classified into a contingency table simply excludes them from any analysis. However, it is generally agreed that such a practice would usually affect both the accuracy and the precision of the results. The purpose of the study is to bring together some of the sound alternatives available in the literature, and provide a comprehensive review. Four methods for handling data missing at random are discussed, they are single-sample method, multiple-sample method, factorization of the likelihood method, and EM algorithm. In addition, one way of handling data missing not at random is also reviewed.
封面頁
證明書
目錄
表目錄
圖目錄
致謝詞
論文摘要
第一章 緒論
1.1 研究動機與目的
1.2 資料架構
第二章 單樣本方法
2.1 二維度完全列聯表和部分列聯表分析
2.1.1 符號介紹
2.1.2 樣本來自卜瓦松分配之最大概似估計值
2.1.3 參數之估計
2.1.4 參數之估計
2.1.5 樣本來自多項分配之最大概似估計值
2.1.6 實例
2.2 三維度之完全列聯表和部分列聯表分析
2.2.1 符號介紹
2.2.2 樣本來自多項分配下之最大概似估計值
2.2.3 參數之估計
2.2.4 參數之估計
2.3 適合度檢定
第三章 多樣本方法
3.1 二維度之完全列聯表和部分列聯表分析
3.2 三維度之完全列聯表和部分列聯表分析
3.3 與單樣本方法之比較
3.4 實例
第四章 概似方程式因式分解法
4.1 巢狀型態資料
4.2 概似方程式因式分解法
4.2.1 符號介紹
4.2.2 最大概似估計量
4.3 應用與限制
4.4 實例
第五章 EM演算法
5.1 EM演算法
5.1.1 符號介紹
5.1.2 最大概似估計量
5.2 應用與限制
5.3 範例一分析結果比較
第六章 樣本遺失不呈隨機分佈之分析方法
6.1 引言
6.2 最大概似估計量
第七章 結論
文獻參考
Agresti, A. (1990). Categorical Data Analysis. New York:Wiley.
Agresti, A. (1996). An Introduction to Categorical Data Analysis. New York:Wiley.
Anderson, T.W. (1964). Maximum likelihood estimates for the multivariate normal distribution when some observations are missing. Journal of the American Statistical Association, 52, 200-203.
Blumenthal, S. (1968). Multinomial sampling with partially categorized data. Journal of the American Statistical Association, 63, 542-551.
Chen, T., and S. E. Fienberg (1974). Two-dimensional contingency tables with both completely and partially cross-classified data. Biometrics, 30, 629-642.
Chen, T., and S. E. Fienberg (1976). The analysis of contingency tables with incompletely classified data. Biometrics, 32, 133-144.
Choi, S.C., and D.M. Stablein (1988). Comparing incomplete paired binomial data under non-random mechanisms. Statistics in Medicine, 7, 929-939.
Clogg, C. C., and E. S. Shihadeh (1994). Statistical Models for Ordinal Variables. SAGE PUBLICATION.
Fuchs, C. (1982). Maximum likelihood estimation and model selection in contingency tables with missing data. Journal of the American Statistical Association, 77, 270-278.
Haber, M., and G. D. Williamson (1994). Models for three-dimensional contingency tables with completely and partially cross-classified data. Biometrics, 49, 194-203.
Haber, M., C. C.H. Chen, and G. D. Williamson (1991). Analysis of repeated categorical responses from fully and partially cross-classified data. Communications in statistics, 20, 3293-3313.
Hocking, R.R., and H.H. Oxspring (1971). Maximum likelihood estimation with incomplete multinomial data. Journal of the American Statistical Association, 66, 65-70.
Hocking, R.R., and H.H. Oxspring (1974). The analysis of partially categorized contingency data. Biometrics, 60, 469-483.
Laird, N. M. (1988). Missing data in longitudinal studies. Statistics in Medicine, 7, 305-315.
Lipsitz, S. R., J. G. Ibrahim, and G. M. Fitzmaurice (1999). Likelihood methods for incomplete longitudinal binary responses with incomplete categorical covariates. Biometrics, 55, 214-223.
Little, R. J.A. (1982). Models for nonresponse in sample surveys. Journal of the American Statistical Association, 77, 237-250.
Little, R. J.A., and D. B. Rubin (1987). Statistical Analysis with Missing Data. New York:Wiley.
Nordheim, E. V. (1984). Inference from nonrandomly missing categorical data: an example from a genetic study on Turner’s syndrome. Journal of the American Statistical Association, 79, 772-780.
Rubin, D. B. (1976). Inference and missing data. Biometrics, 63, 581-592.
(限達賢圖書館四樓資訊教室A單機使用)