| 研究生: |
陳彥勳 Chen, Yen-Shiun |
|---|---|
| 論文名稱: |
非監督式新細胞認知機神經網路之研究 Studies on the Unsupervised Neocognitron |
| 指導教授: |
蔡瑞煌
Tsaih, Ray-R. |
| 學位類別: |
碩士
Master |
| 系所名稱: |
商學院 - 資訊管理學系 Department of Management Information System |
| 論文出版年: | 1996 |
| 畢業學年度: | 84 |
| 語文別: | 英文 |
| 論文頁數: | 65 |
| 中文關鍵詞: | 神經網路 、非監督式學習 、新細胞認知機 、印刷體中文字辨識 |
| 外文關鍵詞: | Neural network, Unsupervised learning, Neocognition, Printed chinese character recognition |
| 相關次數: | 點閱:215 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文使用非監督式新細胞認知機(Unsupervised neocognitron)神經網路來便是印刷體中文字。
關於非監督式新細胞認知機,本論文提出兩項修改。第一項,Us1子層的結點不進行學習,而是直接套用人為方式所指定的12個區域特徵,而Us1之後的S子層仍然使用非監督式學習的方式決定其所要偵測的區域特徵。第二項修改則是,在學習過中設定一個上限值來限制代表節點(representative)產生的個數。如此設計的目的是為了避免模板(cell-planes)分配不均的問題。在本研究,採用這兩項修改的新細胞認知機稱為模式一,而使用第二項修改的新細胞認知機稱為模式二。
本論文裡的所有實驗分為兩部分。在第一部分有四個實驗,這些實驗都使用相同的訓練範例與測試範例。訓練範例有兩組,第一組包含“川”,“三”,“大”,“人”,“台”等五個中文字。而第二組包含“零”,“壹”,“貳”,“參”,“肆”等中文字。訓練範例都市採用細明體,而測試範例則是採用其他九種不同字體。第一個實驗的主要目的是測試模式一的績效。實驗結果顯示,模式一很容易學習成功而且辨識率可以接受。另外三個實驗的目的是想要了解某些參數值與系統績效的關係。這些參數包含S-欄的大小(the size of S-column),模板樹(the number of cell-planes),以及節點的接收場大小(the size of cells’ receptive field)。這三個實驗所使用的網路系統是模式一。
第二部分有二個實驗,主要的目的是比較模式一與模式二的系統績效。在第一個實驗,所使用的訓練範例與第一部分實驗相同。實驗結果顯示模式一比較容易成功地學習,而且系統有不錯的表現。第二個實驗,使用17個中文字做為訓練範例。這17個字包括“零”,“壹”,“貳”,“參”,“肆”,“伍”,“陸”,“柒”,“捌”,“玖”,“拾”,“佰”,“仟”,“萬”,“億”,“圓”,“角”。實驗結果顯示,模式一仍然是一個不錯的系統。
In this study, we are investigating the feasibility of applying the unsupervised neocognitron to the recognition of printed Chinese characters.
Two propositions for the unsupervised neocognitron are mentioned. The first on proposes that the input connections of the first layer are manually given, and all subsequent layers are trained unsupervised. The second one concerns the selection of representatives. During the process of learning, the number of cell-planes that send representatives for each training pattern has an upper bound. The unsupervised neocognitron with implementing these two propositions is named as Model 1, and the unsupervised neocognitron with implementing only the second proposition is named as Model 2.
Experiment in this study are grouped into two parts, called Part I and Part II. In Part I, four experiments are conducted. For each experiment, two sets of training patterns will be conducted respectively. The first one, called the simple training set, consists of five printed Chinese characters“川”,“三”,“大”,“人”, and “台” with size of 25*25 in MingLight font. The second one, called the complex training set, contains another five printed Chinese characters“零”,“壹”,“貳”,“參”, and “肆” in the some font and size. After training, these characters of other nine different fonts are presented to test the generalization of the network.
The objective of the first experiment of Part I is to investigate the performance of Model 1. Simulation results shot that Model 1 demonstrates a good ability to achieve a successful learning. In other three experiments, the effect of choosing different value for some parameters in investigated. The parameters include the size of S-column, the number of cell-planes, and the receptive field of cells.
In Part II, a comparison of the performance of Model 1 and Model 2 is made. In the first experiment, Model 1 and Model 2 are trained to recognize the simple and complex training sets described above. Experimental results show that Model 1 shows higher ability to achieve a successful learning, and performance of Model 1 is acceptable. In the second experiment, 17 training patterns are presented during the learning process. These training patterns include “零”,“壹”,“貳”,“參”,“肆”,“伍”,“陸”,“柒”,“捌”,“玖”,“拾”,“佰”,“仟”,“萬”,“億”,“圓”,, and “角”. From the simulation results, Model 1 is a promising approach for the recognition of printed Chinese characters.
ABSTRACT(in Chinese)..........i
ABSTRACT(in English)..........ii
CONTENTS..........iv
LIST OF TABLES..........vi
LIST OF FIGURES..........viii
Chapter 1 Introduction..........1
1.1Motivation..........1
1.2Research topics and goal..........3
1.3The empirical way of analysis..........3
1.4Thesis organization..........4
Chapter 2 Neocognitron..........5
2.1Structure of the network..........5
2.2Behavior of cells..........9
2.2.1Feature extraction by an S-cell..........10
2.2.2The role of C-cell..........12
2.3Unsupervised learning..........13
2.4Review of earlier studies..........15
2.4.1Earlier work on the unsupervised neocognitron..........15
2.4.2Summary of Literature review..........19
Chapter 3 Experiments of Part I..........20
3.1The propositions for the unsupervised neocognitron..........21
3.2The ways of training and testing in experiments of Part I..........21
3.2.1Training process..........22
3.2.2Testing process..........22
3.3Experiment 1..........23
3.4Experiment 2..........24
3.5Experiment 3..........28
3.6The reasons for the divergence of the neocognitron..........31
3.7Experiment 4..........37
Chapter 4 Experiments of Part II..........44
4.1Experiment 1..........44
4.1.1The successful learning rate..........44
4.1.2The recognition rate..........47
4.2Experiment 2..........49
4.2.1Simulation results for SYSTEM A..........49
4.2.2Simulation results for SYSTEM B..........52
Chapter 5 Summary and future work..........54
5.1Summary..........54
5.2Future work..........55
References..........57
Appendix A..........59
Appendix B..........62
Appendix C..........64
[1] K. Fukushima, "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position", BioI Cybern., Vo1.36, pp .193-202,Apr. 1980.
[2] Y. LeCun, B. Boser, "Backpropagation Applied to Handwritten Zip Code Recognition", Neural Computation, 1, pp.541-551, 1989.
[3] H.Y. Liao, IS. Huang, and S.T. Huang, "Two-Dimentional Neural Networks for Handwritten Chinese Character Recognition", 1992 IEEE IJCNN illS79-S84.
[4] A. Rajavelu, M.T. Musavi, and M.V. Shirvaikar, "A Neural Network Approach to Character Recognition", Neural Networks, Vol 2, pp.387-393 1989.
[5] K. Fukushima, S. Miyake, T. Ito, "Neocognitron: A Neural Network Model for a Mechanism of Visual Pattern Recognition", IEEE Trans. on System, Man, and Cybernetics, Vol. SMC-13,No.S, Sep/Oct 1983. pp.826-834.
[6] K. Fukushima, "Neocognitron: A Hierarchical Neural Network Capable of Visual Pattem Recognition", Neural Networks, Vol.1, pp.1l9-130, 1988.
[7] K. Fukushima, N. Wake, "Handwdtten Alphanumeric Character Recognition by the
Neocognitron", IEEE Trans. Oll Neural Networks, Vol.2, No.3, May 1991, pp.35S-36S.
[8] K. Fukushima, Sei Miyake, "Neocognitron: A New Algorithm For Pattern Recognition Tolerant of Deformation and Shifts In Position", Pattem Recognition, Vol.lS, No.6, pp. 4SS-469,1982.
[9] K. Fukushima, N. Wake, "Improved Neocognitroll with Bend-Detecting Cells", Proc. IEEE IJCNN, Vol.4, pp.190-19S, 1992.
[10] K. Fukushima, "Analysis of the Process of Visual Pattem Recognition by the Neocognitron",Neural Networks, Vol. 2, pp.413-420, 1989.
[11] MuraU M. Menon,Karl G. Heinemann, "Classification of Patterns Using a Self-Organizing Neural Network.", Neural Networks, Vol I, pp.201-21S, 1988.
[12] Glenn S. Himes and Rafael M. Inigo, "Automatic Target Recognition Using a Neocognitron",IEEE Trans. on Knowledge and Data Engineering, Vol.4, No.2, April 1992.
[13] James A. Freeman, "Neural Networks, Algorit~ Applications, and Program.rrring Techniquesll,Addison-Wesley Publishing Company, July 1992.
[14] Hubel, D.H.,Wiesel, T.N.,”Receptive fields, binocular interaction and functional architecture in cat's visual cortexll, 1. Physiol. 160, pp.l06-1S4, 1962.
[IS] Hubel, D.H.,Wiesel, T.N.,"Receptive fields and functional architecture in two nonstriate visual area (18 and 19) of the catll, 1. Neurophysiol. 28,229-289, 1965.
[16] Eun Jin Kim, "Handwritten Hangul Recognition Using a Modified Neocognitron", Neural Networks, Vol 4, pp.743-7S0, 1991.
[17] S. Yamaguchi, H. Itakura, "A Car Detection System Using the Neocognitron", Proc. IEEE IJCNN, Vol.2, pp.1208-1213, 1991.
[18] S.D. Wang, C.C. Pan, "A Neural Network Approach for Chinese Character Recognition", Proc.IEEE IJCNN, Vol.1, pp.416-419, 1990.
[19] F.G. Shieh, “Studies of the Recognition of the Printed Chinese Character Using the
NeocognitfOn model with the Changjei Codes", Master thesis of Computer and Information Engineering, Tatung Institute of Engineering, July 1993.
(限達賢圖書館四樓資訊教室A單機使用)