| 研究生: |
馮書昭 FENG,SHU-CHAO |
|---|---|
| 論文名稱: |
發展社群媒體事件之圖像快篩方法:以2016年美濃地震之Twitter資料為例 Developing a Fast Screening Method for Visual Images in Social Media Events: A Case Study of Twitter Data during 2016 Meinong Earthquake |
| 指導教授: |
陳恭
Chen, Kung |
| 口試委員: |
陳百齡
Chen, Pai-Lin 鄭宇君 Cheng, Yu-Chung |
| 學位類別: |
碩士
Master |
| 系所名稱: |
理學院 - 資訊科學系 |
| 論文出版年: | 2018 |
| 畢業學年度: | 106 |
| 語文別: | 中文 |
| 論文頁數: | 61 |
| 中文關鍵詞: | 2016 高雄美濃地震 、推特 、社群媒體分析 、圖像分析 、文字分析 、圖論 |
| 外文關鍵詞: | 2016 Meinong earthquake, Social media analysis |
| DOI URL: | http://doi.org/10.6814/THE.NCCU.CS.004.2018.B02 |
| 相關次數: | 點閱:329 下載:4 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
近年來,網際網路的普及以及各種社群網站的興盛,使得無論人們身在何處,無時無刻不能夠在社群網路上表達自己的想法,無論是資訊傳播、經驗分享以及意見交流,社群平台上的互動已經逐漸變成人們生活中的一部份,而這些充斥在社群平台上的即時資訊,也使得社群媒體成為蘊含各種最新消息的寶庫,而隨著社群網站的演進以及功能的增加,除了文字外人們更可以透過各種的多媒體,以更豐富的方式傳達內容。在災難發生時,資訊的掌握對於相關單位解析災難的相關情況以及制定後續行動非常重要,此時社群網路中的豐富即時資訊就可以扮演一個快速獲取資料的來源,藉由解析社群媒體上的資訊,可以即時地獲得第一手資訊,專家更可以藉由分析照片或影片等多媒體或是字裏行間中隱含的資訊瞭解到現場的情況,制定更加完備的因應對策。
本研究以2016年高雄美濃地震為例,透過探索研究地震期間在推特上交流的資訊,包含發文內容、附加圖片以及發文者、發文時間、發文語系以及轉推等後設資訊(meta data),瞭解在地震相關的貼文資訊中,附圖的貼文較未附圖的貼文更加的容易被轉推,而後本研究著重在貼文中的附圖,研擬使用圖像辨識的方法,賦予推特上的圖片物件標籤,後以標籤共現嘗試將跟地震相關的圖片分群,確認此方法分析圖片的結果以及其限制,最後回到發佈的貼文本身資訊,本研究分析了地震相關熱門貼文的發布者身份,藉由爬梳推特上的使用者資訊,瞭解到除了台灣本身,包含日本、中國、美國、英國、俄羅斯等多國媒體都有報導的相關資訊,且可能由於推特中日本的使用者較多,且日本位置距離台灣較相近,因此熱門貼文的發文者大多是使用日文的使用者。
Recently, thanks to various social media platforms and availability of mobile web, people have got used to interactive through the internet anywhere anytime. Activities on social media have become everyone's routine, such as searching or sharing information and communication. In the meanwhile, these activities make social media platforms a treasure for getting information. This boundless, real-time information strongly connected to our real life, which means, by locating the information of some position and moment, we are able to analyze specific events of the real world. This feature is especially important at analyzing disasters. At the beginning of disasters, getting information is vital for those authorized. By properly extract messages from social media, experts can realize more and much quickly about the disaster to take the next steps.
This work provides a case study of 2016 Meinong Earthquake happened in Taiwan by exploring the data on Twitter. First of all, this work analyzes the metadata of tweets and show that tweets with images are more likely to be retweeted. Then, after using the computer vision services to label the images, this work provides the results and resistance of using label co-occurrence to cluster images. In the end, by crawling the user information of popular tweets' publisher, we can realize that besides the media of Taiwan, there are also media from other countries caring about this disasters. Moreover, we also know that most personal user publisher of popular tweets use Japanese. The reason might be that Japanese users are the second most in Tweet and location of Japan is relatively near Taiwan than other occidental countries.
第一章 緒論 1
1.1 研究背景與動機 1
1.2 研究目的與問題 2
1.3 研究架構 4
第二章 相關研究 6
2.1 社群媒體事件研究 6
2.2 文字探勘 7
2.2.1 詞袋模型 8
2.2.2 中文斷詞 8
2.3 圖片分析 9
2.3.1 圖片標記服務 10
2.3.2 重複圖片偵測 11
2.4 圖論 11
2.4.1 節點的分群方法 12
2.4.2 分群方法的評估方法 13
第三章 研究資料 15
3.1 研究資料 15
3.2 資料清理 17
3.2.1 時間區間過濾 18
3.2.2 文字過濾 22
3.3 圖片自動下標記 23
3.3.2 不同服務間的標籤分析 26
3.3.3 服務選擇 29
第四章 研究流程及結果 37
4.1 附圖貼文及未附圖貼文的比較 37
4.2 無監督式圖片類別 39
4.2.1 標籤分析 39
4.2.2 分群及其結果 42
4.2.3 結果分析 44
4.3 高轉推貼文發文者身份 51
第五章 結論及未來展望 60
[1] Imran, M., Castillo, C., Diaz, F., & Vieweg, S. (2015). Processing social media messages in mass emergency: A survey. ACM Computing Surveys (CSUR), 47(4), 67.
[2] Daly, S., & Thom, J. A. (2016, May). Mining and Classifying Image Posts on Social Media to Analyse Fires. In ISCRAM.
[3] Mouzannar, H., Rizk, Y., & Awad, M. Damage Identification in Social Media Posts using Multimodal Deep Learning.
[4]Weng, J., & Lee, B. S. (2011). Event detection in twitter. ICWSM, 11, 401-408.
[5]Abel, F., Hauff, C., Houben, G. J., Stronkman, R., & Tao, K. (2012, June). Semantics+ filtering+ search= twitcident. exploring information in social web streams. In Proceedings of the 23rd ACM conference on Hypertext and social media (pp. 285-294). ACM.
[6]Alam, F., Imran, M., & Ofli, F. (2017, July). Image4act: Online social media image processing for disaster response. In Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017 (pp. 601-604). ACM.
[7]Peters, R., & de Albuquerque, J. P. (2015). Investigating images as indicators for relevant social media messages in disaster management. In ISCRAM.
[8] Harris, Z. S. (1954). Distributional Structure. Word, 10(2/3), 146–162
[9] Upadhyaya, N., & Dixit, M. (2016). A Review: Relating Low Level Features to High Level Semantics in CBIR. International Journal of Signal Processing, Image Processing and Pattern Recognition, 9(3), 433-444.
[10] Jain, R., Kasturi, R., & Schunck, B. G. (1995). Machine vision (Vol. 5). New York: McGraw-Hill.
[11]Raghavan, U. N., Albert, R., & Kumara, S. (2007). Near linear time algorithm to detect community structures in large-scale networks. Physical review E, 76(3), 036106.
(label)
[12]Blondel, V. D., Guillaume, J. L., Lambiotte, R., & Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment, 2008(10), P10008.
[13] Pons, Pascal, and Matthieu Latapy. "Computing communities in large networks using random walks." ISCIS. Vol. 3733. 2005.
[14] Almeida, Hélio, et al. "Is there a best quality metric for graph clusters?." Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, Berlin, Heidelberg, 2011.
[15]Santorini, B. (1990). Part-of-speech tagging guidelines for the Penn Treebank Project (3rd revision). Technical Reports (CIS), 570.
[16]González, F.A., Gelbukh, A.F., & Jiménez, S. (2015). Soft Cardinality in Semantic Text Processing: Experience of the SemEval International Competitions. Polibits, 51, 63-72.
[17]鄭宇君, & 陳百齡. (2017). 香港雨傘運動的眾聲喧嘩:探討Twitter社群的多語系貼文. 傳播與社會學刊 41 頁81-117