Journal of Shanghai Jiao Tong University (Science) ›› 2020, Vol. 25 ›› Issue (3): 325-332.doi: 10.1007/s12204-020-2184-1
WANG Yinglin (王英林)
出版日期:
2020-06-15
发布日期:
2020-05-29
通讯作者:
WANG Yinglin (王英林)
E-mail:wang.yinglin@shufe.edu.cn
WANG Yinglin (王英林)
Online:
2020-06-15
Published:
2020-05-29
Contact:
WANG Yinglin (王英林)
E-mail:wang.yinglin@shufe.edu.cn
摘要: Nowadays, the Internet has penetrated into all aspects of people’s lives. A large number of online customer reviews have been accumulated in several product forums, which are valuable resources to be analyzed. However, these customer reviews are unstructured textual data, in which a lot of ambiguities exist, so analyzing them is a challenging task. At present, the effective deep semantic or fine-grained analysis of customer reviews is rare in the existing literature, and the analysis quality of most studies is also low. Therefore, in this paper a fine-grained opinion mining method is introduced to extract the detailed semantic information of opinions from multiple perspectives and aspects from Chinese automobile reviews. The conditional random field (CRF) model is used in this method, in which semantic roles are divided into two groups. One group relates to the objects being reviewed, which includes the roles of manufacturer, the brand, the type, and the aspects of cars. The other group of semantic roles is about the opinions of the objects, which includes the sentiment description, the aspect value, the conditions of opinions and the sentiment tendency. The overall framework of the method includes three major steps. The first step distinguishes the relevant sentences with the irrelevant sentences in the reviews. At the second step the relevant sentences are further classified into different aspects. At the third step fine-grained semantic roles are extracted from sentences of each aspect. The data used in the training process is manually annotated in fine granularity of semantic roles. The features used in this CRF model include basic word features, part-of-speech (POS) features, position features and dependency syntactic features. Different combinations of these features are investigated. Experimental results are analyzed and future directions are discussed.
中图分类号:
WANG Yinglin . Fine-Grained Opinion Mining on Chinese Car Reviews with Conditional Random Field[J]. Journal of Shanghai Jiao Tong University (Science), 2020, 25(3): 325-332.
WANG Yinglin . Fine-Grained Opinion Mining on Chinese Car Reviews with Conditional Random Field[J]. Journal of Shanghai Jiao Tong University (Science), 2020, 25(3): 325-332.
[1] | KIM S M, HOVY E. Determining the sentiment of opinions [C]//20th International Conference on Computational Linguistics. Geneva, Switzerland: Association for Computational Linguistics, 2004: 1367. |
[2] | KOBAYASHI N, INUI K, MATSUMOTO Y, et al.Collecting evaluative expressions for opinion extraction[C]//International Conference on Natural Language Processing. Berlin, Heidelberg: Springer, 2005:596-605. |
[3] | HU M Q, LIU B??Mining and summarizing customer reviews [C]//10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Seattle, Washington, USA: ACM. 2004: 168-177. |
[4] | YAO T F, NIE Q Y, LI J C, et al. An opinion mining system for Chinese automobile reviews[C]//The Academic Conference of the 25th Anniversary of the Chinese Information Society of China. Beijing,China: Chinese Information Processing Society of China, 2006: 260-281 (in Chinese). |
[5] | POPESCU A M, ETZIONI O. Extracting product features and opinions from reviews [C]//Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. Vancouver,Canada: Association for Computational Linguistics,2005: 339-346. |
[6] | NI M S, LIN H F. Mining product reviews based on association rule and polar analysis [C]//3th Chinese Conference on Information Retrieval and Content Security.Suzhou, China: Chinese Information Processing Society of China, 2007: 635-641 (in Chinese). |
[7] | WANG R Y. The key technology research on opinion target extraction [D]. Suzhou, China: Suzhou University,2012 (in Chinese). |
[8] | LIMJ H, HWANG Y S, PARK S Y, et al. Semantic role labeling using maximum entropy model [C]//8th Conference on Computational Natural Language Learning.Boston, Massachusetts, USA: Association for Computational Linguistics, 2004: 954-961. |
[9] | LIU T, CHE W X, LI S. Semantic role labeling with maximum entropy classifier [J]. Journal of Software,2007, 18(3): 565-573 (in Chinese). |
[10] | ZHUANG L, JING F, ZHU X Y. Movie review miningand summarization [C]//15th ACM International Conference on Information and Knowledge Management. Arlington, Virginia, USA: ACM, 2006:43-50. |
[11] | JIN W, HO H H. A novel lexicalized HMM-based learning framework for Web opinion mining [C]//26th International Conference on Machine Learning. Montreal,Canada: ACM, 2009: 465-472. |
[12] | LI F T, HAN C, HUANG M L, et al. Structure-aware review mining and summarization [C]//23rd International Conference on Computational Linguistics. Beijing,China: Association for Computational Linguistics,2010: 653-661. |
[13] | KUANG Y, ZHOU Y, HE H. A combination method of CRF with syntactic rules to identify opinion holder[C]//6th International Conference on Natural Language Processing and Knowledge Engineering. Beijing,China: IEEE, 2010: 11568368. |
[14] | LIU X. Research on opinion extraction method based on product reviews [D]. Harbin, China: Heilongjiang University, 2015 (in Chinese). |
[15] | TITOV I, MCDONALD R. A joint model of text and aspect ratings for sentiment summarization[C]//Proceedings of ACL’08. Columbus, USA: Association for Computational Linguistics, 2008: 308-316. |
[16] | BRANAVAN S R K, CHEN H, EISENSTEIN J, et al. Learning document-level semantic properties from free-text annotations [J]. Journal of Artificial Intelligence Research, 2009, 34(1): 569-603. |
[17] | BRODY S, ELHADAD N. An unsupervised aspectsentiment model for online reviews [C]//2010 Annual Conference of the North American Chapter of the ACL. Los Angeles, USA: Association of Computational Linguistics, 2010: 804-812. |
[1] | 蒋祖华1, 周宏明2, 陶宁蓉3, 李柏鹤1. 基于知识的船舶曲面分段建造调度及应用[J]. J Shanghai Jiaotong Univ Sci, 2024, 29(5): 759-765. |
[2] | 于佳琪1,王殊轶1,王浴屺1,谢华2,吴张檑1,付小妮1,马邦峰1. 基于增强现实技术的新型经皮肾穿刺训练可视化工具[J]. J Shanghai Jiaotong Univ Sci, 2023, 28(4): 517-. |
[3] | 姜锐1,朱瑞祥1,蔡萧萃1,苏虎2. 具有增强注意力的前景分割网络[J]. J Shanghai Jiaotong Univ Sci, 2023, 28(3): 360-369. |
[4] | 祝 楷, 熊柏青, 闫宏伟, 张永安, 李志辉, 李锡武, 刘红伟, 温 凯, 闫丽珍, . 辊道传送速度对大规格铝合金厚板应力分布及演变影响的数值模拟研究[J]. J Shanghai Jiaotong Univ Sci, 2023, 28(2): 255-263. |
[5] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(6): 757-767. |
[6] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(2): 190-201. |
[7] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(2): 240-249. |
[8] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(1): 24-35. |
[9] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(1): 99-111. |
[10] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(1): 121-136. |
[11] | . [J]. J Shanghai Jiaotong Univ Sci, 2022, 27(1): 7-14. |
[12] | . [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 577-586. |
[13] | . [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 587-597. |
[14] | . [J]. J Shanghai Jiaotong Univ Sci, 2021, 26(5): 670-679. |
[15] | SHI Lianxing (石连星), WANG Zhiheng (王志恒), LI Xiaoyong (李小勇) . Novel Data Placement Algorithm for Distributed Storage System Based on Fault-Tolerant Domain[J]. J Shanghai Jiaotong Univ Sci, 2021, 26(4): 463-470. |
阅读次数 | ||||||||||||||||||||||||||||||||||||||||||||||||||
全文 33
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||
摘要 587
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||