Tytuł artykułu
Treść / Zawartość
Pełne teksty:
Identyfikatory
Warianty tytułu
Języki publikacji
Abstrakty
One of the factors that improve businesses in business intelligence is summarization systems that can generate summaries based on sentiment from social media. However, these systems cannot produce such summaries automatically; they use annotated datasets. To support these systems with annotated datasets, we propose a novel framework that uses pattern rules. The framework has two procedures: 1) pre-processing, and 2) aspect knowledge-base generation. The first procedure is to check and correct any misspelled words (bigram and unigram) by a proposed method and tag the parts-of-speech of all of the words. The second procedure is to automatically generate an aspect knowledge base that is to be used to produce sentiment summaries by sentiment-summarization systems. Pattern rules and semantic similarity-based pruning are used to automatically generate an aspect knowledge base from social media. In the experiments, eight domains from benchmark datasets of reviews are used. The performance evaluation of our proposed approach shows the highest performance when compared to other unsupervised approaches.
Wydawca
Czasopismo
Rocznik
Tom
Strony
489--516
Opis fizyczny
Bibliogr. 48 poz., rys., tab.
Twórcy
autor
- Prince of Songkla University, Faculty of Science, Artificial Intelligence and Informatics Innovations (AI3 ) Research Lab, Division of Computational Science, Hat Yai, 90110, Thailand
autor
- Prince of Songkla University, Faculty of Science, Artificial Intelligence and Informatics Innovations (AI3 ) Research Lab, Division of Computational Science, Hat Yai, 90110, Thailand
autor
- Prince of Songkla University, Faculty of Science, Artificial Intelligence and Informatics Innovations (AI3 ) Research Lab, Division of Computational Science, Hat Yai, 90110, Thailand
Bibliografia
- [1] Adela L., Ulfeta M.: Improving sentiment analysis for twitter data by handling negation rules in the Serbian language, Computer Science and Information Systems, vol. 16(1), pp. 289–311, 2019.
- [2] Aichner T., Jacob F.: Measuring the Degree of Corporate Social Media Use, International Journal of Market Research, vol. 57(2), pp. 257–276, 2015.
- [3] Asghar M.Z., Khan A., Zahra S.R., Ahmad S., Kundi F.M.: Aspect-based opinion mining framework using heuristic patterns, Cluster Computing, vol. 22, pp. 7181–7199, 2017.
- [4] Bagheri A., Saraee M., de Jong F.: Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews, Knowledge-Based Systems, vol. 52, pp. 201–213, 2013.
- [5] Bing L.: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, Cambridge University Press, New York, USA, 2015.
- [6] Clark E., Araki K.: Text Normalization in Social Media: Progress, Problems and Applications for a Pre-Processing System of Casual English, Procedia – Social and Behavioral Sciences, vol. 27, pp. 2–11, 2011.
- [7] Dedić N., Stanier C.: Measuring the Success of Changes to Existing Business Intelligence Solutions to Improve Business Intelligence Reporting. In: Research and Practical Issues of Enterprise Information Systems, pp. 225–236, Cham 2016.
- [8] Erşahin B., Aktaş Ö., Kilinç D., Erşahin M.: A hybrid sentiment analysis method for Turkish, Turkish Journal of Electrical Engineering & Computer Sciences, vol. 27(3), pp. 1780–1793, 2019.
- [9] Feng J., Yang W., Gong C., Li X., Bo R.: Product Feature Extraction via Topic Model and Synonym Recognition Approach. In: Big Data, pp. 73–88, Springer Singapore, 2019.
- [10] Freeman A.T., Condon S.L., Ackerman C.M.: Cross Linguistic Name Matching in English and Arabic: A "One to Many Mapping" Extension of the Levenshtein Edit Distance Algorithm. In: Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pp. 471–478, HLT-NAACL ’06, USA, 2006.
- [11] Gerani S., Carenini G., Ng R.T.: Modeling content and structure for abstractive review summarization, Computer Speech & Language, vol. 53, pp. 302–331, 2019.
- [12] Gliwa B., Zygmunt A., Dąbrowski M.: Building sentiment lexicons based on recommending services for the Polish language, Computer Science, vol. 17(2), pp. 163–185, 2016.
- [13] Htay S.S., Lynn K.T.: Extracting product features and opinion words using pattern knowledge in customer reviews, The Scientific World Journal, vol. 2013, pp. 1–5, 2013.
- [14] Hu M., Liu B.: Mining and Summarizing Customer Reviews. In: Proceedings of the 10th International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 168–177, USA, 2004.
- [15] Jakob N., Gurevych I.: Extracting Opinion Targets in a Single- and Cross-Domain Setting with Conditional Random Fields, 2010.
- [16] Jin W., Ho H.H., Srihari R.K.: OpinionMiner: a novel machine learning system for web opinion mining and extraction. In: Proceedings of the 15th International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 1195–1204, ACM, USA, 2009.
- [17] Kang Y., Zhou L.: RubE: Rule-based methods for extracting product features from online consumer reviews, Information & Management, vol. 54(2), pp. 166–176, 2017.
- [18] Khan K., Baharudin B., Khan A.: Identifying Product Features from Customer Reviews Using Hybrid Patterns, International Arab Journal of Information Technology, vol. 11(3), pp. 281–286, 2014.
- [19] Kherwa P., Sachdeva A., Mahajan D., Pande N., Singh P.K.: An approach towards comprehensive sentimental data analysis and opinion mining. In: 2014 IEEE International Advance Computing Conference (IACC), pp. 606–612, 2014.
- [20] Konjengbam A., Dewangan N., Kumar N., Singh M.: Aspect ontology based review exploration, Electronic Commerce Research and Applications, vol. 30, pp. 62–71, 2018.
- [21] Lazhar F.: Implicit feature identification for opinion mining, Internation Journal of Business Information Systems, vol. 30(1), pp. 13–30, 2019.
- [22] Li S., Zhou L., Li Y.: Improving aspect extraction by augmenting a frequency-based method with web-based similarity measures, Information Processing & Management, vol. 51(1), pp. 58–67, 2015.
- [23] Li X., Bing L., Li P., Lam W., Yang Z.: Aspect Term Extraction with History Attention and Selective Transformation. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18), pp. 4194–4200, 2018.
- [24] Li X., Lam W.: Deep Multi-Task Learning for Aspect Term Extraction with Memory Interaction. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017), pp. 2886–2892, Copenhagen, Denmark, 2017.
- [25] Liu Q., Gao Z., Liu B., Zhang Y.: Automated rule selection for opinion target extraction, Knowledge-Based Systems, vol. 104(15), pp. 74–88, 2016.
- [26] Maharani W., Widyantoro D.H., Khodra M.L.: Aspect Extraction in Customer Reviews Using Syntactic Pattern, Procedia Computer Science, vol. 59, pp. 244–253, 2015.
- [27] Mai L., Le B.: Aspect-Based Sentiment Analysis of Vietnamese Texts with Deep Learning. In: Intelligent Information and Database Systems (ACIIDS 2018). Lecture Notes in Computer Science, vol. 10751, pp. 149–158, Springer, 2018.
- [28] Marstawi A., Sharef N.M., Aris T.N.M., Mustapha A.: Ontology-Based Aspect Extraction for an Improved Sentiment Analysis in Summarization of Product Reviews (ICCMS ’17). pp. 100–104, 2017.
- [29] Mataoui M., Hacine T.E.B., Tellache I., Bakhtouchi A., Zelmati O.: A new syntax-based aspect detection approach for sentiment analysis in Arabic reviews. In: Proceedings of the 2nd International Conference on Natural Language and Speech Processing (ICNLSP), pp. 1–6, 2018.
- [30] Nawaz A., Asghar S., Naqvi S.H.: A segregational approach for determining aspect sentiments in social media analysis, The Journal of Supercomputing, vol. 75(5), pp. 2584–2602, 2019.
- [31] Niwattanakul S., Singthongchai J., Naenudorn E., Wanapu S.: Using of Jaccard Coefficient for Keywords Similarity. In: International MultiConference of Engineers and Computer Scientists (IMECS 2013), vol. I, 2013.
- [32] Parlar T., Ozel A.S., Song F.: Analysis of data pre-processing methods for the sentiment analysis of reviews, Computer Science, vol. 20(1), pp. 123–141, 2019.
- [33] Piao Z., Park S.M., On B.W., Choi G.S., Park M.S.: Product reputation mining: bring informative review summaries to producers and consumers, Computer Science and Information Systems, vol. 16(2), pp. 359–380, 2019.
- [34] Poria S., Cambria E., Gelbukh A.: Aspect extraction for opinion mining with a deep convolutional neural network, Knowledge-Based Systems, vol. 108(15), pp. 42–49, 2016.
- [35] Poria S., Cambria E., Ku L.W., Gui C., Gelbukh A.: A Rule-Based Approach to Aspect Extraction from Product Reviews. In: Proceedings of the 2nd Workshop on Natural Language Processing for Social Media (SocialNLP), pp. 28–37, 2014.
- [36] Qiu G., Liu B., Bu J., Chen C.: Opinion Word Expansion and Target Extraction through Double Propagation, Computational Linguistics, vol. 37(1), pp. 9–27 2011.
- [37] Ramamonjisoa D., Murakami R., Chakraborty B.: Comments Analysis and Visualization Based on Topic Modeling and Topic Phrase Mining. In: Proceedings of the 3rd International Conference on E-technologies and Business on the Web (EBW2015), p. 1, 2015.
- [38] Rana T.A., Cheah Y.N.: A two-fold rule-based model for aspect extraction, Expert Systems with Applications, vol. 89(15), pp. 273–285, 2017.
- [39] Rana T.A., Cheah Y.N.: Sequential patterns rule-based approach for opinion target extraction from customer reviews, Journal of Information Science, vol. 45(5), pp. 643–655, 2019.
- [40] Ranta A.: Grammatical Framework: Programming with Multilingual Grammars, CSLI Publications, Center for the Study of Language and Information, 2011.
- [41] Samanta P., Chaudhuri B.B.: A simple real-word error detection and correction using local word bigram and trigram. In: Proceedings of the 25th Conference on Computational Linguistics and Speech Processing (ROCLING 2013), pp. 211–220, Taiwan, 2013.
- [42] Singh S.K., Sachan M.K.: SentiVerb system: classification of social media text using sentiment analysis, Multimedia Tools and Applications, vol. 78(22), pp. 31109–32136, 2019.
- [43] Spacy: Spacy Guides, 2020. https://spacy.io/.
- [44] Tran T.A., Duangsuwan J., Wettayaprasit W.: A Novel Automatic Sentiment Summarization from Aspect-based Customer Reviews. In: Proceedings of the 15th International Joint Conference on Computer Science and Software Engineering (JCSSE), pp. 1–6, Nakhonpathom, Thailand, 2018.
- [45] Tubishat M., Idris N., Abushariah M.: Explicit aspects extraction in sentiment analysis using optimal rules combination, Future Generation Computer Systems, vol. 114, pp. 448–480, 2021.
- [46] Wei C.P., Chen Y.M., Yang C.S., Yang C.C.: Understanding what concerns consumers: a semantic approach to product feature extraction from consumer reviews, Information Systems and e-Business Management, vol. 8(2), pp. 149–167, 2010.
- [47] Wilson T., Wiebe J., Hoffmann P.: Recognizing contextual polarity in phrase-level sentiment analysis. In: HLT ’05: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing October 2005, pp. 347–354, 2005.
- [48] Yin W., Kann K., Yu M., Schütze H.: Comparative Study of CNN and RNN for Natural Language Processing, arXiv Preprint, vol. arXiv: 1702.01923, 2017.
Typ dokumentu
Bibliografia
Identyfikator YADDA
bwmeta1.element.baztech-3a59f132-35e7-42ab-ae66-310b2dc591f7