Ograniczanie wyników
Czasopisma help
Autorzy help
Lata help
Preferencje help
Widoczny [Schowaj] Abstrakt
Liczba wyników

Znaleziono wyników: 63

Liczba wyników na stronie
first rewind previous Strona / 4 next fast forward last
Wyniki wyszukiwania
Wyszukiwano:
w słowach kluczowych:  text mining
help Sortuj według:

help Ogranicz wyniki do:
first rewind previous Strona / 4 next fast forward last
EN
This study aims to evaluate the applicability of a text mining approach for extracting UUX-related issues from a dataset of user comments and not to evaluate the Instagram (IG) app. This study analyses textual data mined from reviews in English written by IG mobile application users. The article’s authors used text mining (based on the LDA algorithm) to identify the main UUX-related topics. Next, they mapped the identified topics with known theoretical constructs to place them in their nomological network relevant to the usability (the 5Es framework by Quesenbery) and UX (the Honeycomb model by Morville). Finally, to expand the study with an emotional diagnosis, sentiment analysis was performed on two levels: (i) for each recognised topic, and (ii) for the full dataset to uncover general insights into users’ emotions within all reviews. The case study of the IG app confirms the usefulness of user feedback data for software development and points out that the review data have the potential for the early detection of frustration and negative feelings introduced during the use of the application. Conducting conventional UUX evaluations with users is problematic since they are remotely located, and the user-generated content of a social app undergoes continuous and frequent changes. Thus, the consecutive stages of the proposed methodology, based on text mining algorithms, constitute a proposed framework for examining the user-perceived quality projection of applications from user feedback, and they are the main contribution of this article. The used approach can be valuable for helping developers, designers and researchers to reveal user problems and fulfil user satisfaction regarding UUX aspects for specific software features.
EN
The article aims to analyse and compare experiences gained by tourists visiting three national parks in Poland. The authors focused on the following questions: What are people's experiences visiting national parks in Poland? Do the natural assets of the national parks affect visitors' unique experiences, or are environmentally valuable areas not crucial for their experiences? The authors used mixed quantitative (text mining, correspondence analysis) and qualitative (content analysis) methods. The data for analysis were opinions written by TripAdvisor users. Reviews on TripAdvisor indicate that the most important experiences for tourists in the National Parks studied were Nature appreciation and Physical activity. The other groups of experiences reflected in the reviews were: Aesthetic, Connection, Tension and Excitement. This confirms that nature is the most important feature of national parks for tourists, but it also indicates a trend to maintain good health and the desire to regenerate physical strength in areas of natural beauty.
PL
Celem artykułu jest analiza i porównanie doświadczeń turystów odwiedzających trzy parki narodowe w Polsce. Autorzy skupili się na następujących pytaniach: Jakie są doświadczenia osób odwiedzających parki narodowe w Polsce? Czy walory przyrodnicze parków narodowych mają wpływ na ich unikalne doświadczenia, czy też obszary cenne przyrodniczo nie są dla nich ważne? Autorzy zastosowali mieszane metody: ilościowe (text mining, analiza korespondencji) i jakościowe (analiza treści). Dane do analizy stanowiły opinie użytkowników serwisu TripAdvisor. Analiza recenzji z TripAdvisora wskazała, że najważniejszymi doświadczeniami turystów uzyskanymi w badanych parkach narodowych było: docenianie przyrody i aktywność fizyczna. Pozostałe grupy doświadczeń odzwierciedlone w recenzjach to: estetyczne, kontaktów, napięcia i ekscytacji. Stwierdzono, że przyroda jest dla turystów najważniejszym walorem parków narodowych. Badania wskazały także na tendencję wśród turystów do dbania o zdrowie i chęć regeneracji sił fizycznych na terenach o wybitnych walorach przyrodniczych.
EN
Purpose: The study provides a concise overview of the CSR concept in the literature regarding two market’s aspects - quality and marketing. The aim of the study is to analyse to what extent a CSR maps to crisis management phases. Design/methodology/approach: The article entails a literature review of 352 articles published in the years 2006-2021 supported by machine learning. Findings: 15 thematic groups were identified. It is impossible to point to unambiguous thematic trends in relation to the topics in the analysed articles. CSR as a field includes three main elements (economic, environmental, social), but this concept can be considered in many areas and a wide range of organizational activities, which was shown in the article in individual thematic groups. Research limitations/implications: As every literature review serves as a snapshot of a particular period, the results of the study are limited. Future research may also concern the qualitative research (e.g., case-studies in individual companies) and quantitative research (e.g., focusing on specific aspects and/or dedicated samples). Practical implications: The CSR implementation is not a single and short-term organization’s project, but an element of a long-term strategy and market trends. It is an important practical information for all organizations undertaking activities in corporate social responsibility. Social implications: The results of the presented research help the reader to see a broader perspective of CSR activities. Microeconomic activities within the CSR triad influence the macroeconomic scale - the global increase in awareness of business responsibility towards society. Originality/value: This is the first paper containing the literature review results on CSR considering the aspects of quality and marketing. The crisis management perspective was also considered as an issue related to the intensity of organizational activities in CSR.
EN
Purpose: To identify the duties and responsibilities of project managers by analysing the content of online job advertisements. Design/methodology/approach: Job advertisements were automatically downloaded for 63 countries/areas available on Indeed. A text mining analysis of fragments of the advertisements describing the scope of duties was carried out. The text mining analysis included initial text processing, creating corpora of the documents, creating a document-term matrix, and using classic methods derived from data mining. Findings: The research established the most frequently used words and n-grams in job advertisements. They have been presented in the form of figures. The 2-grams are also presented in the form of a net, a directed graph. The LDA algorithm identified abstract topics describing the duties and responsibilities of project managers. The most frequent words, n-grams, and topics identified by the LDA algorithm were used to identify the duties and responsibilities of project managers. Research limitations/implications: Only job advertisements written in English were analysed. The postings were downloaded only for six days. An attempt to automatically identify the responsibilities section did not yield the expected results. Therefore, it was carried out manually for random advertisements, which reduced the number of analysed documents. The content of the job advertisements was not analysed by country/area. Practical implications: The method applied can be used by organisations training future project managers, to modify and better adapt curricula to the needs of the labour market. Originality/value: Studies have shown that text mining of job advertisements can help determine the duties and responsibilities of project managers.
PL
Cel: Celem niniejszego przeglądu literatury jest zidentyfikowanie głównych nurtów tematycznych występujących w literaturze naukowej w obszarze odporności organizacyjnej. Projekt badania/metodyka badawcza/koncepcja: Przegląd literatury (formalna eksploracja tekstu) 266 artykułów opublikowanych pomiędzy 01.2003 r. i 11.2022 r. Wyniki/wnioski: Przeprowadzona analiza literatury pozwoliła na: zidentyfikowanie strumieni tematycznych w artykułach dotyczących odporności organizacyjnej, wskazanie zagadnień najczęściej analizowanych i wartych uwagi w zapewnianiu ciągłości działania organizacji. Ograniczenia: Wyniki zawężone są do zastosowanych słów kluczowych oraz danego czasu analizy (statyczne ujęcie), co stanowi klasyczne ograniczenie w przeglądach literatury. Zastosowanie praktyczne: Przedstawione zagadnienia mogą być wykorzystane przez praktyków biznesu w procesie wzmacniania odporności organizacyjnej i tym samym maksymalizacji ciągłości działania. Dla naukowców identyfikacja strumieni odporności organizacyjnej może stanowić wskazanie kierunków dalszych badań. Oryginalność/wartość poznawcza: Główną wartością artykułu jest wykazanie znaczenia elementów odporności organizacyjnej w codziennych decyzjach zarządczych.
EN
Purpose: The purpose of this literature review is to identify the main thematic trends in the scientific literature in the field of organizational resilience. Design/methodology/approach: Literature review (formal text exploration) of 266 articles published between 01.2003 and 11.2022. Findings/conclusions: The conducted analysis of the literature allowed for: identification of thematic streams in articles on organizational resilience, indication of the most frequently analyzed issues and worth attention in ensuring the continuity of the organization’s operation. Research limitation: The results are narrowed down to the keywords used and the time of the analysis (static approach), which is a classic limitation in literature reviews. Practical implications: The presented issues can be used by business practitioners in the process of strengthening organizational resilience and thus maximizing business continuity. For scientists, identifying flows of organizational resilience may be an indication of the directions for further research. Originality/value: The main value of the article is to demonstrate the importance of the elements of organizational resilience in everyday management decisions.
EN
This study examines the sustainability reports (SRs)of 200 firms in both developed and emerging economies in order to identify the words most frequently used in disclosing sustainability practices within the Triple Bottom Line (TBL) approach to reporting (which emphasizes economic, environmental, and social dimensions). Its aim is to evaluate these sustainability reports under the umbrella of the GRI framework. It adopts a semi-automated Text-Mining (TM) technique to evaluate the corporate SRs of select firms from the top ten economies by GDP at current prices. Based on the GRI Standards guidelines, a total of 208 keywords were identified for analysis. The disclosures were then awarded points based on the appearance of these keywords so that the appearance of one resulted in the awarding of a score of one; if a keyword did not appear then the report was scored a zero for that word. Furthermore, a wordcloud was also generated in order to better understand the inclination of reporting language towards various TBL reporting categories. This analysis of the SRs of 200 firms from the top ten economies of the world sheds light on the differences in reporting practices and priorities as they relate to various aspects of the GRI Standards guidelines. The results indicate that SR practices have grown rapidly in the last half decade of the period selected for study (2013-2017) as compared to the first half (2008-2012). Canada ranked highest for its disclosure practices in this analysis followed by the UK, Germany, US, Japan, France, Italy, Brazil, India, and China. This study found that all included countries improved their sustainability performance over the period 2008-2017.
PL
W niniejszym artykule przeanalizowano raporty dotyczące zrównoważonego rozwoju (SR) z 200 firm, zarówno w gospodarkach rozwiniętych, jak i wschodzących, w celu zidentyfikowania słów najczęściej używanych przy ujawnianiu praktyk zrównoważonego rozwoju w ramach podejścia do raportowania treaple bottom line (TB, które kładzie nacisk na ekonomię, środowisko i wymiary społeczne. Celem jest ocena raportów dotyczących zrównoważonego rozwoju w ramach GRI. Przyjęto półautomatyczną technikę Text-Mining (TM) do oceny korporacyjnych praktyk na rzecz zrównoważonego rozwoju (SR) wybranych firm z dziesięciu największych gospodarek według PKB w cenach bieżących. W oparciu o wytyczne standardów GRI do analizy wytypowano łącznie 208 słów kluczowych. Przyznano im następnie punkty w oparciu o częstotliwość ich występowania, tak że pojawienie się jednorazowe skutkowało przyznaniem jednej punktacji; jeśli słowo kluczowe nie pojawiło się, raport był oceniany jako zero dla tego słowa. Ponadto utworzono chmurę słów, aby lepiej zrozumieć skłonność języka raportowania do różnych kategorii raportów TBL. Ta analiza rekomendacji 200 firm z dziesięciu największych gospodarek świata rzuca światło na różnice w praktykach i priorytetach raportowania, które odnoszą się do różnych aspektów wytycznych GRI. Wyniki wskazują, że praktyki zrównoważonego rozwoju (SR) gwałtownie wzrosły w ostatniej połowie dekady wybranej do badania (2013-2017), w porównaniu z pierwszą połową (2008-2012). W tej analizie Kanada zajęła najwyższe miejsce pod względem praktyk ujawniania informacji, a następnie Wielka Brytania, Niemcy, Stany Zjednoczone, Japonia, Francja, Włochy, Brazylia, Indie i Chiny. Badanie wykazało, że wszystkie uwzględnione kraje poprawiły swoje wyniki w zakresie zrównoważonego rozwoju w latach 2008–2017.
EN
Background: Non-Functional Requirements (NFR) have a direct impact on the architecture of the system, thus it is essential to identify NFRs in the initial phases of software development. Aim: The work is based on extraction of relevant keywords from NFR descriptions by employing text mining steps and thereafter classifying these descriptions into one of the nine types of NFRs. Method: For each NFR type, keywords are extracted from a set of pre-categorized specifications using Information-Gain measure. Then models using 8 Machine Learning (ML) techniques are developed for classification of NFR descriptions. A set of 15 projects (containing 326 NFR descriptions) developed by MS students at DePaul University are used to evaluate the models. Results: The study analyzes the performance of ML models in terms of classification and misclassification rate to determine the best model for predicting each type NFR descriptions. The Naïve Bayes model has performed best in predicting “maintainability” and “availability” type of NFRs. Conclusion: The NFR descriptions should be analyzed and mapped into their corresponding NFR types during the initial phases. The authors conducted cost benefit analysis to appreciate the advantage of using the proposed models.
EN
The goal of this study is to review the literature in the field ofmeshfree methodsusing textmining. For this study, the abstracts of around 17330 relevant articles published from 1990to 2020 were collected from Scopus. Text mining techniques such as the latent Dirichletallocation (LDA), along with the calculation of term frequencies and co-occurrence coefficients were used to analyze the text. The study identified a few key topics in the field ofmeshfree methods and helped to see the evolution of the field over the past three decades.Furthermore, the trend in the number of publications and frequency map highlightedresearch trends and lack of focus in certain areas. The co-author network visualizationprovided interesting insights about collaboration between different researchers around theworld. Overall, this study facilitates a systematic literature review in the field of meshfreemethods and provides a broader perspective of the field to the research community.
EN
By analyzing the individual stages of the building's life cycle, it can be easily concluded that the building's exploitation process is the longest and at the same time it is the justification for the construction project related to the construction of this building. In the course of the building's exploitation, various phenomena occur that affect its condition and thus the possibility of unlimited use. These are natural phenomena, as well as phenomena derived from external influences, which often lead to deterioration of the building's condition, or even its degradation. In response to these phenomena, maintenance, renovation and modernization activities are undertaken. Technical management is related to the identification of these phenomena, programming of adequate measures and their implementation. The conducted analysis of the results of the survey in the group of property managers allows to state categorically that the process of technical management is relatively little supported by IT tools and is still based on individual analysis and often intuitive actions. The article presents the possibilities of applying an innovative approach in the acquisition and collection of information about the technical condition of buildings, indicating the legitimacy of standardizing information forms and using them in building a database of cases of the CBR (case based reasoning) inference system .
PL
Okres eksploatacji obiektu budowlanego jest docelowo najdłuższym etapem w cyklu życia obiektu, któremu podporządkowane są wszelkie poprzedzające go etapy cyklu życia. Analizując literaturę przedmiotu odnosi się wrażenie, że proporcjonalnie mało uwagi poświęca się zagadnieniom występującym na etapie eksploatacji. Gros uwagi skupione jest na zagadnieniach występujących na etapie projektowania, przygotowania i samej realizacji inwestycji. Poniekąd jest to właściwe, albowiem poprawne zaprojektowanie i zrealizowanie przedsięwzięcia winno skutkować redukcją problemów na etapie eksploatacji. Nie można ich jednak całkowicie wyeliminować, albowiem wynikają również z czynników zewnętrznych jak i naturalnych. Jednocześnie należy zauważyć, że zagadnienia problemowe występujące na etapie eksploatacji mają charakter kompleksowy i obejmują wiele czynników, w tym również pozatechnicznych. Obserwacje bezpośrednie zagadnień jakie występują na etapie eksploatacji budynków mieszkalnych pozwalają sformułować wniosek o konieczności stałego monitorowania stanu technicznego budynku i w związku z jego zmianą aktywnego działania poprzez planowanie adekwatnej polityki utrzymaniowej i remontowej. Również bezpośrednie obserwacje pozwalają stwierdzić, że w wielu przypadkach, pomimo zaistnienia potrzeby wdrożenia działań utrzymaniowych i remontowych, ze względów ekonomicznych, te nie są wdrażane. Wynika to z faktu, że wdrożenie działań, w szczególności remontowych, wymaga odpowiednio wczesnego przygotowania i zapewnienia środków finansowych na niezbędne nakłady. W przypadku drastycznego pogorszenia stanu technicznego budynku, częstokroć wysokość niezbędnych nakładów przewyższa możliwości finansowe właścicieli i użytkowników, co w konsekwencji prowadzi do jeszcze większej degradacji stanu technicznego. Z uwagi na to, istotnym jest prognozowanie nakładów finansowych niezbędnych na działania utrzymaniowe i remontowe, których zakres jak i zasadność wynika ze zmian stanu technicznego budynku. W aspekcie stwierdzonej zasadności prognozowania nakładów finansowych oraz planowania polityki utrzymaniowo-remontowej dokonano analizy stanu faktycznego poprzez badania ankietowe. Na podstawie uzyskanych wyników stwierdzono m.in., że wykorzystanie rozwiązań IT dedykowanych do zarzadzania nieruchomościami obejmuje 53,06% badanych podmiotów - zarządców nieruchomości. Ponadto stwierdzono, że same rozwiązania IT dedykowane dla zarządzania nieruchomościami w marginalny sposób obejmują zagadnienia typowo techniczne, a akcentują zagadnienia administrowania, rozliczania opłat i ich księgowania, czy komunikacji. Co szczególnie mało satysfakcjonujące, w oparciu o wykonane badanie ankietowe, jak również obserwacje bezpośrednie, zauważa się brak wykorzystania zalet BIM w zarządzaniu nieruchomościami. Ma to jednak uzasadnienie w tym, że technologia BIM obecnie upowszechniła się w projektowaniu i w realizacji i w związku z tym dopiero w pewnej perspektywie czasowej zauważalne będzie wykorzystanie BIM w zarządzaniu nieruchomościami, tym bardziej, że już kilka lat temu stworzono podwaliny BIM 7D.
EN
Purpose: The paper presents an analysis of a scientific publication with regard to the frequency of words and n-grams. The research problem addressed was the question to what extent the text mining analysis of a scientific publication will allow to infer its content. Design/methodology/approach: The main research method is the analysis of tokenized text using word count functions, bigrams, and trigrams in selected sections of a scientific publication. The results of text mining analysis were compared with the classic, non-automated text analysis of the publication. The presented study is a pilot project in the form of a case study. Findings: The proposed method of analyzing a scientific text using an analysis of the frequency of words and n-grams enables inference of the content of the paper with regard to the names of variables involved in the study, the statistical apparatus used and the key literature cited. It should be observed, however, that the discussed method does not make it possible to establish which variables are moderators and which are mediators. Originality/value: In this paper, the text mining technique was used differently in the discussed study than in previous works. The publication was not examined in its entirety, as previous researchers did, but text mining analysis was applied to individual parts of the paper, i.e. the part discussing theoretical foundations of the research and the part presenting the research method, research results, and their discussion. This allowed for obtaining more precise results regarding the content of the publication.
EN
Purpose: An attempt to identify the duties and responsibilities of the project manager by analysing job offers from a job website. An attempt to determine whether there were any changes between 2018 and 2019. Design/methodology/approach: Text mining was performed for fragments of job offers, describing the duties and responsibilities. The text mining analysis consisted of initial processing of the text, creation of a corpus of analysed documents, construction of a word frequency matrix and use of classical methods from the data mining are. Findings: The most common words in job offers are presented, as well as their correlation with other words. With the use of the Topic modeling algorithm, hidden topics describing the analysed job offers have been generated. These topics can also be used to identify the duties and responsibilities of a project manager. Research limitations/implications: Only the job offers meeting the following conditions were analysed: (1) they concerned the job of „project manager”; (2) the content was in Polish; (3) they were provided by www.pracuj.pl website; (4) they were collected from 09 to 11 April in 2018 and 2019. Practical implications: This method can be used by organizations training project managers, in order to modify and better adjust the curriculum to the needs of the labour market. Originality/value: Research has shown that text mining can be used to determine the responsibilities of a project manager by analysing job offers.
EN
Purpose: The objective of the paper is to use text mining to identify leading research domains concerning the smart city following an analysis of research articles with a high citation index according to the Web of Science. Design/methodology/approach: An original method is proposed for analysing academic texts using the R language, tokenisation, lemmatisation, n-grams and correspondence analysis. The author analysed fifty of the most cited articles indexed in the Web of Science from 2014 to 2019. Findings: The paper presents the advantages and drawbacks of the proposed method of analysing research publications. The assets include automation and repeatability of the analysis of a large number of documents and improved knowledge about links among the articles in terms of research domains. The disadvantage is the loss of information from diagrams and figures. The method identified two leading research domains related to the notion of the smart city, technologies and systems. The analysed publications were categorised by selected keywords. Research limitations/implications: Future work should include further refinement of the assumptions for the method, analyses of a more significant number of research texts and a narrowing down of the domain of the smart city. It is desirable to consider other functional domains of the city, such as energy, public health, environmental protection or transport. Practical implications: The proposed method can complement a standard literature analysis regarding the smart city. The leading research domains related to the smart city in the analysed articles were systems and technologies employed to improve how the city operates. Social implications: Text mining can be employed by various experts focusing on the smart city and constitutes a refreshing complement for other research methods, such as questionnaire surveys, interviews or observations. Originality/value The publication can be useful for researchers from various fields and managers seeking to create and use simple, useful methods and tools for analysing unstructured text documents for decision-making. The paper proposes a separate text mining analysis of abstracts and whole documents using n-grams. This yielded a more precise list of areas relevant to the smart city. The grouping was done using correspondence analysis of the fifty most cited articles indexed in the Web of Science from 2014 to 2019.
EN
Purpose: The objective of this paper is to identify leading technologies in Industry 4.0. Design/methodology/approach: The identification was made with the use of text mining to explore the scientific texts in this field. Assumptions of own iterative method for analyzing scientific texts were proposed, with the use of R language, tokenization, lemmatization, n-grams and correspondence analysis. The assumptions of the proposed method were used to analyze the 40 most often quoted articles indexed in the Web of Science. Findings: On the basis of the obtained results, 4 leading technologies were identified. These are Cloud Computing, Internet of Things, Cyber-physical System and Big Data. Originality/value: The article proposes an original method of identifying the leading technologies used in Industry 4.0. The proposed method is based on text mining and correspondence analysis.
EN
Purpose: An attempt to identify the competencies of the project manager desired by the employers and to determine whether changes have occurred over time. Design/methodology/approach: Job offers were automatically downloaded from website with job offers. An analysis of text mining of fragments of offers describing the competence was carried out. The analysis of text mining included initial text processing, creation of corpora of analyzed documents, creation of a document-term matrix, topic modeling algorithm and the use of classic methods derived from data mining. Findings: The most frequently used words/n-grams and the correlation of selected words/ n-grams with other words/n-grams were presented in the form of drawings. Based on the frequency of words/n-grams and the correlation value, efforts were made to identify the project manager competencies. The topic modeling algorithm was used to generate topics that can also be used to identify expected project manager competencies. Research limitations/implications: Only offers written in Polish, downloaded from one websites with job offers, which had the phrase “kierownik projektu” (“project manager”) in their job title, were analyzed. Data was collected from 09 to 11 April 2018 and from 09 to 11 April 2019. Practical implications: The method applied can be used by organizations preparing for the profession of a project manager, to modify and better adapt curricula to the needs of the labor market. Originality/value: Studies have shown that text mining of job offers can, to some extent, help determine the desired project manager competence.
EN
Purpose: An attempt to identify the scope of duties of the business analyst by analyzing job offers from websites. Design/methodology/approach: Job offers were automatically downloaded from five websites. An analysis of text mining of fragments of offers describing the scope of duties was carried out. The analysis of text mining included initial text processing, creation of corpora of analyzed documents, creation of a document-term matrix and the use of classic methods derived from data mining. Findings: The most frequently used words/n-grams and the correlation of selected words/ n-grams with other words/n-grams were presented in the form of drawings. Based on the frequency of words/n-grams and the correlation value, efforts were made to identify the responsibilities of the business analyst. Research limitations/implications: Only offers written in Polish, downloaded from five websites with job offers, which had the phrase “business analyst” in their job title, were analyzed. The data was collected between 13 and 15 April 2020. Practical implications: The method applied can be used by organizations preparing for the profession of a business analyst, to modify and better adapt curricula to the needs of the labor market. Originality/value: Studies have shown that text mining of job offers can, to some extent, help determine the responsibilities of a business analyst.
EN
Analyzing User-Generated Content present in social media has become mandatory for companies looking for maintaining competitiveness. These data contain information such as consumer opinions, and recommendations that are seen as rich sources of information for the development of decision support systems. When observing the state of the art, it was found that there is a lack of antecedents that address the analysis of online reviews of Brazilian restaurants. In this sense, the focus of this work is to fill this gap through a case study of Santar\'em city. The results show that professionals in this segment can use these analyzes in order to improve the user's experiences and increase their profits.
EN
The purpose of the work was to analyse publications in the area of Business Intelligence. Only bibliometric data was used in the analysis. The analysis was performed using the R programming language. An attempt was made to determine whether by analysing bibliometric data, it is possible to obtain information on Business Intelligence systems. Aiming at achieving the adopted goal, in the second point of the work, selected information on Business Intelligence systems was presented. The third point presents the manner of collecting data. Further stages of the analysis were also presented. The fourth point contains the results of the conducted research. Among others, the number of publications in individual years and the most common words in titles, abstracts and keywords were presented. Using two topic modelling algorithms, topics were generated that can also be used to identify information related to Business Intelligence systems.
18
Content available remote Similarity detection based on document matrix model and edit distance algorithm
EN
This paper presents a new algorithm with an objective of analyzing the similarity measure between two text documents. Specifically, the main idea of the implemented method is based on the structure of the so-called “edit distance matrix” (similarity matrix). Elements of this matrix are filled with a formula based on Levenshtein distances between sequences of sentences. The Levenshtein distance algorithm (LDA) is used as a replacement for various implementations of stemming or lemmatization methods. Additionally, the proposed algorithm is fast, precise, and may be implemented for analyzing very large documents (e.g., books, diploma works, newspapers, etc.). Moreover, it seems to be versatile for the most common European languages such as Polish, English, German, French and Russian. The presented tool is intended for all employees and students of the university to detect the level of similarity regarding analyzed documents. Results obtained in the paper were confirmed in the tests shown in the article.
EN
This article presents a proposal of identification of project managers’ competences in order to employers’ requirements. For this purpose the article presents the analysis of job advertisements with use of text analysis. According to that point of view the structure of the paper was created. The second part containing description of competences of project managers’. It shows the definition of competence, how it could be divided and which are the most important according to the authors. The third part of the article is divided into three pieces. First shows how to collect data to analysis. Second part shows procedure of text mining analysis, and the last part shows the results of such kind of analysis. It contains the words which appeared most frequently in the job offers and the correlation of those words with other words in text.
PL
W artykule podjęto próbę identyfikacji oczekiwanych przez pracodawców kompetencji kierowników projektów. W tym celu przeanalizowano treść ofert pracy z wykorzystaniem analizy text mining. Stosownie do przyjętego celu ustalono strukturę pracy. W punkcie drugim przedstawiono najważniejsze informacje dotyczące kompetencji kierowników projektów. Wymieniono cztery modele kompetencji, przedstawiono czym one są, na jakie grupy można je podzielić oraz, które z nich według wybranych autorów są najistotniejsze. Punkt trzeci podzielono na trzy części. W pierwszej z nich przedstawiono sposób zebrania danych do analizy. Kolejna część opisuje procedurę przeprowadzonej analizy text mining. Ostatnia część prezentuje wyniki przeprowadzonej analizy. Ustalono jakie słowa w analizowanych ofertach pojawiały się najczęściej, oraz jak była korelacji tych słów z innymi słowami.
EN
This work proposes a software environment implementing a methodology for acquiring and exploiting the collective perception (CP) of Points of Interests (POIs) in a Smart City, which is meant to support decision makers in urban planning and management. This environment relies upon semantic knowledge discovery techniques and fuzzy computational approaches, including natural language processing, sentiment analysis, POI signatures and Fuzzy Cognitive Maps, turning them into a cohesive architectural blend in order to effectively gather the realistic perception of a user community towards given areas and attractions of a Smart City. The environment has been put to the test via a thorough experimentation against a massive user base of an online community with respect to a large metropolitan city (the City of Naples). Such an experimentation yielded consistent results, useful for providing decision makers with a clear awareness of the positive as well as critical aspects of urban areas, and thus helping them shape the measures to be taken for an improved city management and development.
first rewind previous Strona / 4 next fast forward last
JavaScript jest wyłączony w Twojej przeglądarce internetowej. Włącz go, a następnie odśwież stronę, aby móc w pełni z niej korzystać.