Data Mining

Major: Software Engineering
Code of subject: 7.121.01.E.016
Credits: 6.00
Department: Software
Lecturer: Dr. Science, professor Hrytsiuk Yuri Ivanovych
Semester: 2 семестр
Mode of study: денна
Мета вивчення дисципліни: Providing students with both fundamental theory and practical implementation of analyzing and processing large volumes of information, extracting the necessary knowledge from large databases.
Завдання: The ability to effectively solve specialized tasks and practical problems of an innovative nature during professional activities related to all aspects of software development from the initial stages of specification creation to system support after commissioning. Knowledge of modern mathematical methods and algorithms of Data Mining technology for analysis and processing of large volumes of information used in software engineering (ФКС2.1).
Learning outcomes: РНС2.1. Be able to develop methods for analysis and processing large data sets using known Data Mining tools.
Required prior and related subjects: Prerequisites: Research Methods and Tools in Software Engineering Corequisites: Master’s Thesis Preparation and Defence
Summary of the subject: The discipline involves the study of the basic concepts of Data Mining technology, the methods, tools and applications of Data Mining are considered in detail. The description of each method is accompanied by a concrete example of its use. The concept of Web Mining is introduced. The analytical software market is analyzed, products from leading Data Mining manufacturers are described, and their capabilities are discussed. Special attention is focused on data analysis technology Data Mining, Text Mining, Visual Mining, as well as visual (Visual Mining) and text (Text Mining) data analysis, process analysis (Process Mining), analysis of Web resources (Web mining) and analysis are considered in real time (Real-Time Data Mining). A description of methods and algorithms for solving the main tasks of analysis: classification, clustering, etc. is given. The description of the idea of ??each method is supplemented by a specific example of its use.
Опис: Data Mining: application specifics. Data Mining technology and its Ukrainian market. Practical usage of Data Mining methods and tools. Models and methods of Data Mining. Knowledge detection in datasets. Knowledge Management. Object regression and classification. Methods of constructing rules, decision trees and mathematical functions. Prediction of time series. Associative rules. Basic clustering algorithms. Adaptive data clustering methods. Visual Mining: usage problems and visualization tools. Methods of data visualization. Visualization as a way of data comprehension. Text Mining. Extracting key concepts from a text. Text document classification. Methods of clustering text documents. Web Mining. Methods of extracting Web-content. Process Mining. PROM library. Distributed data analysis. Systems of mobile agents. Real-Time Data Mining. Recommendation machines. Data Mining standards: CWM, CRISP, PMML and others. Xelopes library. Performance of genetic algorithms for processing big data.
Assessment methods and criteria: 1. Questioning in laboratory classes. 2. Control tests in laboratory classes. 3. Labs defense. 4. Examination control (written component (tests), oral component).
Критерії оцінювання результатів навчання: Current control: 45% (laboratory works – 30% (6 labs 5% each), interims tests on classes 15% (2 tests – 5% + 10%)) Exam: 55% (written part 50%, spoken part 5%) Each laboratory work takes two weeks. If its defence is delayed, a student loses one point per each week of delay.
Порядок та критерії виставляння балів та оцінок: 100-88 points - certified with an “excellent” grade - High level: the student demonstrates an in-depth mastery of the conceptual and categorical apparatus of the discipline, systematic knowledge, skills and abilities of their practical application. The mastered knowledge, skills and abilities provide the ability to independently formulate goals and organize learning activities, search and find solutions in non-standard, atypical educational and professional situations. The applicant demonstrates the ability to make generalizations based on critical analysis of factual material, ideas, theories and concepts, to formulate conclusions based on them. His/her activity is based on interest and motivation for self-development, continuous professional development, independent research activities, implemented with the support and guidance of the teacher. 87-71 points - certified with a grade of “good” - Sufficient level: involves mastery of the conceptual and categorical apparatus of the discipline at an advanced level, conscious use of knowledge, skills and abilities to reveal the essence of the issue. Possession of a partially structured set of knowledge provides the ability to apply it in familiar educational and professional situations. Aware of the specifics of tasks and learning situations, the student demonstrates the ability to search for and choose their solution according to the given sample, to argue for the use of a particular method of solving the problem. Their activities are based on interest and motivation for self-development and continuous professional development. 70-50 points - certified with a grade of “satisfactory” - Satisfactory level: outlines the mastery of the conceptual and categorical apparatus of the discipline at the average level, partial awareness of educational and professional tasks, problems and situations, knowledge of ways to solve typical problems and tasks. The applicant demonstrates an average level of skills and abilities to apply knowledge in practice, and solving problems requires assistance, support from a model. The basis of learning activities is situational and heuristic, dominated by motives of duty, unconscious use of opportunities for self-development. 49-00 points - certified with a grade of “unsatisfactory” - Unsatisfactory level: indicates an elementary mastery of the conceptual and categorical apparatus of the discipline, a general understanding of the content of the educational material, partial use of knowledge, skills and abilities. The basis of learning activities is situational and pragmatic interest.
Recommended books: 1. ЕНМК з дисципліни "Інтелектуальний аналіз даних" сертифікат № 04498. Доступний з : https://vns.lpnu.ua/course/view.php?id=4785 2. Грицюк Ю.І. Інтелектуальний аналіз даних і процесів : навч. посібник / Ю. І. Грицюк. – Львів : Вид-во НУ "Львівська політехніка", 2018. – 440 с. (рукопис) 3. Грицюк Ю.І. Інтелектуальний аналіз даних : лаборат. практикум / Ю. І. Грицюк. – Львів : Вид-во НУ "Львівська політехніка", 2016. – 160 с. (рукопис) 4. Грицюк Ю.І. Обчислювальні методи та моделі в наукових дослідженнях : монографія / Ю.І. Грицюк. – Львів : Вид-во ЛДУ БЖД, 2014. – 288 с.
Уніфікований додаток: Lviv Polytechnic National University ensures the realization of the right of persons with disabilities to obtain higher education. Inclusive educational services are provided by the Service of accessibility to learning opportunities "Without restrictions", the purpose of which is to provide permanent individual support for the educational process of students with disabilities and chronic diseases. An important tool for the implementation of the inclusive educational policy at the University is the Program for improving the qualifications of scientific and pedagogical workers and educational and support staff in the field of social inclusion and inclusive education. Contact at: St. Karpinsky, 2/4, 1st floor, room 112 E-mail: nolimits@lpnu.ua Websites: https://lpnu.ua/nolimits https://lpnu.ua/integration
Академічна доброчесність: The policy regarding the academic integrity of the participants of the educational process is formed on the basis of compliance with the principles of academic integrity, taking into account the norms "Regulations on academic integrity at the Lviv Polytechnic National University" (approved by the academic council of the university on June 20, 2017, protocol No. 35).