Data Mining and Deep Learning (DAT550)
The purpose of this course is for students to gain knowledge and practical experience of data mining and deep learning techniques. The course will prepare the students with a deep knowledge of technologies and be able to prepare large-scale data for data mining (pre-processing), feature extraction, dimensionality reduction and use a number of supervised and unsupervised methods for classification, regression and clustering tasks that can help to extract actionable knowledge. The course will provide the opportunity for students to learn state-of-the-art data mining and deep learning algorithms and tools. The students will get hands-on experience to try these tools on real data through lab assignments and a project.
Course description for study year 2024-2025. Please note that changes may occur.
Course code
DAT550
Version
1
Credits (ECTS)
10
Semester tution start
Spring
Number of semesters
1
Exam semester
Spring
Language of instruction
English
Content
Unsupervised learning
- Data cleansing, transformation and preparation
- Clustering
- Dimensionality reduction, SVD, PCA
Supervised learning
- Classification
- Neural Networks and Deep learning
- Recommendation systems
Learning outcome
Knowledge
Candidate:
- has advanced knowledge in the field and specialized in the theory and practice of data preparation, selection and mining.
- has in-depth knowledge of the scientific or art theory of the subject area and methods to gain insight from large data collections.
- can apply knowledge in new areas of data mining and deep learning
- can analyze knowledge extraction issues on the basis of data mining and deep learning
Skills
Candidate can:
- Be able to analyze and critically relate to various sources of information and use these to structure and formulate professional reasoning for various data mining tasks
- Be able to analyze existing theories, methods and interpretations within the area of data mining and deep learning and work independently with problem-solving on data mining and deep learning tasks
- Be able to use relevant data mining methods such as clustering, classification, graph, stream mining, frequent pattern mining, association rule mining, deep learning for research and professional application development
- Be able to carry out an independent, limited research or development project under supervision and in accordance with current research ethics standards which involves preparing data mining pipelines, evaluation, and tune parameters for various data mining models and deep learning using state-of-the-art tools.
General competencies
Candidate:
- can analyze relevant professional, research and ethical issues in data mining and deep learning
- can apply their knowledge and skills in new areas to carry out advanced tasks and projects
- can provide extensive independent work on data mining and deep learning issues
- can communicate about data mining and deep learning issues, analyzes and conclusions within the subject area, both with specialists and to the general public.
- can contribute to research and innovation in data mining and deep learning.
- identify the theoretical and practical issues behind various data mining and deep learning techniques. Being able to list and describe strengths, limitations and trade-offs among various data mining techniques and choose the appropriate techniques for solving data science problems for various applications.
Required prerequisite knowledge
Recommended prerequisites
Exam
Written exam and project report
Form of assessment | Weight | Duration | Marks | Aid |
---|---|---|---|---|
Written exam | 3/5 | 4 Hours | Letter grades | - 1) |
Project report | 2/5 | Letter grades |
1) Textbooks and Lecture notes
Three obligatory assignments are graded, amounting to 3/20 (15%) of the final grade.Project consisting one large assignment. The project is to be performed in a group. The grade for the project will be based on the submitted program code, project report document and an oral hearing in groups of the submitted program code and report. Both parts must be done before final grade is given. If a student fails the project, she/he has to take this part next time the subject is lectured.The written exam will be digital (Inspera).Both exam units must be passed in order to receive a final grade in the course.
Coursework requirements
Three mandatory graded (A-F) exercises/programming assignments amounting to 3/20 (15%) of the final grade.
All programming exercises must be passed to attend for the written exam and to get project approved. Completion of mandatory lab assignments are to be made on time. Absence due to illness or for other reasons must be communicated as soon as possible to the laboratory personnel. One cannot expect that provisions for completion of the lab assignments at other times are made unless prior arrangements with the laboratory personnel have been agreed upon. Failure to complete the assigned labs on time or not having them approved will result in barring from taking the exam of the course.
Course teacher(s)
Course coordinator:
Vinay Jayarama SettyCourse teacher:
Mina FarmanbarHead of Department:
Tom RyenMethod of work
Overlapping courses
Course | Reduction (SP) |
---|---|
Web Search and Data Mining (DAT630_1) | 5 |