KURSPLAN
Data Science, 7,5 högskolepoäng
Data Science, 7.5 credits
Kursplan för studenter höst 2019
Kurskod: TDSR29
Fastställd av: VD 2019-06-01
Gäller fr.o.m.: 2019-08-01
Version: 1
Utbildningsnivå: Avancerad nivå
Utbildningsområde: Tekniska området
Ämnesgrupp: DT1
Fördjupning: A1N
Huvudområde: Informatik

Lärandemål

After a successful course, the student shall

Kunskap och förståelse

- Display knowledge of the fundamental tasks in data analysis, i.e., classification, regression, clustering and association rules
- Display knowledge of the phases in a data analysis project, i.e. preprocessing, modeling and evaluation
- Display knowledge of tools and techniques for large scale data analysis
- Show familiarity with key research directions in data analysis and machine learning

Färdighet och förmåga

- Demonstrate the ability to use a software tool for all parts of a data analysis project; i.e., preprocessing, modeling and evaluation
- Demonstrate the ability to identify and apply a suitable technique based on a problem description
- Demonstrate the ability to analyse results and models from a data analysis project

Värderingsförmåga och förhållningssätt

- Demonstrate an understanding of how data analysis can be used as a tool to support business and technical decision-making

Innehåll

The exponential growth of the digital universe, particularly in the form of storage and computing power in recent decades, enables companies to accumulate huge amounts of data at moderate cost. Accompanying this technological shift is a widespread realization that collected data contain potentially valuable information. Data science is an interdisciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from data in various forms, both structured and unstructured, through data analysis.

The course includes the following elements:
- Fundamental tasks in data analysis, i.e., classification, regression, clustering and association rules
- Basic machine learning techniques for classification, regression, clustering and association rules
- Organization of a data analysis project, i.e., preprocessing, modeling and evaluation
- Tools and techniques for large scale data analysis
- Software tools for data analytics
- Data analytics applied in different engineering and business domains

Undervisningsformer

The course consists of lectures, seminars and assignments with tutoring.

Undervisningen bedrivs på engelska.

Förkunskapskrav

The applicant must hold the minimum of a bachelor’s degree (i.e the equivalent of 180 ECTS credits at an accredited university) with at least 90 credits in computer engineering, electrical engineering (with relevant courses in computer engineering), or equivalent. The bachelor’s degree should comprise a minimum of 15 credits in mathematics. Proof of English proficiency is required.

Examination och betyg

Kursen bedöms med betygen 5, 4, 3 eller Underkänd.

The final grade for the course is based on a balanced set of assessments. The final grade will only be issued after satisfactory completion of all assessments.

Poängregistrering av examinationen för kursen sker enligt följande system:
ExaminationsmomentOmfattningBetyg
Examination12,5 hp5/4/3/U
Predictive modeling assignment2,5 hpU/G
Descriptive modeling assignment2,5 hpU/G
1 Bestämmer kursens slutbetyg vilket utfärdas först när samtliga moment godkänts.

Kurslitteratur

The literature list for the course will be provided one month before the course starts.

Principal text:

Authors: Linoff G. and Berry M.
Title: Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management (third edition)
Publisher: Wiley (2011)
ISBN: 9780470650936 (paperback)

Additional texts:

Author: Demšar, J.
Title: Statistical comparisons of classifiers over multiple data sets.
Journal: Journal of Machine learning research, 7(Jan), 1-0. (2006)

3-5 additional research articles