DATA MINING
cod. 1001891

Academic year 2019/20
1° year of course - Second semester
Professor
Piero GANUGI
Academic discipline
Statistica economica (SECS-S/03)
Field
A scelta dello studente
Type of training activity
Student's choice
48 hours
of face-to-face activities
6 credits
hub:
course unit
in ITALIAN

Learning objectives

Aim of the course is the deepening of some statistical models which are of particular relevance in the firm: the design of experiments, the discriminant analysis, Trees with machine learning.
During the course it is also developed a comparison between Machine learning and some statistical models.

Prerequisites

- - -

Course unit content

1. Analysis of variance and design of experiments.

The problem of curvature in the response plan.
Method of the steepest ascent.
Two casual factors model.
Two factors mixed model.
Nested plan analysis of variance.
Covariance analysis.
Analysis of variance with stochastic factors.

Part 2. Evaluation of firm with discriminant models.

Fisher Discriminant Analysis.
Logistic Discriminat Analysis.
Quadratic Discriminant Analysis.
ML Discriminant Analysis.


Part 3. Machine learning and the evaluation of firm.

The Nearest Neighbor.
Classification Trees.
Regression Trees.
Random Forests.
Support Vector Machine.
Neural Networks.

Full programme

1. Analysis of variance and design of experiments.

The problem of curvature in the response plan with missing values.
Method of the steepest ascent.
Two casual factors model.
Two factors mixed model: one casual and one fixed factor model.
Nested plan analysis of variance.
Covariance analysis.
Analysis of variance with stochastic factors.

Part 2. Evaluation of firm.

Different methods of Financial Statement analysis.

ISTAT Statistics on firms.
Flows of Funds Analysis.
SEC/Eurostat analysis.

Discriminant Models for the evaluation of firm.

Fisher Discriminant Analysis.
Logistic Discriminat Analysis.
Quadratic Discriminant Analysis.
ML Discriminant Analysis.


Part 3. Machine learning and the evaluation of firm.

The Nearest Neighbor.
Classification Trees.
Regression Trees.
Random Forests.
Support Vector Machine.
Neural Networks.

Bibliography

Montgomery, D. C.(2006)
Design and Analysis of experiments.
McGraw-Hill, New York (Chapters indicated during the course).

Fleury, B. (1997)
A first course in multivariate statistics, Springer, 1997, New York. (Chapters indicated during the course).

Lantz B.(2015) Machine learning with R. Packt Publishing, Birminghan. Open source.(Chapters indicated during the course).

Teaching methods

Lectures and laboratory with R.

Assessment methods and criteria

Oral exam.
In the exam the student has to show knowledge of the different models indicated in the programme and developed during the course.

Other information

- - -

2030 agenda goals for sustainable development

- - -

Contacts

Toll-free number

800 904 084

Student registry office

E. segreteria.ingarc@unipr.it
T. +39 0521 905111

Quality assurance office

Education manager:
Lucia Orlandini
T.+39 0521 906542
Office E. dia.didattica@unipr.it
Manager E. lucia.orlandini@unipr.it

 

Course president

Francesco Zammori
E. francesco.zammori@unipr.it

Faculty advisor

Giovanni Romagnoli
E. giovanni.romagnoli@unipr.it

Career guidance delegate

Giovanni Romagnoli
E. giovanni.romagnoli@unipr.it

Tutor professor

Giovanni Romagnoli
E. giovanni.romagnoli@unipr.it

Erasmus delegates

Roberto Montanari
E. roberto.montanari@unipr.it
Fabrizio Moroni
E. fabrizio.moroni@unipr.it
Adrian Hugh Alexander Lutey
E. adrianhughalexander.lutey@unipr.it

Quality assurance manager

Francesco Zammori
E. francesco.zammori@unipr.it

Tutor students