Strumenti Utente

Strumenti Sito


magistraleinformatica:dmi:start

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisioneRevisione precedente
Prossima revisione
Revisione precedente
magistraleinformatica:dmi:start [09/12/2022 alle 14:15 (22 mesi fa)] – [First Semester] Anna Monrealemagistraleinformatica:dmi:start [19/09/2024 alle 09:58 (14 ore fa)] (versione attuale) – Table header Mattia Setzu
Linea 1: Linea 1:
-<html> +====== Data Mining (309AA- 9 CFU A.Y2024/2025 ======
-<!-- Google Analytics --> +
-<script type="text/javascript" charset="utf-8"> +
-(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function()+
-(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), +
-m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) +
-})(window,document,'script','//www.google-analytics.com/analytics.js','ga');+
  
-ga('create', 'UA-34685760-1', 'auto', 'personalTracker', {'allowLinker': true}); +**Instructors:**
-ga('personalTracker.require', 'linker'); +
-ga('personalTracker.linker:autoLink', ['pages.di.unipi.it', 'enforce.di.unipi.it', 'didawiki.di.unipi.it'] ); +
-   +
-ga('personalTracker.require', 'displayfeatures'); +
-ga('personalTracker.send', 'pageview', 'courses/dminf/'); +
-setTimeout("ga('send','event','adjusted bounce rate','30 seconds')",30000);  +
-</script> +
-<!-- End Google Analytics --> +
-<!-- Global site tag (gtag.js) - Google Analytics --> +
-<script async src="https://www.googletagmanager.com/gtag/js?id=G-LPWY0VLB5W"></script> +
-<script> +
-  window.dataLayer = window.dataLayer || []; +
-  function gtag(){dataLayer.push(arguments);+
-  gtag('js', new Date()); +
- +
-  gtag('config', 'G-LPWY0VLB5W'); +
-</script> +
-<!-- Global site tag (gtag.js) - Google Analytics --> +
-<script async src="https://www.googletagmanager.com/gtag/js?id=G-LPWY0VLB5W"></script> +
-<script> +
-  window.dataLayer = window.dataLayer || []; +
-  function gtag(){dataLayer.push(arguments);+
-  gtag('js', new Date()); +
- +
-  gtag('config', 'G-LPWY0VLB5W'); +
-</script> +
-<!-- Capture clicks --> +
-<script> +
-jQuery(document).ready(function(){ +
-  jQuery('a[href$=".pdf"]').click(function() { +
-    var fname = this.href.split('/').pop(); +
-    ga('personalTracker.send', 'event',  'DMINF', 'PDFs', fname); +
-  }); +
-  jQuery('a[href$=".r"]').click(function() { +
-    var fname = this.href.split('/').pop(); +
-    ga('personalTracker.send', 'event',  'DMINF', 'Rs', fname); +
-  }); +
-  jQuery('a[href$=".zip"]').click(function() { +
-    var fname = this.href.split('/').pop(); +
-    ga('personalTracker.send', 'event',  'DMINF', 'ZIPs', fname); +
-  }); +
-  jQuery('a[href$=".mp4"]').click(function() { +
-    var fname = this.href.split('/').pop(); +
-    ga('personalTracker.send', 'event',  'DMINF', 'Videos', fname); +
-  }); +
-  jQuery('a[href$=".flv"]').click(function() { +
-    var fname = this.href.split('/').pop(); +
-    ga('personalTracker.send', 'event',  'DMINF', 'Videos', fname); +
-  }); +
-}); +
-</script> +
-</html> +
-====== Data Mining (309AA) - 9 CFU A.Y. 2022/2023 ====== +
- +
-**Instructor:**+
   * **Anna Monreale**   * **Anna Monreale**
     * KDDLab, Università di Pisa     * KDDLab, Università di Pisa
     * [[anna.monreale@unipi.it]]        * [[anna.monreale@unipi.it]]   
 +  * **Mattia Setzu**
 +    * KDDLab, Università di Pisa
 +    * [[mattia.setzu@unipi.it]]   
 +
 **Teaching Assistant:** **Teaching Assistant:**
-  * **Francesca Naretto** 
-    * KDDLab, SNS, Pisa 
-    * [[francesca.naretto@sns.it]]   
   * * **Lorenzo Mannocci**   * * **Lorenzo Mannocci**
     * University of Pisa     * University of Pisa
Linea 75: Linea 15:
  
 ====== News ====== ====== News ======
-  * [28.10.2022] ** The lectures on 16 and 17 November will be suppressed. **  +  * [14.09.2024] ** The lectures will start on 19th September 2024** 
-  * [09.09.2022] he lectures will be only in presence and will NOT be live-streamed, but recordings of the lecture or of the previous years will be made available here for non-attending students.+
    
 ====== Learning Goals ====== ====== Learning Goals ======
Linea 90: Linea 29:
      * Ethical Issues      * Ethical Issues
  
-====== Hours and Rooms ======+====== Schedule ======
  
 **Classes** **Classes**
  
 ^  Day of Week  ^  Hour  ^  Room  ^  ^  Day of Week  ^  Hour  ^  Room  ^ 
-|  Wednesday |  09:00 - 11:00  |  Room  |  +|  Tuesday   |  11:00 - 13:00  |  Room C1  |  
-|  Thursday  |  11:00 - 13:00  |  Room C  |  +|  Thursday  |  09:00 - 11:00  |  Room C  |  
-|  Friday    |  09:00 - 11:00  |  Room  +|  Friday    |  09:00 - 11:00  |  Room C1  
  
  
  
 **Office hours - Ricevimento:** **Office hours - Ricevimento:**
-Anna Monreale: Tuesday11:00-13:00 by online using Teams or at the Department of Computer Science, room 374/E (Please ask an appointment by email). +  * Anna Monreale: TBD 
-Francesca Naretto: TDB+  * Mattia SetzuInfos on [[https://unimap.unipi.it/cercapersone/dettaglio.php?ri=177323&template=dett_didattica.tpl|Unimap]]
  
-**[[https://teams.microsoft.com/l/team/19%3aU9V_a8O2AkYl6KAcYiVMyOx_UfVD4SXKE2bwYRdOQ581%40thread.tacv2/conversations?groupId=ecc43c6e-29fe-4819-bc96-9bc6b906491f&tenantId=c7456b31-a220-47f5-be52-473828670aa1|Teams Channel]]** will be used ONLY to post news, Q&A, and other stuff related to the course. The lectures will be only in presence and will **NOT** be live-streamed, but recordings of the lecture or of the previous years will be made available here for non-attending students.   +A [[https://teams.microsoft.com/l/team/19%3Aq8IK5DrzMwEE5TxVhuw4QdYEVFJ06KVITI5jSJTmaJ81%40thread.tacv2/conversations?groupId=5fae2fa6-38fd-414f-a0c9-ffbd8e6f0710&tenantId=c7456b31-a220-47f5-be52-473828670aa1|Teams Channel]] will be used ONLY to post news, Q&A, and other stuff related to the course. The lectures will be only in presence and will **NOT** be live-streamed, but recordings of the lecture or of the previous years will be made available here for non-attending students.
-====== Learning Material -- Materiale didattico ======+
  
-===== Textbook -- Libro di Testo =====+====== Teaching Material ======
  
-  Pang-Ning Tan, Michael Steinbach, Vipin Kumar. **Introduction to Data Mining**. Addison Wesley, ISBN 0-321-32136-7, 2006 +**Books** 
-    [[http://www-users.cs.umn.edu/~kumar/dmbook/index.php]] +^ Title ^ Authors ^ Edition ^ 
-    * Chapters 4,6 and 8 are also available at the publisher's Web site. +[[http://www-users.cs.umn.edu/~kumar/dmbook/index.php|Introduction to Data Mining]] | Pang-Ning TanMichael SteinbachVipin Kumar | 2nd | 
-  * BertholdM.R., Borgelt, C., Höppner, F., Klawonn, F. **GUIDE TO INTELLIGENT DATA ANALYSIS.** Springer Verlag, 1st Edition., 2010. ISBN 978-1-84882-259-+| [[https://link.springer.com/book/10.1007/978-3-031-48956-3|Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications]] | Laura Igual,  Santi Seguí | 2nd | 
-  * Laura Igual et al.** Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications**. 1st ed. 2017 Edition. +[[http://shop.oreilly.com/product/0636920034919.do| Python Data Science Handbook: Essential Tools for Working with Data]] | Jake VanderPlas | 1st 
-   Jake VanderPlas. **[[http://shop.oreilly.com/product/0636920034919.do| Python Data Science Handbook: Essential Tools for Working with Data.]]** 1st Edition.  +| [[https://github.com/janishar/mit-deep-learning-book-pdf|Deep Learning]] | Ian Goodfellow, Yoshua Bengio, Aaron Courville | | 
-   For Python Notions{{ :magistraleinformatica:dmi:python_basics.ipynb.zip Very basic notions on Python}} +| [[https://math.mit.edu/~gs/linearalgebra/ila5/indexila5.html|Introduction to Linear Algebra]] | Gilbert Strang | 5th |
  
  
-===== Slides =====+**Online tutorials**
  
-  * The slides used in the course will be inserted in the calendar after each class. Most of them are part of the slides provided by the textbook's authors [[http://www-users.cs.umn.edu/~kumar/dmbook/index.php#item4|Slides per "Introduction to Data Mining"]]+^ ^ Authors ^ 
-   +[[https://brianmcfee.net/dstbook-site/content/intro.html|Digital Signals Theory]] | Brian McFee | 
 +| [[https://rtavenar.github.io/blog/dtw.html|An introduction to Dynamic Time Warping]] | Romain Tavenard | 
 +| [[https://github.com/msetzu/intro_to_ds_and_ml/blob/master/python/notebooks/Python.ipynb|Introduction to Python]] | Mattia Setzu |
  
-   
-===== Software===== 
  
-  Python - Anaconda (at least 3.7 version!!!): Anaconda is the leading open data science platform powered by Python. [[https://www.anaconda.com/distribution/| Download page]] (the following libraries are already included) +**Slides**
-  Scikit-learn: python library with tools for data mining and data analysis [[http://scikit-learn.org/stable/ | Documentation page]] +
-  Pandas: pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. [[http://pandas.pydata.org/ | Documentation page]]+
  
-  +The slides used in the course will be inserted in the calendar after each class. Some are part of the slides provided by the textbook'authors [[http://www-users.cs.umn.edu/~kumar/dmbook/index.php#item4|Slides per "Introduction to Data Mining"]].
-====== Class Calendar (2022/2023) ====== +
- +
-===== First Semester  ===== +
- +
-^ ^ Day ^ Topic ^ Learning material ^ References ^ Video Lectures ^ +
-|1.  |  15.09  11:00‑13:00 | Overview. Introduction to KDD   |{{ :magistraleinformatica:dmi:1-overview.pdf |}} {{ :magistraleinformatica:dmi:1-intro-dm.pdf |}}|Chap. 1 Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EShM9RVJWTNHq_V8gj-abVgB86BMP0QX-iNMLzrrNP_sPg?e=6uigHe|Video 1: Course Overview]];[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/ESccksnIm5lFp9ROw18otfsBBhl3Ybeus-OjqqMcl2xCmQ?e=kF1KNe|Video 2: Introduction DM]] (the recording of the Introduction had some audio issue so I published the part of the lecture of the a.y. 2021/22)| +
-|2.  |  16.09  09:00-11:00 | Data Understanding | {{ :magistraleinformatica:dmi:2-data_understanding.pdf |}} |Chap.2 Kumar Book and additioanl resource of Kumar Book:[[https://www-users.cs.umn.edu/~kumar001/dmbook/data_exploration_1st_edition.pdf|Exploring Data]] If you have the first ed. of KUMAR this is the Chap 3 | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EXYTYX_JE4dDsqGwmBTKyBcBCkd7IxaFhUTn-MRc7LyJlA?e=gfCgnI|Video 1: Data Understanding - Part 1]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EeMWAfDY__BKvQ2RHadlmZsBgKCXuAV2mxjUiY3GuWK9Zg?e=7wIafq |Video 2: Data Understanding - Part 2]] | +
-|3.  |  21.09  09:00-11:00 | Data Understanding & Data Preparation |  {{ :magistraleinformatica:dmi:3-data_preparation.pdf |}} |Chap.2 Kumar Book and additioanl resource of Kumar Book:[[https://www-users.cs.umn.edu/~kumar001/dmbook/data_exploration_1st_edition.pdf|Exploring Data]] If you have the first ed. of KUMAR this is the Chap 3 | [[ https://unipiit.sharepoint.com/:v:/s/a__td_54794/EaqvlZGIKvdMi8j8r7TIbHkB76b2K8gMsEPVtDouO5waYw?e=IUYEb6 +
-|Video: Data Understanding & Data Preparation]] | +
-|4.  |  22.09   11:00-13:00 | Data Preparation + Data Similarities.|{{ :magistraleinformatica:dmi:4-data_similarity.pdf |}}       | Data Similarity is in Chap. 2  |[[ https://unipiit.sharepoint.com/:v:/s/a__td_54794/EarSFVCS5MFJnFSi9dMT2y0BJgqS_YIVLX9fenQV9GyrjQ?e=eZshby +
-|Video 1: Data Preparation + Data Similarities - Part 1]]; [[ https://unipiit.sharepoint.com/:v:/s/a__td_54794/EQMiVHPB8hlKuZ7ntw8Km-IBe4HtW_hz5VvYefLvHDfDLQ?e=bKJOn7|Video 2: Data Preparation + Data Similarities - Part 2]]   | +
-|5.  |  23.09  09:00-11:00 | Introduction to Clustering. Center-based clustering: kmeans| {{ :magistraleinformatica:dmi:5-basic_cluster_analysis-intro.pdf |}}  {{ :magistraleinformatica:dmi:6.1-basic_cluster_analysis-kmeans.pdf |}} | Clustering is in Chap. 7  |[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EV-fDd75MIxGmazA79kFHCYBI78yYwqy7AFE5h9MN2rRqg?e=YVgdjS|Video 1: Introduction to Clustering + K-means - Part 1]];[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/ETySd1UWIzxCoAKilzaXO_MBW8oXZZCjf5FEhyywGIdJBg?e=Xq2jdo|Video 2: Introduction to Clustering + K-means - Part 2]]] | +
-|6.  |  28.09  09:00-11:00 | Python Lab: Data Understanding & Data Preparation |{{ :undefined:dataund.zip | Notebook DU tips}} | | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/Eb0GU3ScaudIuh3kmNVw5_EBvgFRME5hnkOyZCetW55vwg?e=fDgxnE|Video 1: Python Lab: DU - Part1]];[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EfTSxKrV45lIhWCySWDflM4BXjD7WSKj6X3Se5Dv7UEb2Q?e=vf2E2h|Video 2: Python Lab: DU - Part2]]| +
-|7.  |  29.09  11:00-13:00 | Hierarchical clustering | {{ :magistraleinformatica:dmi:7.basic_cluster_analysis-hierarchical.pdf |}}| | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EcBupaEx_HNFiEMYa_dQ2m8BNe25abzcmTKZ3JrVOtibCQ?e=SSgN9T| Video: Project Description + Hierarchical Clustering]]| +
-|  |  30.09  09:00-11:00 | Lecture Canceled | | | | +
-|8.  |  05.10  09:00-11:00 | Density based clustering. Clustering validity. | {{ :magistraleinformatica:dmi:8.basic_cluster_analysis-dbscan-validity.pdf |}} | Chap. 7 Kumar Book  |   | +
-|9.|   06.10  11:00-13:00 | Center-based clustering: Bisecting K-means, Xmeans, EM       | {{ :magistraleinformatica:dmi:6.2-basic_cluster_analysis-kmeans-variants.pdf |}} | Chap. 7 Kumar Book, {{ :magistraleinformatica:dmi:clusteringmixturemodels.pdf |}} {{ :magistraleinformatica:dmi:xmeans.pdf |}}| [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EQTbbvqF2kJOgEsFQ1WF48cBjWf2wgTCbOjxcQzn9MyVzw?e=KQ7gEZ|Video 1: Center-based clustering - Bisecting K-means, Xmeans, EM ]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EYc-39UkUhlCsL_huteYf7YBJjTruY207hGwBJmBRobACg?e=ixfbJ2|Video 2: Clustering Lab.]] | +
-|10.|   07.10  09:00-11:00 |Python Lab - Clustering|  {{ :magistraleinformatica:dmi:tips_clustering.ipynb_complete.zip | Notebook CLustering Tips}}    |[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EUl3UMFUoixPo9FzX7nC9e4BWS1SFpIGPVdZbwyfdZBgCw?e=BexKor|Video: Clustering Lab. - Part2]] | +
-|11.|   12.10  09:00-11:00 |Classification Problem. Decision Trees|  {{ :magistraleinformatica:dmi:9.chap3_basic_classification-2022.pdf |}}  | Chap. 3 Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EWVzwCQWC7BKmRlG69Regg4BEmeqRwin9GZ0VJIcV_wtsw?e=YDkrrf| Video Lecture - Part1]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EQUgcfNC2XdPn6QHy3rlgJcBO9UTsgmoMYFJbhP8vM5UIA?e=uWDpVA|Video Lecture - Part 2]]| +
-|12.|   13.10  11:00-13:00 |Decision Trees & Classifier Evaluation|  same slides previous lecture | Chap. 3 Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EaJcTBiLgh1DiErGVCZAovoBlLOaHuCrabxtNOTqXYRg-A?e=IQJDGE|Video Lecture - Part 1]] [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EbXLwpQCdNNMt4ClRGPAivMBFJa-qoH8N9TKpp9OgD8mlw?e=TjOAsx|Video lecture - Part 2]]| +
-|13.|   14.10  09:00-11:00 |Classifier Evaluation|  same slides previous lecture | Chap. 3 Kumar Book |[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EXT98sRBqL9Dpf-pAzfUnkIBy2zDxh86kI2D8ouBXH1zxQ?e=gw3zzM|Video Lecture]] | +
-|14.|   19.10  09:00-11:00 |Rule based Classifiers |  {{ :magistraleinformatica:dmi:10-rule-based-clussifiers-2022.pdf |}}{{ :magistraleinformatica:dmi:10-knn-2022.pdf |}} | Rule based classifiers: Chap. 5.1, KNN: Chap. 4.2 - Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EYWGSnZlI1BMr8CO6QZDMJEBdKQhL5_GAx0YqMAgR-49Fg?e=JSvQPJ|Video 1: Rule based classifiers]][[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EYWGSnZlI1BMr8CO6QZDMJEBdKQhL5_GAx0YqMAgR-49Fg?e=1qMLMf|Video 2: KNN]] | +
-|15.|   20.10  11:00-13:00 |DT - simulation of the learning algorithm |  {{ :magistraleinformatica:dmi:2021-dt-ex.pdf | DT Exercise}}|  | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EaFgtNLiCONAnV6NF7DpZioB8W09o2HQcxJsSzvm65Pb_w?e=SvNSgj|Video 1: DT-EX]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EaOBQeDA_QtLtRYQVHoDFgABilFcAgpYsm01aUdUNjvBGA?e=bFpIpz|Video 2: DT-EX]]| +
-|16.|   21.10  09:00-11:00 |Naive Bayesian Classifier. SVM. Ensemble Classifiers | {{ :magistraleinformatica:dmi:11_2022-naive_bayes.pdf |}} {{ :magistraleinformatica:dmi:14_svm_2022.pdf |}} {{ :magistraleinformatica:dmi:13_ensemble_2022.pdf |}}| Chap. 4 - Kumar Book |[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EX8DmHrg7d5HgaOstPoOjNcBwtfdTk9vbNoCeLQZSAWXYA?e=l3p65j|Video1]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EeVLe8Gr5-lFtAwfPipFTSQB95Wo0HzeLvo2O9aAN3a8_w?e=V2EmcA|Video2]]| +
-|17.|   26.10  09:00-11:00 |Ensemble Classifiers + NN Classifiers + Project Discussion| same slides of the previous lecture  | Chap. 4 - Kumar Book  | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EZWw_m9HxzlKlIss5vsPeIgBl70TAqJW_I0j5x6rtkTOZg?e=kPTZFu|Video1]]| +
-|18.|   27.10  11:00-13:00 | NN Classifiers + Python Lab: Classification| {{ :magistraleinformatica:dmi:15_neural_networks_2021.pdf |}} {{ :magistraleinformatica:dmi:adult_classification_2021.ipynb.zip | Classificaton Notebook}} {{ :magistraleinformatica:dmi:adult.data.zip | Adult Dataset}}  | | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EVt50MbiGTpKsiO_t7WIeP0B8_ocGvnD7zEeUyRQ_d5wwQ?e=squ80v|Video1]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EcN1nMk11VBKgLbOg-2KWakBStYobApkO45gI9FZg1E_2Q?e=WP2MqH|Video2]]| +
-|19.|   28.10  09:00-11:00 |Python Lab: Classification | {{ :magistraleinformatica:dmi:adult_classification_2021.ipynb.zip | Classificaton Notebook}} (same as previous lecture) |  | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EVBxay2OtPhLmVgn658oHqEBMyVzNKp2Ju5aMq3ecTrOZw?e=e2uONt|Video]] | +
-|20.|   02.11  09:00-11:00 |Python Lab: NN & Imbalanced Classification | |  | Unfortunately Video is not available for technical issues +
-|21.|   03.11  11:00-13:00 | Association Rule Mining| {{ :magistraleinformatica:dmi:17_association_analysis2021.pdf |}} | Chap. 5 - Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EWKi0iWArRhAgi8T5pKNsiMB5llNPPF5xnvsnTKmw3c97Q?e=6whkRx|Video]] +
-|22.|   04.11  09:00-11:00 | FP-Growth - Sequential Pattern Mining | {{ :magistraleinformatica:dmi:17_2021-fp-growth.pdf |}} {{ :magistraleinformatica:dmi:18_sequential_patterns_2021.pdf |}}|Chap. 5 &  Chap. 6 - Kumar Book | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EajQMV7eT3VOnZzh698x08UBKIMNqNdpOFep_1l-43Iprw?e=hR8G5L|Video1]];[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EfZbzQAqsCBFrHP4MHpx5twBKsoOhZtF6GJ7dKAaYxxWzg?e=ceNFjT|Video2]] | +
-|23.|   09.11  09:00-11:00 | Sequential Pattern Mining. Intro to Time Series|Slides on SPM (see previous lecture) |  |  [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/ERRAWko36o1KroWob_cbUMoBZ7wgxVU3NbQK_Yz-jZadog?e=sbOrJR|Video1]];[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EQy8x2XPoEdBgq-bZWa6DdoBLGwHiWm-4xuzkMuZaGDaRg?e=8iIh7e|Video2]] | +
-|24.|   10.11  11:00-13:00 | Time Series Similarities| {{ :magistraleinformatica:dmi:22_time_series_similarity_2022.pdf |}}  | [[https://cs.gmu.edu/~jessica/BookChapterTSMining.pdf| Overview on Time Series]] | [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/ER3cHQ0J4YBAsZqo0xzOz4UBefOxnjGELu9SuuMTfo95Eg?e=00wnp0|Video]] | +
-|25.|   11.11  09:00-11:00 | Time Series Transformations - Clustering - Classification|  Slides on transformations (previous lecture) {{ :magistraleinformatica:dmi:23_time_series_motif-2022_2.pdf |}}| |[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/Eff6vI08BjNKkDogZSY6p2kBVNrltI4bZM1d-d8TE0bCdw?e=eqbHKc|Video]]| +
-|26.|   18.11  09:00-11:00 | Shapelets & Motif. Lab: Association Rules|  Slides on shapelets & motif (previous lecture) {{ :magistraleinformatica:dmi:arm-spm.zip |}} | {{ :magistraleinformatica:dmi:matrixprofile.pdf |}}  [[https://www.cs.ucr.edu/~eamonn/MatrixProfile.html|Papers on Matrix Profile]]{{ :magistraleinformatica:dmi:shaplet.pdf |}}|[[https://unipiit.sharepoint.com/:v:/s/a__td_54794/ETW4Mo2LBm9CkSPZljVqplsBxOpJuXJxA5nugiKh-eRZ4Q?e=zjMzVY |Video 1: Shapelets & Motif]]; [[https://unipiit.sharepoint.com/:v:/s/a__td_54794/EaH053-TvQpAkdacrfGUDxYBtVPbFdNYJ8IMyhKyKpK3uA?e=xUYRB6|Video 2: Lab ARM]] | +
-|27.| 23.11  09:00-11:00 | Python: Sequential Pattern Mining & Time Series | For SPM see notebooks of previous lecture. {{ :magistraleinformatica:dmi:timeseries-py.zip |}}|  |  [[https://unipiit.sharepoint.com/:v:/s/Registrazioni628/EdPYuw_W-y9In9AeoHJm1a0BdBeUYhPb6z_3VFZDR9TMCQ?e=bAHAC1|Video]]| +
-|28.| 24.11  11:00-13:00 | Python: Time Series. Ethics & Privacy| {{ :magistraleinformatica:dmi:19_ethics_privacy2021.pdf |}} |  | [[https://unipiit.sharepoint.com/:v:/s/Registrazioni628/EQCiARRNIYtNsvZCucPyX30BvP-KDZBXEMcpYFVeS107eQ?e=Jor8lE|Video 1]]; [[https://unipiit.sharepoint.com/:v:/s/Registrazioni628/EeM4qF4IbRVHrQEyEJFhfRYB3HnapYKARglqiZdGfBgEGA?e=1ah2IV| Video 2]]| +
-|29.| 25.11  09:00-11:00 | Privacy  | same slides off the last lecture |  | [[https://unipiit.sharepoint.com/:v:/s/Registrazioni628/ETMsw1ltw1RMhxXTIRLrb18BTNOx5koRTvQllhM295EDgg?e=wEgRnp|Video]]| +
-|30.| 30.11  09:00-11:00 |Explainability | {{ :magistraleinformatica:dmi:20_explainability_2021.pdf |}} |  | [[|Video]]| +
-|31.| 01.12  11:00-13:00 |Anomaly Detection + Python: XAI | {{ :magistraleinformatica:dmi:lezione-xai.zip |XAI Notebook}} |  | [[|Video]]| +
-|32.| 02.12  09:00-11:00 |Python: XAI + AD|  {{ :magistraleinformatica:dmi:anomalydetection-1.ipynb.zip | Anomaly Detection Notebook}}|  | [[https://unipiit.sharepoint.com/:v:/s/Registrazioni628/EUuzBGfYmYVCqUDk-1GkWvsBWL_QiaRRjUUu5Yj4YLGr7Q?e=VDFf7x|Video]]| +
-|33.| 07.12  09:00-11:00 |Paper Presentation|  |  | | +
-|34.| 09.12  09:00-11:00 |Paper Presentation|  |  | | +
-|35.| 14.12  09:00-11:00 |Paper Presentation|  |  | | +
-|36.| 15.12  11:00-13:00 |Paper Presentation|  |  | |+
  
  
 +  
 +**Software**
  
 +Software material available in the [[https://github.com/data-mining-UniPI/teaching23|Github repository]] (available in the coming days).
  
 + 
 +====== Class Calendar (2024/2025) ======
  
 +===== First Semester  =====
  
 +^ ^ Day ^ Topic ^ Teaching material ^ References ^ Video Lectures ^ Teacher ^
 +|    |  17.09  | Candeled    |   |
 +|1.  |  19.09  | Overview. Introduction to KDD    |Chap. 1 Kumar Book | | |
  
 +  
 ====== Exams ====== ====== Exams ======
-**Project **+TBD
  
-A project consists in data analyses based on the use of data mining tools.  +====== Previous years ===== 
-The project has to be performed by a team of 3 students. It has to be performed by using Python. The guidelines require to address specific tasks. Results must be reported in a unique paper. The total length of this paper must be max 25 pages of text including figures. The students must deliver both: paper (single column) and  well commented Python Notebooks.+[[DM-INF 2023-2024]]
  
-  * First part of the project consists in the **assignments** described here: {{ :magistraleinformatica:dmi:projectdescriptiondm2022-new.pdf | Project Description}} +[[DM-INF 2022-2023]]
-  - **Dataset:[[https://unipiit.sharepoint.com/:u:/s/a__td_54794/ERsHd0L8ZbtCvAjWsHdzmfkBb-B2EvkiQLU09e22b0xsTQ?e=VfSaNW|Twitter Data]]**  +
-  - **Deadline**: the fist part has to be delivered within  November <del>5th 2022</del> 12, 2022. Send an email to: anna.monreale@unipi.it, francesca.naretto@sns.it, lorenzo.mannocci@phd.unipi.it +
-  +
-  * Second part of the project consists in the assignment described here: {{ :magistraleinformatica:dmi:project_description_dm2022-updated.pdf |Updated Project Description}} +
-     - **Deadline**: Jan 8, 2023+
  
-  * Third part of the project consists in the assignment described here: {{ :magistraleinformatica:dmi:project_description_dm2022-complete.pdf |Complete Project Description}} 
-   - Note that the document contains also rules for the delivery and final exam! 
-   - **Deadline**:   Jan 8, 2023 
- 
- 
-**Students who did not deliver the above project within **Jan 8, 2023** need to ask by email a new project to the teachers. The project that will be assigned will require about 2 weeks of work and after the delivery it will be discussed during the oral exam. ** 
- 
-** Paper Presentation (OPTIONAL)** 
- 
-Students need to present a research paper (made available by the teacher) during the last week of the course. This presentation is OPTIONAL: Students that decide to do the paper presentation can avoid the oral exam with open questions. They only need to present the project (see next point). The paper presentation can be done by the group or by a single person. 
- 
-**Oral Exam** 
-  * **Project presentation** (with slides) – 10-15 minutes: mandatory for all the students 
-  * ** Open questions ** on the entire program: optional only for students opting for paper presentation. 
-  
-  
- 
- 
- 
-====== Previous years ===== 
 [[DM-INF 2021-2022]] [[DM-INF 2021-2022]]
  
magistraleinformatica/dmi/start.1670595302.txt.gz · Ultima modifica: 09/12/2022 alle 14:15 (22 mesi fa) da Anna Monreale

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki