Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente |
mds:txa:start [21/11/2022 alle 07:43 (22 mesi fa)] – [Lecture Notes] Lucia Passaro | mds:txa:start [08/08/2024 alle 12:41 (7 settimane fa)] (versione attuale) – versione precedente ripristinata (20/12/2022 alle 09:37 (20 mesi fa)) Salvatore Ruggieri |
---|
<html> | |
<!-- Google Analytics --> | |
<script type="text/javascript" charset="utf-8"> | |
(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){ | |
(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o), | |
m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m) | |
})(window,document,'script','//www.google-analytics.com/analytics.js','ga'); | |
| |
ga('create', 'UA-34685760-1', 'auto', 'personalTracker', {'allowLinker': true}); | |
ga('personalTracker.require', 'linker'); | |
ga('personalTracker.linker:autoLink', ['pages.di.unipi.it', 'enforce.di.unipi.it', 'didawiki.di.unipi.it', 'luciacpassaro.github.io'] ); | |
ga('personalTracker.require', 'displayfeatures'); | |
ga('personalTracker.send', 'pageview', 'courses/txa/'); | |
setTimeout("ga('send','event','adjusted bounce rate','30 seconds')",30000); | |
</script> | |
<!-- End Google Analytics --> | |
<!-- Global site tag (gtag.js) - Google Analytics --> | |
<script async src="https://www.googletagmanager.com/gtag/js?id=G-LPWY0VLB5W"></script> | |
<script> | |
window.dataLayer = window.dataLayer || []; | |
function gtag(){dataLayer.push(arguments);} | |
gtag('js', new Date()); | |
| |
gtag('config', 'G-LPWY0VLB5W'); | |
</script> | |
<!-- Capture clicks --> | |
<script> | |
jQuery(document).ready(function(){ | |
jQuery('a[href$=".pdf"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'PDFs', fname); | |
}); | |
jQuery('a[href$=".r"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'Rs', fname); | |
}); | |
jQuery('a[href$=".zip"]').click(function() { | |
var fname = this.href.split('/').pop(); | |
ga('personalTracker.send', 'event', 'TXA', 'ZIPs', fname); | |
}); | |
}); | |
</script> | |
</html> | |
====== Text Analytics (635AA) A.Y. 2022/23 ====== | ====== Text Analytics (635AA) A.Y. 2022/23 ====== |
| |
^ Date ^ Lecture ^ Slides ^ Material / Reference ^ | ^ Date ^ Lecture ^ Slides ^ Material / Reference ^ |
| 2022/09/16 | Introduction to the course, NLP & Text Analytics. | [[https://drive.google.com/file/d/1wc6yvn6Y5QrFXyFw53xeB4M6MsMmWssS/view?usp=sharing| 1 - Introduction to the Text Analytics course]]|J. Eisenstein. Introduction to Natural Language Processing. MIT Press.[[https://drive.google.com/file/d/17T4zo2uGssKBa_MrHsLW-uSmyP_ZJvpj/view?usp=sharing| Chp. 1]].| | | 2022/09/16 | Introduction to the course, NLP & Text Analytics. | [[https://drive.google.com/file/d/1wc6yvn6Y5QrFXyFw53xeB4M6MsMmWssS/view?usp=sharing| 1 - Introduction to the Text Analytics course]]|J. Eisenstein. Introduction to Natural Language Processing. MIT Press.[[https://drive.google.com/file/d/17T4zo2uGssKBa_MrHsLW-uSmyP_ZJvpj/view?usp=sharing| Chp. 1]].| |
| 2022/09/19 | Reminds on Probability. Language and Probability. | [[https://drive.google.com/file/d/1-exk-JS0_Oa3Eg1ApTGlxonQlL3KbTQG/view?usp=sharing| 2 - Reminds on Probability.pdf]]| | | | 2022/09/19 | Reminds on Probability. Language and Probability. | [[https://drive.google.com/file/d/1-exk-JS0_Oa3Eg1ApTGlxonQlL3KbTQG/view?usp=sharing| 2 - Reminds on Probability]]| | |
| 2022/09/23 | Introduction to Python.| [[https://drive.google.com/file/d/1lpyA0N4K0d0ZTrJgokot1NwC_w4HG6gG/view?usp=sharing| 3 - Introduction to Python.pdf]]|[[https://drive.google.com/file/d/1BubwKtByCankjnbClWErvSsw9EjCLnte/view?usp=sharing|Introduction to Python - Notebook.]]| | | 2022/09/23 | Introduction to Python.| [[https://drive.google.com/file/d/1lpyA0N4K0d0ZTrJgokot1NwC_w4HG6gG/view?usp=sharing| 3 - Introduction to Python]]|[[https://drive.google.com/file/d/1BubwKtByCankjnbClWErvSsw9EjCLnte/view?usp=sharing|Introduction to Python - Notebook.]]| |
| 2022/09/30 | Introduction to Python (continued). Project Presentation and Important Dates. | [[https://drive.google.com/file/d/1FjCYvOkZDWomEsJuXD32Vl_155kxnKik/view?usp=sharing|Project and Dates]]| | | | 2022/09/30 | Introduction to Python (continued). Project Presentation and Important Dates. | [[https://drive.google.com/file/d/1FjCYvOkZDWomEsJuXD32Vl_155kxnKik/view?usp=sharing|Project and Dates]]| | |
| 2022/10/03 | Probabilistic Language Models. | [[https://drive.google.com/file/d/1B5HfPtPgK41Ig_NWrPim6YxK3mCF-XSj/view?usp=sharing| 5 - Probabilistic Language models]]|D. Jurafsky, J.H. Martin.[[https://drive.google.com/file/d/1OXSjwE0-ZN6DZ4MELOMp8JVy-tP2_4Iw/view?usp=sharing| Chp. 3]]. [[https://drive.google.com/file/d/1osuyJi5ZbBMghOrQz_IVqMsfxi2-1Vzj/view?usp=sharing| Probabilistic Language Models - Notebook]].| | | 2022/10/03 | Probabilistic Language Models. | [[https://drive.google.com/file/d/1B5HfPtPgK41Ig_NWrPim6YxK3mCF-XSj/view?usp=sharing| 5 - Probabilistic Language models]]|D. Jurafsky, J.H. Martin.[[https://drive.google.com/file/d/1OXSjwE0-ZN6DZ4MELOMp8JVy-tP2_4Iw/view?usp=sharing| Chp. 3]]. [[https://drive.google.com/file/d/1osuyJi5ZbBMghOrQz_IVqMsfxi2-1Vzj/view?usp=sharing| Probabilistic Language Models - Notebook]].| |
| 2022/10/14 | Text Indexding: Collocations with Gensim. stanza. spacy. Feature selection. | [[https://drive.google.com/file/d/13RDX2D2m8Bhkv0_qddvpWndBYQWoKYpY/view?usp=sharing| 6 - Text Indexing-3]]|[[https://drive.google.com/file/d/12L7nHe9TvZJPSS4RaiyGyIaPnx8cXkrN/view?usp=sharing| L6.3.4 - collocations - stanza - spacy - Notebooks]].| | | 2022/10/14 | Text Indexding: Collocations with Gensim. stanza. spacy. Feature selection. | [[https://drive.google.com/file/d/13RDX2D2m8Bhkv0_qddvpWndBYQWoKYpY/view?usp=sharing| 6 - Text Indexing-3]]|[[https://drive.google.com/file/d/12L7nHe9TvZJPSS4RaiyGyIaPnx8cXkrN/view?usp=sharing| L6.3.4 - collocations - stanza - spacy - Notebooks]].| |
| 2022/10/17 | Text Indexding: Vector space models. | [[https://drive.google.com/file/d/1AhhYq-1mCGqtVcUnvoiAb7c2WYs4CSm2/view?usp=sharing| 6 - Text Indexing-4]]|D. Jurafsky, J.H. Martin.[[https://drive.google.com/file/d/1A1aKTIQh8CnEU8QBkmet1iADpTBAdUHR/view?usp=sharing| Chp. 6]]. [[https://drive.google.com/file/d/1dyn540ISuJ8wMlBkUoFH5J9ctNIcHj54/view?usp=sharing| L6.5 - Vector space model - toy example - Notebook]].| | | 2022/10/17 | Text Indexding: Vector space models. | [[https://drive.google.com/file/d/1AhhYq-1mCGqtVcUnvoiAb7c2WYs4CSm2/view?usp=sharing| 6 - Text Indexing-4]]|D. Jurafsky, J.H. Martin.[[https://drive.google.com/file/d/1A1aKTIQh8CnEU8QBkmet1iADpTBAdUHR/view?usp=sharing| Chp. 6]]. [[https://drive.google.com/file/d/1dyn540ISuJ8wMlBkUoFH5J9ctNIcHj54/view?usp=sharing| L6.5 - Vector space model - toy example - Notebook]].| |
| 2022/10/21 | Machine Learning for Text Analytics. | [[https://drive.google.com/file/d/1eHQR4GhtPjgN7muIRLfyQBXcgM4oQmUK/view?usp=sharing| 10 - Machine Learning for Text Analytics.pdf]]| | | | 2022/10/21 | Machine Learning for Text Analytics. | [[https://drive.google.com/file/d/1eHQR4GhtPjgN7muIRLfyQBXcgM4oQmUK/view?usp=sharing| 10 - Machine Learning for Text Analytics]]| | |
| 2022/10/24 | Student project presentations: proposal, brainstorming, discussion. | | | | 2022/10/24 | Student project presentations: proposal, brainstorming, discussion. | | |
| 2022/10/28 | Student project presentations: proposal, brainstorming, discussion. | | | | 2022/10/28 | Student project presentations: proposal, brainstorming, discussion. | | |
| 2022/11/04 | Machine Learning for Text Analytics. Experiments and Practice. | [[https://drive.google.com/file/d/1HXC4pHde9D7bYYAw4vM6ihS2u7kChgO8/view?usp=share_link| 13 - Experiments.pdf]]| [[https://drive.google.com/file/d/1xbRmZ-HudXRIqBbQDpXjOlUNyrq7-yop/view?usp=share_link| Classification sklearn - Notebook.]]| | | 2022/11/04 | Machine Learning for Text Analytics. Experiments and Practice. | [[https://drive.google.com/file/d/1HXC4pHde9D7bYYAw4vM6ihS2u7kChgO8/view?usp=share_link| 13 - Experiments]]| [[https://drive.google.com/file/d/1xbRmZ-HudXRIqBbQDpXjOlUNyrq7-yop/view?usp=share_link| Classification sklearn - Notebook.]]| |
| 2022/11/07 | Topic Modeling. | [[https://drive.google.com/file/d/1ytnJjLHtLT97gCNbzBp_I2TCcY7bfjMN/view?usp=share_link| 14 - Topic modeling.pdf]]| Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1iJ71WZIpWP-cWxLtvsf5L4vp_epJH0uV/view?usp=share_link| Chp 17]].[[https://drive.google.com/file/d/1fKpyNYs9kNlPJpiYiDkzO_j6TkyHM8sS/view?usp=share_link| Topic Modeling - Notebooks.]]| | | 2022/11/07 | Topic Modeling. | [[https://drive.google.com/file/d/1ytnJjLHtLT97gCNbzBp_I2TCcY7bfjMN/view?usp=share_link| 14 - Topic modeling]]| Zhai and Massung (2016) Text Data Management and Analysis. [[https://drive.google.com/file/d/1iJ71WZIpWP-cWxLtvsf5L4vp_epJH0uV/view?usp=share_link| Chp 17]].[[https://drive.google.com/file/d/1fKpyNYs9kNlPJpiYiDkzO_j6TkyHM8sS/view?usp=share_link| Topic Modeling - Notebooks.]]| |
| 2022/11/11 | A primer on Neural Networks. | [[https://drive.google.com/file/d/1_snMjfUb1z5YLBEHft6HJo65w4EWYD-v/view?usp=share_link|15 - A Primer on Neural Networks.pdf]]| | | | 2022/11/11 | A primer on Neural Networks. | [[https://drive.google.com/file/d/1_snMjfUb1z5YLBEHft6HJo65w4EWYD-v/view?usp=share_link|15 - A Primer on Neural Networks]]| | |
| 2022/11/14 | A primer on Neural Networks (continued). Practice.| | [[https://drive.google.com/file/d/1UEKJ_E1hD92E4OPw5HUOhvf1NKxxTR2T/view?usp=share_link| From SVM to NN, Classification with Keras - Notebooks.]]| | | 2022/11/14 | A primer on Neural Networks (continued). Practice.| | [[https://drive.google.com/file/d/1UEKJ_E1hD92E4OPw5HUOhvf1NKxxTR2T/view?usp=share_link| From SVM to NN, Classification with Keras - Notebooks.]]| |
| 2022/11/18 | Neural Language Models. Word2vec | [[https://drive.google.com/file/d/1Juf8aMqg_c5wW1KvQxfzvz2A6diV4a4A/view?usp=share_link| 17 - Neural Language Models-1.pdf]]|[[https://drive.google.com/file/d/1ffEsnsmb_o3iX9YBkS095UMPrMOMGrdO/view?usp=share_link|Word2vec with Gensim - Notebook.]]| | | 2022/11/18 | Neural Language Models. Word2vec | [[https://drive.google.com/file/d/1Juf8aMqg_c5wW1KvQxfzvz2A6diV4a4A/view?usp=share_link| 17 - Neural Language Models-1]]|[[https://drive.google.com/file/d/1ffEsnsmb_o3iX9YBkS095UMPrMOMGrdO/view?usp=share_link|Word2vec with Gensim - Notebook.]]| |
| | 2022/11/21 | Neural Language Models. Doc2vec. Transformer. BERT. | [[https://drive.google.com/file/d/10_VjJacKzajp7yNuSOhZo-nNUJjChkcN/view?usp=share_link| 18 - Neural Language Models-2]]|D. Jurafsky, J.H. Martin. Chps. [[https://drive.google.com/file/d/14oI6vsl4KCpGyamBbVjeYPTuzWOSNEtV/view?usp=share_link|7]] [[https://drive.google.com/file/d/1wonZ08i0etFhEMSjQEVU2vKf6UyUjEHb/view?usp=share_link|9]][[https://drive.google.com/file/d/1BsCfRzp3t6xAe4GfUTZbHfBujXyxtdAA/view?usp=share_link|11]].[[https://drive.google.com/file/d/1hs6ffqsn1gLM6RXSsFcTYjfh-AjFYXDu/view?usp=share_link|Doc2vec with Gensim - Notebook.]]| |
| | 2022/11/25 | Seminar (Alessandro Bondielli). Evaluating strategies for Automatic Profiling of Résumés.| |[[https://drive.google.com/file/d/1dopSg44-kSGhIo3nv2wLFis7IRePp6xM/view?usp=share_link|A case study.]] | |
| | 2022/12/02 | Student project presentations: ongoing experiments. Discussion. | | |
| | 2022/12/05 | Student project presentations: ongoing experiments. Discussion. | | |
| | 2022/12/09 | Fine-tuning BERT. Advanced applications (Conversational Agents, Affective Computing).| [[https://drive.google.com/file/d/1RdiNnhM5he2ZIfLBFZ-dPdhlDZMZEHjO/view?usp=share_link| 22 - Advanced applications]]| [[https://drive.google.com/file/d/1q7ZsRYoA4fL4e0VRytezq1-b6s-FpkJD/view?usp=share_link|BERT finetune - Notebooks]]. Recommended chapters: D. Jurafsky, J.H. Martin.[[https://drive.google.com/file/d/1BWfVPq4HiTWzvUHGaqaEkJEWieQxhf-g/view?usp=share_link|20]];[[https://drive.google.com/file/d/148pdYBYtUCwCHR349-HDjEMJMS11ONDi/view?usp=share_link|24]].| |
| |
==== Exam ==== | ==== Exam ==== |
| |
| |
The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course. | The exam for non attending students will consist in a written exam with open question and exercises, and an oral discussion on the topics of the course. |
| |
| Written test [[https://drive.google.com/file/d/1Q-NVz_x-UjllTG-CPAKGV4aKmK4Hz5af/view?usp=share_link|example]]. |
| |
| |