Strumenti Utente

Strumenti Sito


magistraleinformatica:ir:ir21:start

Differenze

Queste sono le differenze tra la revisione selezionata e la versione attuale della pagina.

Link a questa pagina di confronto

Entrambe le parti precedenti la revisione Revisione precedente
Prossima revisione
Revisione precedente
magistraleinformatica:ir:ir21:start [23/11/2021 alle 15:20 (2 anni fa)]
Paolo Ferragina [Lectures]
magistraleinformatica:ir:ir21:start [18/08/2022 alle 08:16 (20 mesi fa)] (versione attuale)
Paolo Ferragina [Information Retrieval - Academic Year 2020/2021]
Linea 1: Linea 1:
-====== Information Retrieval - Academic Year 2020/2021 ======+====== Information Retrieval - Academic Year 2021/2022 ======
  
 ====== General Information ====== ====== General Information ======
Linea 42: Linea 42:
 ^ Date  ^ Room  ^ Text ^ Notes | ^ Date  ^ Room  ^ Text ^ Notes |
 | 09/11/21, start at 09:00 | room A1 and virtually | {{ :magistraleinformatica:ir:ir21:ir211109.pdf |text}}, {{ :magistraleinformatica:ir:ir21:informationretrieval-2022-comp1.pdf |results}}, {{ :magistraleinformatica:ir:ir21:ir_2021_11_09_tutto_.pdf |solution}} | The **midterm exam** will consist of a set of exercises, and will last for 45mins. The part of the program for the exercises will be detailed in the list of lectures below.   | | 09/11/21, start at 09:00 | room A1 and virtually | {{ :magistraleinformatica:ir:ir21:ir211109.pdf |text}}, {{ :magistraleinformatica:ir:ir21:informationretrieval-2022-comp1.pdf |results}}, {{ :magistraleinformatica:ir:ir21:ir_2021_11_09_tutto_.pdf |solution}} | The **midterm exam** will consist of a set of exercises, and will last for 45mins. The part of the program for the exercises will be detailed in the list of lectures below.   |
-| 14/12/2021, start at 09:00 | room C and virtual | text, results, solution | The **FinalTerm exam** will have the same structure as the other one, but can participate only the students who got >=18 rank in the first MidTerm. | +| 14/12/2021, start at 09:00 | room C and virtual | {{ :magistraleinformatica:ir:ir21:ir211214.pdf |text}}{{ :magistraleinformatica:ir:ir21:results-finalterm-2021.pdf |results}}{{ :magistraleinformatica:ir:ir21:ir_14.12.2021_soluzione_.pdf |solution}} | The **FinalTerm exam** will have the same structure as the other one, but can participate only the students who got >=18 rank in the first MidTerm. Students have to register at the [[https://forms.office.com/r/gFmFVWVWzp|following form]] by December 7th, 2021.\\ Oral will occur remotely the 20th December, starting at 9:00, on the Teams' room of the course. | 
-| 17/01/2022, start at 09:00 | room C1 | text, results, solution |  | +| 17/01/2022, start at 09:00 | room C1 | {{ :magistraleinformatica:ir:ir21:ir220117.pdf |text}}{{ :magistraleinformatica:ir:ir21:ir-jan2022.pdf |results}}{{ :magistraleinformatica:ir:ir21:ir220117_soluzione_.pdf |solution}} |  | 
-| 07/02/2022, start at 09:00 | room C1 | text, results, solution |  |+| 07/02/2022, start at 09:00 | room A1 | {{ :magistraleinformatica:ir:ir21:ir220207.pdf |text}}{{ :magistraleinformatica:ir:ir21:ir090222_res.pdf |results}}{{ :magistraleinformatica:ir:ir21:ir220207_soluzione_.pdf |solution}} |  | 
 +| 13/06/2022, start at 09:00 | room L1 | {{ :magistraleinformatica:ir:ir21:ir220613.pdf |text}} |  |
  
 ====== Materials for study ====== ====== Materials for study ======
Linea 55: Linea 56:
 ====== Lectures ====== ====== Lectures ======
  
-Students that are not able to attend the lectures, can refer to the [[https://web.microsoftstream.com/group/d6ebab24-d618-46c4-93d7-ced9fb302f65|video-lectures of the last academic year]] of the last academic year (click on “Videos”, and then sort them by name). For the video-lectures of this year, please look at the following agenda. \\+For the video-lectures of this yearplease look at the following agenda.  [[https://web.microsoftstream.com/group/d6ebab24-d618-46c4-93d7-ced9fb302f65|Video-lectures of the last academic year]] are available too (click on “Videos”, and then sort them by name). \\
  
 ^ Date         ^ Argument ^ Refs  ^ ^ Date         ^ Argument ^ Refs  ^
Linea 77: Linea 78:
 | 16.11.2021 | Posting list compression, codes: gamma, variable bytes (t-nibble), PForDelta, Elias-Fano indexing. Text-based ranking: dice, jaccard, tf-idf. Vector space model and cosine similarity doc-doc and query-doc. | Sect. 5.3 of [MRS] and {{:magistraleinformaticanetworking/ae/ae2014/chap_9.pdf|Ferragina's notes}} (only the coders presented in class).\\ {{ :magistraleinformatica:ir:ir21:lect_11_-_compression_integers_new_.ppt |Slides}} and [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_16-20211116_092024-Meeting%20Recording.mp4?web=1|Video]]. |  | 16.11.2021 | Posting list compression, codes: gamma, variable bytes (t-nibble), PForDelta, Elias-Fano indexing. Text-based ranking: dice, jaccard, tf-idf. Vector space model and cosine similarity doc-doc and query-doc. | Sect. 5.3 of [MRS] and {{:magistraleinformaticanetworking/ae/ae2014/chap_9.pdf|Ferragina's notes}} (only the coders presented in class).\\ {{ :magistraleinformatica:ir:ir21:lect_11_-_compression_integers_new_.ppt |Slides}} and [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_16-20211116_092024-Meeting%20Recording.mp4?web=1|Video]]. | 
 | 22.11.2021 | Storage of tf-idf and use for computing document-query similarity. Fast top-k retrieval: high idf, champion lists, many query-terms, fancy hits, clustering. | Sect 6.2 and 6.3 and 7 from [MRS].\\ [[https://www.dropbox.com/s/iyrlc81wuzbtewu/lect%2012-text%20ranking.ppt?dl=0|Slides]] | | 22.11.2021 | Storage of tf-idf and use for computing document-query similarity. Fast top-k retrieval: high idf, champion lists, many query-terms, fancy hits, clustering. | Sect 6.2 and 6.3 and 7 from [MRS].\\ [[https://www.dropbox.com/s/iyrlc81wuzbtewu/lect%2012-text%20ranking.ppt?dl=0|Slides]] |
-| 23.11.2021 | Exact Top-K: WAND and blocked-WAND. Relevance feedback, Rocchio, pseudo-relevance feedback, query expansion. Performance measures: precision, recall, F1 and user happiness. | Chap 8 and 9 [MRS].\\ [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_23-20211123_090451-Meeting%20Recording.mp4|Video]] | +| 23.11.2021 | Exact Top-K: WAND and blocked-WAND. Relevance feedback, Rocchio, pseudo-relevance feedback, query expansion. Performance measures: precision, recall, F1 and user happiness. | Sect 8.1-8.3 and 9 [MRS].\\ [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_23-20211123_090451-Meeting%20Recording.mp4|Video]] | 
- +29.11.2021 | Random Walks. Link-based ranking: pagerank, topic-based pagerank, personalized pagerank. Application to Text Summarization. | Chap 21 of [MRS]. [[https://www.dropbox.com/s/mb6y2k93lba9j10/lect%2013-Web%20ranking.ppt?dl=0|Slides]] and [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_29-20211129_110458-Meeting%20Recording.mp4|video]] 
-00.00.0000 | Random Walks. Link-based ranking: pagerank, topic-based pagerank, personalized pagerank. | Chap 21 of [MRS]. Slides | +30.11.2021 | HITS. Projections to smaller spaces: Latent Semantic Indexing (LSI). Sketch of the ideas underlying Entity Linkers and Knowledge Graphs. | Chap 18 from [MRS].\\ [[https://www.dropbox.com/s/ylcnklittbc4wne/lect%2014-LSI%20and%20random%20proj%20-%20shorter.ppt?dl=0|Slides]] and [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_11_30-20211130_090133-Meeting%20Recording.mp4?web=1|video]]. | 
-00.00.0000 | HITS. Projections to smaller spaces: Latent Semantic Indexing (LSI). | Chap 18 from [MRS].\\ Slides. | +06.12.2021 | Elastic Search, with lab: Students are required to bring their own laptop in class, with already installed [[https://docs.docker.com/get-docker/|Docker]] and then the image of [[https://www.elastic.co/guide/en/elasticsearch/reference/current/docker.html|ElasticSearch]] via **docker pull** (i.e. first step of "Pulling the image"). | | 
-00.00.0000 Lecture on GraphDBs or on Elastic Search |  +| 07.12.2021 | Exercises | [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Lezione%20del%202021_12_07-20211207_090823-Meeting%20Recording.mp4?web=1|Video]] | 
- +| 13.12.2021 | Exercises | [[https://unipiit.sharepoint.com/sites/a__td_50472/Shared%20Documents/General/Recordings/Meeting%20del%202021_12_13-20211213_110529-Meeting%20Recording.mp4?web=1|Video]] | 
-| 14.12.2021 | **FinalTerm exam**. Topics will be the ones that we have dealt with after the MidTerm exam. Students have to register at the [[https://forms.office.com/r/gFmFVWVWzp|following form]] by December 7th, 2021. |  |+| 14.12.2021 | **FinalTerm exam**. Topics will be the ones that we have dealt with **after** the MidTerm exam. Students have to register at the [[https://forms.office.com/r/gFmFVWVWzp|following form]] by December 7th, 2021. |  |
  
magistraleinformatica/ir/ir21/start.1637680850.txt.gz · Ultima modifica: 23/11/2021 alle 15:20 (2 anni fa) da Paolo Ferragina