Entrambe le parti precedenti la revisioneRevisione precedenteProssima revisione | Revisione precedente |
magistraleinformatica:ir:ir10:start [16/12/2010 alle 17:55 (14 anni fa)] – [Content of the Lectures] Paolo Ferragina | magistraleinformatica:ir:ir10:start [28/11/2011 alle 18:54 (13 anni fa)] (versione attuale) – [Exam] Paolo Ferragina |
---|
===== General Information ===== | ===== General Information ===== |
* ** Teacher **: [[http://www.di.unipi.it/~ferragin/|Paolo Ferragina]] | * ** Teacher **: [[http://www.di.unipi.it/~ferragin/|Paolo Ferragina]] |
* **Course ID**: 346AA | * **Course ID**: 289AA |
* **CFU:** 6 (first semester) | * **CFU:** 6 (first semester) |
* **Language:** English | * **Language:** English |
Project assigned during the course, plus an oral discussion concerning with the project and the course material. | Project assigned during the course, plus an oral discussion concerning with the project and the course material. |
| |
Exam dates: February 1 and 16, time slot 9-11, Room A and C at Polo Fibonacci. | ^ Date ^ Text of the exam ^ |
| | 01/02/2011 | {{:magistraleinformatica:ir:ir10:ir110201.doc|text}} | |
| | 21/02/2011 | {{:magistraleinformatica:ir:ir10:ir110221.doc|text}} | |
| | 24/06/2011 | {{:magistraleinformatica:ir:ir10:ir110624.doc|text}} | |
| | 20/07/2011 | {{:magistraleinformatica:ir:ir10:ir110720.doc|text}} | |
| | 01/09/2011 | {{:magistraleinformatica:ir:ir10:ir110901.doc|text}} | |
===== Books, notes, ... ===== | ===== Books, notes, ... ===== |
| |
| 06/12/2010 | Scoring, term weighting and the vector space model. Relevance feedback and pseudo-relevance. Query expansion. top-k retrieval | chap 6, 7 and 9 in [MRS] | | | | 06/12/2010 | Scoring, term weighting and the vector space model. Relevance feedback and pseudo-relevance. Query expansion. top-k retrieval | chap 6, 7 and 9 in [MRS] | | |
| 09/12/2010 | Zone indexes. Recommendation Systems (sketch). Quality of the results: Precision, Recall, F-measure. | chap 8 in [MRS] and {{:magistraleinformatica:ir:ir10:11._tfidf.ppt|slides}} | | | | 09/12/2010 | Zone indexes. Recommendation Systems (sketch). Quality of the results: Precision, Recall, F-measure. | chap 8 in [MRS] and {{:magistraleinformatica:ir:ir10:11._tfidf.ppt|slides}} | | |
| 13/12/2010 | Lucene in action. | {{:magistraleinformatica:ir:ir10:lucene_in_action.ppt|slides}} and [[ http://lucene.apache.org/java/docs/ | web site]] | Ugo Scaiella | | | 13/12/2010 | Lucene in action. | {{:magistraleinformatica:ir:ir10:lucene_in_action3.ppt|slides}} and [[ http://lucene.apache.org/java/docs/ | web site]] | Ugo Scaiella | |
| 16/12/2010 | Self-evaluation at home on Lucene: a small project. | | | | | 16/12/2010 | Self-evaluation at home on Lucene: a small project. | | | |
| 20/12/2010 | Self-evaluation at home on Lucene: a small project. | | | | | 20/12/2010 | Self-evaluation at home on Lucene: a small project. | | | |
| 10/01/2011 | The Web Graph: Properties, storage (compression). | 19.1-19.2 and 20.4 in [MRS] | | | | 10/01/2011 | Web search-engines: Web features and structure, advertising (sketch), crawling, consistent hash. | 19.1-19.4 in [MRS], also this {{:magistraleinformatica:ir:ir10:capitolo5.pdf|note}}, and chap 20 | | |
| | **Lab:** the Web-Graph library | [[http://webgraph.dsi.unimi.it/ | web site]] | Ugo Scaiella | | | 13/01/2011 | Web search-engines: connectivity server, doc-duplicate detection (shingling and min-hash) and link-based ranking. | 19.6 and 21 in [MRS], {{:magistraleinformatica:ir:ir10:12._websearch-esteso.ppt|slides}} | | |
| 13/01/2011 | Web search-engines: structure, crawling, link-based ranking. | chap 20 and 21 in [MRS] | | | | 17/01/2011 | Text Classification via Supervised Learning: definition and applications. | | [[http://nmis.isti.cnr.it/sebastiani/ | Fabrizio Sebastiani]] | |
| 6 | Automated text categorization | [[http://nmis.isti.cnr.it/sebastiani/Publications/ACMCS02.pdf | paper]] | [[http://nmis.isti.cnr.it/sebastiani/ | Fabrizio Sebastiani]] | | | 20/01/2011 | Automated text categorization: the machine learning approach, the indexing step. | | [[http://nmis.isti.cnr.it/sebastiani/ | Fabrizio Sebastiani]] | |
| | 24/01/2011 | Automated text categorization: Methods for the inductive construction of a classifier | {{:magistraleinformatica:ir:ir10:slidescorsoferraginashort.pdf|slides}} | [[http://nmis.isti.cnr.it/sebastiani/ | Fabrizio Sebastiani]] | |
| |
| 2 | Latent Semantic Indexing and Random Projections. | chap 18 in [MRS] | | | |
| 4 | Clustering: sketch of k-means, MST-based, max-cut, spectral. | chap 16 and 17 in [MRS] | | | |
| |