====== Big Data Analytics ====== * **Anna Monreale**\\ Università di Pisa, Knowledge Discovery and Data Mining Lab\\ [[anna.monreale@unipi.it]] ====== Textbooks ====== * Slides (see Calendar). * Berthold, M.R., Borgelt, C., Höppner, F., Klawonn, F. **GUIDE TO INTELLIGENT DATA ANALYSIS.** Springer Verlag, 1st Edition., 2010. ISBN 978-1-84882-259-7 * Pang-Ning Tan, Michael Steinbach, Vipin Kumar. //Introduction to Data Mining//. Addison Wesley, ISBN 0-321-32136-7, 2006 * [[http://www-users.cs.umn.edu/~kumar/dmbook/index.php]] ====== Reading about the "data analyst" job ====== * Data, data everywhere. The Economist, Feb. 2010 {{:dm:economist--010.pdf|download}} * Data scientist: The hot new gig in tech, CNN & Fortune, Sept. 2011 [[http://tech.fortune.cnn.com/2011/09/06/data-scientist-the-hot-new-gig-in-tech/|link]] * Welcome to the yotta world. The Economist, Sept. 2011 {{:dm:economist-2012-dm.pdf|download}} ====== Topics and Material ====== Topic ^ Slides ^ Code and Data |01. | Introduction to Big Data and Data Understanding| {{ :dm:introduction-du-bigdata.pdf |}}| {{ :dm:tips_data_understanding.ipynb.zip |}} {{ :dm:tipsdata.zip |}}| |02. | Big data Technologies: Clustering| {{ :dm:dataminingtech.pdf |}}|{{ :dm:tips_clustering.ipynb.zip |}} | |03. | Big data Technologies: Classification| | {{ :dm:tips-classification.ipynb.zip |}}| |04. | Big data Technologies: Patterns | | {{ :dm:ex_pattern_python.zip |}}|