CloseHelpPrint
Kies de Nederlandse taal
Course module: 201600038
201600038
Data analysis and visualisation
Course info
Course code201600038
EC7.5
Course goals
After successfully completing this course, you will be able to:
  • Understand and explain the different approaches to data analysis;
  • Given a practical data science problem, select appropriate techniques to tackle this problem;
  • Apply various data analysis techniques, including regression, trees, clustering, PCA, correspondence analysis, etc. in R;
  • Implement generic Data Science tools such as train/validation/test sets, crossvalidation, bagging, boosting, and error evaluation in R ;
  • Interpret and evaluate the results of such analyses;
  • Explain these evaluations in layman's terms;
  • Understand and explain the basic principles of data visualization and the grammar of graphics;
  • Construct appropriate visualizations in connection with each of the data analysis techniques in R.
 
Content
What puts former criminals on the right track? How can we prevent heart disease? Can Twitter predict election outcomes? What does a violent brain look like? How many social classes does 21st century society have? Are hospitals spending too much on health care, or too little? When is a series of spikes in hundreds of website logfiles an operational problem?

Data analysis is the art and science of tackling questions like these by looking at data. Just as cartographers make maps to see what a country looks like, data analysts explore the hidden structures of data by creating informative pictures and summarizing relationships among variables. And just as doctors diagnose sick patients and advise healthy ones on how to stay healthy, data analysts predict important events and variables so we can act on this knowledge. Methods from statistics, machine learning, and data mining play an important part in this process, as well as visualizations that allow the analyst and other humans to better understand what we can conclude from the available facts.

During this course, participants will actively learn how to apply the main statistical methods in data analysis and how to use machine learning algorithms and visualizing techniques. The course has a strongly practical, hands-on focus: rather than focusing on the mathematics and background of the discussed techniques, you will gain hands-on experience in using them on real data during the course and interpreting the results.
This course covers both classical and modern topics in data analysis and visualization:
  1. Exploratory data analysis (EDA);
  2. Supervised machine learning and statistical learning;
  3. Unsupervised learning and data mining techniques;
  4. Visualization (throughout the course).
 Note that you need to register for this course via OSIRIS STUDENT and during the UU registration periods. Note also that if you are not an FSW student, the registration period may differ from your habitual one. This course is essential as a basis for each track of the Master of Applied Data Science. If you want to register for this course, please also register for the Applied Data Science profile via http://studyguidelifesciences.nl/profiles/applied-data-science
Please take notice: 7,5 ECTS instead of 5 ECTS.
CloseHelpPrint
Kies de Nederlandse taal