Applied Data Science in R - 3-Day Training Course

Date:

01/03/2017 - 03/03/2017

Organised by:

Mind Project Ltd

Presenter:

Simon Walkowiak MBPsS

Level:

Entry (no or almost no prior knowledge)

Contact:

Simon Walkowiak MBPsS, 0203 322 3786
info@mindproject.co.uk
http://www.mindproject.co.uk

Map:

View in Google Maps  (EC1A 9HA)

Venue:

CAP House, 1st Floor, 9-12 Long Lane, London, EC1A 9HA

Description:

1. Course description.

 

This course will introduce participants to all basic concepts of Data Analysis in R environment. More specifically participants will learn how to input different types of data, prepare, transform and manage datasets and their variables, export/import data files, create simple graphical representations of the data (bar plots, histograms, box plots etc.), run basic statistical tests (e.g. correlations, t-tests etc.), obtain descriptive statistics from a dataset and formulate the results. The course will also provide an introduction to Regression analyses and ANOVAs. Methods of data visualisation will be presented for each statistical test.

Throughout the course the attendees will learn the following concepts:

  • R environment: what is R? Starting R environment;Basic settings and functions; Introduction to IDEs e.g. RStudio,
  • Mathematical functions and control flow operators: R-related help and support; Installing and running third-party packages,
  • R data structures: creating scalars, vectors, matrices, arrays, lists and other data objects in R; Creating simple data frames,
  • Data input and export: adding/deleting observations; Sampling; Flagging/identifying specific cases based on conditional search; Sorting cases; Adding/editing value and variable labels; Dealing with missing data; Reshaping data from long/narrow into wide formats,
  • Exploratory Data Analysis: inspecting the structure of data objects; Cross-tabulations and descriptive statistics (measures of central tendency and dispersion); Vertical/horizontal merging of data frames and other R objects; Basic EDA plots: histograms, density plots, scatterplots, box plots, bar plots, line graphs etc.,
  • Tests of differences and correlations; Testing for normality assumptions: QQ, density plots and test-specific normality measurements; One-sample, matched-sample and independent t tests; Correlations and simple regressions; Test-specific visualisation functions/packages; Effect size and power estimation,
  • Data modelling: ANOVA and multiple regressions; Testing for normality assumptions in GLMs; Introduction to logistic and Poisson regressions; Model optimisation techniques in R,
  • Introduction to data visualisations: creating informative data visualisations using R core and third-party packages; Using graphical parameters for adding/editing text, titles, lines, fonts, colours, axes, background and other elements of plots; Introduction to ggplot2 syntax and an rCharts example,
  • Creating a simple data product with R; Data cleaning, EDA, data management, data "crunching" and analysis, data visualisation, model optimisation and debugging.

 

2. Programme.

 

The course will run for three days (Wednesday to Friday) between 9:30am and 5:00pm and will consist of alternating lecture-style presentations and practical tutorials. The example datasets used during tutorial sessions will come from social sciences, psychology and business fields, however the contents may vary depending on specific interests of participants (based on the Participant's Skills Inventory). There will be two 15-minute coffee/tea breaks and one 1-hour lunch break on each day of the course.

 

3. What is included?

 

Apart from the contents of the course, Mind Project will provide the participants with the following:

 

  • a digital (USB memory stick) Course Manual including all presentation slides, R course codes and a list of reference books and online resources,
  • additional home exercises and all data sets available to download,
  • Wi-Fi access,
  • Central London location - at the CAP House, next to the Barbican station,
  • networking opportunity,
  • Mind Project course attendance certificate.

 

4. Further instructions.

 

  • In order to benefit from the contents of the course it is recommended that attendees have the most recent version of R and R Studio software installed on their personal laptops (any operating system). As R is a free environment you can download it directly from www.r-project.org website and R Studio is available at https://www.rstudio.com/products/rstudio/#Desktop. Please contact us should you have any questions or issues with the installation process. No specific R packages are required before the course (the course tutors will explain this during the training).
  • No prior knowledge of R is required from participants enrolling on this course, however a keen interest in data analysis is assumed.
  • Participants are encouraged to complete the online Participant's Skills Inventory available at http://mindproject.co.uk/skillsinventory.html to allow Mind Project and our course tutors to customise the contents of the course depending on the level of participants' knowledge and their areas of interest. The data obtained through the Participant's Skills Inventory will be held fully-confidential and will only be used to provide a quality data analysis training.
  • By purchasing a place on one of our courses you agree to the Terms and Conditions available at http://mindproject.co.uk/trainingterms.html. Please read the Terms and Conditions before making a booking.
  • The deadline for registrations on this training course is Monday, 27th of February 2017 at 16:00 London (UK) time. However, Mind Project reserves the right to end the registration process earlier if all places are booked before the deadline.

 

Should you have any questions please contact Mind Project Ltd at info@mindproject.co.uk or by phone on 0203 322 3786. Please visit the course website at http://mindproject.co.uk/applied-data-science-london-mar17.html.

Cost:

£475 + VAT (£570) per person for the whole course (normal fee).
£325 + VAT (£390) per person for the whole course for UK registered undergraduate and postgraduate students, and representatives of registered charitable organisations (discounted fee).

Website and registration:

Region:

Greater London

Keywords:

Quantitative Data Handling and Data Analysis, Descriptive Statistics, Statistical Theory and Methods of Inference, Regression Methods, Mixed Methods Data Handling and Data Analysis, Quantitative Software, R, Technology, ICT and Software (other), Data Visualisation

Related publications and presentations:

Quantitative Data Handling and Data Analysis
Descriptive Statistics
Statistical Theory and Methods of Inference
Regression Methods
Mixed Methods Data Handling and Data Analysis
Quantitative Software
R
Technology
ICT and Software (other)
Data Visualisation

Back to archive...