Applied Data Science with R - 6-week tutor-led online course

Date:

16/09/2022 - 21/10/2022

Organised by:

Mind Project Ltd

Presenter:

Simon Walkowiak MSc, MBPsS

Level:

Entry (no or almost no prior knowledge)

Contact:

Simon Walkowiak
Mind Project Ltd
Phone: 02033223786
Email: info@mindproject.co.uk

video conference logo

Venue: Online

Description:

1. Course description.

During the “Applied Data Science with R” open-to-public instructor-led training course you will learn how to apply the R programming language to carry out essential data management, wrangling and statistical analysis operations. The mixture of weekly live webinars with additional online on-demand instruction videos and several homework exercises throughout the duration of the course will ensure you will be able to apply R language to your own data and research questions in a matter of weeks. 

This course will introduce you to all basic concepts of data processing and analysis in R environment. More specifically, you will learn to understand different types of data and common data structures available in R language, prepare, transform and manage datasets and their variables, export/import data from various file formats (Excel spreadsheets, csv, tab, txt etc.), create simple graphical representations of the data (bar plots, histograms, box plots etc.), obtain summaries, data aggregations, cross-tabulations, frequency and pivot tables, and run and explain results of basic statistical tests e.g. correlations, t-tests etc. The course will also provide an introduction to modelling using multiple linear regression methods and will introduce you to data visualisation techniques available in R for data reporting and research communication.

The course will cover modern approaches in applied data science using R language and its rich ecosystem of external libraries including tidyverse family of packages e.g. dplyr, ggplot2, tidyr, readr, tibble and other essential R libraries for data wrangling and statistics.

This course has already been run multiple times and tested both in academic and industry settings. For testimonials from our past learners please visit the course website at: https://www.mindproject.io/product/applied-data-science-with-r-6-week-tutor-led-online-course-september-2022/

 

2. Course programme.

This instructor-led course is planned over 6 teaching weeks. You will attend weekly live webinars with our tutors during which you can ask questions and discuss different R language and data science problems. 

In between the six weekly online live tutorials (2.5 hours long each) you will improve your skills by watching pre-recorded instruction videos via our Mind Project Learning Platform and working through set tasks (e.g. quizzes) as well as homework coding exercises which will require 4-6 hours of your time commitment per week (24-36 hours). We estimate that the total time commitment is 40-50 hours over 6 teaching weeks.

Start date: Friday, 16th of September 2022 @10:00 am London (UK) time
Schedule of sessions: Every Friday at 10:00 am London (UK) time for 6 weeks
Deadline for registrations: Wednesday, 14th of September 2022 @ 17:00 London (UK) time

 

Week 1: First step with R language

  • Introduction to R language, RStudio and the ecosystem of packages in R,
  • Generating random data; logical and mathematical operations in R,
  • Built-in R types and data structures,
  • Data import/export to/from various file formats.

 

Week 2: Data wrangling with R

  • Working with data frames, matrices, arrays and lists in R,
  • Converting data between different types and classes; factors and ordered factors,
  • Essential data wrangling operations: e.g. subsetting, filtering, renaming variables, recoding values and creating new data,
  • Introduction to working with strings, dates and time stamps.

 

Week 3: Exploratory data analysis with R

  • Measures of central tendency, dispersion/variability and other basic descriptive and summary statistics,
  • Value counts, cross-tabulations and data aggregations with tidyverse,
  • Plotting descriptives with ggplot2: basic examples of bar plots, line graphs and boxplots,
  • Faceting - grouped and aggregated plots; multiplots (multiple plots on the same page); additional graphical settings, grid layouts and themes of plots produced with ggplot2 and associated R packages.

 

Week 4: Inferential statistics and hypothesis testing with R - Part 1

  • Understanding hypothesis testing and traditional test assumptions e.g.: normality and homogeneity of variance,
  • Parametric and non-parametrics tests of differences,
  • Power and effect size calculation for inferential tests.

 

Week 5: Inferential statistics and hypothesis testing with R - Part 2

  • Parametric and on-parametric tests of relationships,
  • Introduction to linear and non-linear models,
  • Analysis of Variance (ANOVA),
  • Main effects, random effects and interactions.

 

Week 6: Linear and non-linear models with R

  • Understanding multiple linear regression,
  • Regression metrics and evaluation of multiple linear regression models,
  • Non-linearity in regression models,
  • Comparing regression models. 

 

3. Course pre-requisites and further instructions

  • We recommend that you have the most recent version of R and R Studio software installed on your PC (any operating system). R is a free and open-source environment and you can download it directly from https://cloud.r-project.org/ website. RStudio Desktop (also free) is available at https://rstudio.com/products/rstudio/download/. Please contact us should you have any questions or issues with the installation process. No specific R packages are required before the course (the course tutors will explain this during the training).
  • No prior knowledge of R is required from delegates enrolling on this course, however a keen interest in data analysis and some experience with data processing is assumed.
  • Your PC needs to be connected to a stable WiFi/Internet network (either home or office-based) and have Zoom video-conferencing application installed.
  • You will need at least one commonly used web browser installed on your PC (e.g. Chrome, Safari, Firefox, Edge etc.) to access our Mind Project Learning Platform.

Should you have any questions please contact Mind Project Ltd at info@mindproject.co.uk or by phone on 0203 322 3786. Please visit the course website at https://www.mindproject.io/product/applied-data-science-with-r-6-week-tutor-led-online-course-september-2022/

Cost:

By 26th of August 2022 (Early Bird offer): £450 (normally £600) per person for the whole 6-week course (regular fee). £300 (normally £420) per person for the whole course applicable to undergraduate and postgraduate students, representatives of registered charitable organisations and NHS employees only (discounted fee). Additional discounts available for multiple bookings and groups.

Website and registration:

Region:

Greater London

Keywords:

Data Management , Descriptive Statistics, Correlation, Effect size , Statistical Theory and Methods of Inference, Parametric statistics, Non-parametric statistics, Regression Methods, Ordinary least squares (OLS), ANOVA, ANCOVA, Linear regression, R, Data Visualisation, Creating graphs and charts

Related publications and presentations:

Data Management
Descriptive Statistics
Correlation
Effect size
Statistical Theory and Methods of Inference
Parametric statistics
Non-parametric statistics
Regression Methods
Ordinary least squares (OLS)
ANOVA
ANCOVA
Linear regression
R
Data Visualisation
Creating graphs and charts

Back to archive...