Introduction to Hadoop - London - 2-Day Training Course

Date:

16/10/2017 - 17/10/2017

Organised by:

Mind Project Ltd

Presenter:

Simon Walkowiak MBPsS

Level:

Entry (no or almost no prior knowledge)

Contact:

Mind Project Ltd
Simon Walkowiak MBPsS
Phone: 02033223786
Email: info@mindproject.co.uk

Map:

View in Google Maps  (EC1A 9HA)

Venue:

CAP House, 1st Floor, 9-12 Long Lane, London, EC1A 9HA

Description:

1. Course description.

This two-day training course explores major characteristics and functionalities of Apache Hadoop platform and its ecosystem of tools for Big Data processing and analysis. The course provides a first-hand practical experience in Hadoop Distributed File System (HDFS) and MapReduce frameworks by the means of a number of presentations and tutorials during which the attendees will learn how to design and perform simple MapReduce programs to process the data and calculate a set of statistics. The course can also serve as a gentle introduction to the basics of Java programming language and essential Hadoop File System shell commands. During the course you will learn to:

 

  • understand the features, major characteristics, architecture and operations of Hadoop and its ecosystem including, Yet Another Resource Negotiator (YARN), Hadoop Distributed File System, MapReduce programming framework and other Hadoop-related tools e.g. HBase, Hive, Cassandra, Mahout and Pig etc.,
  • monitor and diagnose the performance of Apache Hadoop clusters and their resources using Apache Ambari and control the deployed services through Apache ZooKeeper,
  • manage large datasets in Hadoop Distributed File System (HDFS) using Hadoop File System shell commands,
  • design and execute simple MapReduce parallel programs (written in Java) to calculate various statistics and control their performance in real-time,
  • implement learnt skills to build and provision Hadoop-based Big Data applications.

 

During the course the attendees will perform several simple MapReduce jobs on a Linux-based Mind Project Hadoop cluster.

 

2. Programme.

The course will run for two days from 9:30am until ~5:00pm on each day and will consist of alternating lecture-style presentations and practical tutorials. The example datasets used during tutorial sessions will come from social sciences, economics and business fields, however the contents may vary depending on specific interests of participants (based on the Participant's Skills Inventory). There will be two 15-minute coffee/tea breaks and one 1-hour lunch break on each day.

 

3. What is included?

Apart from the contents of the course, Mind Project will provide the participants with the following:

  • a digital (USB memory stick) Course Manual including all presentation slides, Hadoop course script files and a list of reference books and online resources,
  • additional home exercises and all data sets and code scripts available to download,
  • Wi-Fi access,
  • Central London location - at the CAP House, next to the Barbican station,
  • networking opportunity,
  • Mind Project course attendance certificate.

 

4. Further instructions.

  • In order to benefit from the contents of the course, we recommend that attendees bring their personal WiFi-enabled laptops to the session with at least one of the following web browsers installed: Chrome, Safari, Mozilla Firefox and/or Internet Explorer. Also, the laptops should be equipped with a simple text editor suitable for code/script typing e.g. Notepad++ (for Windows users) or TextWrangler/BBEdit (for Mac users). Please be advised that we do not recommend the following applications: WordPad, Gedit or TextEdit. 
  • This course is targeted at IT literate users with interest in Big Data processing and architecture. No prior exposure to the Hadoop ecosystem and its tools is required.
  • Participants are encouraged to complete the online Participant's Skills Inventory at https://www.mindproject.io/participants-skills-inventory/ to allow Mind Project and our course tutors to customise the contents of the course depending on the level of participants' knowledge and their areas of interest. The data obtained through the Participant's Skills Inventory will be held fully-confidential and will only be used to provide a quality data analysis training.
  • By purchasing a place on one of our courses you agree to the Terms and Conditions. Please read the Terms and Conditions available at https://www.mindproject.io/services/data-science-training/training-tcs/ before making a booking.
  • The deadline for registrations on this training course is Friday, 13th of October 2017 at 16:00 London (UK) time, however Mind Project reserves the right to end the registration process earlier if all places are booked before the deadline.

 

Should you have any questions please contact Mind Project Ltd at info@mindproject.co.uk or by phone on 0203 322 3786. Please visit the course website at https://www.mindproject.io/product/introduction-to-hadoop-london-october-2017/. 

Cost:

£450 per person for the whole course (regular fee).
£300 per person for the whole course for UK registered undergraduate and postgraduate students, and representatives of registered charitable organisations (discounted fee).
For group bookings of 4 and more participants, please contact us directly.

Website and registration:

Region:

Greater London

Keywords:

Quantitative Software, Technology, ICT and Software (other), Big Data , HDFS , distributed file system , Hive , HBase , Cassandra , Java language , distributed computing , cloud computing , parallel computing , Hadoop , Spark , MapReduce

Related publications and presentations:

Quantitative Software
Technology
ICT and Software (other)

Back to archive...