Working with Big Data
21

Tuesday, 21 February

6:15 8:15 pm GMT

GA London, The Relay Building

1st Floor, 114 Whitechapel High Street
London, E1 7PT
£35 GBP
Regular Ticket
£35 GBP
Total

Questions? Read our FAQs

Working with Big Data

John Sandall Photo


Data Science Consultant

21

Tuesday, 21 February

6:15 8:15 pm GMT

GA London, The Relay Building

1st Floor, 114 Whitechapel High Street
London, E1 7PT
£35 GBP
Regular Ticket
£35 GBP
Total

Questions? Read our FAQs

About This Class

Big data is ever-growing in importance, but operationalising giant sets of data into repositories of easily-accessed insight seems confusing and daunting to most. This brief deep dive into big data seeks to provide you with a better understanding of tools and techniques which help anyone summarise and analyse very large sets of data with elegance and ease.

We’ll begin by setting the stage for Big Data, describing how data sources have proliferated in the past decade, and how analytical tools have changed and evolved to handle these ever-growing data sets. We’ll then dive into the fundamentals of big data systems covering technologies such as distributed architectures, Hadoop, MapReduce and Hive in-depth before exploring the dizzying landscape of big data tools, with examples and case studies to illustrate the way.

This class is ideal for anyone:

  • Who works with large data sets daily or weekly
  • Responsible for (and frustrated with) recurring reporting on large data sets
  • Responsible for driving standards around data + communication in their organisation

Takeaways

  • Understand what big data is, what it isn't, and when big data tools are needed.
  • Learn the fundamentals of distributed architectures for big data analysis, including tools such as Hadoop and MapReduce.
  • Learn where to start with analysis of a big data set.

Prereqs & Preparation

  • Basic familiarity with Excel or Google Sheets
  • Basic understanding of CSV and other “flat” file formats
  • Bonus but not required: familiarity with data analysis flows and ETL processes

About the Instructor

John Sandall Photo

Data Science Consultant

Over the years, John has added a number of strings to his bow. Statistician, geneticist, business analyst, startup founder, software developer, educator, non-profit strategist, data scientist, multi-instrumentalist! He currently runs a data science consultancy focused on helping businesses to identify and solve the challenges they face through a combination of research-grade statistical techniques, strategic analysis and a lean-startup engineering mentality.

Until recently he was the Lead Data Scientist at YPlan. Prior experience includes business analytics at Apple Inc., genomics research at Imperial College London, building an ed-tech startup at Knodium, developing strategy & technological infrastructure for international non-profit startup STIR Education, and losing sleep to many a hackathon along the way.

He's also been known to dabble in violin; a concerto here, a pop gig there, occasionally conducting youth ensembles, and usually playing with his folk music band.

Refund Policy

Plans change. We get it. But if you can't make it to a class/workshop, please email us at least 7 days before the scheduled event date. No refunds will be given after this timeframe.

Coming up near you

Let’s Keep You Updated

Enter your email to start following

By providing us with your email, you agree to the terms of our Privacy Policy and Terms of Service.