Map/Reduce with R, Hadoop and AWS

Anette Bergo (Bouvet)
Siv Midtun Hollup (Bouvet)

Half-day workshop - in English

Approved_talk approved

Come along to get up and running with your first Map/Reduce job in R, and learn about a language that is elegant, complicated and just plain strange all at once. Unless you happen to have a degree in statistics, in which case everything will make sense. Once up and running with a simple example we will build on that to do more advanced analysis of a larger data sets, in order to gain some valuable insight.

You will learn the basics of R, and how to emulate a map/reduce job locally in order to set up and debug it. You will learn how to run a map/reduce job via the amazon console, for both a scripted and a packaged version of R, and you will also learn how to bootstrap your cluster to make sure it is has the packages and tools you need.

Primarily for: Developers, Architects

Participant requirements: You will need to bring a computer. To get up and running fast, you should preferably pre-install R and RStudio. You should also have set up an Amazon AWS account beforehand. Note that we will be running on paid instances, and your creditcard will be charged a few dollars for the session.