Loading…
RMACC 2021 HPC Symposium
Attending this event?
Back To Schedule
Thursday, May 20 • 1:35pm - 3:05pm
Parallel R

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

R is a programming language and environment used for statistical computing, data mining, data analysis, and graphics. There is interest in increasing R’s performance. One way to do this is via parallelism, using multiple cores on a single computer or multiple nodes in a distributed memory machine. There are a number of parallel packages available for use with R. This talk will briefly present some of them including the libraries: parallel, dplyr, doParallel, and the R subset of the Message Passing Interface (Rmpi). Rmpi allows distributed memory computing using multiple nodes. We will look at issues associated with these packages that can reduce performance, such as affinity and operation ordering and how to overcome some of them. We will look at combining packages such as MPI and parallel foreach. The primary context of the presentation will a be computationally intensive statistical analysis of earthquakes to develop a hazard map. We will also look at Bag of Task programming. Recipes will be provided for installing many of the packages discussed.

Speakers

Thursday May 20, 2021 1:35pm - 3:05pm MDT
Feedback form isn't open yet.

Attendees (3)