Parallel evaluation and memory managment for high performance computing

Parallel evaluation

Approaches

Interfaces

ATLAS, ACML BLAS library

'parallel' package

Rmpi

foreach

BatchJobs

BiocParallel

Case study: parallel linear algebra

Motivation: StackOverflow question about calculating correlation coefficients between columns in a large (1M x 400) numeric matrix.

Memory management

Basic observations

Approach

Common solutions 1. Restrict data input to just that required 2. Draw a sample and infer statistical properties if appropriate, e.g., QA 3. Iterate through large data

Case study: Counting reads overlapping regions of interest, Intermediate Sequence Analysis 2013 Chapter 7.