README.md 537 Bytes
Newer Older
DARROUS Jad's avatar
DARROUS Jad committed
1
# Hadoop traces under Erasure Coding
DARROUS Jad's avatar
DARROUS Jad committed
2

DARROUS Jad's avatar
DARROUS Jad committed
3
4
5
6
7
8
9
10
11
12
This repository contains traces of Hadoop MapReduce jobs under replication and erasure coding. Sort, Wordcount, and K-means applications are included. These traces are compiled of different software (overlapping and non-overlapping shuffle, disk persistency, failure) and hardware configurations (HDD, SSD, DRAM, 1 Gbps and 10 Gbps network).

Plotting scripts are available in `scripts` directory.

Plotting scripts are written in Python 3.

Required python libraries
+ pyyaml
+ pandas
+ matplotlib