Machine learning for population genetics
private
robustness_pipeline

Repository

graph LR
A[base data] -- conversion --> B[gargammel and post-treatment]
B -- conversion --> C[damaged data]
C -- compute distance --> A
C --> D[machine learning algorithm]
A --> D
D --> E[compare predictions]
graph LR
A[Split individuals, and create <br>fasta sequence for each,<br>according to the SNP Matrix] --> B[use gargammel]
B --> C["remove adapters<br>(Trimgalore)"]
C --> D["map reads on reference<br>(option -Q : quality treeshold )"]
D --> E[call the variants]
E --> F[merge<br>individuals]
graph LR
A["fragSim : simulation of ancient DNA fragments<br> being retrieved at random from the sequence<br>(option -l : specify size of fragments, <br>-c : specify coverage)"] --> B["deamSim : simulation of damage (deamination)<br>(option -m : specify mapdamage matrix)"]
B --> C[adptSim : add adapters <br>to create raw Illumina reads]
C --> D["ART : add sequencing errors<br>(option -s : change error rate )"]
  sbatch pipeline.sh -c 20 -l 2000000 -f Expansion-015_00001_005.npz