Commit 4df0f4d1 authored by Mathieu Giraud's avatar Mathieu Giraud Committed by Vidjil Team

doc/vidjil-algo.md: details, typos

parent aaa4e622
Pipeline #44838 passed with stages
in 55 minutes and 14 seconds
......@@ -648,15 +648,15 @@ See also [What is a clone ?](vidjil-format/#what-is-a-clone).
| consensus_ratio (+) | number | *Ratio of the number of reads gathered in the clone against the total number of reads analyzed with recombinations.*
| sequence_id | string | Unique query sequence identifier within the file. Most often this will be the input sequence header or a substring thereof, but may also be a custom identifier defined by the tool in cases where query sequences have been combined in some fashion prior to alignment. <br />*This identifier is the (50 bp by default) window extacted around the junction.* |
| clone_id | string | Clonal cluster assignment for the query sequence. <br />*This identifier is again the (50 bp by default) window extacted around the junction.*
| warnings (+) | string | *Warnings associated to this clone. See XXXXXX.*
| warnings (+) | string | *Warnings associated to this clone. See <https://gitlab.vidjil.org/blob/dev/doc/warnings.md>.*
| sequence | string | The query nucleotide sequence. Usually, this is the unmodified input sequence, which may be reverse complemented if necessary. In some cases, this field may contain consensus sequences or other types of collapsed input sequences if these steps are performed prior to alignment. <br />*This contains the consensus/representative sequence of each clone.*
| rev_comp | boolean | True if the alignment is on the opposite strand (reverse complemented) with respect to the query sequence. If True then all output data, such as alignment coordinates and sequences, are based on the reverse complement of 'sequence'. <br />*Set to null, as vidjil-algo gather reads from both strands in clones* |
| v_call, d_call, j_call | string | V/D/J gene with allele. For example, IGHV4-59*01. <br /> *In the case of uncomplete/unexpected recombinations (locus with a `+`, we still use* `v/d/j_call`. |
| v_call, d_call, j_call | string | V/D/J gene with allele. For example, IGHV4-59\*01. <br /> *In the case of uncomplete/unexpected recombinations (locus with a `+`), we still use* `v/d/j_call`. |
| junction_aa | string | Junction region amino acid sequence. | <br />*null*
| productive | boolean | True if the V(D)J sequence is predicted to be productive. <br /> *true, false, or null when no CDR3 has been detected* |
| sequence_alignment | string | Aligned portion of query sequence, including any indel corrections or numbering spacers, such as IMGT-gaps. Typically, this will include only the V(D)J region, but that is not a requirement. <br /> *null* |
| germline_alignment | string | Assembled, aligned, fully length inferred germline sequence spanning the same region as the sequence_alignment field (typically the V(D)J region) and including the same set of corrections and spacers (if any). <br />*null*
| v_cigar, d_cigar, j_cigar | string | CIGAR strings for the V/D/J gene <OBbr />*null*.
| v_cigar, d_cigar, j_cigar | string | CIGAR strings for the V/D/J gene <br />*null*.
Currently, we do not output alignment strings.
Our implementation of .tsv may evolve in future versions.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment