hashclone.Rd
This function executes HashClone algorithm developed to identify the set of clonality markers during the patient follow-up in order to quantify the minimal residual disease.
a character string. Two options: sudo or docker, depending to which group the user belongs
a character string indicating the path of the scratch folder
a character string indicating the folder where input data are located and where output will be written
an integer that define the size of the substrings (k-mer) encoded in the hash table (this must be a value between 1 and 32)
a prime number indicating the size of the hash table. Increasing this value reduces the execution time but increases the memory utilization. Ideally, this value should be close to the number of different k-mers stored in the hash table;
an integer that define the maximum number of different k-mers that the tool might need to store in the hash table.
(tau) this value is the threshold used to select significant k-mers. We suggest to set tau equal to 1
IGH (immunoglobulin heavy chain) or IGK (immunoglobulin kappa locus)
a character string indicating the path of the spike in file (if you don't want the spike in research, please set this parameter as 'null')
a character string indicating the path of the input files
if (FALSE) {
library(docker4seq)
downloadContainers(group="docker","docker.io/qbioturin/hashclone")
hashclone(group="docker",scratch.folder="/home/scratch_folder", data.folder="/home/output_folder", kmer=26, hash=10999997, coll=10999997, threshold=1, type= IGH, spike="/home/spike_in.fa", input.files=c('/home/input_file1.fastq', '/home/input_file2.fastq'))
}