wrapperAutoencoder.Rd
This function executes the whole autoencoder pipeline
wrapperAutoencoder(
group = c("sudo", "docker"),
scratch.folder,
file,
separator,
nCluster,
bias = c("mirna", "TF", "CUSTOM", "kinasi", "immunoSignature", "ALL"),
permutation,
nEpochs,
patiencePercentage = 5,
cl,
seed = 1111,
projectName,
bN = "NULL",
lr = 0.01,
beta_1 = 0.9,
beta_2 = 0.999,
epsilon = 1e-08,
decay = 0,
loss = "mean_squared_error",
clusterMethod = c("GRIPH", "SIMLR", "SEURAT", "SHARP"),
pcaDimensions = 5,
permAtTime = 3,
largeScale = FALSE,
Sp = 0.8,
threads = 1,
X = 0.15,
K = 2,
counts = c("False"),
skipvis = c("False"),
regularization = 10,
variational = FALSE
)
a character string. Two options: sudo or docker, depending to which group the user belongs
a character string indicating the path of the scratch folder
a character string indicating the path of the file, with file name and extension included
separator used in count file, e.g. '\t', ','
number of cluster in which the dataset is divided
bias method to use : "mirna" , "TF", "CUSTOM", kinasi,immunoSignature,ALL
number of permutations to perform the pValue to evaluate clustering
number of Epochs for neural network training
number of Epochs percentage of not training before to stop.
Clustering.output file. Can be the output of every clustering algorithm from rCASC or can be customized with first column cells names, second column cluster they belong.All path needs to be provided.
important value to reproduce the same results with same input
might be different from the matrixname in order to perform different analysis on the same dataset
name of the custom bias file. This file need header, in the first column has to be the source and in the second column the gene symbol.All path needs to be provided,
learning rate, the speed of learning. Higher value may increase the speed of convergence but may also be not very precise
look at keras optimizer parameters
look at keras optimizer parameters
look at keras optimizer parameters
look at keras optimizer parameters
loss of function to use, for other loss of function check the keras loss of functions.
clustering methods: "GRIPH","SIMLR","SEURAT","SHARP"
number of dimensions to use for Seurat Pca reduction.
number of permutation in parallel
boolean for SIMLR analysis, TRUE if rows are less then columns or if the computational time are huge
minimun number of percentage of cells that has to be in common between two permutation to be the same cluster.
integer refering to the max number of process run in parallel default 1 max the number of clusters under analysis, i.e. nCluster
from 0 to 1 argument for XL-mHG default 0.15, for more info see cometsc help.
the number of gene combinations to be considered., possible values 2, 3, 4, default 2. WARNING increasing the number of combinations makes the matrices very big
if set to True it will graph the log(expression+1). To be used if unlogged data are provided
set to True to skip visualizations
this parameter balances between reconstruction loss and enforcing a normal distribution in the latent space
TRUE or FALSE if you want to use a variational autoencoder or the standard autoencoder
folders the complete autoencoder analysis.
if (FALSE) {
wrapperAutoencoder(group="sudo",scratch.folder=scratch.folder,file="/home/lucastormreig/test/setA.csv",separator=",",nCluster=5,bias="mirna",permutation=10,nEpochs=10,cl="/home/lucastormreig/test/setA_clustering.output.csv",projectName="mirna",clusterMethod="GRIPH")
}