integrationPsblk.Rd
This function execute integrationPsblk analysis which search for correspondence between clusters of two different experiments using clusters-pseudo-bulks, z-scored on rows, and a subset of randomly selected genes. Thus, the function clustersBulk has to be run on the two datasets before their comparison.
integrationPsblk(
group = c("sudo", "docker"),
scratch.folder,
fileX,
fileY,
separatorX,
separatorY,
max.genes = 500,
split.by = 100,
outputFolder
)
a character string. Two options: sudo or docker, depending to which group the user belongs
a character string indicating the path of the scratch folder
a character string indicating the path of the pseudobulkRow file, with file name and extension included.
a character string indicating the path of the pseudobulkRow file, with file name and extension included.
separator used in count file, e.g. '\t', ','
separator used in count file, e.g. '\t', ','
MAX number of random genes to be used for each cluster, default 500
value indication the splitting range default 100. I.e. if max.genes= 500 with split.by set to 100 there will be 5 sets of genes selected 100, 200, 300, 400, 500
where results are placed
A folder called XYpb with all the results generated. The final frequency table is saved in final_score.csv, which is made by the frequency of having Pearson >= 0.5 between X and Y datasets for each possible comparisons between the clusters of the X and Y experiment. Pearson correlation is calculated on 10000 random selections of genes for each threshold.