Installation
Support: utap@weizmann.ac.il
Install UTAP2 on a local server
Requirements
The application should be installed on a Linux server.
If the server supports load sharing facility (LSF)/Portable Batch System (PBS)/Slurm/Sun Grid Engine (SGE)/LoadLeveler cluster, it is recommended to run UTAP2 pipelines on the cluster in order to improve computational efficiency (note that UTAP2 was executed and tested only on LSF and Slurm clusters). Otherwise, if the server does not support one of the above clusters, the UTAP2 pipelines will need to be executed locally.
The host server and/or each compute node in the relevant queue(s) requires ~40GB of RAM memory.
The server requires the following: SingularityCE version > 3.10.4
For downloading SingularityCE refer to: https://sylabs.io/singularity/
The user must have full permissions to singularity commands. If the application is run on cluster, the user is also required to have permissions to run cluster commands.
The user should then do the following:
Create a directory for UTAP2 software and its output
Note: Since user output data will be written in this folder, please verify that you have sufficient disk space - approximately 400G per analysis.
export HOST_MOUNT=<the relevant path>
mkdir $HOST_MOUNT
cd $HOST_MOUNT
Download the UTAP2 installation folder
- The UTAP2 installation folder includes the following files:
install_UTAP_singularity.sh
optional_parameters.conf
ports.conf
required_parameters.conf
Singularity_sed.def
update-db.sh
install_UTAP_image.sh
run_UTAP_sandbox.sh
You can download it as noted below, and then unpack it in the $HOST_MOUNT folder.
#Download the zipped folder into $HOST_MOUNT folder:
cd $HOST_MOUNT && git clone https://github.com/utap2/utap2.git
mv $HOST_MOUNT/utap2/scripts/* $HOST_MOUNT
Pull UTAP2 image from the public repository
cd $HOST_MOUNT; wget --no-check-certificate https://dors4.weizmann.ac.il/utap/UTAP_installation_files/utap_latest.sif
Download genomes indexes
The genomes folder contains indexes for human (hg38), mouse (mm10), and zebrafish (danRer11). You have the option to download any one of these genomes individually, as specified below. If you require a genome that is not provided, please refer to the instructions in the “Generate new genome index and annotation file” section.
You can download the genomes folder as indicated below. After downloading, unpack the folder in the $HOST_MOUNT directory under the “genomes” folder. If you choose to download the genomes to a different location, you will need to overwrite the GENOMES_DIR parameter in the optional_parameters.conf file.
If you are using multiple genomes, it is important to ensure that they are synchronized under the same directory. You can achieve this by utilizing the “rsync” command as demonstrated below.
#Download the zipped folder into $HOST_MOUNT folder:
#For Zebrafish genome:
wget --no-check-certificate https://dors4.weizmann.ac.il/utap/UTAP_genomes/Zebrafish.tar.gz
tar -xvzf Zebrafish.tar.gz
rsync -a Zebrafish/* genomes/
#For Mouse genome:
wget --no-check-certificate https://dors4.weizmann.ac.il/utap/UTAP_genomes/Mouse.tar.gz
tar -xvzf Mouse.tar.gz
rsync -a Mouse/* genomes/
#For Human genome:
wget --no-check-certificate https://dors4.weizmann.ac.il/utap/UTAP_genomes/Human.tar.gz
tar -xvzf Human.tar.gz
rsync -a Human/* genomes/
Execute UTAP2
Fill up all the parameters in files required_parameters.conf and optional_parameters.conf.
The parameters listed in the required_parameters.conf file are all mandatory and must be provided.
The parameters mentioned in the optional_parameters.conf file are not mandatory and can be left unchanged. These optional parameters serve the purpose of overriding the default parameters already set in the file.
All the parameters are described below under the section parameters.
Note: UTAP2 can be installed either as an instance container or as a sandbox container. If the user has “fakeroot” privileges and there is approximately 36 GB of available space in the server’s temp directory (default location: /tmp but can be modified using SINGULARITY_TMP_DIR in the optional_parameters.conf) or in the “HOST_MOUNT” directory, and the corresponding directory is not mounted as a GPFS or NFS mount, then UTAP2 will be installed as an instance container. Otherwise, UTAP2 will be installed as a sandbox container.
Run the following shell command in order to run UTAP2:
cd $HOST_MOUNT
./install_UTAP_singularity.sh -a required_parameters.conf -b optional_parameters.conf
If UTAP2 was installed as an instance, an image named utap.SIF (approximately 7GB in size) will be created in your $HOST_MOUNT directory, along with additional folders and files necessary for running UTAP2.
Alternatively, if UTAP2 was installed as a sandbox, a folder named utap.sandbox (around 17GB in size) will be generated in your $HOST_MOUNT directory, containing the required folders and files for running UTAP2.
Upon completion of the run, you will be able to access the application through your web browser using the following address:
http://DNS_HOST:HOST_APACHE_PORT or http://host_ip:7000 if the default values for DNS_HOST and HOST_APACHE_PORT were not modified. To log in to UTAP2 for the first time,use the following credentials:
Username: admin
Password: the password is the ADMIN_PASS required parameter
Install UTAP2 on Google Cloud platform (GCP)
UTAP2 can be deployed on Google Cloud by following the installation process. The installation is carried out using the Terraform application, which deploys a Slurm cluster comprising a login node of n1-standard-2 type, a controller node of n1-standard-4 type, and 20 potential nodes of type N2-high memory. Each node is equipped with 2 CPUs and 32GB of memory on Google Cloud.
Once the installation process is complete, all UTAP2 software and applications are configured and prepared for execution. To install UTAP2 on Google Cloud, refer to the following instructions.
Requirments
Gmail Account:
You need a Gmail account to sign in to Google Cloud Console.
Active Project on Google Cloud:
You should have an active project on Google Cloud with an associated billing account. To create a project on Google Cloud, you can refer to the official documentation at: https://cloud.google.com/free?_ga=2.113028676.-1500123117.1693913602&hl=en.
Quotas on Google Cloud Project:
Ensure that your Google Cloud project has the following default quotas:
-At least 32 CPUs (across all regions).
-At least 8 N2 CPUs for the region us-central1.
These are the default quotas and it’s recommended to request more quotas for better performance. If using more than the default quotas, adjust MAX_CPU and MAX_MEMORY optional parameters accordingly. You can find information on how to request additional quotas in the Google Cloud documentation on quotas at: https://cloud.google.com/docs/quotas#docs.
Install Google cloud Slurm cluster VMs
Enter google cloud shell as described in the bellow image.
Clone from our git repository the scripts for installing Google cloud Slurm cluster login and controller VMs:
cd ~ && git clone https://github.com/utap2/utap2.git && mv ~/utap2/GCP_installation_scripts/required_parameters.conf ~/utap2/GCP_installation_scripts/optional_parameters.conf ~/utap2/scripts/
chmod +X ~/utap2/GCP_installation_scripts/*
In this installation the required paraemeters get also default values. The default parameters can be modified in the files ~/utap2/scripts/required_parameters.conf ~/utap2/scripts/optional_parameters.conf.
Execute Slurm cluster installation on Google Cloud Platform:
bash ~/utap2/GCP_installation_scripts/install_GCP_slurm.sh -i <project_id>
Don’t forget to replace ‘project_id’ parameter with your actual project id.
Upon running the above command, two compressed raw images are fetched from our public server. One image serves as the login node, and the other serves as the controller node. These images encompass the entire UTAP2 installation and include software that streamlines the installation process, ensuring stability. Initially, the images are stored in your Google Storage bucket, created during the installation process. Subsequently, the images are transferred to your Google Cloud project as bootable images. Please note that this procedure can take around 10 hours to complete.
Click on authorize to give permissions to GCP shell as described in the image bellow:
When prompted for authentication, enter ‘Y’ as described in the image bellow:
Click on the link to authenticate with your google account:
Make sure to select the Google account with which your project has been created.
Click on continue to sign in Google cloud SDK.
Click on Allow to access your Google Account.
Click on copy in order to copy the indicated password.
Paste the code in the Google shell console and click on “ENTER”.
If the Google project already contains UTAP2 images, the installation will take a few minutes (approximately 15 minutes) to complete. Otherwise, the installation will take approximately 20 hours to complete. This breakdown includes:
15-16 hours for downloading and uploading the controller image (96GB) and the login image (17GB) to your bucket.
An additional 3 hours for importing the images to your project as bootable images.
Approximately 15 minutes for the installation process.
Upon successful installation, two new VMs (controller and login nodes) will be created on your Google project as shown in the image bellow:
The login VM is assigned an external IP address (as displayed in the image). Please copy this IP address to access the UTAP2 site using the following link: https://<external login node IP>:7000 (the default Apache2 port is 7000 unless the HOST_APACHE_PORT optional parameter has been altered).
To log in to UTAP2 for the first time,use the following credentials:
Username: admin
Password: Admin1234 (if the ADMIN_PASS required parameter was not changed)
Upload data to UTAP2
To upload data, such as fastq files, either from your local PC or a mounted location, utilize the upload feature on the UTAP2 site.
For admin users exclusively, if your data resides in a Google bucket, execute the following commands in your Google Shell, replacing <bucket_name> and <input> with your actual bucket name and input files (the default HOST_MOUNT directory is $HOME, if you changed it in the optional_parameters.conf during UTAP2 installation, you will need to change it here as well), to copy the data from the bucket to the cluster:
export bucket_name=<bucket_name>
export input=<input>
export HOST_MOUNT='~'
export USER_LOGIN=`gcloud compute os-login describe-profile --format json|jq -r '.posixAccounts[].username'`
export LOGIN_IP=`gcloud compute instances list --sort-by=~creationTimestamp --format="value(EXTERNAL_IP)" | sed -e '/^[[:space:]]*$/d' | head -n 1`
ssh -i ~/.ssh/google_compute_engine "$USER_LOGIN"@"$LOGIN_IP" "mkdir -p ~/input_data && fusermount -u ~/input_data; gcsfuse -o rw -file-mode=777 -dir-mode=777 --debug_fuse_errors --debug_fuse --debug_fs --debug_gcs --implicit-dirs \"$bucket_name\" ~/input_data && cp -r ~/input_data/$input $HOST_MOUNT/utap-output/admin"
If your data is stored in an AWS S3 bucket, utilize the Google Transfer Data service to move the data from the AWS S3 bucket to the Google bucket ” slurm-us-central1-simple” generated during the UTAP2 installation process. Refer to the official documentation at https://cloud.google.com/storage-transfer/docs/overview for detailed instructions. After completing the data transfer to the Google bucket, run the aforementioned commands in your Google Shell.
Test UTAP2
Run RNA-Seq pipeline with example data
For testing UTAP2, you can download fastq files and test files for RNA-Seq pipeline folder as noted below.
export HOST_MOUNT=<the relevant path>
cd $HOST_MOUNT
wget --no-check-certificate -nH --cut-dirs=3 -r --reject='index.html*' --exclude-directories=/20230613_081343_test_Transcriptome_RNA-Seq -P $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_hg38_RNA-seq https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/example_and_data_for_testing_hg38_RNA-seq/
Once the download is finished, log in to the UTAP2 site as the admin USER and select the Transcriptome RNA-Seq pipeline from the “Choose pipeline” scroll-bar. Proceed to complete the form according to the instructions provided below.
1.Select the folder $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_hg38_RNA-seq/fastq using the browser in the “Input folder” field.
2.Select “Run DESeq2” in “DESeq2 run” choice field
3.Select Homo_sapiens(hg38) in the “Genome” field
4.Select Homo_sapiens(genecode) in the “Annotation” field
5.Fill in your email in the “User email” field, if the field is empty
6.Fill in the DESeq2 category boxes as follow:
- siTAZ
IQ-siTAZ
IY-siTAZ
IZ-siTAZ
- siC
IQ-siC
IY-siC
IZ-siC
It is crucial to ensure that all category names are identical to the names mentioned above as seen in the screenshot below. This is of utmost importance for verifying the successful completion of the UTAP2 run test.
7.Add batch effect by clicking on “Add Batch Effect” button.
Select IQ-siTAZ and IQ-siC samples and mark them as Batch 1 (red color) by clicking on the button “Batch 1”.
Select IY-siTAZ and IY-siC samples and mark them as Batch 2 (brown color) by clicking on the button “Batch 2”.
Select IZ-siTAZ and IZ-siC samples and mark them as Batch 3 (green color) by clicking on the button “Batch 3”.
8.click on “Run analysis” button
Here is a screen shot of the RNA-Seq pipeline form for the example data.
View pipeline output
After submitting the run, you will be directed to the “User Datasets” page, which can also be accessed by navigating to “User Datasets” in the site navigation bar. This page allows you to track the progress of all the runs. Within a few seconds of starting the run, a folder named $HOST_MOUNT/utap-output/admin/<run_id>_<run_name>_Transcriptome_RNA-Seq will be generated. This folder contains the pipeline output for each step, organized in separate folders.
The folder “4_reports/<report_name>_<run_id>” contains graphs, statistics, and additional information for all the pipeline steps. Once the run is completed, you will receive an email with links to the results report. For a detailed interactive explanation of the report, you can utilize the relevant e-learning module available in the site navigation bar.
An example of the pipeline output can be found at: https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/example_and_data_for_testing_hg38_RNA-seq/20230613_081343_test_Transcriptome_RNA-Seq
For further details, please refer to the “Help” tab in the site navigation bar.
Check pipeline output
After the run is finished, you can verify the successful completion of the test run by executing the script test_UTAP.sh. This script compares the results from your pipeline with the example results available at https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/example_and_data_for_testing_hg38_RNA-seq/20230613_081343_test_Transcriptome_RNA-Seq.
To run the script, follow the instructions below (no need to download the results example):
export HOST_MOUNT=<the relevant path>
cd $HOST_MOUNT
chmod +x $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_hg38_RNA-seq/test_files/test_UTAP.sh
. $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_hg38_RNA-seq/test_files/test_UTAP.sh
If the run is successfully completed, the output message “UTAP test run succeeded” will be displayed. In case any issues arise during the run or testing process, please contact us for further assistance (utap@weizmann.ac.il).
Generate new genome index and annotation file
Only admin users have the authority to generate new genome indexes and annotation files. For RNA-Seq, MARS-Seq, and SCRB-Seq pipelines, please adhere to the following instructions to generate STAR indexes and GTF files. For ATAC-Seq and ChIP-Seq pipelines, please refer to the instructions provided to generate Bowtie2 indexes. It’s important to note that currently, new indexes and annotation files cannot be generated for the Ribo-Seq pipeline. Additionally, it’s worth mentioning that all demultiplexing pipelines do not require any genome indexes or annotation files.
Generate STAR (v2.7.10.a) index and GTF for RNA-Seq, MARS-Seq and SCRB-Seq pipelines
To generate a new index for an organism other than human, mouse and zebrafish, make sure to download first the genome fasta file and annotation file and then run the following command, replacing <organism_name>, <organism_alias>, <host_mount>, <version> (only numbers), <source>, <fasta_full_path>, <gtf_full_path> and <gtf_annotation> (such as: GENCODE/RefSeq/Ensembl etc) with your actual values:
export ORGANISM=<organism_name>
export ALIAS=<organism_alias>
export VERSION=<version> #only numbers
export SOURCE=<source>
export HOST_MOUNT=<host_mount>
export FASTA=<fasta_full_path>
export GTF=<gtf_full_path>
export GTF_ANNOTATION=<gtf_annotation>
Then run the following command:
cd $HOST_MOUNT
source all_parameters
source singularity_variables
export FASTA_BS=$(basename "$FASTA")
export FASTA_DN=$(dirname "$FASTA")
export GTF_NO_EXT=$(basename "$GTF" | cut -d. -f1)
export GTF_DN=$(dirname "$GTF")
export GTF_BS=$(basename "$GTF")
export GTF_PATH=$GENOMES_DIR/star/$ORGANISM/annotation/$GTF_BS
eval $SINGULARITY_HOST_COMMAND; singularity exec -B $FASTA_DN -B $GTF_DN $IMAGE_PATH bash -c "/opt/miniconda3/envs/utap/bin/STAR --runMode genomeGenerate --runThreadN 30 --genomeDir $GENOMES_DIR/star/$ORGANISM/star_index --genomeFastaFiles $FASTA && echo \"from analysis.models import StarGenome, StarAnnotation; \
fk_star = StarGenome(creature=\"\\'$ORGANISM\\'\", alias=\"\\'$ALIAS\\'\", version=\"\\'$VERSION\\'\", source=\"\\'$SOURCE\\'\", path=\"\\'$GENOMES_DIR/star/$ORGANISM/star_index\\'\"); \
fk_star.save(); \
StarAnnotation(genome=fk_star, creature=\"\\'$ALIAS\\'\", alias=\"\\'$GTF_ANNOTATION\\'\", version=$VERSION, source=\"\\'$SOURCE\\'\", path=\"\\'$GTF_PATH\\'\", path3p=\"\\'$GTF_PATH\\'\").save()\" \ | /opt/miniconda3/envs/utap-Django/bin/python /opt/utap/manage.py shell"; unset all_parameters
Generate Bowtie2 index
To generate a new index for an organism other than human, mouse, make sure to download first the genome fasta file and then run the following command, replacing <organism_name>, <organism_alias>, <host_mount>, <version> (only numbers), <source>, <fasta_full_path> with your actual values:
export ORGANISM=<organism_name>
export ALIAS=<organism_alias>
export VERSION=<version> #only numbers
export SOURCE=<source>
export HOST_MOUNT=<host_mount>
export FASTA=<fasta_full_path>
Then run the following command:
cd $HOST_MOUNT
source all_parameters
source singularity_variables
export FASTA_BS=$(basename "$FASTA")
export FASTA_DN=$(dirname "$FASTA")
eval "$SINGULARITY_HOST_COMMAND"; singularity exec -B $FASTA_DN $IMAGE_PATH bash -c "mkdir -p $GENOMES_DIR/bowtie2/$ORGANISM/bowtie2_index && /opt/miniconda3/envs/utap/bin/bowtie2-build $FASTA $GENOMES_DIR/bowtie2/$ORGANISM/bowtie2_index && echo \"from analysis.models import Bowtie2Genome; \
Bowtie2Genome(creature=\"\\'$ORGANISM\\'\", alias=\"\\'$ALIAS\\'\", version=\"\\'$VERSION\\'\", source=\"\\'$SOURCE\\'\", path=\"\\'$GENOMES_DIR/bowtie2/$ORGANISM/bowtie2_index\\'\", fasta=\"\\'$FASTA\\'\").save()\" \ | /opt/miniconda3/envs/utap-Django/bin/python /opt/utap/manage.py shell"; unset all_parameters
Parameters
Required parameters
- HOST_MOUNT
Mount point from the singularity on the host (full path of the folder).
This is the folder that contains all UTAP2 installation files.
All input and output data for all of the users will be written into this folder.
- ADMIN_PASS
Password of an admin in the UTAP2 database.
(The password must contain at least one uppercase character, one lowercase character, and one digit).
- REPLY_EMAIL
Support email for users. Users receive emails from this address. If you provide a Gmail address, please ensure that you input your correct Gmail app password in the field “MAIL_PASSWORD” within the optional_parameters.conf file. Refer to https://support.google.com/accounts/answer/185833?hl=en for getting gmail app password.
- MAX_CORES
Maximum cores in the host computer or in each node of the cluster.
- MAX_MEMORY
Maximum memory in MB in the host computer or in each node of the cluster.
Optional parameters
- USER
User in host server that has permission to run cluster commands (if run with cluster), run singularity commands and write into the $HOST_MOUNT folder (user can have “fakeroot” permissions).
The default is: USER=$USER
- DNS_HOST
DNS address of the host server.
For example: http://servername.ac.il or servername.ac.il
The default is the IPv4 address of the host server (can be obtained with the command ‘hostname -I’)
- MAIL_SERVER
Domain name of the mail server.
For example: mg.weizmann.ac.il
The default is: REPLY_EMAIL= None
- MAIL_PASSWORD
Password associated to the REPLY_EMAIL address in required_parameters.conf file.
The default is: MAIL_PASSWORD=None
- HOST_APACHE_PORT
Any available port on the host server for the singularity Apache.
For example: 8080
The default is: HOST_APACHE_PORT= 7000
- INSTITUTE_NAME
Your institute name or lab
(the string can contain only A-Za-z0-9 characters without whitespaces).
The default is: INSTITUTE_NAME=None
- MAX_UPLOAD_SIZE
Maximum file/folder size that a user can upload at once (Megabytes).
For example: 314572800 (i.e. 300*1024*1024 = 314572800Mb = 300Gb)
The default is: MAX_UPLOAD_SIZE =314572800
- CONDA
Full path to miniconda’s env root folder.
A full miniconda3 env exist inside the container .
For example: /miniconda3
The default is: CONDA=None
When default parameter is used the env at /opt/miniconda3 inside the container will be used.
- PROXY_URL
UTAP2’s URL if you are using proxy server. default: DNS_HOST:HOST_APACHE_PORT
- RUN_NGSPLOT
Set to 1 if for running NGS-plot.
The default is: RUN_NGSPLOT=1
- HOST_HOME_DIR
The USER home directory on the host.
For example: /home/username
The default is: $HOME
- DB_PATH
Full path to the folder where the DB will be located.
$USER needs to have write permission for this folder.
The DB is very small, so it is will not create disk space problems.
For example: mkdir /utap-db; chown -R $USER/utap-db;
The default is: DB_PATH=$HOST_MOUNT/UTAP_DB
- GENOMES_DIR
The full path to the genomes directory.
The default is: GENOMES_DIR =$HOST_MOUNT/genomes
- SINGULARITY_TMP_DIR
Singularity uses a temporary directory to build the squashfs filesystem, and this temp space needs to be at least 36GB large to hold the entire resulting Singularity image.
If you use “fakeroot” privileges, make sure that the tmp directory is local and not NFS or GPFS mounted disc.
The default is: SINGULARITY_TMP_DIR=/tmp
- FAKEROOT
Set to 1 If USER has “fakeroot” privileges.
The default is: FAKEROOT=None
- SINGULARITY_HOST_COMMAND
Singularity command on the host.
for example: if singularity is installed as module named Singularity on the host then the command will be: ”ml Singularity”
The default is: SINGULARITY_HOST_COMMAND=None
Additional optional parameters for installing on a cluster:
- CLUSTER_TYPE
Type of the cluster.
For example: lsf/pbs/slurm/sge/load leveler/local.
The commands will be sent to the cluster. Currently, UTAP2 supports load sharing facility (LSF), Portable Batch System (PBS), Slurm, Sun Grid Engine (SGE) and LoadLeveler clusters (note that UTAP2 was executed and tested onlt on LSF and Slurm clusters).
When “local” parameter is used, UTAP2 pipelines will be run on the local host inside the container.
The default is: CLUSTER_TYPE=local
- CLUSTER_QUEUE
Queue name in the cluster. $USER must have permissions to run on this queue.
The default is: CLUSTER_QUEUE=None
- SINGULARITY_CLUSTER_COMMAND
Singularity command on the cluster.
For example: if singularity is installed as module named Singularity on the cluster, then command will be: ”ml Singularity”
The default is: SINGULARITY_CLUSTER_COMMAND=None
Additional example for testing
Run MARS-Seq pipeline with example data
For testing UTAP2, you can download the fastq and additional test files for the MARS-Seq pipeline folders noted below.
cd $HOST_MOUNT
wget --no-check-certificate -nH --cut-dirs=3 -r --reject='index.html*' --exclude-directories=/20230520_231819_test_Transcriptome_MARS-Seq -P $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_mm10_MARS-seq https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/example_and_data_for_testing_mm10_MARS-seq/
Once the download has finished, log in to the UTAP2 site as the admin USER and select the Transcriptome MARS-Seq pipeline from the “Choose pipeline” scroll-bar. Proceed to complete the form according to the instructions provided below.
1.Select the folder $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_mm10_MARS-seq/fastq using the browser in the “Input folder” field.
2.Select “Run DESeq2: in “DESeq2 run” choice field
3.Select Mus_musculus(mm10) in the “Genome” field
4.Select Mus_musculus(genecode) in the “Annotation” field
5.Fill in your email in the “User email” field, if the field is empty
- 6.Fill in the DESeq2 category boxes as follow:
- MG_cont
GFAP_reporter_12hLPS_MG3_cont_1
- MG_LPS
GFAP_reporter_12hLPS_MG1_LPS_1
GFAP_reporter_12hLPS_MG2_LPS_2
- astro_cont
GFAP_reporter_12hLPS_astro3_cont_1
- astro_LPS
GFAP_reporter_12hLPS_astro1_LPS_1
GFAP_reporter_12hLPS_astro2_LPS_2
- CD45_cont
GFAP_reporter_12hLPS_CD45_3_cont_1
- CD45_LPS
GFAP_reporter_12hLPS_CD45_1_LPS_1
GFAP_reporter_12hLPS_CD45_2_LPS_2
It is crucial to ensure that all category names (e.g “MG_cont”, “MG_LPS” etc. or refer to the screenshot below) are identical to the names mentioned above. This is of utmost importance for verifying the successful completion of the UTAP2 run test.
7.click on “Run analysis” button
Here is a screen shot of the MARS-Seq pipeline form for the example data.
View pipeline output
After submitting the run, you will be directed to the “User Datasets” page, which can also be accessed by navigating to “User Datasets” in the site navigation bar. This page allows you to track the progress of all the runs. Within a few seconds of starting the run, a folder named $HOST_MOUNT/utap-output/admin/<run_id>_<run_name>_Transcriptome_MARS-Seq will be generated. This folder contains the pipeline output for each step, organized in separate folders.
The folder “10_reports/<report_name>_umi_counts_<run_id>” contains graphs, statistics, and additional information for all the pipeline steps. Once the run is completed, you will receive an email with links to the results report. For a detailed interactive explanation of the report, you can utilize the relevant e-learning module available in the site navigation bar.
An example of the pipeline output can be found at: https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/example_and_data_for_testing_mm10_MARS-seq/20230520_231819_test_Transcriptome_MARS-Seq
For further details, please refer to the “Help” tab in the site navigation bar.
Check pipeline output
After the run is finished, you can verify the successful completion of the test run by executing the script test_UTAP.sh. This script compares the results from your pipeline with the example results available at https://dors4.weizmann.ac.il/utap/UTAP_test_and_example_data/exammple_and_data_for_testing_mm10_MARS-seq/20230520_231819_test_Transcriptome_MARS-Seq.
To run the script, follow the instructions below:
cd $HOST_MOUNT
chmod +x $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_mm10_MARS-seq/test_files/test_UTAP.sh
. $HOST_MOUNT/utap-output/admin/example_and_data_for_testing_mm10_MARS-seq/test_files/test_UTAP.sh
If the run is successfully completed, the output message “UTAP test run succeeded” will be displayed. In case any issues arise during the run or testing process, please contact us for further assistance.
REMARKS
PBS cluster installation was prepared but not tested.
Within the $DB_PATH folder, a file named db.sqlite3 will be created.
The db.sqlite3 file serves as the application’s database, storing user details and links to results within the $HOST_MOUNT folder.
The $HOST_MOUNT folder contains data for all users, including input and output files.
It is important to note that the db.sqlite3 database and $HOST_MOUNT folder are located on the host server, outside of the container. Consequently, deleting or stopping the “utap” container will not remove the database or $HOST_MOUNT folder.
In the event of a temporary Singularity deletion, it is advised to retain the database (“db.sqlite3”) and the corresponding $HOST_MOUNT folder. Upon rerunning Singularity using the install_UTAP_singularity.sh script, the existing database (“db.sqlite3”) and $HOST_MOUNT folder will be utilized.