Zaloguj się

Aby wyświetlić tę treść, wymagana jest subskrypcja JoVE. Zaloguj się lub rozpocznij bezpłatny okres próbny.

W tym Artykule

  • Podsumowanie
  • Streszczenie
  • Wprowadzenie
  • Protokół
  • Wyniki
  • Dyskusje
  • Ujawnienia
  • Podziękowania
  • Materiały
  • Odniesienia
  • Przedruki i uprawnienia

Podsumowanie

The protocol submitted here explains the complete in silico pipeline needed to predict and functionally characterize circRNAs from RNA-sequencing transcriptome data studying host-pathogen interactions.

Streszczenie

Circular RNAs (circRNAs) are a class of non-coding RNAs that are formed via back-splicing. These circRNAs are predominantly studied for their roles as regulators of various biological processes. Notably, emerging evidence demonstrates that host circRNAs can be differentially expressed (DE) upon infection with pathogens (e.g., influenza and coronaviruses), suggesting a role for circRNAs in regulating host innate immune responses. However, investigations on the role of circRNAs during pathogenic infections are limited by the knowledge and skills required to carry out the necessary bioinformatic analysis to identify DE circRNAs from RNA sequencing (RNA-seq) data. Bioinformatics prediction and identification of circRNAs is crucial before any verification, and functional studies using costly and time-consuming wet-lab techniques. To solve this issue, a step-by-step protocol of in silico prediction and characterization of circRNAs using RNA-seq data is provided in this manuscript. The protocol can be divided into four steps: 1) Prediction and quantification of DE circRNAs via the CIRIquant pipeline; 2) Annotation via circBase and characterization of DE circRNAs; 3) CircRNA-miRNA interaction prediction through Circr pipeline; 4) functional enrichment analysis of circRNA parental genes using Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). This pipeline will be useful in driving future in vitro and in vivo research to further unravel the role of circRNAs in host-pathogen interactions.

Wprowadzenie

Host-pathogen interactions represent a complex interplay between the pathogens and host organisms, which triggers the hosts' innate immune responses that eventually result in the removal of invading pathogens1,2. During pathogenic infections, a multitude of the host immune genes is regulated to inhibit the replication and release of pathogens. For example, common interferon-stimulated genes (ISGs) regulated upon pathogenic infections include ADAR1, IFIT1, IFIT2, IFIT3, ISG20, RIG-I, and OASL3,4. Besides protein-coding genes, studies have also reported that non-coding RNAs such as long non-coding RNAs (lncRNAs), microRNAs (miRNAs), and circular RNAs (circRNAs) also play a role and are regulated concurrently during pathogenic infections5,6,7. In contrast to protein-coding genes that mainly encode proteins as functional molecules, non-coding RNAs (ncRNAs) are known to function as regulators of genes at transcriptional and post-transcriptional levels. However, studies involving the participation of non-coding RNAs, particularly circRNAs, in regulating the hosts' immune genes are not well reported compared to the protein-coding genes.

CircRNAs are widely characterized by their covalently closed continuous loop structure, which is generated through a non-canonical splicing process called back-splicing8. The process of back-splicing, unlike the splicing process of cognate linear RNAs, involves the ligation of the downstream donor site to the upstream acceptor site, forming a circular-shaped structure. Currently, three different back-splicing mechanisms for the biogenesis of circRNAs have been proposed. These are RNA binding protein (RBP) mediated circularization9,10, intron-pairing-driven circularization11, and lariat-driven circularization12,13,14. Given that circRNAs are connected end-to-end in a circular structure, they tend to be naturally resistant to normal exonuclease digestions and, thus, are considered to be more stable than their linear counterparts15. Another common characteristic exhibited by circRNAs includes the cell or tissue type-specific expression in hosts16.

As implied by their unique structure and cell or tissue-specific expression, circRNAs have been discovered to play important biological functions in cells. To date, one of the prominent functions of circRNAs is their role as microRNA (miRNA) sponges17,18. This regulatory role of circRNAs occurs through the complementary binding of circRNA nucleotides with the seed region of miRNAs. Such a circRNA-miRNA interaction inhibits the miRNAs' normal regulatory functions on target mRNAs, thus regulating the expression of genes19,20. Additionally, circRNAs are also known to regulate gene expression by interacting with RNA binding proteins (RBPs) and forming RNA-protein complexes21. Although circRNAs are classified as non-coding RNAs, there is also evidence that circRNAs can act as templates for protein translation22,23,24.

Recently, circRNAs have been demonstrated to play pivotal roles in regulating the host-pathogen interactions, particularly between the hosts and viruses. Generally, host circRNAs are assumed to assist in regulating the host's immune responses to eliminate the invading pathogens. An example of circRNA that promotes host immune responses is circRNA_0082633, reported by Guo et al.25. This circRNA enhances type I interferon (IFN) signaling within A549 cells, which helps to suppress influenza virus replication25. Moreover, Qu et al. also reported a human intronic circRNA, called circRNA AIVR, that promotes immunity by regulating the expression of CREB-binding protein (CREBBP), a signal transducer of IFN-β26,27. However, circRNAs that are known to promote the pathogenesis of disease upon infection also exist. For example, Yu et al. recently reported the role played by a circRNA spliced from the GATA zinc finger domain containing the 2A gene (circGATAD2A) in promoting the H1N1 virus replication through the inhibition of host cell autophagy28.

To effectively study circRNAs, a genome-wide circRNA prediction algorithm is usually implemented, followed by an in silico characterization of the predicted circRNA candidates before any functional studies can be carried out. Such a bioinformatics approach to predict and characterize circRNAs is less costly and more time efficient. It helps to refine the number of candidates to be functionally studied and could potentially lead to novel findings. Here, we provide a detailed bioinformatics-based protocol for the in silico identification, characterization and functional annotation of circRNAs during the host-pathogen interactions. The protocol includes the identification and quantification of circRNAs from RNA-sequencing datasets, annotation via circBase, and the characterization of the circRNA candidates in terms of circRNA types, number of overlapping genes, and predicted circRNA-miRNA interactions. This study also provides the functional annotation of the circRNA parental genes through Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis.

Protokół

In this protocol, de-identified ribosomal RNA (rRNA)-depleted RNA-seq library datasets prepared from the influenza A virus-infected human macrophage cells were downloaded and used from the Gene Expression Omnibus (GEO) database. The entire bioinformatics pipeline from prediction to functional characterization of circRNAs is summarized in Figure 1. Each part of the pipeline is further explained in the sections below.

1. Preparation, download, and setup before data analysis

NOTE: All software packages used in this study are free and open-source.

  1. Downloading the required tools on the Linux platform
    1. Download and install the required software and tools listed in the Table of Materials on a Linux high-performance computer using the instructions provided by the developer.
      NOTE: Most of the tools and software have their own online GitHub pages or documentation containing instructions on installing and using their tools (refer to the Table of Materials).
    2. Download the desired RNA-seq datasets for circRNA detection and analysis from sequence archive websites (e.g., European Nucleotides Archive and Gene Expression Omnibus).
    3. Download the reference genome(s) (FASTA format) and annotation files (GTF/GFF3 format), compatible with the host from which the RNA-seq dataset was prepared. Host reference genome(s) and annotation files are usually found on online genome browsers such as the National Center for Biotechnology Information (NCBI), the University of California Santa Cruz (UCSC), and the Ensembl websites.
  2. Quality checking of RNA-seq
    1. Input the FASTQ files into the FASTQC program to determine the quality of RNA sequences. If the quality of the FASTQ files are low (e.g., <Q20) or there is a presence of adapter sequences, further trimming might be needed using tools such as Trimmomatic29,30.

2. Prediction and differential expression analysis of circRNAs using CIRIquant

NOTE: A more detailed manual on installing and performing differential expression analysis can be found in the code availability section of the CIRIquant paper31. The supplementary data also includes some of the basic commands used in this protocol.

  1. CircRNA predictions
    1. Index the host's reference genome first using BWA and HISAT2 aligners. Then, on a Linux terminal, execute the commands bwa index32 and hisat2-build33 in the directory of the host's reference genome to index it.
    2. Next, prepare a YML config file containing the name of the file, the path of tools (BWA, HISAT2, stringtie34, samtools35), the path to the downloaded reference files (host's reference genome FASTA files, annotation files), and the path to the index files from step 2.1.1.
    3. Execute the CIRIquant tool from terminal using either the default or manual parameters. The user can specify the library type (either stranded or non-stranded) of the RNA-seq data when executing the CIRIquant tool.
      NOTE: The library type of the RNA-seq data can be determined by knowing the type of library preparation kit used. If the identity of the library preparation kit is unknown, an RNA-seq control bioinformatic package called RSeQC36 can be used to determine the strandedness of RNA-seq data.
  2. Differential expression analysis
    NOTE: CIRIquant package includes prep_CIRIquant, prepDE.py, and CIRI_DE_replicate; therefore, no additional downloads are needed for these three tools.
    1. Prepare a text file (.lst) with a list of data containing the following:
      1st column: IDs of the RNA-seq data used in step 2.1.3
      2nd column: path to the GTF files outputted by CIRIquant
      3rd column: grouping of the RNA-seq data, whether it is a control or treated group.
    2. For an example, refer to Table 1 below.
      NOTE: It is not necessary to put in the headers as they are just for reference.
    3. On the Linux terminal, run prep_CIRIquant with the text file (.lst) prepared in step 2.2.1 as an input. The run will generate a list of files: library_info.csv, circRNA_info.csv, circRNA_bsj.csv, and circRNA_ratio.csv.
    4. Prepare a second text file with a list of data containing the RNA-seq IDs and the path to their respective StringTie output. The file layout must be similar to the text file in step 2.2.1 without the grouping column.
    5. Run prepDE.py with the text file prepared in step 2.2.4 as an input to generate the gene count matrix files.
    6. Execute CIRI_DE_replicate with the library_info.csv and circRNA_bsj.csv files from step 2.2.3 and the gene_count_matrix.csv file from step 2.2.5 as inputs to output the final circRNA_de.tsv file.
  3. Filtering of DE circRNAs
    1. Use R (in the computer terminal or RStudio) or any spreadsheet software (e.g., Microsoft Excel) to open the circRNA_de.tsv file generated from step 2.2.6 to filter and determine the number of differentially expressed (DE) circRNAs.
    2. Filter the DE circRNAs according to the criteria LogFC > |2| and FDR < 0.05.
    3. Create a file named DE_circRNAs.txt to store the information of DE circRNAs.

3. Characterization and annotation of predicted DE circRNAs

  1. Annotation status of DE circRNAs
    1. Load the file named DE_circRNAs.txt in RStudio , which consists of the list of DE circRNAs filtered from step 2.3.3. Include other information such as the genomic positions (Chr, Start, End), strand orientations (+ or -), gene name, and circRNA type. Before proceeding, convert the circRNA genomic start coordinates from CIRIquant to 0-based by subtracting 1 base pair.
      NOTE: The other information stated above can be obtained from the GTF files outputted by CIRIquant (Supplementary File 1).
    2. Determine the annotation status of the predicted DE circRNAs by downloading a library containing the genomic positions of the circRNA-database (e.g., circBase) deposited circRNAs.
      NOTE: Assure that the genome version used to predict the circRNAs is identical to the circRNA database library before making the comparison. The circBase data file used here is freely available in the drive folder provided in Github (https://github.com/bicciatolab/Circr)37.
    3. Once both the files from step 3.1.1 and step 3.1.2 are prepared, run the R script given in Supplementary File 1. Chromosomal locations of DE circRNAs are queried to the library before assigning the status Annotated or Unannotated.
  2. Characterization of DE circRNAs
    1. Use R and other spreadsheet software to summarize the number of circRNAs according to the circRNA types (i.e., exon, intron, intergenic, and antisense) and the number of genes that the circRNAs span across (1 or >1) (Supplementary File 1).​NOTE: CIRIquant can only detect four types of circRNAs (exon, intron, intergenic, and antisense). Exon-intron circRNAs, also known as ElciRNAs, cannot be detected by CIRIquant.

4. Predicting the circRNA-miRNA interaction using Circr

NOTE: A more detailed manual on how to install and use Circr for the circRNA-miRNA interaction analysis can be found at: https://github.com/bicciatolab/Circr37.

  1. Preparation of files
    1. Unzip and extract the contents of the Circr.zip file after downloading it from the Circr GitHub page using the relevant software such as "WinRar" or "7-zip" into a new directory where the analysis will be conducted.
    2. Install the prerequisite software applications (miRanda, RNAhybrid, Pybedtools, and samtools) before conducting the circRNA-miRNA analysis.
    3. Reference genomes and annotation files for several organisms of interest, rRNA coordinates file, validated miRNA interaction file, and circBase circRNA files are provided by the Circr author in the Github page (https://github.com/bicciatolab/Circr)37. Upon clicking on the support files in drive folder, select the folder for the organism of interest, miRNA folder, and the circBase text file and download it.
    4. After downloading the necessary files in step 4.1.3, create a new directory named support_files in the directory mentioned in step 4.1.1. Then, unzip and extract the content into the support_files directory.
    5. Index the reference genome file of the organism of interest using the samtools faidx command (Supplementary File 1).
    6. Prepare an input file consisting of the coordinates of DE circRNAs of interest in a tab-delimited BED file, as shown in Table 2.
      NOTE: Because circRNAs predicted by CIRIquant are not 0-based, it is necessary to minus 1 bp at the starting coordinate (as mentioned in step 3.1.1) before converting them to the BED format. The headers shown in Table 2 are just for reference and are not needed in the BED files.
    7. At this point, ensure that the expected folder tree structure for Circr analysis is as in Figure 2.
  2. Running Circr.py
    1. Execute Circr.py using Python 3, and as arguments, specify the circRNA input file, the FASTA genome of the organism of interest, the genome version of the selected organism, the number of threads, and the name of the output file in the command line.
    2. If the organism of interest is not provided in the drive folder listed in step 4.1.3 or if the user prefers to have a custom set of files to run the analysis, additional commands specifying the location of these files need to be included when executing Circr.py.
    3. After the Circr analysis is complete, the program outputs a circRNA-miRNA interaction file in the csv format.
    4. Filter the circRNA-miRNA interaction results according to the user-specific preference. For this study, the predictions are filtered using Rstudio according to the criteria below:
      -Detected by all three software tools
      -Two or more binding sites reported by both Targetscan and miRanda
      -Identified in either the "AGO" or "validated" columns
      -​Filter out no seed region interactions
    5. Write the circRNAs that pass the filtered conditions from step 4.2.3 into a new text file named circRNA_miRNA.txt. Such filtering can increase the confidence of the predicted interactions.

5. Construction of the ceRNA network

NOTE: A detailed manual on how to use Cytoscape can be found at: http://manual.cytoscape.org/en/stable/ and https://github.com/cytoscape/cytoscape-tutorials/wiki#introduction 

  1. Download and preparation
    1. Download the latest version of Cytoscape38 from: https://cytoscape.org/download.html.
    2. Execute the installer wizard downloaded in step 5.1.1 and select the file location for the Cytoscape software.
    3. Prepare a tab-delimited file containing the circRNAs of interest and their target miRNA. The first column consists of the circRNA name; the second column specifies the type of RNA from the first column; the third column is the target miRNA; and the fourth column specifies the type of RNA from the third column. An example of the file is shown in Table 3.
  2. Constructing the ceRNA network map
    1. Open the Cytoscape software installed in step 5.1.2.
    2. In Cytoscape, navigate to File > Import > Network from File. Select the file that has been prepared in step 5.1.3.
    3. In the new tab, select the first and second column as "Source Node" and "Source Node Attribute" while select the third and fourth column as "Target Node" and "Target Node Attribute" respectively. Click OK and the network will show up on the upper right side of Cytoscape.
    4. To change the visual style of the network, press the Style button on the left side of Cytoscape.
    5. Press the arrow on the right side of Fill Color. Choose Type for the column and Discrete Mapping for the mapping type. Then, select the color desired for each of the RNA types.
    6. After changing the color, change the shape of the nodes by navigating to Shape and following step 5.2.5.

6. Functional enrichment analysis

  1. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis for the parental gene of the circRNAs
    1. Ensure the clusterProfiler39,40 and org.Hs.eg.db41 packages have been installed in Rstudio. The org.Hs.eg.db41 package is a genome-wide annotation package only for humans. If the organism of interest is another species, refer to: https://bioconductor.org/packages/release/BiocViews.html#OrgDb
    2. Import the DE_circRNA information from step 2.3.1 into the Rstudio workspace.
    3. Use the parental gene of the circRNAs provided in this file for enrichment analysis in the upcoming steps. However, if the user wishes to convert the gene symbol to other formats, such as the Entrez ID, use a function such as "bitr".
    4. By using the gene ID as an input, run the GO enrichment analysis using the enrichGO function within the clusterProfiler39,40 package using default parameters.
    5. By using the gene ID as an input, run the KEGG enrichment analysis using the enrichKEGG function within the clusterProfiler39,40 package using the default parameters.

Wyniki

The protocol enlisted in the previous section was modified and configured to suit the Linux OS system. The main reason is that most module libraries and packages involved in the analysis of circRNAs can only work on the Linux platform. In this analysis, de-identified ribosomal RNA (rRNA)-depleted RNA-seq library datasets prepared from the Influenza A virus-infected human macrophage cells were downloaded from the GEO database42 and used to generate the representative results.

Dyskusje

To illustrate the utility of this protocol, RNA-seq from influenza A virus-infected human macrophage cells was used as an example. CircRNAs functioning as potential miRNA sponges in host-pathogen interactions and their GO and KEGG functional enrichment within a host were investigated. Although there are a variety of circRNA tools available online, each of them is a standalone package that does not interact with one another. Here, we put together few of the tools that are required for circRNA prediction and quantification...

Ujawnienia

The authors have nothing to disclose.

Podziękowania

The author would like to thank Tan Ke En and Dr. Cameron Bracken for their critical review of this manuscript. This work was supported by grants from Fundamental Research Grant Scheme (FRGS/1/2020/SKK0/UM/02/15) and University of Malaya High Impact Research Grant (UM.C/625/1/HIR/MOE/CHAN/02/07).

Materiały

NameCompanyCatalog NumberComments
BedtoolsGitHubhttps://github.com/arq5x/bedtools2/Referring to section 4.1.2. Needed for Circr.
BWABurrows-Wheeler Alignerhttp://bio-bwa.sourceforge.net/Referring to section 2.1.1 and 2.1.2. Needed to run CIRIquant, and to index the genome
CircrGitHubhttps://github.com/bicciatolab/CircrReferring to section 4. Use to predict the miRNA binding sites
CIRIquantGitHubhttps://github.com/bioinfo-biols/CIRIquantReferring to section 2.1.3. To predict circRNAs
ClusterprofilerGitHubhttps://github.com/YuLab-SMU/clusterProfilerReferring to section 7. For GO and KEGG functional enrichment
CPUIntel Intel(R) Xeon(R) CPU E5-2620 V2 @ 2.10 GHz   Cores: 6-core CPU Memory: 65 GB Graphics card: NVIDIA GK107GL (QUADRO K2000) Specifications used to run this entire protocol.
CytoscapeCytoscapehttps://cytoscape.org/download.htmlReferring to section 5.2. Needed to plot ceRNA network
FastQCBabraham Bioinformaticshttps://www.bioinformatics.babraham.ac.uk/projects/fastqc/Referring to section 1.2.1. Quality checking on Fastq files
HISAT2http://daehwankimlab.github.io/hisat2/Referring to section 2.1.1 and 2.1.2. Needed to run CIRIquant, and to index the genome
LinuxUbuntu 20.04.5 LTS (Focal Fossa)https://releases.ubuntu.com/focal/Needed to run the entire protocol. Other Ubuntu versions may still be valid to carry out the protocol.
miRandahttp://www.microrna.org/microrna/getDownloads.doReferring to section 4.1.2. Needed for Circr
Pybedtoolspybedtools 0.8.2https://pypi.org/project/pybedtools/Needed for BED file genomic manipulation
PythonPython 2.7 and 3.6 or aboverhttps://www.python.org/downloads/To run necessary library modules
RThe Comprehensive R Archive Networkhttps://cran.r-project.org/To manipulate dataframes
RNAhybridBiBiServhttps://bibiserv.cebitec.uni-bielefeld.de/rnahybridReferring to section 4.1.2. Needed for Circr
RStudioRStudiohttps://www.rstudio.com/A workspace to run R
samtools SAMtoolshttp://www.htslib.org/Referring to section 2.1.2. Needed to run CIRIquant
StringTieJohns Hopkins University: Center for Computational Biologyhttp://ccb.jhu.edu/software/stringtie/index.shtmlReferring to section 2.1.2. Needed to run CIRIquant
TargetScanGitHubhttps://github.com/nsoranzo/targetscanReferring to section 4.1.2. Needed for Circr

Odniesienia

  1. Raman, K., Bhat, A. G., Chandra, N. A systems perspective of host-pathogen interactions: predicting disease outcome in tuberculosis. Molecular BioSystems. 6 (3), 516-530 (2010).
  2. Casadevall, A., Pirofski, L. A. Host-pathogen interactions: basic concepts of microbial commensalism, colonization, infection, and disease. Infection and Immunity. 68 (12), 6511-6518 (2000).
  3. Yang, E., Li, M. M. H. All About the RNA: Interferon-stimulated genes that interfere with viral RNA processes. Frontiers in Immunology. 11, 605024 (2020).
  4. Schneider, W. M., Chevillotte, M. D., Rice, C. M. Interferon-stimulated genes: A complex web of host defenses. Annual Review of Immunology. 32 (1), 513-545 (2014).
  5. Shirahama, S., Miki, A., Kaburaki, T., Akimitsu, N. Long non-coding RNAs involved in pathogenic infection. Frontiers in Genetics. 11, 454 (2020).
  6. Chandan, K., Gupta, M., Sarwat, M. Role of host and pathogen-derived microRNAs in immune regulation during infectious and inflammatory diseases. Frontiers in Immunology. 10, 3081 (2019).
  7. Chen, X., et al. Circular RNAs in immune responses and immune diseases. Theranostics. 9 (2), 588-607 (2019).
  8. Kristensen, L. S., et al. The biogenesis, biology and characterization of circular RNAs. Nature Reviews Genetics. 20 (11), 675-691 (2019).
  9. Ashwal-Fluss, R., et al. circRNA biogenesis competes with pre-mRNA splicing. Molecular Cell. 56 (1), 55-66 (2014).
  10. Conn, S. J., et al. The RNA binding protein quaking regulates formation of circRNAs. Cell. 160 (6), 1125-1134 (2015).
  11. Zhang, X. O., et al. Complementary sequence-mediated exon circularization. Cell. 159 (1), 134-147 (2014).
  12. Robic, A., Demars, J., Kuhn, C. In-depth analysis reveals production of circular RNAs from non-coding sequences. Cells. 9 (8), 1806 (2020).
  13. Eger, N., Schoppe, L., Schuster, S., Laufs, U., Boeckel, J. N. Circular RNA splicing. Advances in Experimental Medicine and Biology. 1087, 41-52 (2018).
  14. Barrett, S. P., Wang, P. L., Salzman, J. Circular RNA biogenesis can proceed through an exon-containing lariat precursor. eLife. 4, 07540 (2015).
  15. Memczak, S., et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 495 (7441), 333-338 (2013).
  16. Misir, S., Wu, N., Yang, B. B. Specific expression and functions of circular RNAs. Cell Death and Differentiation. 29 (3), 481-491 (2022).
  17. Bai, S., et al. Construct a circRNA/miRNA/mRNA regulatory network to explore potential pathogenesis and therapy options of clear cell renal cell carcinoma. Scientific Reports. 10 (1), 13659 (2020).
  18. Sakshi, S., Jayasuriya, R., Ganesan, K., Xu, B., Ramkumar, K. M. Role of circRNA-miRNA-mRNA interaction network in diabetes and its associated complications. Molecular Therapy - Nucleic Acids. 26, 1291-1302 (2021).
  19. Hansen, T. B., et al. miRNA-dependent gene silencing involving Ago2-mediated cleavage of a circular antisense RNA. The EMBO Journal. 30 (21), 4414-4422 (2011).
  20. Lu, M. Circular RNA: functions, applications, and prospects. ExRNA. 2 (1), 15 (2020).
  21. Liu, K. S., Pan, F., Mao, X. D., Liu, C., Chen, Y. J. Biological functions of circular RNAs and their roles in occurrence of reproduction and gynecological diseases. American Journal of Translational Research. 11 (1), 1-15 (2019).
  22. Pamudurti, N. R., et al. Translation of CircRNAs. Molecular Cell. 66 (1), 9-21 (2017).
  23. Legnini, I., et al. Circ-ZNF609 Is a circular RNA that can be translated and functions in myogenesis. Molecular Cell. 66 (1), 22-37 (2017).
  24. Weigelt, C. M., et al. An insulin-sensitive circular RNA that regulates lifespan in Drosophila. Molecular Cell. 79 (2), 268-279 (2020).
  25. Guo, Y., et al. Identification and characterization of circular RNAs in the A549 cells following Influenza A virus infection. Veterinary Microbiology. 267, 109390 (2022).
  26. Qu, Z., et al. A novel intronic circular RNA antagonizes influenza virus by absorbing a microRNA that degrades CREBBP and accelerating IFN-β production. mBio. 12 (4), 0101721 (2021).
  27. Kawarada, Y., et al. TGF-β induces p53/Smads complex formation in the PAI-1 promoter to activate transcription. Scientific Reports. 6 (1), 35483 (2016).
  28. Yu, T., et al. Circular RNA GATAD2A promotes H1N1 replication through inhibiting autophagy. Veterinary Microbiology. 231, 238-245 (2019).
  29. FastQC: A quality control tool for high throughput sequence data. Available from: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (2010)
  30. Bolger, A. M., Lohse, M., Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30 (15), 2114-2120 (2014).
  31. Zhang, J., Chen, S., Yang, J., Zhao, F. Accurate quantification of circular RNAs identifies extensive circular isoform switching events. Nature Communications. 11 (1), 90 (2020).
  32. Li, H., Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 26 (5), 589-595 (2010).
  33. Kim, D., Paggi, J. M., Park, C., Bennett, C., Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nature Biotechnology. 37 (8), 907-915 (2019).
  34. Pertea, M., et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nature Biotechnology. 33 (3), 290-295 (2015).
  35. Li, H., et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 25 (16), 2078-2079 (2009).
  36. Wang, L., Wang, S., Li, W. RSeQC: quality control of RNA-seq experiments. Bioinformatics. 28 (16), 2184-2185 (2012).
  37. Dori, M., Caroli, J., Forcato, M. Circr, a computational tool to identify miRNA:circRNA associations. Frontiers in Bioinformatics. 2, 852834 (2022).
  38. Shannon, P., et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 13 (11), 2498-2504 (2003).
  39. Wu, T., et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. The Innovation. 2 (3), 100141 (2021).
  40. Yu, G., Wang, L. G., Han, Y., He, Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: A Journal of Integrative Biology. 16 (5), 284-287 (2012).
  41. . org.Hs.eg.db: Genome wide annotation for human. 2022. R package version 3.15.0 Available from: https://bioconductor.org/packages/release/data/annotation/html/org.Hs.eg.db.html (2022)
  42. Barrett, T., et al. NCBI GEO: archive for functional genomics data sets-update. Nucleic Acids Research. 41, 991-995 (2012).
  43. Gao, Y., Zhang, J., Zhao, F. Circular RNA identification based on multiple seed matching. Briefings in Bioinformatics. 19 (5), 803-810 (2018).
  44. Zhang, X. O., et al. Diverse alternative back-splicing and alternative splicing landscape of circular RNAs. Genome Research. 26 (9), 1277-1287 (2016).
  45. Memczak, S., et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 495 (7441), 333-338 (2013).
  46. Wang, K., et al. MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Research. 38 (18), 178 (2010).
  47. Song, X., et al. Circular RNA profile in gliomas revealed by identification tool UROBORUS. Nucleic Acids Research. 44 (9), 87 (2016).
  48. Hansen, T. B. Improved circRNA identification by combining prediction algorithms. Frontiers in Cell and Developmental Biology. 6, 20 (2018).
  49. Robinson, M. D., McCarthy, D. J., Smyth, G. K. edgeR: A bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 26 (1), 139-140 (2010).
  50. Ma, X. K., et al. CIRCexplorer3: A CLEAR pipeline for direct comparison of circular and linear RNA expression. Genomics Proteomics Bioinformatics. 17 (5), 511-521 (2019).
  51. Gaffo, E., Buratin, A., Dal Molin, A., Bortoluzzi, S. Sensitive, reliable and robust circRNA detection from RNA-seq with CirComPara2. Briefings in Bioinformatics. 23 (1), (2022).
  52. Glažar, P., Papavasileiou, P., Rajewsky, N. circBase: a database for circular RNAs. RNA. 20 (11), 1666-1670 (2014).
  53. Tan, S., et al. Circular RNA F-circEA-2a derived from EML4-ALK fusion gene promotes cell migration and invasion in non-small cell lung cancer. Molecular Cancer. 17 (1), 138 (2018).
  54. Guarnerio, J., et al. Oncogenic role of Fusion-circRNAs Derived from cancer-associated chromosomal translocations. Cell. 165 (2), 289-302 (2016).
  55. McGeary, S. E., et al. The biochemical basis of microRNA targeting efficacy. Science. 366 (6472), (2019).
  56. Enright, A. J., et al. MicroRNA targets in Drosophila. Genome Biology. 5 (1), 1 (2003).
  57. Rehmsmeier, M., Steffen, P., Hochsmann, M., Giegerich, R. Fast and effective prediction of microRNA/target duplexes. RNA. 10 (10), 1507-1517 (2004).
  58. Zhang, D., et al. AllEnricher: a comprehensive gene set function enrichment tool for both model and non-model species. BMC Bioinformatics. 21 (1), 106 (2020).
  59. Zhou, Y., et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nature Communications. 10 (1), 1523 (2019).

Przedruki i uprawnienia

Zapytaj o uprawnienia na użycie tekstu lub obrazów z tego artykułu JoVE

Zapytaj o uprawnienia

Przeglądaj więcej artyków

CircRNAsHost pathogen InteractionsRNA AnalysisRNA PredictionFunctional EnrichmentMicro RNA InteractionCCE RNA NetworkProgramming LanguagesLinux TerminalGenome IndexingCiriquant ToolRNA Sequence DataGene Count MatrixDifferentially Expressed CircRNAs DE CircRNAs

This article has been published

Video Coming Soon

JoVE Logo

Prywatność

Warunki Korzystania

Zasady

Badania

Edukacja

O JoVE

Copyright © 2025 MyJoVE Corporation. Wszelkie prawa zastrzeżone