Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Feb 28;48(4):e21.
doi: 10.1093/nar/gkz1198.

Disentangling sRNA-Seq data to study RNA communication between species

Affiliations

Disentangling sRNA-Seq data to study RNA communication between species

José Roberto Bermúdez-Barrientos et al. Nucleic Acids Res. .

Abstract

Many organisms exchange small RNAs (sRNAs) during their interactions, that can target or bolster defense strategies in host-pathogen systems. Current sRNA-Seq technology can determine the sRNAs present in any symbiotic system, but there are very few bioinformatic tools available to interpret the results. We show that one of the biggest challenges comes from sequences that map equally well to the genomes of both interacting organisms. This arises due to the small size of the sRNAs compared to large genomes, and because a large portion of sequenced sRNAs come from genomic regions that encode highly conserved miRNAs, rRNAs or tRNAs. Here, we present strategies to disentangle sRNA-Seq data from samples of communicating organisms, developed using diverse plant and animal species that are known to receive or exchange RNA with their symbionts. We show that sequence assembly, both de novo and genome-guided, can be used for these sRNA-Seq data, greatly reducing the ambiguity of mapping reads. Even confidently mapped sequences can be misleading, so we further demonstrate the use of differential expression strategies to determine true parasite-derived sRNAs within host cells. We validate our methods on new experiments designed to probe the nature of the extracellular vesicle sRNAs from the parasitic nematode Heligmosomoides bakeri that get into mouse intestinal epithelial cells.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Factors that influence the number of ambiguous k-mers between pairs of genomes. X-axes represent the k-mer size and Y-axes the fraction of shared or ambiguous k-mers. (A) Random genomes of sizes equivalent to those of A. thaliana and B. cinerea. (B) Fixed A. thaliana genome, compared to full B. cinerea genome or a sample corresponding to 50% or 10% of the complete genome. (C) All genomes were subsampled to the size of the smallest, that of B. japonicum. (D) Real fractions of ambiguous k-mers in each pair of complete genomes. Insets correspond to a zoomed in area of k-mer sizes 18–23.
Figure 2.
Figure 2.
Genomic origin of ambiguous reads from libraries of A. thaliana stems 4cm above a C. campestris haustorium. Each bar represents the sequenced reads of one size between 18 and 50 nucleotides. Bar height represents the actual number of reads (top) or the fraction of reads (bottom). (A) Mapping categories are: host (green), symbiont (blue) or ambiguous (purple). (B) Genomic annotation of ambiguous reads only: intergenic (light green), miRNA (yellow), rRNA (light purple), tRNA (red) or other annotation (orange).
Figure 3.
Figure 3.
Percent of ambiguous and symbiont reads before and after assembly. The name of the two interacting species is shown for each experiment above three bars. All 18–50nt reads were classified and the percent of each category were averaged across each experiment's samples. The first bar of each group represents unassembled reads, the second de novo contigs, the third genome-guided clusters. The Y-axes are independently zoomed and cut to highlight the percent of symbiont (blue) and ambiguous (purple) reads. Host reads (green) always represent the remainder of 100%.
Figure 4.
Figure 4.
Evaluation of differential expression results. Reads were categorized as 22G (yellow), 22U (red), or other (grey) based on length and first-nucleotide. (A) sRNA profiles of control samples: purified H. bakeri Extracellular Vesicles (EVs) and untreated MODE-K cells. Bar height represents the fraction of all reads. (B) sRNA profiles of unassembled reads, de novo contigs and genome-guided clusters. For each of these sets, there are two bars, the first one represents differentially expressed up-regulated elements (Up) and the second, elements that lack evidence for differential expression in this direction (non-Up). Bar height represents the number of reads (top) or the fraction of reads (bottom) belonging to these categories. (C) Percent of reads from pure H. bakeri EV libraries, recovered during differential expression analysis of MODE-K cells treated with EVs. Each circle represents the total reads from EV libraries. The recovered fractions are indicated in blue. The numbers of H. bakeri differentially expressed elements are shown to the right, as well as total read counts (from B).

References

    1. Benner S.A. Extracellular ‘communicator RNA’. FEBS Lett. 1988; 233:225–228. - PubMed
    1. Taylor D.D., Gercel-Taylor C.. The origin, function, and diagnostic potential of RNA within extracellular vesicles present in human biological fluids. Front. Genet. 2013; 4:142. - PMC - PubMed
    1. Chen X., Liang H., Zhang J., Zen K., Zhang C.-Y.. Secreted microRNAs: a new form of intercellular communication. Trends Cell Biol. 2012; 22:125–132. - PubMed
    1. Hoy A.M., Buck A.H.. Extracellular small RNAs: what, where, why. Biochem. Soc. Trans. 2012; 40:886–890. - PMC - PubMed
    1. Turchinovich A., Samatov T.R., Tonevitsky A.G., Burwinkel B.. Circulating miRNAs: cell-cell communication function. Front. Genet. 2013; 4:119. - PMC - PubMed

Publication types