From single-cell transcriptomics to single-molecule counting
Author: Islam, Saiful
Date: 2013-11-19
Location: Sal Gustaf Retzius (Cesar), Retzius väg 20, Karolinska institutet, Solna Campus
Time: 09.00
Department: Inst för medicinsk biokemi och biofysik / Dept of Medical Biochemistry and Biophysics
Abstract
RNA-sequencing (RNA-seq) technology has been progressing so fast in the last few years and made it possible to perform transcriptome analysis at single-cell level that was even unimaginable a few years before. Nowadays, the importance of gene expression analysis at the single-cell level is increasingly appreciated for the study of complex heterogeneous tissue. Also, in order to solve the obscure and no consensus definition of cell types, the single-cell gene expression analysis approach will be important. In this thesis we described a novel approach of single-cell gene expression profiling, called Single-cell Tagged Reverse Transcription (STRT). Here we can analyse 96 single cells at the same time by using a DNA barcode to tag individual single-cell. We analysed both 41 embryonic stem cells (ES) and 44 mouse embryonic fibroblast (MEF) cells in a 96-well PCR plate using STRT and successfully separated the distinct cell types from each other by cluster analysis using only the gene expression profile. We believe that the strategy to discriminate cell types using gene expression profiling will empower the unbiased discovery and analysis of heterogeneous cell populations in both normal and diseases tissue. However, the low efficiency of cDNA synthesis (~ 10%) and PCR amplification bias of this first STRT version reduced throughput and also decreased the possibility to detect genes that were expressed at lower level.
To improve the efficiency of cDNA synthesis, we significantly improved the template switching (TS) mechanism – the mechanism to make secondary oligonucleotide from RNA during first strand cDNA synthesis. Since most of the present single-cell methods use TS mechanism, the base preference of terminal transferase activity was studied in detail. We found that an NGG motif at the 3 ́ end of the template switching oligonucleotide (TSO) will work better than a GGG motif.
To remove the amplification bias we introduced a molecular barcode, a 5 bp short random degenerate sequence, which entirely eliminate the amplification bias. As this short random sequence uniquely labels each single molecule, their exact number can be determined. By introducing a microfluidic sample preparation, Fluidigm C1, we ensure the quality of cells that undergo sample processing until sequencing. In this updated method, called STRT-C1, we used 96 ES cells and split it as two final libraries consisting of one single strand cDNA library and one double strand cDNA obtained by amplifying the first single strand cDNA. We also used and analysed spike-in RNA in the same experiment. Both the reproducibility (correlation coefficient >99.5 within and between sample) and efficiency of reverse transcription (~48%, 5 folds more than previous version of STRT) of the improved method are excellent. Correlation coefficient for the endogenous gene within and between samples is much better at the molecule level than at the read level, especially for the genes of low abundance.
By analysing only the spike-in RNAs, we found that technical noise is minimized and we can observe true biological noise for endogenous genes. Some biological noise is intrinsic at the single-cell level but most of the genes showed only low level of noise. However, we detected ~173 genes in ES cells which shows significant noise. Our data also revealed that biologically noisy genes have a significant function to give a resonant state of the embryonic stem cell.
In conclusion, the single-cell molecule counting method makes it possible to count molecules in single- cell accurately and without any bias. So we believe our STRT-C1 version of the single-cell method is a significant step forward for all expression analysis.
To improve the efficiency of cDNA synthesis, we significantly improved the template switching (TS) mechanism – the mechanism to make secondary oligonucleotide from RNA during first strand cDNA synthesis. Since most of the present single-cell methods use TS mechanism, the base preference of terminal transferase activity was studied in detail. We found that an NGG motif at the 3 ́ end of the template switching oligonucleotide (TSO) will work better than a GGG motif.
To remove the amplification bias we introduced a molecular barcode, a 5 bp short random degenerate sequence, which entirely eliminate the amplification bias. As this short random sequence uniquely labels each single molecule, their exact number can be determined. By introducing a microfluidic sample preparation, Fluidigm C1, we ensure the quality of cells that undergo sample processing until sequencing. In this updated method, called STRT-C1, we used 96 ES cells and split it as two final libraries consisting of one single strand cDNA library and one double strand cDNA obtained by amplifying the first single strand cDNA. We also used and analysed spike-in RNA in the same experiment. Both the reproducibility (correlation coefficient >99.5 within and between sample) and efficiency of reverse transcription (~48%, 5 folds more than previous version of STRT) of the improved method are excellent. Correlation coefficient for the endogenous gene within and between samples is much better at the molecule level than at the read level, especially for the genes of low abundance.
By analysing only the spike-in RNAs, we found that technical noise is minimized and we can observe true biological noise for endogenous genes. Some biological noise is intrinsic at the single-cell level but most of the genes showed only low level of noise. However, we detected ~173 genes in ES cells which shows significant noise. Our data also revealed that biologically noisy genes have a significant function to give a resonant state of the embryonic stem cell.
In conclusion, the single-cell molecule counting method makes it possible to count molecules in single- cell accurately and without any bias. So we believe our STRT-C1 version of the single-cell method is a significant step forward for all expression analysis.
List of papers:
I. Saiful Islam*, Una Kjällquist*, Annalena Moliner, Pawel Zajac, Jian-Bing Fan, Peter Lönnerberg, and Sten Linnarsson. Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. Genome Research. (2011) Jul; 21(7): 1160-7.
Fulltext (DOI)
Pubmed
View record in Web of Science®
II. Saiful Islam, Una Kjällquist, Annalena Moliner, Pawel Zajac, Jian-Bing Fan, Peter Lönnerberg, & Sten Linnarsson. Highly multiplexed and strand-specific single-cell RNA 5′ end sequencing. Nature Protocol. (2012). Volume 7, No. 5 (813-828).
Fulltext (DOI)
Pubmed
View record in Web of Science®
III. Pawel Zajac, Saiful Islam, Hannah Hochgerner, Peter Lönnerberg, and Sten Linnarsson. Base preferences in non-templated nucleotide incorporation by MMLV derived reverse transcriptases. [Manuscript]
IV. Saiful Islam, Amit Zeisel, Simon Joost, Gioele La Manno, Pawel Zajac, Maria Kasper, Peter Lönnerberg & Sten Linnarsson. Quantitative single-cell RNA-seq. [Manuscript]
I. Saiful Islam*, Una Kjällquist*, Annalena Moliner, Pawel Zajac, Jian-Bing Fan, Peter Lönnerberg, and Sten Linnarsson. Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. Genome Research. (2011) Jul; 21(7): 1160-7.
Fulltext (DOI)
Pubmed
View record in Web of Science®
II. Saiful Islam, Una Kjällquist, Annalena Moliner, Pawel Zajac, Jian-Bing Fan, Peter Lönnerberg, & Sten Linnarsson. Highly multiplexed and strand-specific single-cell RNA 5′ end sequencing. Nature Protocol. (2012). Volume 7, No. 5 (813-828).
Fulltext (DOI)
Pubmed
View record in Web of Science®
III. Pawel Zajac, Saiful Islam, Hannah Hochgerner, Peter Lönnerberg, and Sten Linnarsson. Base preferences in non-templated nucleotide incorporation by MMLV derived reverse transcriptases. [Manuscript]
IV. Saiful Islam, Amit Zeisel, Simon Joost, Gioele La Manno, Pawel Zajac, Maria Kasper, Peter Lönnerberg & Sten Linnarsson. Quantitative single-cell RNA-seq. [Manuscript]
Institution: Karolinska Institutet
Supervisor: Linnarsson, Sten
Issue date: 2013-10-28
Rights:
Publication year: 2013
ISBN: 978-91-7549-313-8
Statistics
Total Visits
Views | |
---|---|
From ...(legacy) | 1219 |
From ... | 295 |
Total Visits Per Month
March 2024 | April 2024 | May 2024 | June 2024 | July 2024 | August 2024 | September 2024 | |
---|---|---|---|---|---|---|---|
From ... | 2 | 1 | 1 | 3 | 3 | 3 | 2 |
File Visits
Views | |
---|---|
Main Thesis_Saiful.pdf(legacy) | 1088 |
Main Thesis_Saiful.pdf | 648 |
Spikblad_Saiful.pdf(legacy) | 179 |
Spikblad_Saiful.pdf | 40 |
Top country views
Views | |
---|---|
United States | 617 |
Sweden | 141 |
China | 121 |
Germany | 111 |
Japan | 57 |
United Kingdom | 48 |
Russia | 26 |
Ukraine | 24 |
Canada | 21 |
Finland | 19 |
Top cities views
Views | |
---|---|
Ashburn | 91 |
Stockholm | 46 |
Sunnyvale | 38 |
Kiez | 33 |
Guangzhou | 28 |
Kiev | 24 |
Wako | 22 |
Beijing | 20 |
Cambridge | 20 |
Romeo | 16 |