2019-10-07-文献解读(一)

题目:《综合单细胞异质性分析方法来定义人类心脏核心转录因子层次结构》

截止2019-10月引用率6次!

摘要

诱导人多能干细胞分化为心肌细胞(hiPSC-CMs)的技术已成为相关疾病建模和治疗测试的有力工具。然而,由于不成熟和异质性的的存在使得推广仍然受到的限制。为了阐明这种异质性的原因,作者在hiPSC心肌诱导分化和成人心脏组织细胞中应用了单细胞转录组和常规转录组测序技术。通过整合及拼接数据等分析,观察到了超过六个的不同单细胞亚群,其中几群细胞在分化的某个时间点(第30天不同)被重复观测到。为了剖析与每个细胞群相关的不同心脏核心转录因子的调控作用,本文使用了single-cell 和 bulk RNA-seq、CRISPR技术、ChIP-seq,同时配合电生理、钙成像和CyTOF分析检测到三个转录因子(NR2F2、TBX5和HEY2)的上调或下调产生的影响。汇总这些靶标、数据和基因组分析方法为理解体外细胞异质性提供了一个强大的平台。

首先是样品,建库测序,RNA-seq上游分析概况

样品来源
  • Two hiPSC lines were obtained from the Stanford Cardiovascular Institute biobank (CVI0076, CVI0059). CVI0059 was processed for single cell RNA-seq at day 5, day 14, and day 45 of the cardiomyocyte differentiation protocol using the 10X Genomics single-cell RNA-seq v1 kit.CVI0076 was processed for single cell RNA-seq at day 0, day 5, day 14, and day 45 use v2 kit.
  • Libraries were quantified using Bioanalyzer (Agilent) and qPCR (KAPA) analysis. Libraries were sequenced on the NextSeq 500 (Illumina).
  • Unsupervised cell population discovery analyses were performed with Seurat-CCA and the software ICGS available in AltAnalyze version 2.1.1 (http://www.altanalyze.org)
  • For these analyses, only protein-coding genes were considered, applying a correlation cutoff of 0.3 and Euclidean column HOPACH clustering. Associated t-SNE visualizations were obtained in AltAnalyze using ICGS obtained dynamically regulated genes.
  • ERCC spike-ins were included for further evaluation of sample quality.
  • libraries were pooled and sequenced using Illumina’s HiSeq 2000 using 2 × 100 paired-end sequencing (Macrogen, South Korea)
  • Filtered reads were aligned to the reference genome hg19 using STAR
  • Using STAR BAM files, AltAnalyze was used to generate exon read counts for gene expression analysis and junction read counts for splicing analysis
  • All retained single-cell libraries were required to have a minimum of 1 million uniquely aligning paired-end fragments and > 40% aligned fragments, based on STAR analysis. The retained libraries had an average of ~3 million aligned fragments.
  • To calculate RPKM values for each gene, AltAnalyze was run on the junction and exon BED files using default settings
  • To identify discrete cell states, unsupervised clustering was initially performed to define predominant populations (ICGS module of AltAnalyze, Pearson correlation coefficient > 0.4).
  • Although this analysis identified three initial populations, we augmented these results using a supervised analysis of cardiac transcription factors from our 10X Genomics identified using the ICGS supervised correlation option.
  • In agreement with our Fluidigim C1 microscopy analyses, no gene expression signatures with evident “doublet cell” profiles (more than one cell population signature) were discerned from this analysis.
  • Furthermore, ERCC spike-in expression (ERCC92.fa, Kallisto TPM) ratios indicated single-cell transcriptome profiles were being assessed.
  • the MarkerFinder algorithm in AltAnalyze was run to identify additional genes with population- restricted expression profiles (Pearson correlation coefficient > 0.4).
  • Additional differentiations were performed on NR2F2GE1 (N = 2), TBX5GE1 (N = 2), HEY2GE1 (N = 2), NR2F2GE2 (N = 4), TBX5GE2 (N = 3), and HEY2GE2 (N = 2) lines and sequenced using Illumina’s HiSeq 4000 2 × 150 paired end sequencing (Novogene).
  • Pseudotemporal ordering of these cells with the software Monocle designated SF1-expressing cardiomyocytes as the “earliest” population and HOPX as the latest, suggesting that cardiomyocyte subpopulations underlie distinct cardiac maturation states
  • Data availability
    GSE81585;
    10x Genomics synapse ID: syn7818379.

然后是质量控制情况,最后的表达矩阵是多少个基因多少个细胞

  • 200 hiPSC-CMs at day 30 were run throuth Fluidigm C1 microfluidic chip to capture single hiPSC-CMs (site 8shown) and processed for single-cell RNA-seq.
  • Cells were labeled using a viability dye(Calcein-AM) to ensure RNA for live cells were processed. IHC TNNT2,MYL2,ACTC1,MYL7 marker
  • 54 hiPSC-CMs were successfully sequenced which expressed cardiac markers
  • single cell 10X genomics RNA-seq clusters called transcription factor and GO terms related to cardiac developmental progression
  • Monocle applied to single- cell RNA-seq was used to identify a pseudotime progression of different populations of hiPSC-CMs in relation to each other.

接着介绍作者是如何挑选重要的基因和降维

  • To visualize and interpret the high- dimensional dataset generated, we applied the t-SNE algorithm based on seven cardiac markers preselected for the dataset, in which individual cells in the high-dimensional space were pro- jected onto a two-dimensional map but their neighboring rela- tionship was preserved.


    Heatmap of gene markers specific for each day of differentiation. Selected cardiac specific genes are overlaid in the right panel.

降维后的聚类以及对每个类的注释

tSNE聚类后注释

与bulk RNA 测序得到的 基因marker进行重新注释分组

Single cells from day 30 of differentiation were profiled using an independent technology (Fluidigm C1) to resolve coincident mid-to-late state differentiation heterogeneity.
PCA聚类Single-cell population-specific genes(ICGS/MarkerFinder) and expression profiles were used to populate the LineageProfiler signature database

Evaluation of single-cell population heterogeneity among replicate bulk time-course samples is preformed using K-nearest neghbor-based classification of bulk RNA-seq time-course samples with the software LineageProfiler

Overlay of tSNE maps of cardiomyocytes derived from hESCs undergoing cardiac differentiation from day 8 to day 18 to day 30.Each point represents a single cell, and different colors represent samples from different colors represent samples from different time points.

An extended panel of relevant cardiac marker expression patterns.cells are colored based on the intensity of expression of the indicated markers. Higher expression of MLC2A was noted throughout each time point of differentiation, whereas high levels of MLC2V could only be observed at day 30

Bulk RNA seq: RNA-seq expression from each day of differentiation was analyzed for transcription factors demonstration a greater than two-fold change in expression from each day. A two-fold increase in expression was denoted as the start of transcription factor expression, whereas the corresponding two-fold decrease in expression was denoted as acessation of expression.

类的下游分析(差异分析或者实验验证等)

Chip-seq 验证 marker基因
类的差异基因分析
  • Given that our single-cell RNA-seq of the wildtype and genome-edited lines suggested that NR2F2, TBX5, and HEY2 can regulate atrial-like and ventricular-like signatures, we next quantified the expression of these transcription factors within the adult heart
  • RNA-seq of the human atria confirmed that NR2F2 and TBX5 are specifically enriched within the atria, and HEY2 is highly enriched within the ventricle.(Supplementary Fig. 5E).
  • RNA-seq quantification demonstrated that MYL2 is highly expressed within ventricular tissue, while MYL7 is enriched within atrial tissue(Supplementary Fig. 5F).
  • differentiating hiPSC-CMs reveals that MYL2 is only observed at later differentiation time points (e.g. day 30 and day 90) (Supplementary Fig. 5G).


    supFigure5

总结一下

  • 本文作者通过对human embryonic stem cell-derived cardiomyocytes (hESC-CMs) 以及 human induced pluripotent stem cell-derived cardiomyocytes (hiPSC- CMs)取不同时间点及相应的转录因子上调下调表达后选取特定时间的样本进行single-cell 和 bulk RNA-seq的分析,确定了由不同基因表达谱富集的hiPSC-CM的亚种群。意义是由于心肌细胞的再生性差,损伤修复较困难,而且受损后严重危害人群健康,科学家们研究了hiPSC- CMs来治疗心肌损伤,但是hiPSC- CMs自身的混杂导致了预后的异质性,因此本文用单细胞测序的技术找到这个混杂的干细胞分化的的心肌细胞的特殊分化时期亚型所高表达的细胞标记基因,从而实现分类富集相应的亚群的心肌细胞,降低混杂差异提高治疗效果非常值得期待。
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 217,277评论 6 503
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 92,689评论 3 393
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 163,624评论 0 353
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 58,356评论 1 293
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 67,402评论 6 392
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 51,292评论 1 301
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 40,135评论 3 418
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 38,992评论 0 275
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 45,429评论 1 314
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 37,636评论 3 334
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 39,785评论 1 348
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 35,492评论 5 345
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 41,092评论 3 328
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 31,723评论 0 22
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 32,858评论 1 269
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 47,891评论 2 370
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 44,713评论 2 354

推荐阅读更多精彩内容