Single-Cell Epigenomics and Functional Fine-Mapping of Atherosclerosis GWAS Loci

Tiit Örd, Kadri Õunap, Lindsey K Stolze, Redouane Aherrahrou, Valtteri Nurminen, Anu Toropainen, Ilakya Selvarajan, Tapio Lönnberg, Einari Aavik, Seppo Ylä-Herttuala, Mete Civelek, Casey E Romanoski, Minna U Kaikkonen.
Circ Res.. 2021-05-24;129(2):240-258.
Abstract
Genome-wide association studies have identified hundreds of loci associated with coronary artery disease (CAD). Many of these loci are enriched in cisregulatory elements but not linked to cardiometabolic risk factors nor to candidate causal genes, complicating their functional interpretation. Objective: Single-nucleus chromatin accessibility profiling of the human atherosclerotic lesions was used to investigate cell type–specific patterns of cisregulatory elements, to understand transcription factors establishing cell identity, and to interpret CAD-relevant, noncoding genetic variation. Methods and Results: We used single-nucleus ATAC-seq (assay for transposase-accessible chromatin with sequencing) to generate DNA accessibility maps in >7000 cells derived from human atherosclerotic lesions. We identified 5 major lesional cell types including endothelial cells, smooth muscle cells, monocyte/macrophages, natural killer/T cells, and B cells and further investigated subtype characteristics of macrophages and smooth muscle cells transitioning into fibromyocytes. We demonstrated that CAD-associated genetic variants are particularly enriched in endothelial and smooth muscle cell–specific open chromatin. Using single-cell coaccessibility and cis–expression quantitative trait loci information, we prioritized putative target genes and candidate regulatory elements for ≈30% of all known CAD loci. Finally, we performed genome-wide experimental fine-mapping of the CAD variants identified in genome-wide association studies using epigenetic quantitative trait loci analysis in primary human aortic endothelial cells and self-transcribing active regulatory region sequencing (STARR-Seq) massively parallel reporter assay in smooth muscle cells. This analysis identified potential causal single-nucleotide polymorphisms (SNPs) and the associated target gene for over 30 CAD loci. We present several examples where the chromatin accessibility and gene expression could be assigned to one cell type predicting the cell type of action for CAD loci. Conclusions: These findings highlight the potential of applying single-nucleus ATAC-seq to human tissues in revealing relative contributions of distinct cell types to diseases and in identifying genes likely to be influenced by noncoding genome-wide association study variants. Keywords: atherosclerosis, coronary artery disease, genetics, genome-wide association study, myocardial infarction
Consortium data used in this publication
The data used for the analyses described in this article were obtained from the Genotype-Tissue Expression Portal on May 1, 2020 (phs000424.v8.p2). Anonymized data and materials have been made publicly available at the figshare repository and can be accessed at https://doi.org/10.6084/m9.figshare.14501985. Please see the Data Supplement for detailed methods.
Datasets
DSR132DPW

Related data

Available data
website
File format
other format
Data summary
Single-cell ATAC-Seq data from human atherosclerosis. The files contain cells from endarterectomy samples from three donors, yielding approximately 7000 cells total. snATAC-Seq libraries were generated using the 10x Genomics scATAC-Seq v1 kit. The files were produced by the cellranger-atac v1.1 pipeline commands 'count' and 'aggr' (without normalization). The genome reference used was hg19 Reference v1.1.0.