Visualizing High-Dimensional Single-Cell RNA-seq Data via Random Projections and Geodesic Distances
Fecha
2019Language
en
Materia
Resumen
The recent advent in Next Generation Sequencing has created a huge data source which offers a great potential for elucidating complex disease mechanisms and biological processes. A recent technology is the single-cell RNA sequencing, which allows transcriptomics measurements in individual cells, having promising results. However, such studies measure the entire genome for thousands of cells, creating datasets with extremely high dimensionality and complexity. Following this perspective, we propose a dimensionality reduction approach, called RGt-SNE, which visualizes single-cell RNA-seq data in two dimensions. Initially, RGt-SNE defines a cell-cell distance matrix based on Random Projections and Geodesic Distances. The first is used to define the pairwise cells distances in a low dimensional projected space avoiding the difficulties that exist in data with ultra-high dimensionality. The latter is used to better define the large pairwise cells distances. Subsequently, the t-SNE method is applied in the customized distance matrix for two dimensional visualization. RGt-SNE was evaluated in two real experimental single-cell RNA-seq data against three well-known methods, such as t-SNE, Multidimensional scaling, and ISOMAP. Outcomes provide the superiority of RGt-SNE suggesting it as a reliable tool for single-cell RNA-seq data analysis and visualization. © 2019 IEEE.