Afficher la notice abrégée

dc.creatorGrigoriadis D., Perdikopanis N., Georgakilas G.K., Hatzigeorgiou A.en
dc.date.accessioned2023-01-31T08:27:17Z
dc.date.available2023-01-31T08:27:17Z
dc.date.issued2020
dc.identifier10.1007/978-3-030-45385-5_55
dc.identifier.isbn9783030453848
dc.identifier.issn03029743
dc.identifier.urihttp://hdl.handle.net/11615/73703
dc.description.abstractThe spread, distribution and utilization of transcription start sites (TSS) experimental evidence within promoters are poorly understood. Cap Analysis of Gene Expression (CAGE) has emerged as a popular gene expression profiling protocol, able to quantitate TSS usage by recognizing the 5′ end of capped RNA molecules. However, there is an increasing volume of studies in the literature suggesting that CAGE can also detect 5′ capping events which are transcription byproducts. These findings highlight the need for computational methods that can effectively remove the excessive amount of noise from CAGE samples, leading to accurate TSS annotation and promoter usage quantification. In this study, we present an annotation agnostic computational framework, DIANA Signal-TSS (DiS-TSS), that for the first time utilizes digital signal processing inspired features customized on the peculiarities of CAGE data. Features from the spatial and frequency domains are combined with a robustly trained Support Vector Machines (SVM) model to accurately distinguish between peaks related to real transcription initiation events and biological or protocol-induced noise. When benchmarked on experimentally derived data on active transcription marks as well as annotated TSSs, DiS-TSS was found to outperform existing implementations, by providing on average ~11k positive predictions and an increase in performance by ~5% based on in the experimental and annotation-based evaluations. © Springer Nature Switzerland AG 2020.en
dc.language.isoenen
dc.sourceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en
dc.source.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85085186554&doi=10.1007%2f978-3-030-45385-5_55&partnerID=40&md5=69731804cd4dc3b49017be48ceebdc10
dc.subjectBioinformaticsen
dc.subjectBiomedical engineeringen
dc.subjectDigital signal processingen
dc.subjectRNAen
dc.subjectSupport vector machinesen
dc.subjectComputational frameworken
dc.subjectExperimental evidenceen
dc.subjectGene expression profilingen
dc.subjectInduced noiseen
dc.subjectRNA moleculesen
dc.subjectSpatial and frequency domainen
dc.subjectTranscription initiationen
dc.subjectTranscription start siteen
dc.subjectTranscriptionen
dc.subjectSpringeren
dc.titleDiS-TSS: An Annotation Agnostic Algorithm for TSS Identificationen
dc.typeconferenceItemen


Fichier(s) constituant ce document

FichiersTailleFormatVue

Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Afficher la notice abrégée