Logo
    • English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • Ελληνικά 
    • English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • Σύνδεση
Προβολή τεκμηρίου 
  •   Ιδρυματικό Αποθετήριο Πανεπιστημίου Θεσσαλίας
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • Προβολή τεκμηρίου
  •   Ιδρυματικό Αποθετήριο Πανεπιστημίου Θεσσαλίας
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • Προβολή τεκμηρίου
JavaScript is disabled for your browser. Some features of this site may not work without it.
Ιδρυματικό Αποθετήριο Πανεπιστημίου Θεσσαλίας
Όλο το DSpace
  • Κοινότητες & Συλλογές
  • Ανά ημερομηνία δημοσίευσης
  • Συγγραφείς
  • Τίτλοι
  • Λέξεις κλειδιά

DeepTSS: multi-branch convolutional neural network for transcription start site identification from CAGE data

Thumbnail
Συγγραφέας
Grigoriadis D., Perdikopanis N., Georgakilas G.K., Hatzigeorgiou A.G.
Ημερομηνία
2022
Γλώσσα
en
DOI
10.1186/s12859-022-04945-y
Λέξη-κλειδί
Bioinformatics
Computational methods
Convolution
Convolutional neural networks
Deep learning
Learning systems
Molecular biology
Proteins
Signal processing
Signal to noise ratio
Cap analyse of gene expression
Deep learning
Gene Expression Data
Genes expression
Genomic-signal-processing
Machine-learning
Promoter
Protein-coding genes
Technical noise
Transcription start site
Transcription
chromatin
promoter region
software
transcription initiation site
Chromatin
Neural Networks, Computer
Promoter Regions, Genetic
Software
Transcription Initiation Site
BioMed Central Ltd
Εμφάνιση Μεταδεδομένων
Επιτομή
Background: The widespread usage of Cap Analysis of Gene Expression (CAGE) has led to numerous breakthroughs in understanding the transcription mechanisms. Recent evidence in the literature, however, suggests that CAGE suffers from transcriptional and technical noise. Regardless of the sample quality, there is a significant number of CAGE peaks that are not associated with transcription initiation events. This type of signal is typically attributed to technical noise and more frequently to random five-prime capping or transcription bioproducts. Thus, the need for computational methods emerges, that can accurately increase the signal-to-noise ratio in CAGE data, resulting in error-free transcription start site (TSS) annotation and quantification of regulatory region usage. In this study, we present DeepTSS, a novel computational method for processing CAGE samples, that combines genomic signal processing (GSP), structural DNA features, evolutionary conservation evidence and raw DNA sequence with Deep Learning (DL) to provide single-nucleotide TSS predictions with unprecedented levels of performance. Results: To evaluate DeepTSS, we utilized experimental data, protein-coding gene annotations and computationally-derived genome segmentations by chromatin states. DeepTSS was found to outperform existing algorithms on all benchmarks, achieving 98% precision and 96% sensitivity (accuracy 95.4%) on the protein-coding gene strategy, with 96.66% of its positive predictions overlapping active chromatin, 98.27% and 92.04% co-localized with at least one transcription factor and H3K4me3 peak. Conclusions: CAGE is a key protocol in deciphering the language of transcription, however, as every experimental protocol, it suffers from biological and technical noise that can severely affect downstream analyses. DeepTSS is a novel DL-based method for effectively removing noisy CAGE signal. In contrast to existing software, DeepTSS does not require feature selection since the embedded convolutional layers can readily identify patterns and only utilize the important ones for the classification task. This study highlights the key role that DL can play in Molecular Biology, by removing the inherent flaws of experimental protocols, that form the backbone of contemporary research. Here, we show how DeepTSS can unleash the full potential of an already popular and mature method such as CAGE, and push the boundaries of coding and non-coding gene expression regulator research even further. © 2022, The Author(s).
URI
http://hdl.handle.net/11615/73704
Collections
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19735]

Related items

Showing items related by title, author, creator and subject.

  • Thumbnail

    Transcriptional regulatory networks of the proteasome in mammalian systems 

    Kapetanou M., Athanasopoulou S., Gonos E.S. (2022)
    The tight regulation of proteostasis is essential for physiological cellular function. Mammalian cells possess a network of mechanisms that ensure proteome integrity under normal or stress conditions. The proteasome, being ...
  • Thumbnail

    Hypoxia-induced Changes in SUMO Conjugation Affect Transcriptional Regulation under Low Oxygen 

    Chachami G., Stankovic-Valentin N., Karagiota A., Basagianni A., Plessmann U., Urlaub H., Melchior F., Simos G. (2019)
    Hypoxia occurs in pathological conditions, such as cancer, as a result of the imbalance between oxygen supply and consumption by proliferating cells. HIFs are critical molecular mediators of the physiological response to ...
  • Thumbnail

    Transcription of the NKG2D ligand MICA is suppressed by the IRE1/XBP1 pathway of the unfolded protein response through the regulation of E2F1 

    Obiedat A., Seidel E., Mahameed M., Bernani O., Tsukerman P., Voutetakis K., Chatziioannou A., Mcmahon M., Avril T., Chevet E., Mandelboim O., Tirosh B. (2019)
    The unfolded protein response (UPR) is an adaptive signaling pathway activated in response to endoplasmic reticulum (ER) stress. The effectors of the UPR are potent transcription activators; however, some genes are suppressed ...
htmlmap 

 

Πλοήγηση

Όλο το DSpaceΚοινότητες & ΣυλλογέςΑνά ημερομηνία δημοσίευσηςΣυγγραφείςΤίτλοιΛέξεις κλειδιάΑυτή η συλλογήΑνά ημερομηνία δημοσίευσηςΣυγγραφείςΤίτλοιΛέξεις κλειδιά

Ο λογαριασμός μου

ΣύνδεσηΕγγραφή (MyDSpace)
Πληροφορίες-Επικοινωνία
ΑπόθεσηΣχετικά μεΒοήθειαΕπικοινωνήστε μαζί μας
Επιλογή ΓλώσσαςΌλο το DSpace
EnglishΕλληνικά
htmlmap