• English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • español 
    • English
    • Ελληνικά
    • Deutsch
    • français
    • italiano
    • español
  • Login
Ver ítem 
  •   DSpace Principal
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • Ver ítem
  •   DSpace Principal
  • Επιστημονικές Δημοσιεύσεις Μελών ΠΘ (ΕΔΠΘ)
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ.
  • Ver ítem
JavaScript is disabled for your browser. Some features of this site may not work without it.
Todo DSpace
  • Comunidades & Colecciones
  • Por fecha de publicación
  • Autores
  • Títulos
  • Materias

Pose recognition using convolutional neural networks on omni-directional images

Thumbnail
Autor
Georgakopoulos S.V., Kottari K., Delibasis K., Plagianakos V.P., Maglogiannis I.
Fecha
2018
Language
en
DOI
10.1016/j.neucom.2017.08.071
Materia
Cameras
Computer vision
Convolution
Convolutional neural networks
Feature extraction
Gesture recognition
Multilayer neural networks
Network architecture
Transfer learning
Well testing
Calibration model
Computer vision applications
Fish-eye cameras
Omni-directional
Omnidirectional cameras
Pose classifications
Threedimensional (3-d)
Zernike moments
Image enhancement
Article
artificial neural network
body image
body position
calibration
convolutional neural network
geodesically corrected Zernike moment
human
image analysis
machine learning
measurement accuracy
priority journal
recognition
three dimensional imaging
transfer of learning
Elsevier B.V.
Mostrar el registro completo del ítem
Resumen
Convolutional neural networks (CNNs) are used frequently in several computer vision applications. In this work, we present a methodology for pose classification of binary human silhouettes using CNNs, enhanced with image features based on Zernike moments, which are modified for fisheye images. The training set consists of synthetic images that are generated from three-dimensional (3D) human models, using the calibration model of an omni-directional camera (fisheye). Testing is performed using real images, also acquired by omni-directional cameras. Here, we employ our previously proposed geodesically corrected Zernike moments (GZMI) and confirm their merit as stand-alone descriptors of calibrated fisheye images. Subsequently, we explore the efficiency of transfer learning from the previously trained model with synthetically generated silhouettes, to the problem of real pose classification, by continuing the training of the already trained network, using a few frames of annotated real silhouettes. Furthermore, we propose an enhanced architecture that combines the calculated GZMI features of each image with the features generated at CNNs’ last convolutional layer, both feeding the first hidden layer of the traditional neural network that exists at the end of the CNN. Testing is performed using synthetically generated silhouettes as well as real ones. Results show that the proposed enhancement of CNN architecture, combined with transfer learning improves pose classification accuracy for both the synthetic and the real silhouette images. © 2017
URI
http://hdl.handle.net/11615/72060
Colecciones
  • Δημοσιεύσεις σε περιοδικά, συνέδρια, κεφάλαια βιβλίων κλπ. [19735]

Ítems relacionados

Mostrando ítems relacionados por Título, autor o materia.

  • Thumbnail

    Deep Endoscopic Visual Measurements 

    Iakovidis D.K., DImas G., Karargyris A., Bianchi F., Ciuti G., Koulaouzidis A. (2019)
    Robotic endoscopic systems offer a minimally invasive approach to the examination of internal body structures, and their application is rapidly extending to cover the increasing needs for accurate therapeutic interventions. ...
  • Thumbnail

    Resource-efficient TDNN Architectures for Audio-visual Speech Recognition 

    Koumparoulis A., Potamianos G., Thomas S., da Silva Morais E. (2021)
    In this paper, we consider the problem of resource-efficient architectures for audio-visual automatic speech recognition (AVSR). Specifically, we complement our earlier work that introduced efficient convolutional neural ...
  • Thumbnail

    Design Space Exploration of a Sparse MobileNetV2 Using High-Level Synthesis and Sparse Matrix Techniques on FPGAs 

    Tragoudaras A., Stoikos P., Fanaras K., Tziouvaras A., Floros G., Dimitriou G., Kolomvatsos K., Stamoulis G. (2022)
    Convolution Neural Networks (CNNs) are gaining ground in deep learning and Artificial Intelligence (AI) domains, and they can benefit from rapid prototyping in order to produce efficient and low-power hardware designs. The ...
htmlmap 

 

Listar

Todo DSpaceComunidades & ColeccionesPor fecha de publicaciónAutoresTítulosMateriasEsta colecciónPor fecha de publicaciónAutoresTítulosMaterias

Mi cuenta

AccederRegistro
Help Contact
DepositionAboutHelpContacto
Choose LanguageTodo DSpace
EnglishΕλληνικά
htmlmap