Enhancing accessibility: deep learning-based image description for individuals with visual impairments

Nidhi B. Shah, Amit P. Ganatra

Abstract


Technological developments in artificial intelligence, namely in the area of deep learning, have created new avenues for enhancing accessibility for those with visual impairments. In order to improve the capacity of people who are blind or visually impaired to understand and interact with visual material, this research investigates the creation and use of deep learning-based image description systems. We provide a comprehensive method that uses recurrent neural networks (RNNs) to generate natural language descriptions and convolutional neural networks (CNNs) and Autoencoders for extracting picture features. Our technology automatically creates comprehensive, context-aware descriptions of photographs by incorporating these models, giving users a better knowledge of their surroundings. We show the accuracy and reliability of the system on a wide range of photos through comprehensive testing. According to our research, deep learning-based picture description systems and converting the description in audio and making a promise to empower people who are visually impaired and foster diversity in the digital sphere.

Keywords


CNN; Deep learning; Image processing; LSTM; RNN

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v38.i2.pp1051-1060

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics