Software Application for Assisting Visually Impaired Persons at Public Transport Stops
DOI:
https://doi.org/10.18664/ikszt.v30i3.351322Ключові слова:
vision impairment; computer vision; YOLOv8; OCR; Tesseract; synthesis; object detection; assistive technologies; deep learningАнотація
The paper addresses the urgent problem of social adaptation of visually impaired individuals in Ukraine, where the number of people with visual disabilities has significantly increased due to the consequences of military actions. The research presents the development of an intelligent software application designed to assist visually impaired persons in independent orientation at public transport stops. The system integrates deep learning, computer vision, and text-to-speech synthesis to automatically detect public transport vehicles (bus, trolleybus, tram) and recognize their route numbers in real time. The YOLOv8 model was employed for object detection, Tesseract OCR for route number recognition, and pyttsx3 for offline speech synthesis. Data preprocessing included dataset annotation, augmentation with simulated weather conditions, and the use of OpenCV-based filters to enhance OCR accuracy. Testing under both real and simulated conditions (fog, rain, snow, blur, low lighting) demonstrated consistently high detection accuracy (100%) and acceptable classification performance, though recognition robustness decreased under low-visibility scenarios. The results confirm the practical value of the proposed approach, while further improvements will focus on expanding realworld datasets, enhancing preprocessing methods, and integrating stronger deep learning models. The system holds promise as a foundation for wearable assistive technologies aimed at improving inclusivity and mobility for visually impaired users in urban environments.
Посилання
Berad, A. S. (2025). Emerging Technologies for Enhancing Vision in People with Blindness: Innovations, Applications, and Future Directions. Central India Journal of Medical Research, 4(01), 18-20. https://doi.org/10.58999/cijmr.v4i01.218
Serdechnyi, V., Barkovska, O., Kovalenko, A., Martovytskyi, V., & Havrashenko, A. (2025). Knowledge systematization about the characteristics of existing technological means for assisting people with visual impairments. Advanced Information Systems, 9(1), 13-23. https://doi.org/10.20998/2522-9052.2025.1.02.
Нечітайло О. В. Інтелектуальний модуль навігації для реабілітації осіб з вадами зору на зупинках громадського транспорту / О. В.Нечітайло, А. О. Гаврашенко // Сучасні напрями розвитку інформаційно-комунікаційних технологій та засобів управління : тези доп. 15-ї міжнар. наук.-техн. конф., 24-25 квітня 2025 р., Баку–Харків–Жиліна : [у 4 т.]. Т. 3 : секція 3,4 / Нац. ун-т оборони Азербайджанської Республіки [та ін.]. – Харків : Impress., 2025. – C. 123. https://doi.org/10.32620/ICT.25.t3
Tulashvili Y. Humanization of society through integration and inclusive education: Challenges and prospects for individuals with visual impairments in Ukraine //Forum for Education Studies. – 2024. – Т. 2. – №. 3. – С. 279-279. https://doi.org/10.59400/fes.v2i3.279
Khan, S., Nazir, S., & Khan, H. U. (2021). Analysis of navigation assistants for blind and visually impaired people: A systematic review. IEEE access, 9, 26712-26734. https://doi.org/10.1109/ACCESS.2021.3052415.
Hmidani, O., & Alaoui, E. I. (2022, December). A comprehensive survey of the R-CNN family for object detection. In 2022 5th International Conference on Advanced Communication Technologies and Networking (CommNet) (pp. 1-6). IEEE. https://doi.org/10.1109/CommNet56067.2022.9993862.
Ali, M. L., & Zhang, Z. (2024). The YOLO framework: A comprehensive review of evolution, applications, and benchmarks in object detection. Computers, 13(12), 336. https://doi.org/10.3390/computers13120336.
Avyodri, R., Lukas, S., & Tjahyadi, H. (2022, September). Optical character recognition (ocr) for text recognition and its post-processing method: A literature review. In 2022 1st International Conference on Technology Innovation and Its Applications (ICTIIA) (pp. 1-6). IEEE. https://doi.org/10.1109/ICTIIA54654.2022.9935961.
Thorat, C., Bhat, A., Sawant, P., Bartakke, I., & Shirsath, S. (2022). A detailed review on text extraction using optical character recognition. ICT Analysis and Applications, 719-728. https://doi.org/10.1109/I2CT61223.2024.10544007.
Kumar, Y., Koul, A., & Singh, C. (2023). A deep learning approaches in text-to-speech system: a systematic review and recent research perspective. Multimedia Tools and Applications, 82(10),15171-15197. https://doi.org/10.1007/s11042-022-13943-4.
Praveen Borra. ‘A Survey of Google Cloud Platform (GCP): Features, Services, and Applications’. International Journal of Advanced Research in Science, Communication and Technology, Jun. 2024, pp. 191–99. DOI.org (Crossref) https://doi.org/10.48175/IJARSCT-18922.
S. Khan, S. Nazir and H. U. Khan, "Analysis of Navigation Assistants for Blind and Visually Impaired People: A Systematic Review," in IEEE Access, vol. 9, pp. 26712-26734, 2021, doi:10.1109/ACCESS.2021.3052415
Mukhiddinov, M.; Cho, J. Smart Glass System Using Deep Learning for the Blind and Visually Impaired. Electronics 2021, 10, 2756. https://doi.org/10.3390/electronics10222756
Lo Valvo, A.; Croce, D.; Garlisi, D.; Giuliano, F.; Giarré, L.; Tinnirello, I. A Navigation and Augmented Reality System for Visually Impaired People. Sensors 2021, 21, 3061 https://doi.org/10.3390/s21093061
U. Masud, T. Saeed, H. M. Malaikah, F. U. Islam and G. Abbas, "Smart Assistive System for Visually Impaired People Obstruction Avoidance Through Object Detection and Classification," in IEEE Access, vol. 10, pp. 13428-13441, 2022, https://doi.org/10.1109/ACCESS.2022.3146320.
J. K. Mahendran, D. T. Barry, A. K. Nivedha and S. M. Bhandarkar, "Computer Vision-based Assistance System for the Visually Impaired Using Mobile Edge Artificial Intelligence," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2021, pp. 2418-2427, https://doi.org/10.1109/CVPRW53098.2021.00274.
##submission.downloads##
Опубліковано
Номер
Розділ
Ліцензія

Ця робота ліцензується відповідно до Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.