Abstract
Navigation assistance is essential for visually impaired and elderly individuals, as traditional tools often lack the necessary feedback for safe and independent mobility. A smart navigation system that integrates text-to-speech (TTS) and real-time scene analysis technology is introduced in this paper to assist visually impaired and elderly individuals with safe navigation. The system utilizes optical character recognition (OCR) and Tesseract to extract and read text from the environment, specifically medicine labels. Additionally, the system uses YOLOv8 for object detection to identify and describe the user's surroundings. The detected objects are passed to Bootstrapping Language-Image Pre-training (BLIP) for scene captioning, which is then converted into speech through the TTS module. The system provides real-time auditory feedback, offering guidance on both objects and text, thereby enhancing mobility and safety. Experimental results demonstrated a TTS word error rate (WER) of 9.2% and a scene recognition accuracy of 92.6%. The efficacy of the system is demonstrated in this paper through its ability to provide reliable and informative navigation support.
| Original language | English |
|---|---|
| Title of host publication | 15th IEEE International Conference on Control System, Computing and Engineering (ICCSCE) |
| Publisher | IEEE |
| Pages | 24-29 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331515270 |
| DOIs | |
| Publication status | Published - 6 Oct 2025 |
| Event | 15th IEEE International Conference on Control System, Computing and Engineering 2025 - Batu Ferringhi, Malaysia Duration: 22 Aug 2025 → 23 Aug 2025 |
Conference
| Conference | 15th IEEE International Conference on Control System, Computing and Engineering 2025 |
|---|---|
| Abbreviated title | ICCSCE 2025 |
| Country/Territory | Malaysia |
| City | Batu Ferringhi |
| Period | 22/08/25 → 23/08/25 |
Keywords
- text-to-speech
- visually impaired
- navigation assistance
- OCR
- YOLO-v8
- object detection