Advancing Diabetic Retinopathy Detection: Fine-Tuning VIT Models
This work focuses on advancing the detection of diabetic retinopathy through the implementation of Vision Transformer (ViT) models, a state-of-the-art deep learning approach to computer vision. The system automates the identification of levels of diabetic retinopathy by classifying retinal images into three categories. The ViT model is meticulously trained on a large dataset of retinal images from Kaggle (available at Kaggle).
To ensure accurate and reliable results, the study incorporates extensive data mining and pre-processing techniques. By using data normalisation and optimisation algorithms, the model acquires the ability to effectively understand the complex patterns associated with diabetic retinopathy.
In addition to the Vision Transformer model, traditional machine learning techniques have been explored to further enhance overall detection performance. The ultimate aim of this mission is to develop an effective and scalable solution that will enable healthcare professionals to make timely diagnoses, thereby significantly improving patient outcomes in the management of diabetic retinopathy. [code] [slides]