Object Detection task in Visual Question Answering

dc.contributor.authorBENAICHE, Abla
dc.contributor.authorSuperviser: DEBBI, Hicham
dc.date.accessioned2023-07-02T09:15:04Z
dc.date.available2023-07-02T09:15:04Z
dc.date.issued2023-06-10
dc.description.abstractThis research proposes a novel approach for Visual Question Answering (VQA) by incorporating object detection features into the model as image features instead of traditional CNN features. The aim is to leverage specific information about objects present in the image to improve the VQA task. The experiments yielded accuracy values of 76% for Yes/No questions, 43% for counting questions, and 47% for other questions. Overall, this research enhances the understanding and processing of visual information by incorporating object detection features, leading to improved accuracy and performance in answering questions based on images.en_US
dc.identifier.urihttps://repository.univ-msila.dz/handle/123456789/39840
dc.language.isoenen_US
dc.publisherUniversity of M'silaen_US
dc.subjectVisual Question Answering (VQA), object detection, image features, CNN, VQA v2 dataset.en_US
dc.titleObject Detection task in Visual Question Answeringen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
29_Object Detection task in Visual Question Answering.pdf
Size:
7.61 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections