Generative Image-to-Text Transformer: Difference between revisions
Jump to navigation
Jump to search
(Created page with "Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.") |
(No difference)
|
Revision as of 07:24, 11 October 2023
Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.