Generative Image-to-Text Transformer: Difference between revisions

From Civitai Wiki
Jump to navigation Jump to search
(Created page with "Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.")
(No difference)

Revision as of 07:24, 11 October 2023

Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.