Generative Image-to-Text Transformer

From Civitai Wiki
Revision as of 07:24, 11 October 2023 by Civitai (talk | contribs) (Created page with "Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.