Generative Image-to-Text Transformer: Difference between revisions

From Civitai Wiki
Jump to navigation Jump to search
(Created page with "Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.")
 
(added external link disclaimer)
 
Line 1: Line 1:
Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.
Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.
{{Disclaim-external-links}}
https://arxiv.org/abs/2205.14100

Latest revision as of 13:59, 2 February 2024

Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.


External Links

Please note that the content of external links are not endorsed or verified by us and can change with no notice. Use at your own risk.

https://arxiv.org/abs/2205.14100