Generative Image-to-Text Transformer: Difference between revisions

Latest revision as of 13:59, 2 February 2024

Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.

External Links

Please note that the content of external links are not endorsed or verified by us and can change with no notice. Use at your own risk.

https://arxiv.org/abs/2205.14100

@@ Line 1: / Line 1: @@
 Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.
+{{Disclaim-external-links}}
+https://arxiv.org/abs/2205.14100

Generative Image-to-Text Transformer: Difference between revisions

Latest revision as of 13:59, 2 February 2024

External Links

Navigation menu