Generative Image-to-Text Transformer: Difference between revisions
Jump to navigation
Jump to search
(Created page with "Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.") |
(added external link disclaimer) |
||
Line 1: | Line 1: | ||
Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor. | Generative Image-to-Text (GIT), first discussed in [https://arxiv.org/abs/2205.14100 this paper], was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor. | ||
{{Disclaim-external-links}} | |||
https://arxiv.org/abs/2205.14100 |
Latest revision as of 13:59, 2 February 2024
Generative Image-to-Text (GIT), first discussed in this paper, was trained on 20 million image-text pairs, and further fine-tuned on TextCaps. A robust image-to-text processor.
External Links
Please note that the content of external links are not endorsed or verified by us and can change with no notice. Use at your own risk.