Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

From Civitai Wiki
Revision as of 00:33, 25 June 2024 by Aishavingfun (talk | contribs) (Added links)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation (BLIP), created by Salesforce, is a solid, simple image-to-text processor. Used to caption images for training.