Image and video platform startup, Cloudinary, announced the release of several new Generative AI, large language models (LLM) and GPT-based capabilities to its Programmable Media image and video APIs. The features include Generative Fill, Generative Remove, Generative Replace, AI-powered Image Captioning, and a ChatGPT-backed natural language interface.
The latest developments allow users to produce personalized assets rapidly and enable technical teams to scale by intelligently automating workflows, eliminating repetitive and time-consuming image manipulation tasks.
The Generative AI capabilities provide users the possibility to create, edit, and deliver dynamic visual experiences at an unprecedented scale. For example, developers and digital marketers can now remove unwanted objects and create images at scale, while AI-powered image captioning provides intelligent captions for images instantly to enhance accessibility, asset searchability and SEO, and increase productivity.
The new features improve the efficiency and automation of visual media workflows. Generative Fill expands an image intelligently, Generative Remove enables users to delete unwanted elements from images, and Generative Replace allows users to detect, change, and replace unwanted elements and colors via natural-language prompts. The AI-powered Image Captioning intelligently creates image captions, and the Conversational Transformations Builder provides a natural language interface through ChatGPT.
Cloudinary has leveraged AI capabilities, including OpenAI, Google Vision, and Amazon Rekognition, alongside its own machine learning models. The startup reached unicorn status with a $2 billion valuation last year.



