Cloudinary Introduces Generative AI Features to Programmable Media APIs, Enhancing Image and Video Handling

2 min read
Cloudinary Introduces Generative AI Features to Programmable Media APIs, Enhancing Image and Video Handling

Image and video platform startup, Cloudinary, announced the release of several new Generative AI, large language models (LLM) and GPT-based capabilities to its Programmable Media image and video APIs. The features include Generative Fill, Generative Remove, Generative Replace, AI-powered Image Captioning, and a ChatGPT-backed natural language interface.

The latest developments allow users to produce personalized assets rapidly and enable technical teams to scale by intelligently automating workflows, eliminating repetitive and time-consuming image manipulation tasks.

Related startups

The Generative AI capabilities provide users the possibility to create, edit, and deliver dynamic visual experiences at an unprecedented scale. For example, developers and digital marketers can now remove unwanted objects and create images at scale, while AI-powered image captioning provides intelligent captions for images instantly to enhance accessibility, asset searchability and SEO, and increase productivity.

The new features improve the efficiency and automation of visual media workflows. Generative Fill expands an image intelligently, Generative Remove enables users to delete unwanted elements from images, and Generative Replace allows users to detect, change, and replace unwanted elements and colors via natural-language prompts. The AI-powered Image Captioning intelligently creates image captions, and the Conversational Transformations Builder provides a natural language interface through ChatGPT.

Cloudinary has leveraged AI capabilities, including OpenAI, Google Vision, and Amazon Rekognition, alongside its own machine learning models. The startup reached unicorn status with a $2 billion valuation last year.

© 2023 StartupHub.ai. All rights reserved. Do not enter, scrape, copy, reproduce, or republish this article in whole or in part. Use as input to AI training, fine-tuning, retrieval-augmented generation, or any machine-learning system is prohibited without written license. Substantially-similar derivative works will be pursued to the fullest extent of applicable copyright, database, and computer-misuse laws. See our terms.