Fashion Florence model extracts structured clothing attributes

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing category, color, material, style, and occasion tags, which is directly usable by recommendation and retrieval systems. In evaluations, Fashion Florence outperformed GPT-4o-mini and Gemini 2.5 Flash in category and style tag accuracy, while also demonstrating high JSON output validity and efficiency with its 0.77B parameters. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Enables direct programmatic use of fashion attributes for recommendation and retrieval systems, improving e-commerce operations.

RANK_REASON The cluster describes a fine-tuned model release based on an existing architecture, with performance benchmarks and deployment details. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Daily Papers →

COVERAGE [1]

Hugging Face Daily Papers TIER_1 · 2026-05-11 00:04

Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, style tags, and occasion tags, st…

COVERAGE [1]

Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

RELATED ENTITIES

RELATED TOPICS