florence-2

Here are 26 public repositories matching this topic...

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma, Florence-2, and Qwen2-VL

transformers vqa objectdetection captioning fine-tuning multimodal vision-and-language phi-3-vision paligemma florence-2

Updated Dec 23, 2024
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava cogvlm florence-2

Updated Dec 13, 2024
Python

autodistill / autodistill-grounded-sam-2

Star

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

grounded-sam autodistill florence-2 segment-anything-2

Updated Aug 7, 2024
Python

Ravi-Teja-konda / Surveillance_Video_Summarizer

Star

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.

video ai summarization gradio vlm vision-and-language huggingface surviellance gpt-4 chatgpt gradio-python-llm florence-2

Updated Sep 17, 2024
Python

autodistill / autodistill-florence-2

Star

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

object-detection zero-shot-object-detection autodistill florence-2

Updated Aug 15, 2024
Python

D-Ogi / WatermarkRemover-AI

Star

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.

dataset-creation inpainting watermark-remover lama-cleaner florence-2

Updated Oct 30, 2024
Python

retkowsky / florence-2

Star

Florence-2

azure florence-2

Updated Jun 21, 2024
Jupyter Notebook

Damarcreative / rem-wm

Sponsor

Star

Rem-WM, a powerful watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

watermark lama-cleaner florence-2

Updated Oct 10, 2024
Python

fireicewolf / wd-llm-caption-cli

Star

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

image-caption wd14 llama3-vision florence-2 qwen2-vl joy-caption

Updated Nov 10, 2024
Python

ANYANTUDRE / Florence-2-Vision-Language-Model

Star

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

computer-vision deep-learning huggingface vision-language vision-transformer vision-transformer-models vision-language-model florence-2

Updated Jul 3, 2024
Jupyter Notebook

sayedmohamedscu / Vision-language-models-VLM

Star

vision language models finetuning notebooks & use cases (paligemma - florence .....)

computer-vision vlm florence finetuning multimodal colab-notebook finetune-llms paligemma florence-2 visionlanguage florence-finetuning

Updated Sep 26, 2024
Jupyter Notebook

jacobmarks / fiftyone_florence2_plugin

Star

Run SOTA Vision-Language Model Florence-2 on your data!

computer-vision ml transformer datacentric fiftyone-datasets vision-language-model florence-2

Updated Jun 29, 2024
Python

mithunparab / text2segment_video

Star

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.

raft video-summarization optical-flow segment-anything florence-2 sam2

Updated Dec 10, 2024
Python

nguyennpa412 / simple-multimodal-ai

Star

Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features

docker text-to-speech computer-vision gradio vlm visual-question-answering llm mllm vision-foundation-model image-text-to-text florence-2 xtts-v2 mini-internvl

Updated Aug 16, 2024
Python

Ambruk-chan / DiscordBot

Star

The Ultimate Local LLM Discord Bot!!!

ai discord-bot roleplay llm koboldcpp gbnf florence-2

Updated Dec 6, 2024
Python

regiellis / ecko-cli

Star

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX

cli ai image-processing image-classification onnxruntime huggingface-transformers generative-ai ecko florence-2 ecko-cli