flystem-usls

History

Jamjamjon 371a08011f Add MODNet model (#11 ) * Add MODNet for portrait matting * Minor fixes * Move assets to home directory * Add colormap * ci * Update README.md		2024-04-30 15:26:53 +08:00
..
README.md	Add YOLOv8-OBB and some bug fixes (#9 )	2024-04-21 17:06:58 +08:00
main.rs	Add MODNet model (#11 )	2024-04-30 15:26:53 +08:00

README.md

This demo shows how to use BLIP to do conditional or unconditional image captioning.

Quick Start

cargo run -r --example blip

BLIP ONNX Model

blip-visual-base
blip-textual-base

Results

[Unconditional image captioning]: a group of people walking around a bus
[Conditional image captioning]: three man walking in front of a bus
Some(["three man walking in front of a bus"])

TODO

Multi-batch inference for image caption
VQA
Retrival
TensorRT support for textual model