Caio Cesar Iglesias

caioiglesias

AI & ML interests

Reinforcement Learning

Recent Activity

liked a model 3 months ago

tarn59/book_flatten_and_crop_qwen_image_edit_2509

liked a Space 5 months ago

briaai/BRIA-RMBG-2.0

liked a Space 5 months ago

Stable-X/ReconViaGen

View all activity

Organizations

liked a model 3 months ago

tarn59/book_flatten_and_crop_qwen_image_edit_2509

Image-to-Image • Updated Nov 18, 2025 • 38 • • 39

liked 2 Spaces 5 months ago

BRIA RMBG 2.0

🐢

885

remove background from any image

ReconViaGen

🖥

166

High-fidelity 3D Geometry Generation from multi-view images

upvoted a collection 5 months ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 20 days ago • 207

liked a Space 8 months ago

Sparc3D

🏃

1.59k

Next-Gen High-Resolution 3D Model Generation

upvoted 2 papers 9 months ago

Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Paper • 2505.23747 • Published May 29, 2025 • 69

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81

liked a Space 9 months ago

TRELLIS - Multiple Imagen a 3D

🚀

Scalable and Versatile 3D Generation from images

liked a model 9 months ago

Comfy-Org/HiDream-I1_ComfyUI

Updated Aug 5, 2025 • 208k • 205

liked a Space 9 months ago

Qwen3 WebGPU

🚀

100

A hybrid reasoning model that runs locally in your browser.

liked 2 models 11 months ago

sesame/csm-1b

Text-to-Speech • Updated Dec 1, 2025 • 94k • 2.33k

docling-project/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 40.5k • 1.61k

liked a Space 11 months ago

Gemini Image Edit

📚

276

Edit images with text prompts using Gemini AI

upvoted an article 12 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

•

191

upvoted an article about 1 year ago

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

Jan 23, 2025

•

191

liked a Space about 1 year ago

Stable Point-Aware 3D

⚡

468

Generate 3D models from images

liked a model over 1 year ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • 11B • Updated Dec 4, 2024 • 181k • 1.56k

upvoted a collection over 1 year ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 655

liked a Space over 1 year ago

Stable Fast 3D

🎮

1.16k

Generate a 3D mesh model from an image

liked a Space almost 2 years ago

OpenVoice

🤗

1.12k

Generate speech in a chosen voice from a short audio sample