Yume-1.5: A Text-Controlled Interactive World Generation Model
Paper
•
2512.22096
•
Published
•
55
Computer Vision
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
ViCO: A Training Strategy towards Semantic Aware Dynamic High-Resolution
Agent_Skills tool provides full support for Agent Skills (Claude Skills but open-source)./Skills under the same File_System root, and any bundled scripts run through Shell_Command, no new infrastructure required.Agent_Skills(action="discover") # List all available skills
Agent_Skills(action="info", skill_name="music-downloader") # Full SKILL.md
Agent_Skills(action="resources", skill_name="music-downloader") # Scripts, refs, assetsmusic-downloader skill as a working demo, it wraps yt-dlp for YouTube/SoundCloud audio extraction.Shell_Command works for most tasks, but some operations (like YouTube downloads) are restricted due to the container environment. For full functionality, run the server locally on your machine.Agent_Terminal, a single "master tool" that cuts token usage by over 90%!Agent_Terminal wraps all 12 tools (Web_Search, Web_Fetch, File_System, Generate_Image, Generate_Speech, Generate_Video, Deep_Research, Memory_Manager, Obsidian_Vault, Shell_Command, Code_Interpreter) into a single Python code execution gateway.# Search for Bitcoin price
result = Web_Search("current price of bitcoin", max_results=3)
print(result)print(search_tools('image')) # Find tools by keyword
print(usage('Generate_Image')) # Get full docs for a specific toolAgent_Terminal. Try it now - https://www.nymbo.net/nymbot