Agent Capability Analysis
The multimodal-io skill by Trongdepzai-dev is an open-source community AI agent skill for Claude Code and other IDE workflows, helping agents execute tasks with better context, repeatability, and domain-specific guidance.
Ideal Agent Persona
Perfect for AI Agents needing unified multimodal content processing and generation via the Gemini 3 API.
Core Value
Empowers agents to process and generate multimodal content using the Gemini 3 API, leveraging libraries like google-genai and pillow, and supporting environment variable configuration through .env files.
↓ Capabilities Granted for multimodal-io
! Prerequisites & Limits
- Requires Gemini API Key
- Dependent on google-genai and pillow libraries
- Limited to Gemini 3 API compatibility
Browser Sandbox Environment
⚡️ Ready to unleash?
Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.
multimodal-io
Install multimodal-io, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command setup.
FAQ & Installation Steps
These questions and steps mirror the structured data on this page for better search understanding.
? Frequently Asked Questions
What is multimodal-io?
Perfect for AI Agents needing unified multimodal content processing and generation via the Gemini 3 API. An Xorium Stealer Pulsar that I stole from the darknet and modified.
How do I install multimodal-io?
Run the command: npx killer-skills add Trongdepzai-dev/Xorium-Stealer-Pulsar/multimodal-io. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.
What are the use cases for multimodal-io?
Key use cases include: Generating multimodal content for diverse applications, Processing multimodal inputs via the Gemini 3 API, Automating content creation with unified interface.
Which IDEs are compatible with multimodal-io?
This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.
Are there any limitations for multimodal-io?
Requires Gemini API Key. Dependent on google-genai and pillow libraries. Limited to Gemini 3 API compatibility.
↓ How To Install
-
1. Open your terminal
Open the terminal or command line in your project directory.
-
2. Run the install command
Run: npx killer-skills add Trongdepzai-dev/Xorium-Stealer-Pulsar/multimodal-io. The CLI will automatically detect your IDE or AI agent and configure the skill.
-
3. Start using the skill
The skill is now active. Your AI agent can use multimodal-io immediately in the current project.