nano-pdf — community nano-pdf, openclaw-graph, community, ide skills, Claude Code, Cursor, Windsurf

v1.0.0
GitHub

About this Skill

Ideal for Document Analysis Agents requiring advanced PDF processing capabilities, including text extraction, form filling, and OCR integration. Graph-native workspace backend for OpenClaw — 312 skills, 27 clusters, embedded Cypher via LadybugDB. Zero flat files, zero daemons.

alphaonedev alphaonedev
[0]
[0]
Updated: 3/5/2026

Agent Capability Analysis

The nano-pdf skill by alphaonedev is an open-source community AI agent skill for Claude Code and other IDE workflows, helping agents execute tasks with better context, repeatability, and domain-specific guidance.

Ideal Agent Persona

Ideal for Document Analysis Agents requiring advanced PDF processing capabilities, including text extraction, form filling, and OCR integration.

Core Value

Empowers agents to efficiently handle document workflows by providing tools for PDF text extraction, mining, form manipulation, and OCR integration via embedded Cypher and LadybugDB, all without relying on flat files or daemons.

Capabilities Granted for nano-pdf

Automating form submissions in PDF documents
Extracting text from scanned documents using OCR integration
Merging and splitting PDF files for data pipeline optimization

! Prerequisites & Limits

  • Requires graph-native workspace backend for OpenClaw
  • Zero flat files and daemons may limit compatibility with certain legacy systems
Labs Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

nano-pdf

Install nano-pdf, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command setup.

SKILL.md
Readonly

nano-pdf

Purpose

This skill provides tools for PDF processing, including text extraction, mining, form filling, manipulation, and OCR integration, to handle document workflows efficiently.

When to Use

Use this skill for tasks involving PDF data extraction (e.g., from scanned documents), text analysis in reports, automating form submissions, merging/splitting files, or applying OCR to non-text PDFs. Apply it in data pipelines, document automation scripts, or when integrating with OCR services for unstructured data.

Key Capabilities

  • Text extraction: Pulls plain text or structured data from PDFs, supporting encrypted files with passwords; uses OCR via Tesseract integration for image-based PDFs.
  • Text mining: Analyzes extracted text for keywords, sentiment, or patterns; e.g., counts occurrences of phrases in a document.
  • Form filling: Populates interactive PDF forms with JSON data; supports flattening forms to static PDFs.
  • Manipulation: Merges, splits, rotates, or watermarks PDFs; handles up to 500-page documents efficiently.
  • OCR integration: Converts scanned PDFs to searchable text using external APIs; requires Tesseract or similar engine configuration.

Usage Patterns

Invoke via CLI for quick scripts or API for server-side integration. For batch processing, chain commands in a shell script; for web apps, use API calls in loops. Always specify input/output paths explicitly. Pattern: Extract text first, then mine or manipulate as needed. For OCR-heavy tasks, preprocess images before PDF operations.

Common Commands/API

CLI commands use nano-pdf binary; API endpoints are under https://api.opencclaw.com/nano-pdf/. Authentication requires $NANO_PDF_API_KEY environment variable.

  • Extract text: nano-pdf extract --file input.pdf --output text.txt --ocr true (adds OCR if text is not selectable).
  • Mine text: nano-pdf mine --input text.txt --keywords "AI,robot" --output results.json (outputs keyword frequencies).
  • Fill form: nano-pdf fill --template form.pdf --data '{"field1": "value"}' --output filled.pdf.
  • Manipulate PDF: nano-pdf merge --files file1.pdf file2.pdf --output combined.pdf.
  • API endpoint for extraction: POST /extract with body {"file": "base64encoded_content", "ocr": true} and header Authorization: Bearer $NANO_PDF_API_KEY.
  • Code snippet (Python):
    import requests
    response = requests.post('https://api.opencclaw.com/nano-pdf/extract', headers={'Authorization': f'Bearer {os.environ["NANO_PDF_API_KEY"]}'}, json={'file': 'base64data'})
    print(response.json()['text'])
    
  • Config format: JSON for API bodies, e.g., {"file": "path", "options": {"ocr_engine": "tesseract", "language": "en"}}; CLI uses flag-based configs like --config config.json.

Integration Notes

Integrate by setting $NANO_PDF_API_KEY for authenticated requests; for local use, install via pip install nano-pdf and import as a module. Combine with other tools: pipe CLI output to NLP libraries for mining, or use in Node.js via HTTP requests. For OCR, ensure Tesseract is installed and configured in your environment path. Test integrations in a sandbox to verify API rate limits (e.g., 100 requests/min).

Error Handling

Check for common errors like file not found (exit code 404), invalid API keys (401), or OCR failures (e.g., no Tesseract installed). Use try-except in code:

try:
    result = nano_pdf.extract('input.pdf')
except FileNotFoundError:
    print("Error: File does not exist.")
except Exception as e:
    print(f"API Error: {e} - Check $NANO_PDF_API_KEY.")

For CLI, parse stderr output; retry transient errors (e.g., network issues) with exponential backoff. Always validate inputs, like ensuring PDFs are not corrupted before processing.

Example 1: Extract and Mine Text from a PDF

To extract text from a scanned invoice PDF and mine for product names:

  1. Run: nano-pdf extract --file invoice.pdf --output invoice_text.txt --ocr true
  2. Then: nano-pdf mine --input invoice_text.txt --keywords "product" --output analysis.json This produces a JSON with keyword occurrences for further processing.

Example 2: Fill and Manipulate a Form PDF

To fill a job application form and merge it with a cover letter:

  1. Prepare data in JSON: {"name": "John Doe", "position": "Engineer"}
  2. Execute: nano-pdf fill --template application.pdf --data application_data.json --output filled_app.pdf
  3. Merge: nano-pdf merge --files filled_app.pdf cover_letter.pdf --output final_packet.pdf Output is a single PDF ready for submission.

Graph Relationships

  • Related to: "ocr-tool" (for enhanced OCR capabilities), "document-parser" (for broader file type support), "text-analyzer" (for advanced mining integrations).
  • Clusters: Connected via "community" cluster to skills like "data-extraction" and "automation-utils".

FAQ & Installation Steps

These questions and steps mirror the structured data on this page for better search understanding.

? Frequently Asked Questions

What is nano-pdf?

Ideal for Document Analysis Agents requiring advanced PDF processing capabilities, including text extraction, form filling, and OCR integration. Graph-native workspace backend for OpenClaw — 312 skills, 27 clusters, embedded Cypher via LadybugDB. Zero flat files, zero daemons.

How do I install nano-pdf?

Run the command: npx killer-skills add alphaonedev/openclaw-graph/nano-pdf. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for nano-pdf?

Key use cases include: Automating form submissions in PDF documents, Extracting text from scanned documents using OCR integration, Merging and splitting PDF files for data pipeline optimization.

Which IDEs are compatible with nano-pdf?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for nano-pdf?

Requires graph-native workspace backend for OpenClaw. Zero flat files and daemons may limit compatibility with certain legacy systems.

How To Install

  1. 1. Open your terminal

    Open the terminal or command line in your project directory.

  2. 2. Run the install command

    Run: npx killer-skills add alphaonedev/openclaw-graph/nano-pdf. The CLI will automatically detect your IDE or AI agent and configure the skill.

  3. 3. Start using the skill

    The skill is now active. Your AI agent can use nano-pdf immediately in the current project.

Related Skills

Looking for an alternative to nano-pdf or another community skill for your workflow? Explore these related open-source skills.

View All

widget-generator

Logo of f
f

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

149.6k
0
AI

flags

Logo of vercel
vercel

flags is a Next.js feature management skill that enables developers to efficiently add or modify framework feature flags, streamlining React application development.

138.4k
0
Browser

zustand

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI

data-fetching

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI