reference-indexer — installing reference-indexer reference-indexer, smart_alarm, community, installing reference-indexer, ide skills, reference-indexer for AI agents, automated reference management, RAG reference indexing, PDF and Word document indexing, Claude Code, Cursor

v1.0.0
GitHub

About this Skill

Ideal for AI Agents like Cursor, Windsurf, and Claude Code requiring automated document management and analysis capabilities. reference-indexer is a skill that indexes and manages external reference documents for AI agents, enabling efficient search and retrieval of relevant information.

Features

Validates files using /ref-add command
Extracts text from PDF and Word documents
Generates automatic summaries for indexed documents
Updates the RAG corpus with new reference documents
Supports search functionality using /ref-search command

# Core Topics

arbgjr arbgjr
[0]
[0]
Updated: 3/8/2026

Agent Capability Analysis

The reference-indexer skill by arbgjr is an open-source community AI agent skill for Claude Code and other IDE workflows, helping agents execute tasks with better context, repeatability, and domain-specific guidance. Optimized for installing reference-indexer, reference-indexer for AI agents, automated reference management.

Ideal Agent Persona

Ideal for AI Agents like Cursor, Windsurf, and Claude Code requiring automated document management and analysis capabilities.

Core Value

Empowers agents to efficiently manage external reference documents by validating files, extracting text from PDF and Word documents, generating automatic summaries, and updating the RAG corpus index, leveraging file formats like PDF and protocols for seamless integration.

Capabilities Granted for reference-indexer

Automating document validation and indexing
Generating summaries for large reference documents
Searching reference documents with specific queries

! Prerequisites & Limits

  • Requires filesystem access to read and index documents
  • Limited to PDF and Word document formats for text extraction
Labs Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

reference-indexer

Install reference-indexer, an AI agent skill for AI agent workflows and automation. Works with Claude Code, Cursor, and Windsurf with one-command setup.

SKILL.md
Readonly

Reference Indexer Skill

Proposito

Esta skill gerencia documentos de referencia externa, indexando-os para uso no RAG.

Comandos

/ref-add {path}

Adiciona documento ao indice de referencias:

bash
1/ref-add .agentic_sdlc/references/legal/lei-13775-2018.pdf

Acoes:

  1. Valida o arquivo
  2. Extrai texto (se PDF/Word)
  3. Cria resumo automatico
  4. Adiciona ao corpus RAG
  5. Atualiza indice

/ref-search {query}

Busca nos documentos de referencia:

bash
1/ref-search "prazo de aceite duplicata"

Retorna:

  • Documentos relevantes
  • Trechos com contexto
  • Score de relevancia

/ref-list

Lista todos os documentos indexados:

bash
1/ref-list

Mostra:

  • Documentos por categoria
  • Status de indexacao
  • Data de adicao

/ref-remove {path}

Remove documento do indice:

bash
1/ref-remove .agentic_sdlc/references/legal/documento-antigo.pdf

Formatos Suportados

FormatoExtensaoMetodo de Extracao
PDF.pdfpdftotext / PyPDF2
Word.docxpython-docx
Markdown.mdDireto
Texto.txtDireto
HTML.htmlBeautifulSoup

Estrutura de Referencias

.agentic_sdlc/references/
├── legal/              # Leis, regulamentos, normas
├── technical/          # RFCs, especificacoes tecnicas
├── business/           # Regras de negocio, manuais
├── internal/           # Documentos internos
└── _index.yml          # Indice de documentos

Indice de Documentos

Arquivo _index.yml:

yaml
1index: 2 version: 1 3 updated_at: "2026-01-12T..." 4 5documents: 6 - id: "ref-001" 7 path: "legal/lei-13775-2018.pdf" 8 title: "Lei 13.775/2018 - Duplicatas Eletrônicas" 9 category: legal 10 added_at: "2026-01-12T..." 11 indexed: true 12 summary: "Lei que regulamenta as duplicatas escriturais..." 13 keywords: 14 - duplicata 15 - escritural 16 - eletronica 17 page_count: 5 18 19 - id: "ref-002" 20 path: "technical/icp-brasil.pdf" 21 title: "Padrões ICP-Brasil" 22 category: technical 23 added_at: "2026-01-12T..." 24 indexed: true

Extracao de Texto

PDF

bash
1# Usando pdftotext (poppler-utils) 2pdftotext -layout input.pdf output.txt 3 4# Usando Python 5python3 << 'EOF' 6import PyPDF2 7 8with open('input.pdf', 'rb') as f: 9 reader = PyPDF2.PdfReader(f) 10 text = '' 11 for page in reader.pages: 12 text += page.extract_text() + '\n' 13 print(text) 14EOF

Word (docx)

python
1from docx import Document 2 3doc = Document('input.docx') 4text = '\n'.join([p.text for p in doc.paragraphs]) 5print(text)

Integracao com RAG

Documentos indexados sao adicionados ao corpus RAG:

yaml
1corpus_entry: 2 id: "ref-001" 3 source: "references/legal/lei-13775-2018.pdf" 4 type: "reference" 5 category: "legal" 6 content: "{texto extraido}" 7 embeddings: [...] # Gerado pelo RAG 8 metadata: 9 title: "Lei 13.775/2018" 10 page: 1 11 section: "Art. 1"

Workflow de Indexacao

yaml
1indexing_workflow: 2 1_validate: 3 - Verificar formato suportado 4 - Verificar tamanho (max 50MB) 5 - Verificar permissoes 6 7 2_extract: 8 - Extrair texto do documento 9 - Limpar formatacao 10 - Dividir em chunks 11 12 3_analyze: 13 - Gerar resumo automatico 14 - Extrair keywords 15 - Classificar categoria 16 17 4_index: 18 - Adicionar ao corpus RAG 19 - Gerar embeddings 20 - Atualizar indice 21 22 5_verify: 23 - Testar busca 24 - Verificar qualidade

Configuracao

No settings.json:

json
1{ 2 "memory": { 3 "rag_corpus": ".agentic_sdlc/corpus", 4 "max_document_size_mb": 50, 5 "chunk_size": 1000, 6 "chunk_overlap": 200 7 } 8}

Boas Praticas

  1. Nomeie arquivos descritivamente: lei-13775-2018-duplicatas.pdf
  2. Organize por categoria: legal, technical, business
  3. Mantenha versoes: Nao sobrescreva, versione
  4. Documente a fonte: Adicione de onde veio
  5. Resuma docs longos: Crie resumos para PDFs grandes

Troubleshooting

PDF nao extrai texto

Alguns PDFs sao imagens escaneadas. Use OCR:

bash
1ocrmypdf input.pdf output.pdf 2pdftotext output.pdf -

Documento muito grande

Divida em partes menores ou aumente max_document_size_mb.

Encoding incorreto

Force UTF-8 na extracao:

bash
1pdftotext -enc UTF-8 input.pdf output.txt

FAQ & Installation Steps

These questions and steps mirror the structured data on this page for better search understanding.

? Frequently Asked Questions

What is reference-indexer?

Ideal for AI Agents like Cursor, Windsurf, and Claude Code requiring automated document management and analysis capabilities. reference-indexer is a skill that indexes and manages external reference documents for AI agents, enabling efficient search and retrieval of relevant information.

How do I install reference-indexer?

Run the command: npx killer-skills add arbgjr/smart_alarm. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for reference-indexer?

Key use cases include: Automating document validation and indexing, Generating summaries for large reference documents, Searching reference documents with specific queries.

Which IDEs are compatible with reference-indexer?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for reference-indexer?

Requires filesystem access to read and index documents. Limited to PDF and Word document formats for text extraction.

How To Install

  1. 1. Open your terminal

    Open the terminal or command line in your project directory.

  2. 2. Run the install command

    Run: npx killer-skills add arbgjr/smart_alarm. The CLI will automatically detect your IDE or AI agent and configure the skill.

  3. 3. Start using the skill

    The skill is now active. Your AI agent can use reference-indexer immediately in the current project.

Related Skills

Looking for an alternative to reference-indexer or another community skill for your workflow? Explore these related open-source skills.

View All

widget-generator

Logo of f
f

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

149.6k
0
AI

flags

Logo of vercel
vercel

flags is a Next.js feature management skill that enables developers to efficiently add or modify framework feature flags, streamlining React application development.

138.4k
0
Browser

zustand

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI

data-fetching

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
AI