mirror of https://github.com/Frooodle/Stirling-PDF.git synced 2026-05-01 23:16:31 +02:00

Go to file

James Brunton 5541dd666c Flesh out RAG system (#6197 )

# Description of Changes
Flesh out the RAG system and connect it to the PDF Question Agent so it
can respond to questions about PDFs of an extremely large size.

I'd expect lots more work will need to be done to finish off the RAG
system to really be what we need, but this should be a reasonable start
which will let us connect it to tools and have the ingestion mostly
handled automatically. I'm leaving file deletion and proper file ID
management to be done in a future PR. We also need to consider whether
all tools should retrieve content exclusively via RAG, or whether it's
beneficial to have tools sometimes fetch the direct content and other
times fetch it from RAG.

A diagram of the expected interaction is as follows:

```mermaid
sequenceDiagram
    autonumber
    actor U as User
    participant FE as Frontend<br/>(ChatPanel)
    participant J as Java<br/>(AiWorkflowService)
    participant O as Engine:<br/>OrchestratorAgent
    participant QA as Engine:<br/>PdfQuestionAgent
    participant RAG as Engine:<br/>RagService + SqliteVecStore
    participant V as VoyageAI<br/>(embeddings)
    participant L as LLM<br/>(Claude / etc.)

    U->>FE: types "Summarise this PDF"<br/>(PDF already uploaded)
    FE->>J: POST /api/v1/ai/orchestrate/stream<br/>multipart: fileInputs[], userMessage
    Note over J: ByteHashFileIdStrategy<br/>id = sha256(bytes)[:16]
    J->>O: POST /api/v1/orchestrator<br/>{ files:[{id,name}], userMessage }

    O->>L: route via fast model
    L-->>O: delegate_pdf_question
    O->>QA: PdfQuestionRequest

    loop for each file
        QA->>RAG: has_collection(file.id)
        RAG-->>QA: false
    end
    QA-->>O: NeedIngestResponse(files_to_ingest)
    O-->>J: { outcome:"need_ingest", filesToIngest:[...] }

    Note over J: onNeedIngest
    loop per file
        J->>J: PDFBox: extract page text
        J->>O: POST /api/v1/rag/documents<br/>(long-running timeout)
        O->>RAG: chunk + stage documents
        O->>V: embed_documents (batches of 256)
        V-->>O: embeddings
        O->>RAG: add_documents
        O-->>J: { chunks_indexed: N }
    end

    Note over J: retry with resumeWith=pdf_question
    J->>O: POST /api/v1/orchestrator
    Note over O: fast-path to PdfQuestionAgent

    O->>QA: PdfQuestionRequest
    Note over QA: build RagCapability<br/>pinned to file IDs
    QA->>L: run(prompt) with search_knowledge tool

    loop up to max_searches
        L->>QA: search_knowledge(query)
        QA->>V: embed_query
        V-->>QA: query vector
        QA->>RAG: search(vector, collections=[file.id])
        RAG-->>QA: top-k chunks
        QA-->>L: formatted chunks
    end

    Note over QA: once budget spent,<br/>prepare() hides the tool
    L-->>QA: PdfQuestionAnswerResponse
    QA-->>O: answer
    O-->>J: { outcome:"answer", answer, evidence }
    J-->>FE: SSE "result"
    FE->>U: assistant bubble
```

2026-05-01 14:11:54 +01:00

.devcontainer

…

.github

Add Dependabot groups for frontend npm + cargo deps (#6287 )

2026-05-01 11:37:07 +01:00

.taskfiles

Pdf comment agent (#6196 )

2026-05-01 10:19:38 +01:00

.vscode

…

app

Flesh out RAG system (#6197 )

2026-05-01 14:11:54 +01:00

devGuide

Add Taskfile for unified dev workflow across all components (#6080 )

2026-04-15 14:16:57 +00:00

devTools

build(deps-dev): bump @stylistic/stylelint-plugin from 4.0.0 to 5.1.0 in /devTools (#6177 )

2026-04-28 21:42:02 +01:00

docker

build(docker): pin base container images to immutable digests (#6173 )

2026-04-23 13:31:21 +01:00

docs

enable AppImage and rpm distrobutions (#6127 )

2026-04-17 22:19:16 +01:00

engine

Flesh out RAG system (#6197 )

2026-05-01 14:11:54 +01:00

frontend

Pdf comment agent (#6196 )

2026-05-01 10:19:38 +01:00

gradle/wrapper

fix(gradle): bump gradle jar version to 9.3.1-bin (#5938 )

2026-03-20 12:00:01 +00:00

images

…

scripts

Adjust zh-TW translation ignore list (#6062 )

2026-04-29 10:55:51 +01:00

testing

playwright (#6025 )

2026-04-27 11:35:50 +01:00

.dockerignore

fix file sharing bug (#6161 )

2026-04-23 14:52:25 +01:00

.editorconfig

Change AI engine to execute tools in Java instead of on frontend (#6116 )

2026-04-20 15:57:11 +01:00

.git-blame-ignore-revs

…

.gitattributes

…

.gitignore

playwright (#6025 )

2026-04-27 11:35:50 +01:00

.pre-commit-config.yaml

chore(pre-commit): bump linting and formatting tool versions and ignore Windows DLL artifact (#6165 )

2026-04-23 13:30:35 +01:00

ADDING_TOOLS.md

…

AGENTS.md

Have Task choose free ports for dev servers (#6145 )

2026-04-28 17:26:04 +01:00

build.gradle

build(deps): bump org.springframework.boot from 4.0.5 to 4.0.6 (#6225 )

2026-04-28 17:35:32 +01:00

CLAUDE.md

Move AI advice to AGENTS.md and add symlink from CLAUDE.md (#5914 )

2026-03-11 13:43:30 +00:00

CONTRIBUTING.md

Add Taskfile for unified dev workflow across all components (#6080 )

2026-04-15 14:16:57 +00:00

DATABASE.md

…

DeveloperGuide.md

Add Taskfile for unified dev workflow across all components (#6080 )

2026-04-15 14:16:57 +00:00

FILE_SHARING.md

fileshare (#5414 )

2026-03-25 11:00:40 +00:00

gradle.properties

…

gradlew

fix(gradle): bump gradle jar version to 9.3.1-bin (#5938 )

2026-03-20 12:00:01 +00:00

gradlew.bat

fix(gradle): bump gradle jar version to 9.3.1-bin (#5938 )

2026-03-20 12:00:01 +00:00

HowToUseOCR.md

…

launch4jConfig.xml

…

LICENSE

Add prototypes folder to test new functionality in (#6081 )

2026-04-09 08:21:07 +00:00

README.md

Add Taskfile for unified dev workflow across all components (#6080 )

2026-04-15 14:16:57 +00:00

SECURITY.md

…

settings.gradle

…

SHARED_SIGNING.md

fileshare (#5414 )

2026-03-25 11:00:40 +00:00

Taskfile.yml

Have Task choose free ports for dev servers (#6145 )

2026-04-28 17:26:04 +01:00

test_globalsign.pdf

…

test_irs_signed.pdf

…

WINDOWS_SIGNING.md

…

README.md

Stirling PDF - The Open-Source PDF Platform

Stirling PDF is a powerful, open-source PDF editing platform. Run it as a personal desktop app, in the browser, or deploy it on your own servers with a private API. Edit, sign, redact, convert, and automate PDFs without sending documents to external services.

Key Capabilities

Everywhere you work - Desktop client, browser UI, and self-hosted server with a private API.
50+ PDF tools - Edit, merge, split, sign, redact, convert, OCR, compress, and more.
Automation & workflows - No-code pipelines direct in UI with APIs to process millions of PDFs.
Enterprise‑grade - SSO, auditing, and flexible on‑prem deployments.
Developer platform - REST APIs available for nearly all tools to integrate into your existing systems.
Global UI - Interface available in 40+ languages.

For a full feature list, see the docs: https://docs.stirlingpdf.com

Quick Start

docker run -p 8080:8080 docker.stirlingpdf.com/stirlingtools/stirling-pdf

Then open: http://localhost:8080

For full installation options (including desktop and Kubernetes), see our Documentation Guide.

Resources

Support

Community Discord
Bug Reports: Github issues

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

This project uses Task as a unified command runner for all build, dev, and test commands. Run task install to get started, or see the Developer Guide for full details.

For adding translations, see the Translation Guide.

License

Stirling PDF is open-core. See LICENSE for details.

Languages

TypeScript 47.7%

Java 42.3%

Python 4.4%

CSS 2%

Shell 1%

Other 2.5%

README.md Unescape Escape

Stirling PDF - The Open-Source PDF Platform

Key Capabilities

Quick Start

Resources

Support

Contributing

License

README.md