Hi folks!

I’m the creator of BentoPDF. It is an open source PDF toolkit that runs entirely in your browser. Your documents stay private, by design.

BentoPDF started as a small side project, but over time it has grown into something much bigger. With our latest major update, BentoPDF now includes 100+ tools, all running fully client-side.

You can do the basics like merge PDFs(while preserving bookmarks), split documents, extract or delete pages, reorder files, rotate pages, and compress PDFs. Thee are also some advanced tools.

You can edit and annotate PDFs directly in the browser: highlight text, add comments, draw shapes, insert images, fill(including XFA) and create forms, manage bookmarks, generate tables of contents, redact, add headers, footers, watermarks, and page numbers.

BentoPDF also supports an extensive range of file conversions. You can convert Word, Excel, PowerPoint, OpenOffice, Pages, CSV, RTF, EPUB, MOBI, comic book formats, and many more into PDFs, and also convert PDFs back into Word, Excel, images, Markdown, CSV, JSON, and plain text.

For images, BentoPDF supports a massive variety of formats, including HEIC, WebP, SVG, PSD, JP2, and and aalso other formats such as EPUB, CBR/CBZ. You can convert images to PDFs, extract images from PDFs in their original format, or rasterize PDFs with full DPI control.

There are also organization and optimization tools: OCR, PDF/A conversion, booklet creation, N-up layouts, page division, attachment management, layer (OCG) editing, metadata inspection and editing, repair tools, and advanced compression algorithms that rival commercial solutions.

The latest update also includes AI ready extraction tools to export PDFs to structured JSON, extract tables as CSV/Markdown/JSON, and prepare PDFs for RAG and LLM workflows.

All of this works entirely in the browser, without accounts, uploads, or tracking.

This is my first post here and I hope you like it. Any feedback or feature requests are appreciated. Thank you.

Github Link: https://github.com/alam00000/bentopdf

  • stephaaaaan@feddit.org
    link
    fedilink
    English
    arrow-up
    7
    ·
    3 hours ago

    What I would love to see is batch processing of mapped form fields from a PDF template, e.g. to fill out training certificate template pdfs with name, date, company, and instructor from a given CSV file, add a signature and print it. Is something like that possible? 🙂

    We currently use nodered, python and reportlab and I‘m looking to somewht simplify the process :)

  • Phoenixz@lemmy.ca
    link
    fedilink
    English
    arrow-up
    1
    ·
    2 hours ago

    Interesting!

    Can this work server side as well? I’d love a good PDF toolkit to integrate as a backend into my open source system

    Most important missing detail though: reliable conversion from HTML to PDF

    I’m currently using wkhtnltopdf and it gets unreliable results at best, especially with layout (CSS)

    Any suggestions on what tool could do that best?

  • tburkhol@lemmy.world
    link
    fedilink
    English
    arrow-up
    38
    ·
    7 hours ago

    Great project. I like the 1-star reviews complaining about the lack of advertising and tracking.

  • Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    120
    ·
    11 hours ago

    Can it also redact text from documents without allowing you to just copy and paste it back out again?

    Asking for a friend.

  • ToxicWaste@lemmy.cafe
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    6 hours ago

    what is the reason to put that tool into a browser? if i use the thing on my private computer, it increases complexity compared to a local installation (not an issue for many ppl here, but for my grandma surely). if i use it on a corporate environment, wouldn’t more employees use it if it was the default PDF viewer on their managed device?

    what did i miss?

    • alam@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      6
      ·
      5 hours ago

      Yes, as arthor mentioned it is supposed to be a self hosted tool. But since it’s client side and just a bunch of static files in the end, I will soon be porting it for all major platforms as an installable using Tauri

    • arthor@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      5 hours ago

      this is meant to be self hosted on a server… in my case its a home server, so my wife and i can access/use this at home and on the road, having the documents synced in one place, which is also self-hosted

  • Clot@lemmy.zip
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 hours ago

    Thank you so much! This could be a great replacement to ilovepdf, that website has so much data and no one knows what they are dong with it.

  • Lemminary@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    ·
    edit-2
    8 hours ago

    I’ve used it before for a job application! I needed to send them sensitive data. Tysm!

    Great intuitive UI, does what it says, and it’s fast. 5/5

  • Meldrik@lemmy.wtf
    link
    fedilink
    English
    arrow-up
    7
    ·
    7 hours ago

    I use this at home and I’m thinking of setting this up at work, to prevent my colleagues from using shady PDF-sites, for merging or splitting PDF-files.

    How does the license work for internal use at a company, by it employees?

    • alam@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      5
      ·
      7 hours ago

      Hello and thank you. There is a one time life time commercial licence that comes for $49 and can be used by unlimited number of users (:

  • carrylex@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 hours ago

    8k Stars in 2 months. Wild…

    PS: Your git is misconfigured and doesn’t line up with your GH account…