Hi folks!

I’m the creator of BentoPDF. It is an open source PDF toolkit that runs entirely in your browser. Your documents stay private, by design.

BentoPDF started as a small side project, but over time it has grown into something much bigger. With our latest major update, BentoPDF now includes 100+ tools, all running fully client-side.

You can do the basics like merge PDFs(while preserving bookmarks), split documents, extract or delete pages, reorder files, rotate pages, and compress PDFs. Thee are also some advanced tools.

You can edit and annotate PDFs directly in the browser: highlight text, add comments, draw shapes, insert images, fill(including XFA) and create forms, manage bookmarks, generate tables of contents, redact, add headers, footers, watermarks, and page numbers.

BentoPDF also supports an extensive range of file conversions. You can convert Word, Excel, PowerPoint, OpenOffice, Pages, CSV, RTF, EPUB, MOBI, comic book formats, and many more into PDFs, and also convert PDFs back into Word, Excel, images, Markdown, CSV, JSON, and plain text.

For images, BentoPDF supports a massive variety of formats, including HEIC, WebP, SVG, PSD, JP2, and and aalso other formats such as EPUB, CBR/CBZ. You can convert images to PDFs, extract images from PDFs in their original format, or rasterize PDFs with full DPI control.

There are also organization and optimization tools: OCR, PDF/A conversion, booklet creation, N-up layouts, page division, attachment management, layer (OCG) editing, metadata inspection and editing, repair tools, and advanced compression algorithms that rival commercial solutions.

The latest update also includes AI ready extraction tools to export PDFs to structured JSON, extract tables as CSV/Markdown/JSON, and prepare PDFs for RAG and LLM workflows.

All of this works entirely in the browser, without accounts, uploads, or tracking.

This is my first post here and I hope you like it. Any feedback or feature requests are appreciated. Thank you.

Github Link: https://github.com/alam00000/bentopdf

  • stephaaaaan@feddit.org
    link
    fedilink
    English
    arrow-up
    10
    ·
    7 hours ago

    What I would love to see is batch processing of mapped form fields from a PDF template, e.g. to fill out training certificate template pdfs with name, date, company, and instructor from a given CSV file, add a signature and print it. Is something like that possible? 🙂

    We currently use nodered, python and reportlab and I‘m looking to somewht simplify the process :)

  • tburkhol@lemmy.world
    link
    fedilink
    English
    arrow-up
    43
    ·
    11 hours ago

    Great project. I like the 1-star reviews complaining about the lack of advertising and tracking.

  • Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    136
    arrow-down
    1
    ·
    14 hours ago

    Can it also redact text from documents without allowing you to just copy and paste it back out again?

    Asking for a friend.

  • Phoenixz@lemmy.ca
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 hours ago

    Interesting!

    Can this work server side as well? I’d love a good PDF toolkit to integrate as a backend into my open source system

    Most important missing detail though: reliable conversion from HTML to PDF

    I’m currently using wkhtnltopdf and it gets unreliable results at best, especially with layout (CSS)

    Any suggestions on what tool could do that best?

  • Lemminary@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    edit-2
    12 hours ago

    I’ve used it before for a job application! I needed to send them sensitive data. Tysm!

    Great intuitive UI, does what it says, and it’s fast. 5/5

  • ToxicWaste@lemmy.cafe
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    9 hours ago

    what is the reason to put that tool into a browser? if i use the thing on my private computer, it increases complexity compared to a local installation (not an issue for many ppl here, but for my grandma surely). if i use it on a corporate environment, wouldn’t more employees use it if it was the default PDF viewer on their managed device?

    what did i miss?

    • alam@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      7
      ·
      9 hours ago

      Yes, as arthor mentioned it is supposed to be a self hosted tool. But since it’s client side and just a bunch of static files in the end, I will soon be porting it for all major platforms as an installable using Tauri

    • arthor@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      9 hours ago

      this is meant to be self hosted on a server… in my case its a home server, so my wife and i can access/use this at home and on the road, having the documents synced in one place, which is also self-hosted

  • Meldrik@lemmy.wtf
    link
    fedilink
    English
    arrow-up
    8
    ·
    10 hours ago

    I use this at home and I’m thinking of setting this up at work, to prevent my colleagues from using shady PDF-sites, for merging or splitting PDF-files.

    How does the license work for internal use at a company, by it employees?

    • alam@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      8
      ·
      10 hours ago

      Hello and thank you. There is a one time life time commercial licence that comes for $49 and can be used by unlimited number of users (:

  • carrylex@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    9 hours ago

    8k Stars in 2 months. Wild…

    PS: Your git is misconfigured and doesn’t line up with your GH account…

    • krash@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 hours ago

      As someone who have been using both, you don’t need an account to use bentopdf. All the data is processed locally, making it excellent for a single user scenario. I drink Sterling has a very handy omni-tool, but I dare say it’s a matter of preference.

      I go with bento where I can, and use sterling as a fallback.

    • Voroxpete@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      5
      ·
      11 hours ago

      Honestly, I think this is just one where you try it for yourself. The compose file is about 4 lines long, I had the whole thing up and running in about 30 seconds (OK, 45; I forgot a port was already in use and had to redeploy).

      So far my one big complaint would be that the self-hosted version replicates the entire website, including all of the “Why choose Bento PDF” and “Try now” and so on. It’d be nice to just have the tools right there when I load it up. Other than that, well, it looks cool, I’ll know more once I actually try out the available options.

    • alam@lemmy.worldOP
      link
      fedilink
      English
      arrow-up
      9
      ·
      13 hours ago

      Not sure, as I haven’t used Stirling and at the same time I didn’t make it to compete with other tools. Hence I never mention its better than xyz tool either on our github or website. Users would have to do their own due diligence in this case. However it does have the best bookmark tool in the market(yes, better than adobe acrobat) and also a form creator tool, among others, which you can’t find in other OS tools.