Posted by Rambler in Linux

I've been messing with https://linux.die.net/man/1/pdftohtml and adjusting the settings some to try to get the results I desire, and am not getting what I want. There is still some overlapping text or broken URLs due to random, special characters being inserted in file names.

I have a collection of ebooks that I'd like to make available and share, but figured converting them to html with inline images would be the best way to do it after reading this discussion earlier.

Thoughts?

1

Comments

You must log in or register to comment.

CyberKat wrote

First time posting so take it easy on me please... I use Linux Mint and have found a program back when I was a windows user called Calibre which is an ebook management program. It is a Linux program that normally is in your Software Manager. Among it's features is the ability to convert ebooks to different formats. HTML is one format, I personally use it to convert them to .txt and then use text2wav and Lame to convert them to mp3 as I am an OLD computer user (53) and am slowly going blind. Hope this helps! if you want me to send you my .sh I wrote to handle the .txt to .mp3 process let me know via email fu.killme @ gmail.com

2

Rambler OP wrote

Thanks for the resposne. I actually ended up using Calibre to convert to html with some limited success. I think I need to adjust the settings more, as some of the pages had some weird formatting issues with the embedded images and text overlapping them, or in some cases lines not wrapping to the page causing horizontal scroll in the browser.

It's still something I want to get working as I have some great ebooks on alternative construction, gardening, and a lot of homestead and off-grid living stuff that I want to share. Maybe some day soon...

1