Probably also want to use the -j option to pdfimages, because a Them, because it gets you the raw data at its original size. Series of bitmaps, pdfimages will do a much better job of extracting It simply ignores any text or vectorĪs a result, if what you have is a PDF that's just a wrapper around a Pdfimages looks through the PDF for embedded bitmap images andĮxports each one to a file. pdfimagesĭoes not do the same thing that convert does when given a PDF asĬonvert takes the PDF, renders it at some resolution, and uses the Update: As you pointed out, gscan2pdf (the way you're using it) is just a wrapper for pdfimages (from poppler). (You can prepend -units PixelsPerInch or -units Perhaps you need to use -density to do the conversion at a higherĭpi: convert -density 300 file.pdf page_%04d.jpg Versions (as a PNG to avoid further quality loss). Perhaps cut the same section out of the poor quality and good quality Could you post some samples to illustrate? It's not clear what you mean by "quality loss". TLDR - Use pdfimages : pdfimages -j input.pdf output The method in the answer given here results in an output which is comparable in size to the input and doesn't suffer from quality loss. The currently accepted answer does the job but results in an output which is larger in size and suffers from quality loss. For example: pdftoppm input.pdf outputname -png -rx 300 -ry 300 Converting a single page or a range of pages of the PDF pdftoppm input.pdf outputname -png -f. This will output each page in the PDF using the format outputname-01.png, with 01 being the index of the page. You can use pdftoppm from the poppler-utils package to convert a PDF to a PNG: pdftoppm input.pdf outputname -png
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |