Is it possible to partial rendering of pdf file?

mukesh.9.bhalla · October 30, 2020, 4:42am

Hi , can i do partial rendering of pdf file.
Suppose pdf size is 60-100 mb then it will take time to load on viewer so is it possible to render only first 4 pages and as user scroll down then remaining pages will load.

Logan · October 30, 2020, 2:21pm

Hi there,

PDF.js Express already does this out of the box (if possible). This kind of loading only works if the PDF is linearized, however.

Thanks!
Logan

fuse · December 16, 2020, 11:46pm

Hi Logan.

I am trying to get range requests working and I believe I have set my server up to support this, however the web viewer always downloads the documents in full.

To test this I have done the following:

This is a 2GB linearized document https://s3.amazonaws.com/pdftron/downloads/pl/2gb-sample-file.pdf

Using PDF.js:
https://mozilla.github.io/pdf.js/web/viewer.html?file=https%3A%2F%2Fs3.amazonaws.com%2Fpdftron%2Fdownloads%2Fpl%2F2gb-sample-file.pdf

You can see the partial requests (status 206) flowing in. So this works as expects

Using PDFTron:
https://www.pdftron.com/webviewer/demo/
Choose File > paste in link

Again, the range requests are working as expected

Using PDF.js Express:
https://pdfjs.express/demo
Try Your Own File > paste link in

This time, no range requests are made and the viewer tries to download the whole file. And its the same on my instance of PDFjs Express

I am wondering if I need to do something special to turn this feature on - or if this is a bug?

Thanks, Luke

Logan · December 17, 2020, 4:29pm

Hey there! You shouldn’t have to do anything to enable this functionality.

We can reproduce this issue and will investigate.

Thanks!
Logan

nigel · January 19, 2021, 11:31pm

Hi.

I’m wondering whether you have any further on this issue. I’m facing the same issue and seeking to debug it in the dev console but with limited success.

It would be good to confirm my assumptions about what the viewer is expecting in order to use byte range support. It looks to me like the requirement is for the site to return the following headers:

Content-Encoding: “identity” or not supplied
Accept-Ranges: “bytes”
Content-Length: Must return an value parsable by parseInt that is a number.

Plus the content length header’s value must be > 2 x the chunk size and you must be using http/https as the URL protocol.

There also seems to be support for disabling the use of ranges and setting the range size, but I haven’t spent the time to figure out whether/how they can be configured.

fuse · February 16, 2021, 12:16am

Hi Logan,

Just checking in to see if you have any updates about this issue.

Cheers, Luke

Logan · February 17, 2021, 6:30pm

Hi there,

There hasn’t really been any progress on this yet as some higher priority things have came up. We should have some resources freed up soon and then we will start investigating.

I will keep you updated!

Logan · March 1, 2021, 8:11pm

Hey everyone,

Just a heads up that we found the issue and it will be fixed in the next patch, hopefully some time this week.

Thank you for your patience on this one.

Logan

Logan · March 3, 2021, 6:19pm

This has been fixed!

nigel · March 4, 2021, 1:18am

Hi Logan.

I’m sorry to report this but it seems that the work is incomplete. I’m seeing a slew of “Bad end offset” messages and what appear to be multiple retrievals of the entire document:

I’m also attaching a dev console screenshot at the time the error is thrown that will hopefully be very helpful.

Logan · March 4, 2021, 3:15pm

Hey there!

I did a bit of research on this one and it seems like it may be due to an issue with Chrome. Do you get this error in FireFox? Also can you try clearing your cache and see if it fixes it?

Also if you could give me the URL of the document you’re trying to load, that would be great!

Thanks,
Logan

nigel · March 4, 2021, 9:44pm

Thanks for the quick reply, Logan.

The URL of the PDF is on a staging server for the site we’re developing. Can I PM you the details?

Logan · March 4, 2021, 9:56pm

Yup, absolutely!

Logan

fuse · March 9, 2021, 3:04am

Hi Logan,

I can confirm that range requests are now working for me.

However, it seems that the initial request (with response status 200) seems to continue downloading alongside the range requests. Is this the intended behaviour? My understanding is that the first request should be canceled if range requests are initiated.

Here is a screenshot demonstrating the current behaviour.

Whereas this is a screenshot of how PDFTron handles it, which is more like what I would expect.

Also I noticed that PDFTron’s inital request recieves a 206 response instead of 200. Likely due to the presence of the range header on that initial request.

PDFTron inital request

PDFjs Express inital request

Cheers, Luke.

Logan · March 9, 2021, 3:03pm

Hey there,

This is the expected behaviour of PDF.js. It downloads range requests on demand/when needed, and in parallel downloads the entire document. I am not sure why they implemented it this way, but this is intended.

That being said, I will be adding an option in the future to disable the extra downloading. Keep an eye out in a future release.

Thanks!

fuse · March 9, 2021, 10:34pm

Thanks for clarifying that Logan

tanshiqi · November 23, 2022, 6:29am

Is this issue solved? It seems range request not work on official demo or my project(8.7.0)
PDF: https://s3.amazonaws.com/pdftron/downloads/pl/2gb-sample-file.pdf
demo: PDF.js Viewer Demo | PDF.js Express

Topic		Replies	Views
How to enable range requests on the WebViewer? Right now the viewer waits for the whole file to download Technical Support pdfjs-express	2	1377	March 1, 2021
Linearized pdf load incorrectly Technical Support	1	45	October 7, 2024
Server side range request sample code in javascript/angular Technical Support pdfjs-express	5	47	November 25, 2024
Viewing linearized (fast web view) of password protected pdf Technical Support pdfjs-express	8	467	August 22, 2022
loadDocument does not load linearlized PDF correctly Technical Support pdfjs-express	5	41	March 19, 2025

Is it possible to partial rendering of pdf file?

Related topics