Detailed description of issue
{I am trying to build a tool where a search result provides PDFs but the user might have a few scanned pdfs uploaded, now as a feature I want to highlight the keyword the user searched. Is there any way i can highlight the words in a scanned PDF }
Expected behaviour
{Provide a screenshot or description of the expected behaviour}
Does your issue happen with every document, or just one?
{Answer here}
Link to document
{Provide a link to the document in question if possible}
This is a process called OCR which is a pretty advanced machine learning technique that converts an image of text to machine readable text.
PDF.js Express does not support OCR (no viewer does) - however, our parent company PDFTron has some OCR tools you can use - here are some guides and here is a sample using Node. Please note that this has to be done server side.