Google now also indexes scanned pdf-documents

Thanks to OCR (optical character recognition) technologies Google now indexes and makes available to text searches pdf-documents, that have originally been scanned. While these documents in the past appeared to be images with their content (text) hidden in the image, Google now converts them to html text and indexes their content. These are great news for everyone searching for articles in old scanned journals.
See: Official Google Blog: A picture of a thousand words?

Blogged with the Flock Browser

Tags:

Language:

Add new comment