Google now also indexes scanned pdf-documents

Thanks to OCR (optical character recognition) technologies Google now indexes and makes available to text searches pdf-documents, that have originally been scanned. While these documents in the past appeared to be images with their content (text) hidden in the image, Google now converts them to html text and indexes their content. These are great news for everyone searching for articles in old scanned journals.
See: Official Google Blog: A picture of a thousand words?

Blogged with the Flock Browser

Tags:

Language:

Add new comment

Creative Commons License
All content on the site authored by Ulrich Schrader is licensed under a Creative Commons-License. Other licenses may apply for other authors.
Creative Commons explained

User login