OCR-and-Paste
I have actually taken it one step further and used the technology to automate the extraction of data from web pages that are scroll heavy. Instead of scrolling forever for information on a web page I can use the tool to take a screenshot of the entire web page and convert it to text for me. You can imaging how the technology could be used maliciously but in this case it's just to get information.
The ability of OCR to read screen-shots is quite impressive. Though screen-shots usually come out in low 72 or 96 DPI resolution which is traditionally not optimal for OCR, the text and text in image is what is called pixel perfect so it provides an excellent candidate for conversion. Also leveraging document analysis technologies built into OCR I can grab a table and have it export a table versus having to copy and paste text and manipulate back to original form later.
When you become and expert in OCR you find yourself using the technology in the oddest places, but this is one case where my productivity has increased because of the tool, and I think it's worth sharing. I suspect that OCR of screen-shots is only going to increase in the future because of this, malicious reasons, and counter mal-ware technologies. As well as a very easy way to convert data from one locked down legacy system to a new one.
Labels: copy and paste, efficency, OCR, screenshot

0 Comments:
Post a Comment
<< Home