Document Conversion

October 1996


Document Conversion

Scanners are becoming smaller, cheaper, faster and more functional

The first step in most imaging applications is to capture text, graphics and photos so that they can be stored for easy reference. Electronic scanners do this by using light to "read" images off paper documents, microfilm or photographic film. Ordinary scanners take pictures of images for archival applications in which images are stored for reference only. Optical Character Recognition (OCR) devices are used to recognize shapes and characters from predefined fields and convert them into digital computer code, so that text can be manipulated once it is scanned.

Until recently, many government imaging projects involved high-end color scanners capable of processing more than 100 pages a minute and costing hundreds of thousands of dollars. These large flatbed units, from companies such as Sharp and Xerox, are used in sophisticated defense and intelligence applications, or to tackle high-volume paperwork such as tax forms or medical records.

But now agencies are turning to scanners for smaller jobs as well. Inexpensive desktop and handheld models are being used to store correspondence and other simple tasks. Sheet-fed units, from companies such as Hewlett-Packard and Microtek, can be as small as a PC mouse and cost as little as $200. Data capture has become so popular that Compaq and Hewlett-Packard recently introduced computers with scanners built into the keyboards.

Increased competition in the scanners market has resulted in less expensive and more sophisticated machines. Time- and money-saving features once considered optional are now standard on many models. Duplex scanning, which enables images on both sides of documents to be captured at the same time, is rapidly replacing simplex scanning. Other features such as automatic feeders, color dropout options and super-high resolutions also are turning up on medium-range and low-end machines. Higher resolutions, however, require more scanning time per page and more storage capacity.

Scanning speeds on lower-resolution units are up to about 150 pages a minute, with an average recognition accuracy of 95 percent. Some models have built-in spell checkers and word-analysis programs to ease the cleanup job when characters are misread. Many OCR devices use "fuzzy logic" to decipher mispelled words.

Stay up-to-date with federal news alerts and analysis — Sign up for GovExec's email newsletters.
Close [ x ] More from GovExec

Thank you for subscribing to newsletters from
We think these reports might interest you:

  • Sponsored by G Suite

    Cross-Agency Teamwork, Anytime and Anywhere

    Dan McCrae, director of IT service delivery division, National Oceanic and Atmospheric Administration (NOAA)

  • Data-Centric Security vs. Database-Level Security

    Database-level encryption had its origins in the 1990s and early 2000s in response to very basic risks which largely revolved around the theft of servers, backup tapes and other physical-layer assets. As noted in Verizon’s 2014, Data Breach Investigations Report (DBIR)1, threats today are far more advanced and dangerous.

  • Federal IT Applications: Assessing Government's Core Drivers

    In order to better understand the current state of external and internal-facing agency workplace applications, Government Business Council (GBC) and Riverbed undertook an in-depth research study of federal employees. Overall, survey findings indicate that federal IT applications still face a gamut of challenges with regard to quality, reliability, and performance management.

  • PIV- I And Multifactor Authentication: The Best Defense for Federal Government Contractors

    This white paper explores NIST SP 800-171 and why compliance is critical to federal government contractors, especially those that work with the Department of Defense, as well as how leveraging PIV-I credentialing with multifactor authentication can be used as a defense against cyberattacks

  • Toward A More Innovative Government

    This research study aims to understand how state and local leaders regard their agency’s innovation efforts and what they are doing to overcome the challenges they face in successfully implementing these efforts.

  • From Volume to Value: UK’s NHS Digital Provides U.S. Healthcare Agencies A Roadmap For Value-Based Payment Models

    The U.S. healthcare industry is rapidly moving away from traditional fee-for-service models and towards value-based purchasing that reimburses physicians for quality of care in place of frequency of care.

  • GBC Flash Poll: Is Your Agency Safe?

    Federal leaders weigh in on the state of information security


When you download a report, your information may be shared with the underwriters of that document.