Retrieval Software

New products help find needles in imaging-system haystacks

Keyword searches are not considered a big deal when done on small text files, but plowing through millions of electronic documents full of typographical errors is another matter. Search and retrieval of particular information is difficult if not impossible without the help of special software that scans database documents for key words or phrases. Such programs run anywhere from $100 to $100,000, depending on the level of sophistication.

Search and retrieval software is capable of indexing both structured and unstructured data. Structured data such as Census lists is organized into predetermined fields containing names, addresses, Social Security numbers and other information. Searches then can be done, for instance, to find all the people named Smith living within a particular ZIP code.

Unstructured data, such as maps or regulatory documents, is considerably more difficult to catalog. Relational database management systems from companies such as Informix and Oracle can be used to break down data into various tables that are cross-indexed. Matrixes developed from the Navy's aircraft maintenance records, for instance, highlight information such as plane identification numbers, engine overhaul dates and flying weather. The tables can be linked to answer questions such as "What is the average fuel consumption for helicopter landings in windy versus calm weather?"

High-end search and retrieval packages from companies such as Excalibur Technologies Corp. and Future Tech Systems employ a type of artificial intelligence known as fuzzy logic, which enables users to search for words even if they are misspelled. Most large-volume imaging applications use optical character recognition devices that convert documents into digital formats. But OCR scanning has about a 5 percent error rate, resulting in lots of misread characters in big jobs.

Fuzzy logic programs use techniques such as adaptive pattern recognition processing to search for patterns in digital data, instead of searching for specific words. Thus if similar characters-i's and l's, for instance-are misread during OCR scanning, the software can deduct that "lmuemtery" is really "inventory" and thus retrieve relevant database data.

Stay up-to-date with federal news alerts and analysis — Sign up for GovExec's email newsletters.
Close [ x ] More from GovExec

Thank you for subscribing to newsletters from
We think these reports might interest you:

  • Sponsored by Brocade

    Best of 2016 Federal Forum eBook

    Earlier this summer, Federal and tech industry leaders convened to talk security, machine learning, network modernization, DevOps, and much more at the 2016 Federal Forum. This eBook includes a useful summary highlighting the best content shared at the 2016 Federal Forum to help agencies modernize their network infrastructure.

  • Sponsored by CDW-G

    GBC Flash Poll Series: Merger & Acquisitions

    Download this GBC Flash Poll to learn more about federal perspectives on the impact of industry consolidation.

  • Sponsored by One Identity

    One Nation Under Guard: Securing User Identities Across State and Local Government

    In 2016, the government can expect even more sophisticated threats on the horizon, making it all the more imperative that agencies enforce proper identity and access management (IAM) practices. In order to better measure the current state of IAM at the state and local level, Government Business Council (GBC) conducted an in-depth research study of state and local employees.

  • Sponsored by Aquilent

    The Next Federal Evolution of Cloud

    This GBC report explains the evolution of cloud computing in federal government, and provides an outlook for the future of the cloud in government IT.

  • Sponsored by Aquilent

    A DevOps Roadmap for the Federal Government

    This GBC Report discusses how DevOps is steadily gaining traction among some of government's leading IT developers and agencies.

  • Sponsored by LTC Partners, administrators of the Federal Long Term Care Insurance Program

    Approaching the Brink of Federal Retirement

    Approximately 10,000 baby boomers are reaching retirement age per day, and a growing number of federal employees are preparing themselves for the next chapter of their lives. Learn how to tackle the challenges that today's workforce faces in laying the groundwork for a smooth and secure retirement.

  • Sponsored by Hewlett Packard Enterprise

    Cyber Defense 101: Arming the Next Generation of Government Employees

    Read this issue brief to learn about the sector's most potent challenges in the new cyber landscape and how government organizations are building a robust, threat-aware infrastructure

  • Sponsored by Aquilent

    GBC Issue Brief: Cultivating Digital Services in the Federal Landscape

    Read this GBC issue brief to learn more about the current state of digital services in the government, and how key players are pushing enhancements towards a user-centric approach.

  • Sponsored by CDW-G

    Joint Enterprise Licensing Agreements

    Read this eBook to learn how defense agencies can achieve savings and efficiencies with an Enterprise Software Agreement.

  • Sponsored by Cloudera

    Government Forum Content Library

    Get all the essential resources needed for effective technology strategies in the federal landscape.


When you download a report, your information may be shared with the underwriters of that document.