TMSSequoia proposes to develop a software system to allow conversion of paper-based legacy engineering drawings, technical manuals and reports, to a digital format. This process will include automatic extraction of data from the scanned images that is input into a Configuration Management Information System (CMIS) for storage, query and retrieval. TMSSequoia will then identify current software tools and/or applications that support this effort. TMSSequoia will then take the identified software and define how the available tools can be used to minimize any actual software development. Additional software development will be identified that brings current technologies together into a complete system for scanning of documents, identification of document type, extraction of index data from image , storage into CMIS, and ultimate retrieval and viewing. Much of the research in this project will be in the identification and evaluation of possible input data to ensure that proper extraction and indexing takes place.
Keywords: DIGITAL AUTOMATION FORMS PROCESSING DOCUMENT RECOGNITION