Organisations are increasingly information intensive; hence providing access to data that is trapped in various proprietary forms including catalogues, databases, human resource systems and internally generated documents is now becoming a significant and challenging task. The authors have undertaken research into approaches to capture relevant knowledge from legacy documents. This is achieved by converting the legacy documents to XML, (eXtensible Markup Language), documents where the output is semantically tagged. Once in an XML form, the data can be easily transformed. This paper describes the development of tools to automate the process of converting legacy documents to XML documents. The purpose of this work is improve the efficiency and reliability of Expertise Finder suitable for use within an engineering design environment. We will also show that by querying the resultant XML versions of legacy documents provides better results than a basic text search over the identical documents when applied used within an Expertise Finder.
Skip Nav Destination
ASME 2004 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference
September 28–October 2, 2004
Salt Lake City, Utah, USA
Conference Sponsors:
- Design Engineering Division and Computers and Information in Engineering Division
ISBN:
0-7918-4697-0
PROCEEDINGS PAPER
An Approach to Extracting Knowledge From Legacy Documents
Richard Crowder,
Richard Crowder
University of Southampton, Southampton, UK
Search for other works by this author on:
Yee-Wie Sim
Yee-Wie Sim
University of Southampton, Southampton, UK
Search for other works by this author on:
Richard Crowder
University of Southampton, Southampton, UK
Yee-Wie Sim
University of Southampton, Southampton, UK
Paper No:
DETC2004-57677, pp. 253-259; 7 pages
Published Online:
June 27, 2008
Citation
Crowder, R, & Sim, Y. "An Approach to Extracting Knowledge From Legacy Documents." Proceedings of the ASME 2004 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference. Volume 4: 24th Computers and Information in Engineering Conference. Salt Lake City, Utah, USA. September 28–October 2, 2004. pp. 253-259. ASME. https://doi.org/10.1115/DETC2004-57677
Download citation file:
7
Views
Related Proceedings Papers
A Design Alternatives Assessment and Management Approach
IDETC-CIE2003
Related Articles
Materials Databases and Knowledge Management for Advanced Nuclear Technologies
J. Pressure Vessel Technol (February,2011)
Methodology and Tools to Support Knowledge Management in Topology Optimization
J. Comput. Inf. Sci. Eng (December,2010)
Articulating a Learning Objective
J. Mech. Des (July,2007)
Related Chapters
Engineering Design about Electro-Hydraulic Intelligent Control System of Multi Axle Vehicle Suspension
International Conference on Instrumentation, Measurement, Circuits and Systems (ICIMCS 2011)
Reliability Analysis and Evaluation of Gas Supply System
International Conference on Mechanical and Electrical Technology 2009 (ICMET 2009)
Quality and Reliability Issues in Materials Databases: ASTM Committee E 49.05
Computerization and Networking of Materials Databases: Third Volume