The aim of the Data Extraction Research Group is the extraction and structuring of data from unstructured and semi-structured electronic documents, such as those found on the Web and in many different data warehouses.
The Data Extraction Group includes faculty from Brigham Young University's Computer Science Department, Information Systems Department, and Linguistics Department.
Here is a statement of our objectives (28K PostScript, 30K PDF, 3K ASCII).
A closely related research project is Data Revitalization in Historical Documents.
Research efforts of the Data Extraction Group are partially funded by: the National Science Foundation, Novell, Faneuil Research, Ancestry.com, and found.com. (Here's a link to our NSF award on the NSF Award Abstracts Web site .)
|Comments are welcome.||Updated Thu Mar 20 11:43:24 MST 2003|