.
logo

The PEKING project

developing new technology

for document processing



The PEKING project (People and Knowledge Information Gathering) is a 5th framework project (IST-25338, January 2001 - December 2002) addressing the problems of supervised and unsupervised classification and (cross-lingual) matching of documents in organizations.

The consortium consisted of the following partners:

The project started in Januari 2001 and was successfully completed in December 2002.



In the course of the PEKING project, KUN and Edmond have addressed the real-life situation of a Dutch User (Fiscaal) which is typical for many firms and institutions which are providing access to a large amount of systematically collected documents. The documents are presently manually classified according to a hierarchical thesaurus, which is hard to keep up to date and to modify. Furthermore, certain index terms have been added to the documents manually, and a conventional keyword-based search facility is available. Since the manual classification and index term assignment is expensive, inflexible and rather subjective, there is a pressing need for an automatic disclosure mechanism to replace or at least support the manual classification process.

The following technical problems were addressed:

KUN has extended the LCS (Linguistic Classification System), developed as a prototype in the course of the earlier DORO project, into an industrial quality system capable of classifying large streams of documents in many languages.


Publicly available documentation and publications:


Requests for information can be directed to

Cornelis H.A. Koster
Department of Computing Science
University of Nijmegen
6525ED Nijmegen, The Netherlands
tel: +30.24.3653411
fax: +30.24.3553450
email: kees@cs.kun.nl