Jan W. Amtrup

Text classification and information extraction
Since I started at Kofax, the main part of my work shifted to general document processing. We are mainly involved in the classification of documents and the extraction of information from them. Interestingly, some aspects combine very well with finite state approaches.

Natural Speech Processing
The goal of Natural Speech Processing is to employ speech recognition and processing systems that facilitate capabilities close to those of humans. Some main properties of such systems are

  • Incrementality. Only if a speech system begins processing its input before the speaker has finished her utterance, a natural, fluid reaction is possible; for example, a machine is only able to interrupt a dialog partner if it understands while it is listening.
  • Interactivity. The use of top-down interactions, the process of influencing understanding phases using knowledge from higher levels, is crucial in several ways for natural speech understanding. Ambiguity that arises in certain components can be reduced based on evidence from other modules; the search spaces within a system can be modified, leading to an expectation-based understanding as is often used by people.
  • Parallelism. The efficiency of computers is steadily growing. However, computers never seem fast enough to cope with the extremely high demands of full speech and language processing. Parallelism (both on an intra-modular and an inter-modular level) can alleviate some of the deficiencies. Moreover, only a modular, incremental, parallel system can truly exploit interaction, and thus be a part of a Natural Speech Understanding system

Some of my work has been in the vicinity of some of these properties. My master's thesis dealt with parallel parsing on a transputer network (Available here).

My dissertation describes a fully incremental, parallel system for translating spontaneous speech, showing that incremental techniques are indeed ready to be employed in next-generation speech processing systems. An English version is published by Springer in the LNAI series: Jan W. Amtrup (1999). Incremental Speech Translation. Springer Verlag, Berlin, Heidelberg, New York. 1999. Number 1735 in Lecture Notes in Artificial Intelligence. Springer offers online access to the book at http://www.springerlink.com/content/f1m9jc9135jx/.

Persian Morphology and Machine Translation
At CRL, I worked in the Shiraz Project, which created a machine translation system from Persian to English. We used finite state morphology with feature structures, feeding into a typed unification-based syntactic parser. Over time, I developed the morphology into a two-level system using finite state transducers with typed feature structures as weights on the edges and proved that having feature structures does not come with a price in efficiency. A short description of the architecture is here.

Architecture of NLP systems
Starting with Verbmobil, and continuing through my time at CRL and Bowne Global, I was interested in the architecture of natural language processing systems, in particular of machine translation systems. This extended both to the makeup of components and their interaction and to the general software architecture.