Abstract
Discovering the interactions between genes and proteins is
seen as one of the core tasks in molecular biology. The
quantity of research results in this area is growing at such arate that it is very dicult for individual researchers to keep track of them. As such results appear mainly in the form of scientic articles, it is necessary to process them in an ecient manner in order to be able to extract the relevant results.
Many databases exist that aim at consolidating the newly
gained knowledge in a format that is easily accessible and
searchable, however the creators of such databases normally
make use of human readers who manually curate the rel-
evant papers. This is an expensive and time consuming
process, besides, there might be a signicant time lag be-
tween the publication of a result and its introduction into
such databases.
In this paper we propose a method for discovery of inter-
actions between genes and proteins from the scientic liter-
ature, based on a complete syntactic analysis of the corpus.
We report on preliminary results.