The Beginnings at Microsoft Research
However, IBM OfficeVision effort buckled under
the
company's stifling bureaucracy, and its collapse threatened
to end Heidorn, Jensen, and Richardson's NLP Project. In a bold move,
the trio jumped ship and moved the project to Microsoft's fledgling
research lab in 1991, becoming the first researchers to join Microsoft
Research (MSR) and the first to leave the prestigious halls of IBM
Watson for the relatively small software company, mainly known for
its DOS program.
To add insult to injury, this move took place at
time
when IBM and Microsoft were going through a very public
"divorce" over the OS/2 personal computer operating system.
Microsoft had decided to abandon the joint OS/2 development effort
in favor a Windows-only strategy. Microsoft's decision was widely
viewed within IBM as nothing less than base treachery, and losing
three top researchers to Microsoft during this period made matters
all the more infuriating.
Hence, Heidorn, Jensen, and Richardson's transition
to MSR was fraught with nasty episodes, such as IBM
locking them out of their offices. The rights to the group's intellectual
property were a real point of contention between the two companies.
Fortunately for the trio, they had placed most of the work on the
NLP Project in the public domain in various scientific journals and
publications. Nonetheless, the trio had to coauthor a book to document
this fact before IBM relinquished its claims to the concepts.
Once at MSR, the trio immediately began recruiting
theoretical and computational linguists, starting with
the team
|
|
members they had originally assembled
at IBM. Bill Dolan, Lucy Vanderwende, and Joseph Pentheroudakis, the
first three recruits, were deemed critical in moving the NLP Project
along - and they possessed sufficient computer skills to pass muster
with Microsoft's development managers. Dolan and Vanderwende had worked
with the technology at IBM, while Pentheroudakis had been at Executive
Communications Systems (ECS), a leading developer of natural language
software.
The NLPWin System
The NLP Group at MSR, growing steadily in number,
strove to make their NLP Project's conceptual modules
a reality in the NLPWin system. This systems conceptual framework
consists of the components shown in Figure 1, with each module in
this series designed to successively abstract the structure and meaning
of the words and sentences within the language.
The NLP Group's bottom up approach to natural
language processing contrasts sharply with the approach
typically taken by researchers in the Artificial Intelligence (AI)
field. AI scientists primarily focus on creating a machine that reasons
similar to a human and on reach this ability to thinking, assume it
is a relatively trivial task to generate a natural language dialogue
with humans. However, after forty plus years of AI research, this
top down approach has shown little success. The mapping between the
abstract concepts found in machine reasoning and the highly rich,
complex nature of natural language is much more difficult than first
imagined. |