Apache Logo
The Apache Way Contribute ASF Sponsors

Formal board meeting minutes from 2010 through present. Please Note: The board typically approves minutes from one meeting during the next board meeting, so minutes will be published roughly one month later than the scheduled date. Other corporate records are published, as is an alternate categorized view of all board meeting minutes.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

OpenNLP

15 Mar 2017 [Jörn Kottmann / Brett]

Report was filed, but display is awaiting the approval of the Board minutes.

21 Dec 2016 [Jörn Kottmann / Rich]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced
text
processing services.

Development
------------------
The development team stayed active over the last three month
and the activity in commits increased slightly while working
on the next release.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs and
to improve our code.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Anastasija Mensikova was added as a committer on Jul 20 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

21 Sep 2016 [Jörn Kottmann / Jim]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced
text
processing services.

Development
------------------
The development team stayed active over the last two month
and the activity in commits decreased slightly due to our git
migration.

The OpenNLP GSOC 2016 project will be merged soon into opennlp-tools
and afterwards the process to draft the next release will be started.

Community
---------------
The community stayed active with the usual amount of traffic on the use
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Anastasija Mensikova was added as a committer on Jul 20 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

20 Jul 2016 [Jörn Kottmann / Marvin]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active over the last three month
and the activity increased.

Anastasija Mensikova was accepted in GSOC 2016 and is working on
a new sentiment analysis component for OpenNLP. Her work will
soon be merged into opennlp-tools.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Chris Mattmann was added as a committer on Jul 07 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

15 Jun 2016 [Jörn Kottmann / Marvin]

No report was submitted.

16 Mar 2016 [Jörn Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active but with less activity. A language
model component was added and a few smaller bugs were fixed.
At the current pace we will probably finish the next minor
release before the next board report.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

16 Dec 2015 [Jörn Kottmann / Chris]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active over the last three month and
participated in smaller development efforts. A couple of bugs were
fixed. At the current pace we will probably the next minor release in
the first few month of 2016.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

16 Sep 2015 [Joern Kottmann / Brett]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The two GSOC students both passed and we will start working to incorporate
the WSD component they build into the core OpenNLP Tools package.

The development team stayed active over the last three month and
participated in smaller development efforts.

Community
---------------

We received a Naive Bayes classifier contribution from
Cohan Sujay Carlos and integrated it into OpenNLP.

The community stayed active with the usual amount of traffic on the user
mailing list.

Rodrigo Agerri was added to the PMC on Jul 09 2015

The two GSOC 2015 students were voted in as committers.
Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

17 Jun 2015 [Joern Kottmann / Brett]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The team is still working on the next release. All major issues
have been solved and the last release candidates fixed only
some details. A release vote for RC 6 is up and is expected
to pass within the next days.

The testing process has been optimized and a few very time consuming
manual tests are now automated.

The development team stayed active over the last three month and
participated in the testing and bug fixing effort.

Community
---------------

The community stayed active with the usual amount of traffic on the user
mailing list.

Two students were accepted and are now working on
Word Sense Disambiguation component as part of GSOC 2015.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active
since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Mar 2015 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The team is still working on releasing the next release.
In the last three month two release candidates where tested.
A couple of bug where fixed after each of the two iterations. A third
release candidate will be prepared soon.

The development team stayed active over the last three month and
participated in the testing and bug fixing effort.

Community
---------------

The community stayed active with the usual amount of traffic on the user
mailing list.

OpenNLP received a summarization component as a contribution
from Ramakrishna Soma and incorporated it into the sandbox.

Two students approached the project and would like to develop a
Word Sense Disambiguation system as part of GSOC 2015.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active
since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

17 Dec 2014 [Joern Kottmann / Chris]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development stayed active over the last 3 month and the release is finally
prepared. William Colen was elected as Release Manager and published the first
release candidate for testing.

Community
---------------

The community stayed active with the usual amount of traffic on the user mailing list.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

@Chris: Discuss leaving out individual names from reports

17 Sep 2014 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development team remains active, but the activity decreased over
the summer month. The 1.6.0. release is still not out but will hopefully
be finished this year. Most open issues for it are solved.

Community
---------------
The community remains active and there is good traffic on the lists.

There are no new PMC members and there have been no PMC/PPMC additions
since the project moved to Apache.

Vinh Khuc (May), Tommaso Teofili (April) and Rodrigo Agerri (March) have
become committers in the first half of 2014.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Jun 2014 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution. These tasks
are usually required to build more advanced text processing services.

Development
------------------
The development team remains active. In the last three months many
contributions from new committers where integrated into OpenNLP. The work for
the next release paused and will hopefully continue soon.

Community
---------------
The community remained stable during the last three months, a few patches were
contributed and three new committers were voted in.

There are no new PMC members.

Vinh Khuc (May), Tommaso Teofili (April) and Rodrigo Agerri (March) have
become committers since the last report.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

19 Mar 2014 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the last issues for the 1.6.x release branch
and will soon begin testing. The next release it probably out in a couple of
weeks.

Community
---------------
The community remained stable the last three months, a few patches
were contributed and a new component to build language models was
added to the sandbox.

There are no new PMC members and there have been no PMC/PPMC additions
since the project moved to Apache.

Mark Giaconia was added as a committer in October 2013 and is active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Dec 2013 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the new features for the 1.6.x release branch
and will need a few more month for the next release.

Community
---------------
The community activity increased in the last three months, a few patches
were contributed and a new component was added to rapidly create training
data for the name finder.

There are no new PMC members.
Mark Giaconia was added as a committer in October and was active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Sep 2013 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the new features for the 1.6.x release branch
and it is expected to take quite a bit more time until the next release.

Community
---------------
The community activity reduced a bit over the summer months, a few patches
were committed for the recently contributed entity linker component.

No new committers have been voted in and no new PMC members.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

19 Jun 2013 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------.
After the last release of the 1.5x branch the next release can contain bigger
changes and we are actively working on new features, currently, pluggable
machine learning support, refactoring of the machine learning code, support
for the Brat (annotation tool) data format.

Community
---------------
The community remains active and there is good traffic on the lists.
The project received a contribution to resolve named entities to an
entry in a data base and it was discussed to add a lemmatizer component.

No new committers have been voted in and no new PMC members.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

20 Mar 2013 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is still working on getting the next release, 1.5.3, out
William Colen was elected as a release manager and already produced
the first RC, with William as our release manager we are finally able
to spread the knowledge on how to make a release further in the team.
It will take sometime until all the manual tests are run and 1.5.3
can finally be released.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

19 Dec 2012 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
We are preparing the trunk now for the upcoming release and fixed almost
all outstanding issues. Additionally the opennlp-similarity component in
the sandbox is prepared for its first release.

We expect to have an increased development activity again after the 1.5.3
release is out.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

19 Sep 2012 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development activity remained slow but constant, a few smaller
fixes and improvements have been contributed by the community
and one bigger patch for L-BFGS maxent training support. There was
still no work done on the outstanding release and the next release
is at least two or three months away.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

20 Jun 2012 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development activity slowed down a bit compared to last month.
Bug fixes and smaller improvements are actively being worked on.
No work was done on the outstanding release and it will take
still two or three month until it is finished.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

16 May 2012 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
Last month the development stayed active and we had commits
almost every day from different committers. Most of the changes
were related to bug fixes and smaller improvements.

A discussion about the first OpenNLP release as a TLP was started
on the dev list, but it will likely take 2 or 3 month until the
release is finished.

Community
---------------
The community remains active and there is good traffic on the user list.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

18 Apr 2012 [Joern Kottmann / Shane]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
All the graduation tasks are done and the team is now focused on the
development of OpenNLP again, there have been a few commits almost
every day from different committers.

Much work and time went into bug fixing and smaller improvements.

There is now integrated training support for the coreference component
which is very important for others to be able to work on the code.
The training is needed to ensure that code changes don't break anything.

Community
---------------
The community is active and the project received a few smaller patches.
We will likely soon receive a clojure interface layer contribution
and will hopefully be able to integrate the clojure minded OpenNLP
user better into the community at Apache.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

21 Mar 2012 [Joern Kottmann / Jim]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The project graduated last month from the incubator which distracted everyone
a bit from the normal development activity. The team is active and works on
bug fixes and features for our next release.

Community
---------------
The community is active and we have good traffic on the developer
and user mailing list. Users frequently report issues which usually get fixed
quickly.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

15 Feb 2012

Establish the Apache OpenNLP Project

 WHEREAS, the Board of Directors deems it to be in the best interests
 of the Foundation and consistent with the Foundation's purpose to
 establish a Project Management Committee charged with the creation
 and maintenance of open-source software related to the processing of
 natural language text supported by machine learning for distribution
 at no charge to the public.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache OpenNLP Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache OpenNLP Project be and hereby is
 responsible for the creation and maintenance of software
 related to the processing of natural language text
 supported by machine learning; and be it further

 RESOLVED, that the office of "Vice President, Apache OpenNLP" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache OpenNLP Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache OpenNLP Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache OpenNLP Project:

    * William Silva <colen@apache.org>
    * Thomas Morton <tsmorton@apache.org>
    * Jason Baldridge <jbaldrid@apache.org>
    * James Kosin <jkosin@apache.org>
    * Jörn Kottmann <joern@apache.org>
    * Aliaksandr Autayeu <autayeu@apache.org>
    * Boris Galitsky <bgalitsky@apache.org>
    * Grant Ingersoll <gsingers@apache.org>
    * Benson Margulies <bimargulies@apache.org>
    * Isabel Drost <isabel@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Jörn Kottmann
 be appointed to the office of Vice President, Apache OpenNLP, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache OpenNLP PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache OpenNLP Project; and be it further

 RESOLVED, that the Apache OpenNLP Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Incubator OpenNLP podling; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Incubator OpenNLP podling encumbered upon the Apache Incubator
 Project are hereafter discharged.

 Special Order 7B, Establish the Apache OpenNLP Project, was
 approved by Unanimous Vote of the directors present.

15 Feb 2012

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. Incubating since November, 2011.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction, chunking,
parsing, and coreference resolution. These tasks are usually required to
build more advanced text processing services. OpenNLP also includes maximum
entropy and perceptron based machine learning.

The team was extended by two new committers, Boris Galitsky and Aliaksandr
Autayeu. We worked towards our graduation and had a positive community and
recommendation vote.

Our community became more active and we saw a couple of new faces
on the user and development mailing list.

Signed off by mentor:

21 Dec 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Our second release at Apache is now out for vote and will hopefully be
released soon. After some delay through legal issues we have finally
accepted the syntactic generalization contribution from Boris Galitsky and
he did send in a couple of patches to improve it. The development
team is active and the development of the next release will now start.

A list of the the most important issues to address in the move towards
graduation:
 * Establish open regression tests for the parser and coreference component
 * Identify and encourage new contributors on the path to committership

16 Nov 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Our second release at Apache is now out for vote and will hopefully be
released soon. After some delay through legal issues we have finally
accepted the syntactic generalization contribution from Boris Galitsky and
he did send in a couple of patches to improve it. The development
team is active and the development of the next release will now start.

A list of the the most important issues to address in the move towards
graduation:
 * Establish open regression tests for the parser and coreference component
 * Identify and encourage new contributors on the path to committership

17 Aug 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We almost finished the development for our next release (1.5.2) and second
release in the Incubator and will start testing it soon.
The development team is still active and most of the changes have been done
by four independent committers.
This week Boris Galitsky proposed on the mailing list to contribute a
component for syntactic generalization which we will hopefully be able to
accept.
Furthermore we started to work on tooling for an annotation project in our
sandbox and hope to be able to attract new contributors through this effort.

A list of the the most important issues to address in the move towards
graduation:
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

20 Jul 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

A list of the the most important issues to address in the move towards
graduation:
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

15 Jun 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We started to integrate a couple of new features and will soon start testing
for our second release at Apache.
To solve our training data problem we created a proposal for an annotation
project which suggest to label
AL 2.0 licensable text documents with semantic annotations.

There is still good traffic on both the user and dev mailing list.

A list of the the most important issues to address in the move towards
graduation:
* Resolve potential IP issues around releasing training models
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

19 May 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We just did our first Apache release on 2nd of May and are now planning
which features/work will go into the next release. For instance we started
to discuss/plan a big refactoring of our machine learning code, custom
feature generation for the name finder, dictionary support for the name
finder, etc.

There is still good traffic on both the user and dev mailing list. A couple
of users asked questions about how OpenNLP can be trained for new languages.

A list of the the most important issues to address in the move towards
graduation:
 * Resolve potential IP issues around releasing training models
 * Establish open regression tests for the parser and coreference component

20 Apr 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We are still working on our first release. All the planed testing in our
test plan is done now,
we will prepare Release Candidate 6 in the next days and start the vote to
release it.

A list of the three most important issues to address in the move towards
graduation:

* Resolve potential IP issues around releasing training models
* Do a release
* Establish open regression tests

16 Mar 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We are right now working on getting our first release out, we already
created two release candidates and
are busy with testing and bug fixing. The release will hopefully be released
in late March or early
April. For the testing we are now using all supported public data sets and
distribute the task among
the committers which brings us closer to our third goal to have open
regression tests.
There is still good activity on our user and dev mailing list.

A list of the three most important issues to address in the move towards
graduation:

 * Resolve potential IP issues around releasing training models
 * Do a release
 * Establish open regression tests

16 Feb 2011

OpenNLP is a machine learning based toolkit for the processing of
natural language text. It supports the most common NLP tasks, such as
tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text
processing services.

OpenNLP entered incubation on 11/23 2010.

Since we last reported in January we continued to fill up jira with
more issues, rewrote our maven
based build to comply with general Apache rules and to be ready to
create our first release, fixed a few minor
bugs and re-factored parts of the chunker, migrated the SourceForge
wiki documentation into a docbook for inclusion in future releases,
added build instructions to the website.
We decided to focus on our first release, which will hopefully be out
in March. The release will just contain OpenNLP without any
statistical models to avoid any legal issues which might delay the
release.
Regression testing will mostly be done on private data.
There has been daily activity on dev mailing list and a little less on
the user mailing list.

A list of the three most important issues to address in the move
towards graduation:

 * Resolve potential IP issues around releasing training models
 * Do a release
 * Establish open regression tests

19 Jan 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Since we last reported in December, we have completed the code import.
There has been a lot of activity on the dev list, and we're filling up Jira
with issues to resolve before we can release.  Our users also seem to have
found the way from sourceforge to Apache, as there is also activity on the
user list.

A list of the three most important issues to address in the move towards
graduation:

 * Resolve potential IP issues
 * Do a release
 * Establish open regression tests

15 Dec 2010

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.  Progress since then:

 * SVN and Jira set up complete
 * All committers now have ICLAs on file
 * Accounts have been created
 * Status page created
 * Initial web site created base on ASF CRM

A list of the three most important issues to address in the move towards
graduation:

 * Do the code import
 * Resolve potential IP issues
 * Do a release