Skip to Main Content
Apache Events The Apache Software Foundation
Apache 20th Anniversary Logo

This was extracted (@ 2024-04-17 21:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

WARNING: these pages may omit some original contents of the minutes.
This is due to changes in the layout of the source minutes over the years. Fixes are being worked on.

Meeting times vary, the exact schedule is available to ASF Members and Officers, search for "calendar" in the Foundation's private index page (svn:foundation/private-index.html).

OpenNLP

20 Mar 2024 [Jeff Zemerick / Rich]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Project Status:
Current project status: Ongoing
Issues for the board: None

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (12 years ago)
There are currently 25 committers and 17 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:5.

Community changes, past quarter:
- No new PMC members. Last addition was Martin Wiesner on 2023-06-24.
- No new committers. Last addition was Atita Arora on 2023-02-28.

## Project Activity:
Apache OpenNLP 2.3.2 was a maintenance release with minor improvements and was
released on 2024-02-04. A pull request to Apache Lucene to upgrade its OpenNLP
dependency to 2.3.2 was merged last month. Lucene was previously using OpenNLP
1.9.4 so it was a bit behind. Now that Lucene is using the newest version,
there is work being done in Apache Solr to use the new features. This may help
make the community more active.

## Community Health:
The community activity was slower this period compared the previous periods,
but the community remains healthy.

20 Dec 2023 [Jeff Zemerick / Justin]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Project Status:
Current project status: Ongoing
Issues for the board: None

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (12 years ago)
There are currently 25 committers and 17 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:5.

Community changes, past quarter:
- No new PMC members. Last addition was Martin Wiesner on 2023-06-24.
- No new committers. Last addition was Atita Arora on 2023-02-28.

## Project Activity:
The project released version 2.3.1 on November 22, 2023. This was largely a
maintenance release and to support an effort in Apache Solr. OpenNLP was
represented at Community over Code.

## Community Health:
The project has had contributions from a few new contributors this quarter
which is very exciting. The community is healthy. We are very happy to have
had Martin Wiesner be the release manager for the first time for the 2.3.1
release.

20 Sep 2023 [Jeff Zemerick / Craig]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Project Status:
Current project status: Ongoing
Issues for the board: None

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (12 years ago)
There are currently 25 committers and 17 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:5.

Community changes, past quarter:
- Martin Wiesner was added to the PMC on 2023-06-24
- Richard Zowalla was added to the PMC on 2023-06-24
- No new committers. Last addition was Atita Arora on 2023-02-28.

## Project Activity:
OpenNLP 2.3.0 was released on July 31, 2023. We have been able to release more
often this year, with releases happening about every 3 months. The project now
has integration tests running on the ASF Jenkins and that has helped with the
release process. A lot of work has been done by the community to address tech
debt and make a lot of usability improvements.

## Community Health:
Mailing list traffic remains low but the project is healthy. The project added
two new PMC members since the last report.

21 Jun 2023 [Jeff Zemerick / Shane]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Project Status:
Current project status: Ongoing
Issues for the board: There is a low priority
email thread from the 2023-03-22 board meeting regarding how to release NLP
models. This email thread is pending a response from the OpenNLP PMC, but if
the board has any additional thoughts please also reply.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (11 years ago)
There are currently 25 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is 5:3.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Atita Arora on 2023-02-28.

## Project Activity:
The project is healthy. OpenNLP 2.2.0 was released on April 22, 2023. The
project has been trying to have more frequent releases.

## Community Health:
The community remains healthy even with a decline in activity on the mailing
lists since the last board report.

22 Mar 2023 [Jeff Zemerick / Shane]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
The project would like guidance from the board on how to release NLP models,
specifically around the voting process. Are there any other Apache projects
releasing trained model artifacts?

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (11 years ago)
There are currently 25 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is 5:3.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- Atita Arora was added as committer on 2023-02-28
- Martin Wiesner was added as committer on 2022-12-08

## Project Activity:
- OpenNLP 2.1.1 was released on February 23, 2023.

## Community Health:
The community has been more active that this time last year. The project
has added 3 new committers in the past few months and development activity
in terms of Jira issues and pull requests has increased. The OpenNLP
community is healthy.

@Shane: follow up with OpenNLP around issue raised in report

21 Dec 2022 [Jeff Zemerick / Bertrand]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (11 years ago)
There are currently 23 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- Richard Zowalla was added as committer on 2022-11-28
- Martin Wiesner was added as committer on 2022-12-08

## Project Activity:
OpenNLP 2.1.0 was released on November 23, 2022. Being able to offer more
frequent releases is a goal if community involvement keeps increasing.

## Community Health:
The project had an increase in activity this period largely thanks to new
contributors. The best metric is 48 issues closed in JIRA, past quarter (700%
increase). The project saw many more commits and merged pull requests. We were
able to add two new committers, the first in about 2 years. The project was
able to address some older pull requests and get them resolved, either through
merging or closing as not needed anymore.

21 Sep 2022 [Jeff Zemerick / Sander]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (11 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
We released version 2.0.0. This version introduces support for document
classification and named-entity recognition deep learning models via ONNX
Runtime along with various bug fixes and improvements. These details and
future plans will be presented in a talk at ApacheCon in October. A minor
release is expected in the coming quarter.

## Community Health:
The community remains healthy even though it is slow. Developer activity was
slightly higher in the past quarter in terms of pull requests and mailing list
traffic. Project outreach through activities like a presentation at ApacheCon
next month should help to grow the community and hopefully lead to additional
committers.

15 Jun 2022 [Jeff Zemerick / Bertrand]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (10 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
The project recently completed a successful release vote for a 2.0 release.
This will be the project's second major release in its 10+ year history.
Version 2.0 brings new features around ONNX models while maintaining
compatibility with 1.x models.

## Community Health:
Since we had the 2.0 release preparation we had increases in mailing list
traffic along with JIRA and GitHub activity. The community activity remains
low but healthy. The project is hoping that the 2.0 release will help attract
new contributors.

16 Mar 2022 [Jeff Zemerick / Bertrand]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (10 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
We are working toward a version 2.0 release. A pull request to add support for
ONNX NLP models was merged and once that functionality is documented the team
can consider a 2.0 release. If this takes too long we will consider a 1.9.5
release.

## Community Health:
The community activity remains low but healthy. Activity on the mailing lists
remains fairly constant with previous periods. The hope is that with the
addition of support for newer NLP models the community activity will increase.

19 Jan 2022 [Jeff Zemerick / Roman]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (10 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
OpenNLP had a 1.9.4 release in November. This was a minor release that
addressed mostly code refactoring and minor improvements. There are a few
other pull requests that can be merged soon for another minor release. Work
has made progress at integrating OpenNLP with newer NLP architectures and is
trying to target it for a 2.0 release.

## Community Health:
Project activity remains relatively low but healthy. There have been recent
pull requests from new contributors and activity on the mailing lists remains
low but consistent.

15 Dec 2021 [Jeff Zemerick / Sander]

No report was submitted.

20 Oct 2021 [Jeff Zemerick / Sharan]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (10 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
Our last release was the pre-trained models in in May 2021. There have been
several minor issues resolved that may result in a 1.9.4 release. Work
continues toward a 2.0 release to integrate deep learning capabilities into
OpenNLP.

## Community Health:
The project activity remains slow but the community seems healthy. We had a
few more JIRA issues opened and closed this quarter than in the previous
quarter. If we can reach the goal of expanding OpenNLP to deep learning the
community activity may pick up.

15 Sep 2021 [Jeff Zemerick / Roy]

No report was submitted.

16 Jun 2021 [Jeffrey T. Zemerick / Bertrand]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (9 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
The project successfully voted and released its first set of models (sentence,
parts-of-speech, and token) models available under the Apache 2.0 license.
Work is under way to provide users an automated means of using these models
with the goal of reducing the steps required to use OpenNLP.

## Community Health:
Community activity this quarter was low. The work that was done to train the
models is not reflected in the GitHub activity stats. Code activity and pull
requests will likely increase as work to integrate the newly trained models is
started.

17 Mar 2021 [Jeffrey T. Zemerick / Sam]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (9 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
The last release was version 1.9.3 on July 31, 2020. The project will likely
see a 1.9.4 release in the first half of 2021. Version 1.9.4 will largely be a
maintenance release with code refactor improvements. A vote to release
pre-trained OpenNLP models will be started soon and will likely coincide with
the next release. The pre-trained models aim to lower the entry curve for
usage.

## Community Health:
The project saw a small increase in community activity having more pull
requests, issues closed, and commits. Mailing list traffic remains low. The
project is not facing any issues and otherwise remains healthy.

16 Dec 2020 [Jeffrey T. Zemerick / Patricia]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text.

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (9 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
There has been several refactoring tasks completed and we are working toward
releasing Apache license-compatible OpenNLP models. The goal is to lower the
barrier to entry for new users of OpenNLP. We will likely target January 2021
for the next release.

Recent releases:
1.9.3 was released on 2020-07-31.
1.9.2 was released on 2019-12-30.
1.9.1 was released on 2018-12-31.

## Community Health:
The community has been more active this quarter. The project had several pull
requests open and closed this quarter along with an increase in JIRA issue
creation and resolution. Some of these pull requests and issues were by new
contributors. Overall the community remains healthy.

16 Sep 2020 [Jeffrey T. Zemerick / Justin]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (9 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:
Version 1.9.3 was released in July. This version resolved some important
discrepancies that were hindering development across JDK versions. This
release makes OpenNLP's results consistent across JDKs 11/8 and AMD/Intel
architectures. We aim for another release before year end.

- 1.9.3 was released on 2020-07-31.
- 1.9.2 was released on 2019-12-30.
- 1.9.1 was released on 2018-12-31.

## Community Health:
We had a small uptick in the dev mailing list traffic likely due to the
release vote thread. Overall, while the community activity remains low in
terms of contributions and mailing list activity, the community is healthy as
observed during the 1.9.3 release.

17 Jun 2020 [Jeffrey T. Zemerick / Patricia]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
machine learning-based toolkit for the processing of natural language text.

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (8 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- No new committers. Last addition was Tim Allison on 2020-01-28.

## Project Activity:

We are working toward a 1.9.3 release. An issue affecting test consistency
across Java 8 and Java 11 was discovered that we are working through. We would
like to get this issue resolved prior to a 1.9.3 release.

Recent releases:
1.9.2 was released on 2019-12-30.
1.9.1 was released on 2018-12-31.
1.9.0 was released on 2018-07-02.

## Community Health:
We currently have 11 JIRA issues resolved for a 1.9.3 release with 16 pending
pull requests. We have made progress resolving pull requests but we still have
some outstanding pull requests to resolve. We will likely be working to merge
or close the backlog of pull requests at a higher priority than adding new
features for the near future. (We closed 7 in the last quarter while only 3
new ones were opened.)

18 Mar 2020 [Jeffrey T. Zemerick / Shane]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (8 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- Tim Allison was added as committer on 2020-01-28

## Project Activity:
The project has 7 JIRA issues closed and marked as fix for the next release
(1.9.3) and is in the planning process for a 1.9.3 release with the goal of
having the release done by the end of March 2020.

## Community Health:
Commit activity and developer mailing list activity had an increase likely due
to the working of pending pull requests. We were able to successfully merge
and close many pull requests and are continuing to work on the remaining ones.
Faster resolution of pull requests may encourage more contributions from the
community.

19 Feb 2020 [Jeffrey T. Zemerick / Dave]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache OpenNLP was founded 2012-02-14 (8 years ago)
There are currently 22 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is roughly 3:2.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-09.
- Tim Allison was added as committer on 2020-01-28

## Project Activity:
Recent releases:
- 1.9.2 was released on 2019-12-30.

New features recently merged include improvements to the language detector,
documentation, and general clean up of code. As chair I would like to see
OpenNLP have more frequent releases and we will work toward that.

## Community Health:
The OpenNLP community remains healthy even though it is not extremely active.
The project added a new committer, Tim Allison, who provided valuable
contributions to OpenNLP last year. We have quite a few pending pull requests
that we are working on merging or asking the authors to update. These short
conversations will likely mostly take place directly the on GitHub pull
requests and will not be reflected in the OpenNLP mailing list statistics.

15 Jan 2020

Change the Apache OpenNLP Project Chair

 WHEREAS, the Board of Directors heretofore appointed Jörn Kottmann
 (joern) to the office of Vice President, Apache OpenNLP, and

 WHEREAS, the Board of Directors is in receipt of the resignation of
 Jörn Kottmann from the office of Vice President, Apache OpenNLP, and

 WHEREAS, the Project Management Committee of the Apache OpenNLP project
 has chosen by vote to recommend Jeffrey T. Zemerick (jzemerick) as the
 successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Jörn Kottmann is relieved and
 discharged from the duties and responsibilities of the office of Vice
 President, Apache OpenNLP, and

 BE IT FURTHER RESOLVED, that Jeffrey T. Zemerick be and hereby is
 appointed to the office of Vice President, Apache OpenNLP, to serve in
 accordance with and subject to the direction of the Board of Directors
 and the Bylaws of the Foundation until death, resignation, retirement,
 removal or disqualification, or until a successor is appointed.

 Special Order 7A, Change the Apache OpenNLP Project Chair, was
 approved by Unanimous Vote of the directors present.

15 Jan 2020 [Jörn Kottmann / Shane]

No report was submitted.

@Matt: ensure new PMC Chair knows to report next month

18 Dec 2019 [Jörn Kottmann / Daniel]

## Description:
The mission of OpenNLP is the creation and maintenance of software related to
Machine learning based toolkit for the processing of natural language text.

## Issues:
 No issues to report.

## Membership Data:
Apache OpenNLP was founded 2012-02-15 (8 years ago)
There are currently 21 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is 7:5.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-10.
- No new committers. Last addition was Jeffrey  T. Zemerick on 2017-04-26.

## Project Activity:
The project will prepare a release to the end of the year to include
the few Pull Requests we received from our community.

## Community Health:
We see the usual activity on the dev and user list.

@Daniel: pursue a better report for next month

16 Oct 2019 [Jörn Kottmann / Craig]

## Description:
Apache OpenNLP is a machine learning based toolkit for the
processing of natural language text.

## Issues:
There are no board level issues

## Membership Data:
Apache OpenNLP was founded 2012-02-15 (8 years ago)
There are currently 21 committers and 15 PMC members in this project.
The Committer-to-PMC ratio is 7:5.

Community changes, past quarter:
- No new PMC members. Last addition was Koji Sekiguchi on 2017-10-10.
- No new committers. Last addition was Jeffrey  T. Zemerick on 2017-04-26.

## Project Activity:
- 1.9.1 was released on 2018-12-31.

There are plans to make one release this year to fix some issue and include
PRs send by the community.


## Community Health:
4 commits in the past quarter (33% increase)
2 code contributors in the past quarter (100% increase)
13 PRs opened on GitHub, past quarter (-18% decrease)
8 PRs closed on GitHub, past quarter (166% increase)

18 Sep 2019 [Jörn Kottmann / Craig]

No report was submitted.

@Craig: pursue a report for OpenNLP

19 Jun 2019 [Jörn Kottmann / Daniel]

## Description:
 - Apache OpenNLP is a machine learning based toolkit for the  processing of
   natural language text.

## Issues:
 - None

## Activity:
 - Working with Apache Tika project to leverage Apache OpenNLP langDetect
   functionality in Tika.
 - Team is presently working on OpenNLP-DeepLearning - bringing Deep Learning
   capabilities to OpenNLP
 - User/dev activity is not very high but steady

## Health report:
 - Project has a decent user community.  Activity has been low in the last
   quarter.

## PMC changes:

 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Tue Oct 10 2017

## Committer base changes:

 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeffrey T. Zemerick at Wed Apr 26 2017

## Releases:

 - Last release was 1.9.1 on Mon Dec 31 2018

## JIRA activity:

 - 14 JIRA tickets created in the last 3 months
 - 3 JIRA tickets closed/resolved in the last 3 months

20 Mar 2019 [Jörn Kottmann / Isabel]

## Description:
 -  Apache OpenNLP is a machine learning based toolkit for the processing of
    natural language text.

## Issues:
 -  No issues to report.

## Activity:
 - Suneel Marthi and Jörn Kottmann had a talk at FOSDEM 2019 which utilized
   OpenNLP, "Streaming Pipelines for Neural Machine Translation"
 - Suneel Marthi and Jörn Kottmann had a talk at Big Data Warsaw 2019, which
   used OpenNLP, "Streaming topic model training and inference with Apache
   Flink"

## Health report:
 - The project has an active committer base and there’s healthy activity on
   mailing lists.
 - The PMC is not tracking any prospects right now, all active committers are
   already in the PMC.

## PMC changes:
 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Tue Oct 10 2017

## Committer base changes:
 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeffrey  T. Zemerick at Wed Apr 26 2017

## Releases:
 - 1.9.1 was released on Mon Dec 31 2018

## Mailing list activity:
 - We received the usual amount of traffic on the dev and users lists.

 - users@opennlp.apache.org:
    - 455 subscribers (up 7 in the last 3 months):
    - 11 emails sent to list (24 in previous quarter)

 - dev@opennlp.apache.org:
    - 239 subscribers (up 2 in the last 3 months):
    - 14 emails sent to list (8 in previous quarter)

 - issues@opennlp.apache.org:
    - 52 subscribers (up 1 in the last 3 months):
    - 62 emails sent to list (82 in previous quarter)


## JIRA activity:
 - 11 JIRA tickets created in the last 3 months
 - 5 JIRA tickets closed/resolved in the last 3 months

19 Dec 2018 [Jörn Kottmann / Ted]

## Description:
 - Apache OpenNLP is a machine learning based toolkit for the processing of
   natural language text.

## Issues:
 - There are no issues

## Activity:
 - The activity on the 1.x branch of slowed down a bit and more time was spent
   developing the future 2 version of OpenNLP based on Deep Learning concepts.
   There are now proof-of-concepts for three NLP components.
 - Suneel Marthi and Joern Kottmann will present at FOSDEM 2019 and BigData
   Warsaw 2019 about Streaming pipelines for Neural Machine Translation
   leveraging Apache OpenNLP for Language Detection, Tokenization, Sentence
   Detection from an Apache Flink streaming pipelines.

## Health report:
 - The project has a very active committer base and there’s healthy activity
   on mailing lists.

## PMC changes:

 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Tue Oct 10 2017

## Committer base changes:

 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeffrey  T. Zemerick at Wed Apr 26 2017

## Releases:

 - Last release was 1.9.0 on Mon Jul 02 2018

## Mailing list activity:

 - The dev and user lists have the usual amount of activity.

 - users@opennlp.apache.org:
    - 448 subscribers (down -2 in the last 3 months):
    - 23 emails sent to list (16 in previous quarter)

 - dev@opennlp.apache.org:
    - 237 subscribers (up 1 in the last 3 months):
    - 7 emails sent to list (25 in previous quarter)

 - issues@opennlp.apache.org:
    - 51 subscribers (up 1 in the last 3 months):
    - 82 emails sent to list (123 in previous quarter)


## JIRA activity:

 - 8 JIRA tickets created in the last 3 months
 - 6 JIRA tickets closed/resolved in the last 3 months

19 Sep 2018 [Jörn Kottmann / Shane]

## Description:
 - Apache OpenNLP is a machine learning based toolkit for the
processing of natural language text.

## Issues:
 - No issues to report.

## Activity:
- Suneel Marthi and Joey Frazee presented at FlinkForward, Berlin
on 5th September 2018 about "Streaming topic model training and
inference with Apache Flink" leveraging Apache OpenNLP with Apache Flink.
- OpenNLP is used by https://github.com/tteofili/jtm for jira issue tracking
- Various bug fixes and refactorings
- Team’s presently working on the next OpenNLP release

## Health report:
 - The project has a very active committer base and there’s healthy
activity on mailing lists.

## PMC changes:

 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Tue Oct 10 2017

## Committer base changes:

 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeffrey  T. Zemerick at Wed Apr 26 2017

## Releases:

 - 1.9.0 was released on Mon Jul 02 2018

## JIRA activity:

 - 16 JIRA tickets created in the last 3 months
 - 17 JIRA tickets closed/resolved in the last 3 months

20 Jun 2018 [Jörn Kottmann / Brett]

## Description:
 - Apache OpenNLP is a machine learning based toolkit for the processing of
   natural language text.

## Issues:
 - No issues to report.

## Activity:
 - Team working towards next 1.8.5 release and OpenNLP 2.0
 - Work on a proof-of-concept Deep Learning NER component started and is
   currently being evaluated
 - Suneel Marthi presented at Dataworks Summit, Berlin on April 19 2018 about
   Streaming pipelines for Neural Machine Translation leveraging Apache
   OpenNLP for SLangiuentence Detection, Tokenization, Language Detection from
   an Apache Flink streaming pipelines.



## Health report:
 - The project has a very active committer base and there’s healthy activity
   on mailing lists.

## PMC changes:

 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Tue Oct 10 2017

## Committer base changes:

 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeffrey  T. Zemerick at Wed Apr 26 2017

## Releases:

 - Last release was 1.8.4 on Mon Dec 25 2017

## JIRA activity:

 - 14 JIRA tickets created in the last 3 months
 - 19 JIRA tickets closed/resolved in the last 3 months

21 Mar 2018 [Jörn Kottmann / Chris]

## Description:
 - Apache OpenNLP library is a machine learning based toolkit for the
   processing of natural language text.

## Issues:
 - There are no issues requiring board attention at this time

## Activity:
 - Peter Thygesen and Jörn Kottmann presented ‘Deriving Actionable Insights
   from High Volume Media Streams’ at Big Data Tech Warsaw, Warsaw, Poland on
   Feb 22, 2018.
 - Apache OpenNLP 1.8.4 was released on Dec 24, 2017 with Jeff Zemerick as the
   Release Manager.
 - Jeff Zemerick published a 2017 OpenNLP retrospective here -
   https://blogs.apache.org/opennlp/entry/apache-opennlp-2017-year-in
 - The project has sustained activity levels and a strong user community.
 - Work in progress on Deep Learning OpenNLP leveraging TensorFlow.

## Health report:
 - The project has sustained activity levels and a strong user community.

## PMC changes:
 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Koji Sekiguchi on Mon Oct 09 2017

## Committer base changes:
 - Currently 21 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Jeff Zemerick at Wed Apr 26 2017

## Releases:
 - 1.8.4 was released on Sun Dec 24 2017

## JIRA activity:
 - 23 JIRA tickets created in the last 3 months
 - 21 JIRA tickets closed/resolved in the last 3 months

20 Dec 2017 [Jörn Kottmann / Rich]

## Description:
- Apache OpenNLP is a machine learning based toolkit for the processing of
 natural language text.


## Issues:
-  No issues to report.


## Activity:
- Joern Kottmann and Peter Thygesen presented OpenNLP for processing large
 volume realtime streams at Big Data Spain, Madrid on Nov 15 2017[1].

- Suneel Marthi presented OpenNLP for processing large volume streams at Big
 Data Ignite, Grand Rapids, Michigan on Sep 29 2017

- Jorn Kottmann and Peter Thygesen will be presenting the Madrid talk again at
 Big Data Technology Conference, Warsaw Poland on Feb 22 2018

- Apache OpenNLP 1.8.3 was released on Oct 26 2017

- a new module for Language Detection was released as langdetect-1.8.3 on Nov
 1 2017

- Apache OpenNLP has been integrated into Apache Lucene

- Team’s presently working on the next OpenNLP release

- The project has been seeing plenty of traction and the committers have been
 invited to present at upcoming Big Data conferences like DataWorks Summit
 Berlin, Big Data Tech Warsaw.


## Health report:
- The project has a very active committer base and there’s healthy activity on
 mailing lists.


## PMC changes:

- Currently 15 PMC members.
- Koji Sekiguchi was added to the PMC on Mon Oct 09 2017

## Committer base changes:

- Currently 21 committers.
- No new committers added in the last 3 months

## Releases:

- 1.8.3 was released on Thu Oct 26 2017
- langdetect-1.8.3 was released on Wed Nov 01 2017

## JIRA activity:
- 36 JIRA tickets created in the last 3 months
- 28 JIRA tickets closed/resolved in the last 3 months

20 Sep 2017 [Jörn Kottmann / Brett]

## Description:
- The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

## Issues:
- Need help from infra to setup a repository to store large training data
files

## Activity:
 1. Project had a minor 1.8.2 release this month.
 2. The team’s presently working on 1.8.3 release cycle and on
    releasing pre-trained statistical models
 3. A model for the new language detector component will be released soon.
 4. Daniel Russ presented ‘“Its takes a Village to solve a Problem in Data
   Science” at Data Science Maryland Meetup, June 19 2017, Laurel Maryland -
    https://www.slideshare.net/DataScienceMD/it-takes-a-village-to-solve-a-problem-in-data-science
 5. Suneel Marthi will be presenting ‘Deriving Actionable Insights from High
    Volume Media Streams” at Machine Learning Conference, San Francisco on Nov 10 2017 -
    http://mlconf.com/mlconf-2017-san-francisco/#Suneel
 6. Suneel Marthi and Jorn Kottmann will be presenting ‘Deriving Actionable
    Insights from High Volume Media Streams” at Big Data Spain, Madrid in Nov 2017.
 7. Apache OpenNLP was cited in paper “An Automatic Approach for Discovering
    and Geocoding Locations in Domain-Specific Web Data” by Chris Mattmann, Madhav
    Sharan - https://memex.jpl.nasa.gov/IRI16-Gazetteer.pdf

## Health report:
- Project has healthy activity levels and dedicated committers.

## PMC changes:

- Currently 14 PMC members.
- New PMC members:
   - Jeffrey  T. Zemerick was added to the PMC on Wed Jul 26 2017
   - Bruno P. Kinoshita was added to the PMC on Wed Jul 05 2017
   - Peter Thygesen was added to the PMC on Mon Aug 07 2017

## Committer base changes:

- Currently 21 committers.
- No new committers added in the last 3 months
- Last committer addition was Jeffrey  T. Zemerick at Wed Apr 26 2017

## Releases:

- 1.8.1 was released on Sat Jul 08 2017
- 1.8.2 was released on Fri Sep 15 2017

## Mailing list activity:

- users@opennlp.apache.org:
   - 441 subscribers (up 10 in the last 3 months):
   - 75 emails sent to list (38 in previous quarter)

- dev@opennlp.apache.org:
   - 225 subscribers (up 3 in the last 3 months):
   - 331 emails sent to list (364 in previous quarter)

- issues@opennlp.apache.org:
   - 48 subscribers (up 1 in the last 3 months):
   - 340 emails sent to list (552 in previous quarter)


## JIRA activity:

- 37 JIRA tickets created in the last 3 months
- 32 JIRA tickets closed/resolved in the last 3 months

21 Jun 2017 [Jörn Kottmann / Chris]

## Description:
- The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

## Issues:
- Need guidance from Infra for performing daily updates to the website
via Jenkins CI

## Activity:
1. Project had a major 1.8.0 release in May 2017.
2. The team’s presently working on 1.8.1 release planned for June 2017.
3. The project now has a new logo - courtesy of Bruno Kinoshita with
contributions by Koji Sekiguchi .
4. The project website was completely redone with JBake by
Bruno Kinoshita and William Colen.
5. An Irish Sentence Detector component was contributed by Jim Regan.
6. New Language Detector Module from William Colen will be part of
the next release.

### Conferences/Public Speaking:

Tommaso Teofili and Suneel Marthi presented a talk using Apache OpenNLP
Language detector titled ‘Embracing Diversity: Searching over Multiple
Languages’
at Berlin Buzzwords, June 12, 2017, Berlin, Germany

Suneel Marthi presented a talk ‘Large Scale Analysis of
Structured Text’ using Apache OpenNLP and Apache Flink at
Dataworks Summit, San Jose on June 15, 2017.

Daniel Russ will be presenting some of his work using Apache OpenNLP
at Data Science Maryland meetup on June 19, 2017 -
https://www.meetup.com/Data-Science-MD/events/240470935/?gj=co2&rv=co2

3 of the PMC members - Jorn Kottmann, Tommaso Teofili, Suneel Marthi were
at Berlin Buzzwords 2017, Berlin, Germany.

## Health report:
- Project has healthy activity levels and dedicated committers.
Four new committers were added in the past quarter and one
committer was promoted to PMC.

## PMC changes:

- Currently 11 PMC members.
- Daniel Russ was added to the PMC on Tue Apr 18 2017

## Committer base changes:

- Currently 21 committers.
- New committers:
- Jeffrey T. Zemerick was added as a committer on Wed Apr 26 2017
- Bruno P. Kinoshita was added as a committer on Fri Apr 21 2017
- Koji Sekiguchi was added as a committer on Wed Apr 12 2017
- Peter Thygesen was added as a committer on Sat Mar 25 2017

## Releases:

- 1.8.0 was released on Wed May 17 2017

## Mailing list activity:

- users@opennlp.apache.org:
- 431 subscribers (up 10 in the last 3 months):
- 37 emails sent to list (123 in previous quarter)

- dev@opennlp.apache.org:
- 223 subscribers (up 1 in the last 3 months):
- 391 emails sent to list (466 in previous quarter)


## JIRA activity:

 - 84 JIRA tickets created in the last 3 months
 - 80 JIRA tickets closed/resolved in the last 3 months

15 Mar 2017 [Jörn Kottmann / Brett]

Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text.

## Issues:
 - There are no issues requiring board attention at this time

## Activity:
 -  There’s been a big surge in project activity since Dec 2016
 -  Project had a major 1.7.0 release on Dec 30 2016
 -  There have been 2 minor releases - 1.7.1 and 1.7.2 on
    Jan 23, 2017 and Feb 4, 2017 respectively.
 -  All of the legacy code has been ported over to Java 8
 -  Team’s presently working towards the next major 1.8.0
    release targeted for late March 2017
 -  The project added 2 new committers and 2 new PMC
    members since Dec 2016
 -  An Apache OpenNLP talk titled - ‘Large Scale Processing of
    Unstructured Text’ accepted for Apache Big Data North
    America 2017, Miami
 -  A few abstracts about the project have been submitted for
    Berlin Buzzwords 2017.
 -  Project is presently working on a new project logo,
    see https://issues.apache.org/jira/browse/OPENNLP-6

## Health report:
The overall project state significantly improved with a strong increase
in activity and a series of releases in the last three month.
Two new committers joined and existing committers became much more active.

## PMC changes:

 - Currently 10 PMC members.
 - New PMC members:
    - Suneel Marthi was added to the PMC on Thu Jan 19 2017
    - Tommaso Teofili was added to the PMC on Thu Jan 19 2017

## Committer base changes:

 - Currently 17 committers.
 - New committers:
    - Daniel Russ was added as a committer on Wed Jan 11 2017
    - Suneel Marthi was added as a committer on Sun Dec 25 2016

## Releases:

 - 1.7.0 was released on Fri Dec 30 2016
 - 1.7.1 was released on Mon Jan 23 2017
 - 1.7.2 was released on Sat Feb 04 2017

## Mailing list activity:

The volume of mails on the dev list mainly increased because Github
Pull Request status updates are sent there, otherwise the volume stayed
on a similar level as before.

 - users@opennlp.apache.org:
    - 421 subscribers (down -7 in the last 3 months):
    - 123 emails sent to list (18 in previous quarter)

 - dev@opennlp.apache.org:
    - 222 subscribers (down -9 in the last 3 months):
    - 506 emails sent to list (114 in previous quarter)

 - issues@opennlp.apache.org:
    - 44 subscribers (down -3 in the last 3 months):
    - 859 emails sent to list (155 in previous quarter)


## JIRA activity:

 - 110 JIRA tickets created in the last 3 months
 - 196 JIRA tickets closed/resolved in the last 3 months

21 Dec 2016 [Jörn Kottmann / Rich]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced
text
processing services.

Development
------------------
The development team stayed active over the last three month
and the activity in commits increased slightly while working
on the next release.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs and
to improve our code.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Anastasija Mensikova was added as a committer on Jul 20 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

21 Sep 2016 [Jörn Kottmann / Jim]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced
text
processing services.

Development
------------------
The development team stayed active over the last two month
and the activity in commits decreased slightly due to our git
migration.

The OpenNLP GSOC 2016 project will be merged soon into opennlp-tools
and afterwards the process to draft the next release will be started.

Community
---------------
The community stayed active with the usual amount of traffic on the use
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Anastasija Mensikova was added as a committer on Jul 20 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

20 Jul 2016 [Jörn Kottmann / Marvin]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active over the last three month
and the activity increased.

Anastasija Mensikova was accepted in GSOC 2016 and is working on
a new sentiment analysis component for OpenNLP. Her work will
soon be merged into opennlp-tools.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Chris Mattmann was added as a committer on Jul 07 2016.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

15 Jun 2016 [Jörn Kottmann / Marvin]

No report was submitted.

16 Mar 2016 [Jörn Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active but with less activity. A language
model component was added and a few smaller bugs were fixed.
At the current pace we will probably finish the next minor
release before the next board report.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list and contributed a couple of patches to fix bugs.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

16 Dec 2015 [Jörn Kottmann / Chris]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The development team stayed active over the last three month and
participated in smaller development efforts. A couple of bugs were
fixed. At the current pace we will probably the next minor release in
the first few month of 2016.

Community
---------------
The community stayed active with the usual amount of traffic on the user
mailing list.

Rodrigo Agerri was added to the PMC on Jul 09 2015

Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

16 Sep 2015 [Joern Kottmann / Brett]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The two GSOC students both passed and we will start working to incorporate
the WSD component they build into the core OpenNLP Tools package.

The development team stayed active over the last three month and
participated in smaller development efforts.

Community
---------------

We received a Naive Bayes classifier contribution from
Cohan Sujay Carlos and integrated it into OpenNLP.

The community stayed active with the usual amount of traffic on the user
mailing list.

Rodrigo Agerri was added to the PMC on Jul 09 2015

The two GSOC 2015 students were voted in as committers.
Mondher Bouazizi and Anthony Beylerian were added as a committer on
Fri Sep 04 2015.

Releases
------------
The last release OpenNLP 1.6.0 was released on Jul 09 2015.

Issues
--------
There are no board-level issues at this time.

17 Jun 2015 [Joern Kottmann / Brett]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The team is still working on the next release. All major issues
have been solved and the last release candidates fixed only
some details. A release vote for RC 6 is up and is expected
to pass within the next days.

The testing process has been optimized and a few very time consuming
manual tests are now automated.

The development team stayed active over the last three month and
participated in the testing and bug fixing effort.

Community
---------------

The community stayed active with the usual amount of traffic on the user
mailing list.

Two students were accepted and are now working on
Word Sense Disambiguation component as part of GSOC 2015.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active
since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Mar 2015 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech
tagging, named entity extraction, chunking, parsing, and coreference
resolution. These tasks are usually required to build more advanced text
processing services.

Development
------------------
The team is still working on releasing the next release.
In the last three month two release candidates where tested.
A couple of bug where fixed after each of the two iterations. A third
release candidate will be prepared soon.

The development team stayed active over the last three month and
participated in the testing and bug fixing effort.

Community
---------------

The community stayed active with the usual amount of traffic on the user
mailing list.

OpenNLP received a summarization component as a contribution
from Ramakrishna Soma and incorporated it into the sandbox.

Two students approached the project and would like to develop a
Word Sense Disambiguation system as part of GSOC 2015.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active
since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

17 Dec 2014 [Joern Kottmann / Chris]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development stayed active over the last 3 month and the release is finally
prepared. William Colen was elected as Release Manager and published the first
release candidate for testing.

Community
---------------

The community stayed active with the usual amount of traffic on the user mailing list.

There are no new PMC members.

Mark Giaconia was added as a committer in October 2013 and is active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

@Chris: Discuss leaving out individual names from reports

17 Sep 2014 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP
tasks, such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development team remains active, but the activity decreased over
the summer month. The 1.6.0. release is still not out but will hopefully
be finished this year. Most open issues for it are solved.

Community
---------------
The community remains active and there is good traffic on the lists.

There are no new PMC members and there have been no PMC/PPMC additions
since the project moved to Apache.

Vinh Khuc (May), Tommaso Teofili (April) and Rodrigo Agerri (March) have
become committers in the first half of 2014.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Jun 2014 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution. These tasks
are usually required to build more advanced text processing services.

Development
------------------
The development team remains active. In the last three months many
contributions from new committers where integrated into OpenNLP. The work for
the next release paused and will hopefully continue soon.

Community
---------------
The community remained stable during the last three months, a few patches were
contributed and three new committers were voted in.

There are no new PMC members.

Vinh Khuc (May), Tommaso Teofili (April) and Rodrigo Agerri (March) have
become committers since the last report.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

19 Mar 2014 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the last issues for the 1.6.x release branch
and will soon begin testing. The next release it probably out in a couple of
weeks.

Community
---------------
The community remained stable the last three months, a few patches
were contributed and a new component to build language models was
added to the sandbox.

There are no new PMC members and there have been no PMC/PPMC additions
since the project moved to Apache.

Mark Giaconia was added as a committer in October 2013 and is active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Dec 2013 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the new features for the 1.6.x release branch
and will need a few more month for the next release.

Community
---------------
The community activity increased in the last three months, a few patches
were contributed and a new component was added to rapidly create training
data for the name finder.

There are no new PMC members.
Mark Giaconia was added as a committer in October and was active since then.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

18 Sep 2013 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is currently working on the new features for the 1.6.x release branch
and it is expected to take quite a bit more time until the next release.

Community
---------------
The community activity reduced a bit over the summer months, a few patches
were committed for the recently contributed entity linker component.

No new committers have been voted in and no new PMC members.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

19 Jun 2013 [Joern Kottmann / Sam]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------.
After the last release of the 1.5x branch the next release can contain bigger
changes and we are actively working on new features, currently, pluggable
machine learning support, refactoring of the machine learning code, support
for the Brat (annotation tool) data format.

Community
---------------
The community remains active and there is good traffic on the lists.
The project received a contribution to resolve named entities to an
entry in a data base and it was discussed to add a lemmatizer component.

No new committers have been voted in and no new PMC members.

Releases
------------
The last release OpenNLP 1.5.3 was released on 15.4.2013.

Issues
--------
There are no board-level issues at this time.

20 Mar 2013 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The team is still working on getting the next release, 1.5.3, out
William Colen was elected as a release manager and already produced
the first RC, with William as our release manager we are finally able
to spread the knowledge on how to make a release further in the team.
It will take sometime until all the manual tests are run and 1.5.3
can finally be released.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

19 Dec 2012 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
We are preparing the trunk now for the upcoming release and fixed almost
all outstanding issues. Additionally the opennlp-similarity component in
the sandbox is prepared for its first release.

We expect to have an increased development activity again after the 1.5.3
release is out.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

19 Sep 2012 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development activity remained slow but constant, a few smaller
fixes and improvements have been contributed by the community
and one bigger patch for L-BFGS maxent training support. There was
still no work done on the outstanding release and the next release
is at least two or three months away.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

20 Jun 2012 [Joern Kottmann / Ross]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The development activity slowed down a bit compared to last month.
Bug fixes and smaller improvements are actively being worked on.
No work was done on the outstanding release and it will take
still two or three month until it is finished.

Community
---------------
The community remains active and there is good traffic on the lists.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

16 May 2012 [Joern Kottmann / Greg]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
Last month the development stayed active and we had commits
almost every day from different committers. Most of the changes
were related to bug fixes and smaller improvements.

A discussion about the first OpenNLP release as a TLP was started
on the dev list, but it will likely take 2 or 3 month until the
release is finished.

Community
---------------
The community remains active and there is good traffic on the user list.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

18 Apr 2012 [Joern Kottmann / Shane]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
All the graduation tasks are done and the team is now focused on the
development of OpenNLP again, there have been a few commits almost
every day from different committers.

Much work and time went into bug fixing and smaller improvements.

There is now integrated training support for the coreference component
which is very important for others to be able to work on the code.
The training is needed to ensure that code changes don't break anything.

Community
---------------
The community is active and the project received a few smaller patches.
We will likely soon receive a clojure interface layer contribution
and will hopefully be able to integrate the clojure minded OpenNLP
user better into the community at Apache.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

21 Mar 2012 [Joern Kottmann / Jim]

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. It supports the most common NLP tasks,
such as tokenization, sentence segmentation, part-of-speech tagging,
named entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text processing
services.

Development
------------------
The project graduated last month from the incubator which distracted everyone
a bit from the normal development activity. The team is active and works on
bug fixes and features for our next release.

Community
---------------
The community is active and we have good traffic on the developer
and user mailing list. Users frequently report issues which usually get fixed
quickly.

No new committers have been voted in.

Releases
------------
No releases since we graduated from the incubator.

Issues
--------
There are no board-level issues at this time.

15 Feb 2012

Establish the Apache OpenNLP Project

 WHEREAS, the Board of Directors deems it to be in the best interests
 of the Foundation and consistent with the Foundation's purpose to
 establish a Project Management Committee charged with the creation
 and maintenance of open-source software related to the processing of
 natural language text supported by machine learning for distribution
 at no charge to the public.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache OpenNLP Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache OpenNLP Project be and hereby is
 responsible for the creation and maintenance of software
 related to the processing of natural language text
 supported by machine learning; and be it further

 RESOLVED, that the office of "Vice President, Apache OpenNLP" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache OpenNLP Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache OpenNLP Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache OpenNLP Project:

    * William Silva <colen@apache.org>
    * Thomas Morton <tsmorton@apache.org>
    * Jason Baldridge <jbaldrid@apache.org>
    * James Kosin <jkosin@apache.org>
    * Jörn Kottmann <joern@apache.org>
    * Aliaksandr Autayeu <autayeu@apache.org>
    * Boris Galitsky <bgalitsky@apache.org>
    * Grant Ingersoll <gsingers@apache.org>
    * Benson Margulies <bimargulies@apache.org>
    * Isabel Drost <isabel@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Jörn Kottmann
 be appointed to the office of Vice President, Apache OpenNLP, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache OpenNLP PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache OpenNLP Project; and be it further

 RESOLVED, that the Apache OpenNLP Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Incubator OpenNLP podling; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Incubator OpenNLP podling encumbered upon the Apache Incubator
 Project are hereafter discharged.

 Special Order 7B, Establish the Apache OpenNLP Project, was
 approved by Unanimous Vote of the directors present.

15 Feb 2012

The Apache OpenNLP library is a machine learning based toolkit for the
processing of natural language text. Incubating since November, 2011.

It supports the most common NLP tasks, such as tokenization, sentence
segmentation, part-of-speech tagging, named entity extraction, chunking,
parsing, and coreference resolution. These tasks are usually required to
build more advanced text processing services. OpenNLP also includes maximum
entropy and perceptron based machine learning.

The team was extended by two new committers, Boris Galitsky and Aliaksandr
Autayeu. We worked towards our graduation and had a positive community and
recommendation vote.

Our community became more active and we saw a couple of new faces
on the user and development mailing list.

Signed off by mentor:

21 Dec 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Our second release at Apache is now out for vote and will hopefully be
released soon. After some delay through legal issues we have finally
accepted the syntactic generalization contribution from Boris Galitsky and
he did send in a couple of patches to improve it. The development
team is active and the development of the next release will now start.

A list of the the most important issues to address in the move towards
graduation:
 * Establish open regression tests for the parser and coreference component
 * Identify and encourage new contributors on the path to committership

16 Nov 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Our second release at Apache is now out for vote and will hopefully be
released soon. After some delay through legal issues we have finally
accepted the syntactic generalization contribution from Boris Galitsky and
he did send in a couple of patches to improve it. The development
team is active and the development of the next release will now start.

A list of the the most important issues to address in the move towards
graduation:
 * Establish open regression tests for the parser and coreference component
 * Identify and encourage new contributors on the path to committership

17 Aug 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We almost finished the development for our next release (1.5.2) and second
release in the Incubator and will start testing it soon.
The development team is still active and most of the changes have been done
by four independent committers.
This week Boris Galitsky proposed on the mailing list to contribute a
component for syntactic generalization which we will hopefully be able to
accept.
Furthermore we started to work on tooling for an annotation project in our
sandbox and hope to be able to attract new contributors through this effort.

A list of the the most important issues to address in the move towards
graduation:
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

20 Jul 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

A list of the the most important issues to address in the move towards
graduation:
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

15 Jun 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We started to integrate a couple of new features and will soon start testing
for our second release at Apache.
To solve our training data problem we created a proposal for an annotation
project which suggest to label
AL 2.0 licensable text documents with semantic annotations.

There is still good traffic on both the user and dev mailing list.

A list of the the most important issues to address in the move towards
graduation:
* Resolve potential IP issues around releasing training models
* Establish open regression tests for the parser and coreference component
* Identify and encourage new contributors on the path to committership

19 May 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We just did our first Apache release on 2nd of May and are now planning
which features/work will go into the next release. For instance we started
to discuss/plan a big refactoring of our machine learning code, custom
feature generation for the name finder, dictionary support for the name
finder, etc.

There is still good traffic on both the user and dev mailing list. A couple
of users asked questions about how OpenNLP can be trained for new languages.

A list of the the most important issues to address in the move towards
graduation:
 * Resolve potential IP issues around releasing training models
 * Establish open regression tests for the parser and coreference component

20 Apr 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We are still working on our first release. All the planed testing in our
test plan is done now,
we will prepare Release Candidate 6 in the next days and start the vote to
release it.

A list of the three most important issues to address in the move towards
graduation:

* Resolve potential IP issues around releasing training models
* Do a release
* Establish open regression tests

16 Mar 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

We are right now working on getting our first release out, we already
created two release candidates and
are busy with testing and bug fixing. The release will hopefully be released
in late March or early
April. For the testing we are now using all supported public data sets and
distribute the task among
the committers which brings us closer to our third goal to have open
regression tests.
There is still good activity on our user and dev mailing list.

A list of the three most important issues to address in the move towards
graduation:

 * Resolve potential IP issues around releasing training models
 * Do a release
 * Establish open regression tests

16 Feb 2011

OpenNLP is a machine learning based toolkit for the processing of
natural language text. It supports the most common NLP tasks, such as
tokenization, sentence segmentation, part-of-speech tagging, named
entity extraction, chunking, parsing, and coreference resolution.
These tasks are usually required to build more advanced text
processing services.

OpenNLP entered incubation on 11/23 2010.

Since we last reported in January we continued to fill up jira with
more issues, rewrote our maven
based build to comply with general Apache rules and to be ready to
create our first release, fixed a few minor
bugs and re-factored parts of the chunker, migrated the SourceForge
wiki documentation into a docbook for inclusion in future releases,
added build instructions to the website.
We decided to focus on our first release, which will hopefully be out
in March. The release will just contain OpenNLP without any
statistical models to avoid any legal issues which might delay the
release.
Regression testing will mostly be done on private data.
There has been daily activity on dev mailing list and a little less on
the user mailing list.

A list of the three most important issues to address in the move
towards graduation:

 * Resolve potential IP issues around releasing training models
 * Do a release
 * Establish open regression tests

19 Jan 2011

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.

Since we last reported in December, we have completed the code import.
There has been a lot of activity on the dev list, and we're filling up Jira
with issues to resolve before we can release.  Our users also seem to have
found the way from sourceforge to Apache, as there is also activity on the
user list.

A list of the three most important issues to address in the move towards
graduation:

 * Resolve potential IP issues
 * Do a release
 * Establish open regression tests

15 Dec 2010

OpenNLP is a machine learning based toolkit for the processing of natural
language text. It supports the most common NLP tasks, such as tokenization,
sentence segmentation, part-of-speech tagging, named entity extraction,
chunking, parsing, and coreference resolution. These tasks are usually
required to build more advanced text processing services.

OpenNLP entered incubation on 11/23 2010.  Progress since then:

 * SVN and Jira set up complete
 * All committers now have ICLAs on file
 * Accounts have been created
 * Status page created
 * Initial web site created base on ASF CRM

A list of the three most important issues to address in the move towards
graduation:

 * Do the code import
 * Resolve potential IP issues
 * Do a release