Skip to Main Content
The Apache Software Foundation
Apache 20th Anniversary Logo

This was extracted (@ 2024-11-19 16:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

WARNING: these pages may omit some original contents of the minutes.
This is due to changes in the layout of the source minutes over the years. Fixes are being worked on.

Meeting times vary, the exact schedule is available to ASF Members and Officers, search for "calendar" in the Foundation's private index page (svn:foundation/private-index.html).

SAMOA

16 Dec 2020

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None.

### How has the community developed since the last report?

 - Mailing list activity:
 - @dev: 4 messages

### How has the project developed since the last report?

 We are working in an implementation of adaptive random forests.

### How would you assess the podling's maturity?
 Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2020-04-23

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 There are no mentors since June.

### Is the PPMC managing the podling's brand / trademarks?

 Yes. There are no 3rd parties incorrectly using the podling‘s name and
 brand for now.

### Signed-off-by:

### IPMC/Shepherd notes:
 Justin Mclean: I suggest you reach out the the IPMC general list and
 ask for more mentors. Given the low activity it may be hard to attract
 one. Can the project point to me to where this development on adaptive
 random forests is? There doesn't seem to be any commits for almost a year.

16 Sep 2020

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None.

### How has the community developed since the last report?

 - Mailing list activity:
 - @dev: 14 messages

### How has the project developed since the last report?

 We are working in an implementation of adaptive random forests.

### How would you assess the podling's maturity?
 Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2020-04-23

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 Yes, the mentors have been helpful and responsive.

### Is the PPMC managing the podling's brand / trademarks?

 Yes. There are no 3rd parties incorrectly using the podling‘s name and
 brand for now.

### Signed-off-by:



### IPMC/Shepherd notes:

17 Jun 2020

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None.

### How has the community developed since the last report?

 - Mailing list activity:
 - @dev: 11 messages

### How has the project developed since the last report?

 With the help of the new commiter Corey Sterling, we prepared a new
 release of Apache SAMOA.

### How would you assess the podling's maturity?
 Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2020-04-23

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 Yes, the mentors have been helpful and responsive.

### Is the PPMC managing the podling's brand / trademarks?

 Yes. There are no 3rd parties incorrectly using the podling‘s name and
 brand for now.

### Signed-off-by:

 - [X] (samoa) Alan Gates
    Comments:  Good to see an initial release.

### IPMC/Shepherd notes:

18 Mar 2020

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None.

### How has the community developed since the last report?

 - Mailing list activity:
 - @dev: 31 messages

### How has the project developed since the last report?

 With the help of the new commiter Corey Sterling, we prepared a new
 release of Apache SAMOA, that is already being discussed and voted (RC3).

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2016-09-30

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 Yes, the mentors have been helpful and responsive.

### Is the PPMC managing the podling's brand / trademarks?

 Yes. There are no 3rd parties incorrectly using the podling‘s name and
 brand for now.

### Signed-off-by:

 - [X] (samoa) Alan Gates
    Comments:  With the election of the new committer it looks like the
    project is slowly making some progress.  I'm happy to see it start
    moving forward.

### IPMC/Shepherd notes:

18 Dec 2019

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 Nothing important. It took more than 2 months to create the account
 of Corey Sterling, as it seems that some things are still done manually,
 and there was a mistake with his email address. Finally, we could solve
 it out.

### How has the community developed since the last report?
 *  Mailing list activity:
  * @dev: 33 messages

### How has the project developed since the last report?

 * Retirement has been suggested due to its very low activity
 * To restart engagement, Corey Sterling has been elected as commiter in
 late August. With his help, we prepared a new release of Apache
 SAMOA, that is already being discussed and voted (RC).

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2016-09-30

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 Yes, the mentors have been helpful and responsive.

### Is the PPMC managing the podling's brand / trademarks?

 Yes. There are no 3rd parties incorrectly using the podling‘s name and
 brand for now.

### Signed-off-by:

 - [X] (samoa) Alan Gates
    Comments:
 - [ ] (samoa) Ashutosh Chauhan
    Comments:

### IPMC/Shepherd notes:

18 Sep 2019

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None

### How has the community developed since the last report?
 *  Mailing list activity:
   * @dev: 12 messages

### How has the project developed since the last report?

 * Retirement has been suggested due to its very low activity
 * To restart engagement, Corey Sterling has been elected as commiter in
 late August. With his help, we are preparing a new release of Apache
 SAMOA in early September. Corey Sterling has experience on open source
 software,  as an example, he did the last release of MOA, an open source
 software non-distributed for data streams very related to SAMOA. He works
 at University of Waikato.

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2016-09-30

### When were the last committers or PPMC members elected?

 August 2019

### Have your mentors been helpful and responsive?

 Yes, the mentors have been helpful and responsive.

### Signed-off-by:

 - [X] (samoa) Alan Gates
    Comments: Happy to see a new committer, hopefully this will help
    kickstart the project.
 - [ ] (samoa) Ashutosh Chauhan
    Comments:

### IPMC/Shepherd notes:

 Dave Fisher: Let's see if bringing on a new committer revitalizes the
 podling.

19 Jun 2019

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

### Three most important unfinished issues to address before graduating:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

### Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

### How has the community developed since the last report?
 *  Mailing list activity:
   * @dev: 8 messages

### How has the project developed since the last report?
 * 1 new PRs created
 * Working on code for supporting Apache Heron

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 2016-09-30

### When were the last committers or PPMC members elected?

 January 2018

### Have your mentors been helpful?

 Yes, the mentors have been helpful and responsive.

### Signed-off-by:

 - [X] (samoa) Alan Gates
    Comments:  Dave Fisher (shepherd for this reporting cycle) recently
    suggested that this podling consider retirement as there is very low
    activity and it has been incubator for quite some time.  Whether and
    how to attempt to restart engagement is being discussed.
 - [ ] (samoa) Ashutosh Chauhan
    Comments:

### IPMC/Shepherd notes:

20 Mar 2019

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).

It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None

How has the community developed since the last report?

  *  Mailing list activity (September 2018 - November 2018):
    * @dev: 8 messages

How has the project developed since the last report?
  * 1 new PRs created
  * Working on code for supporting Apache Heron

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [x] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:
  2016-09-30

When were the last committers or PPMC members elected?

  January 2018

Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.

  Yes, the mentors have been helpful and responsive.

Signed-off-by:

  [X](samoa) Alan Gates
     Comments:
  [ ](samoa) Ashutosh Chauhan
     Comments:

IPMC/Shepherd notes:

19 Dec 2018

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None

How has the community developed since the last report?

 *  Mailing list activity (September 2018 - November 2018):
    * @dev: 30 messages

How has the project developed since the last report?
  * 2 new PRs created
  * Working on code for supporting Apache Heron

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [x] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

  2016-09-30

When were the last committers or PPMC members elected?

  January 2018

Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.

  Yes, the mentors have been helpful and responsive.

Signed-off-by:

  [X](samoa) Alan Gates
     Comments:
  [ ](samoa) Ashutosh Chauhan
     Comments:

IPMC/Shepherd notes:

19 Sep 2018

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Revitalize the project by resuming development
 2. Enlarge the user base and contributing community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware
of?

 None

How has the community developed since the last report?
 *  Mailing list activity (June 2018 - August 2018):
   * @dev: 19 messages

How has the project developed since the last report?
 * 2 new PRs created
 * Presentation at KDD 2018 (MUD3 workshop) on urban data mining using
 Apache SAMOA

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [ ] Initial setup
 [ ] Working towards first release
 [X] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PPMC members elected?

 January 2018

Signed-off-by:

 [X](samoa) Alan Gates
    Comments:  I agree with Ted, I remain concerned about the lack of
    activity in this podling.
 [ ](samoa) Ashutosh Chauhan
    Comments:
 [ ](samoa) Enis Soztutar
    Comments:
 [x](samoa) Ted Dunning
    Comments:

IPMC/Shepherd notes:

 ted: Almost no mailing list activity since April. March looked promising,
 but after voting in a new committer, nothing much else happened.

20 Jun 2018

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Revitalise the project by resuming development
 2. Enlarge the user base and contributing community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?
 *  Mailing list activity (March 2018 - May 2018):
   * @dev: 25 messages

How has the project developed since the last report?
 * Work on the project has slowed down, no new development since the last
report

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [ ] Initial setup
 [ ] Working towards first release
 [X] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PPMC members elected?

 January 2018

Signed-off-by:

 [X](samoa) Alan Gates
    Comments: I am glad to see the project is aware it needs to increase
the pace of development.  Hopefully this will lead to more contributions
from the core team.
 [ ](samoa) Ashutosh Chauhan
    Comments:
 [ ](samoa) Enis Soztutar
    Comments:
 [ ](samoa) Ted Dunning
    Comments:

IPMC/Shepherd notes:

21 Mar 2018

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Enlarge the contributing community
  2. Have new companies and organizations using the Apache Samoa technology

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

None

How has the community developed since the last report?


 Mailing list activity (December 2017 - February 2018):
  * @dev: 23 messages

 Jira issues backlog (December 2017 - February 2018):
  * Created:  2
  * Resolved: 1

How has the project developed since the last report?

 * Just elected two new members to the Apache Samoa PPMC
 * Planning a new release in the next month
 * Worked on consolidating input and output API, to work better with multiple types of data

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [X] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

  2016-09-30

When were the last committers or PPMC members elected?

January 2018


Signed-off-by:

  [ ](samoa) Alan Gates
     Comments:
  [ ](samoa) Ashutosh Chauhan
     Comments:
  [ ](samoa) Enis Soztutar
     Comments:
  [X](samoa) Ted Dunning
     Comments: Is this community really generating enough momentum to graduate?

20 Dec 2017

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Elect new PPMC members
  2. Enlarge the community
  3.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

None

How has the community developed since the last report?


 Mailing list activity (September 2017 - November 2017):
  * @dev: 49 messages

 Jira issues backlog (September 2017 - November 2017):
  * Created:  5
  * Resolved: 1

How has the project developed since the last report?

 * Planning a new release in the next month.

 * Presentation at the Flink Forward Conference from Orange's team

 * Integration of a new Boosting algorithm

 * Improved Apache Kafka support and support for JSON and Avro.

 * Improved SAMOA instances.

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [X] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

  2016-09-30

When were the last committers or PPMC members elected?

None


Signed-off-by:

  [X](samoa) Alan Gates
     Comments: In response to shepherd's comments (see below) community started a vote on a couple of contributors becoming committers.
  [ ](samoa) Ashutosh Chauhan
     Comments:
  [ ](samoa) Enis Soztutar
     Comments:
  [ ](samoa) Ted Dunning
     Comments:

IPMC/Shepherd notes:
Concerned about the lack of community growth. Very few commits this year yet there are pull requests by contributors.
Sent an email to private@samoa

20 Sep 2017

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Elect new PPMC members
  2. Enlarge the community
  3.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (June 2017 - August 2017):
  * @dev: 120 messages

 Jira issues backlog (June 2017 - August 2017):
  * Created:  4
  * Resolved: 5

 We saw the engagement of new contributors, who are plausible candidates for the PPMC.

How has the project developed since the last report?

 * Planning a new release in the next month. Work on the release was delayed due to summer holidays for most of the contributors.

 * Integrated Apache Kafka support and support for JSON and Avro.

 * Developed a new Boosting algorithm that will be integrated shortly.

 * Integrated new version of the SAMOA instances.

 * Integrated ability to store predictions on disk.


How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [X] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PPMC members elected?

 None


Signed-off-by:

  [X](samoa) Alan Gates
     Comments:
  [ ](samoa) Ashutosh Chauhan
     Comments:
  [ ](samoa) Enis Soztutar
     Comments:
  [ ](samoa) Ted Dunning
     Comments:

21 Jun 2017

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Flink, Apache Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware
of?

 None

How has the community developed since the last report?

Mailing list activity (March 2017 - May 2017):
 * @dev: 38 messages

 Jira issues backlog (March 2017 - May 2017):
 * Created:  6
 * Resolved: 1

 - We organized a 2-day internal workshop on SAMOA at Telefonica I+D with
   researchers of Orange Labs, Telecom Paris and QCRI. Various have been
   discussed to improve SAMOA's ML and other interfaces with systems like
   Kafka, etc.

 - We submitted a proposal for funding which will help the development of new
   features on Samoa.

 - We have been working on a new release 0.5.0, probably coming up in one
   month. This release will have support for Kafka, a more well-rounded
   support for Avro and Json formats, as well as support to store predictions
   outputted from the model.

 - There was also significant work done to integrate correctly the instances
   between MOA and SAMOA, as there was a deviation of how they were defined
   and hindered the portability of new methods from MOA to SAMOA.

 - Early discussions to increase the committers / developers team by inviting
   new contributors to the panel.

How has the project developed since the last report?

 - We are looking into new ML techniques for development.

 - Worked on the integration of Samoa-MOA instances

 - Engaging interactions with new parties (Orange Labs) and potential
   collaborations

How would you assess the podling's maturity? Please feel free to add your own
commentary.

 [ ] Initial setup
 [ ] Working towards first release
 [X] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
    Comments:
 [ ](samoa) Ashutosh Chauhan
    Comments:
 [ ](samoa) Enis Soztutar
    Comments:
 [ ](samoa) Ted Dunning
    Comments:

IPMC/Shepherd notes:

   Drew Farris (shepherd): One mentor active. Light activity on the project in
     general.

15 Mar 2017

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as
classification, clustering, and regression, as well as programming
abstractions to develop new algorithms that run on top of distributed
stream processing engines (DSPEs).  It features a pluggable architecture
that allows it to run on several DSPEs such as Apache Flink, Apache
Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (December 2016 - February 2017):
 * @dev: 24 messages

 Jira issues backlog (December 2016 - February 2017):
 * Created:  2
 * Resolved: 0

 - We are organizing an internal workshop on SAMOA at Telefonica
   I+D with researchers of Orange Labs

 - We had a presentation of Apache Samoa at Paris Machine Learning
   Meetup and Hamburg Machine Learning Meetup

 - Bhupesh Chawda has presented Apache SAMOA in different venues
   with his presentation "Machine Learning Support in Apache Apex
   (Next Gen Hadoop) with Apache SAMOA"

 - We invited edi_bice who made several contributions last
   spring to become a committer.

 - We published a scientific paper using Apache Samoa and one of
   its machine learning techniques (Vertical Hoeffding Tree)
   presented in IEEE BigData Conference last December 2016.

 - We submitted a paper on SAMOA to a post-proceedings book of the
   workshops MUSE/MSM 2015/2016 in the Springer LNCS/LNAI series.

How has the project developed since the last report?

 - We are looking into new ML techniques for development.

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [ ] Initial setup
 [ ] Working towards first release
 [X] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
    Comments:

      Activity on this podling remains low.  It is good to see a new
      committer being elected.  Shouldn't that be noted in the final section
      on when new committers and PMC members were elected?

 [ ](samoa) Ashutosh Chauhan
    Comments:
 [ ](samoa) Enis Soztutar
    Comments:
 [ ](samoa) Ted Dunning
    Comments:

--------------------
Singa

Singa is a distributed deep learning platform.

Singa has been incubating since 2015-03-17.

Three most important issues to address in the move towards graduation:

 1. Improve distributed training in SINGA V1.2
 2. Improve the documentation and add more examples
 3. Attract more contributors

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of?

 No

How has the community developed since the last report?

 There were  66, 60, and 43 emails from dev@ list in December 2016, January
 2017 and February 2017 respectively.

 There are 67 new commits since the last report.

 One new committer (Li Boon Tan) was added.

How has the project developed since the last report?

 We released the V1.1 version after the last report.  The following features
 were added after last report

 + Ease the installation process via Docker images, debian packages, conda
   packages and Amazon AMI  (CPU version)
 + Integrate with Jenkins for automatically generating convenient packages
   and updating the website.
 + Improve the model classes: adding debug mode, adding the Concat and Slice
   layers, and supporting model loading and saving via the Snapshot API
 + Add image_tool.py for image augmentation and rafiki sub-package for
   providing RESTFul APIs.
 + Enable Java binding (basic) for SINGA
 + Add examples pre-trained from Caffe, e.g., GoogleNet, and examples
   pre-trained from torch, e.g. ResNet

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [X] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

 2017-02-12

When were the last committers or PMC members elected?

 2017-02-26

Signed-off-by:

   [ ](singa) Daniel Dai
     Comments:
   [X](singa) Alan Gates
     Comments:

       On the maturity assessment I would say the community is in the
       "Community building" phase but not far from the "Nearing graduation"
       phase.

   [ ](singa) Ted Dunning
     Comments:
   [ ](singa) Thejas Nair
     Comments:

21 Dec 2016

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as
classification, clustering, and regression, as well as programming
abstractions to develop new algorithms that run on top of distributed
stream processing engines (DSPEs).  It features a pluggable architecture
that allows it to run on several DSPEs such as Apache Flink, Apache
Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (September 2016 - November 2016):
 * @dev: 87 messages

 Jira issues backlog (September 2016 - November 2016):
 * Created:  0
 * Resolved: 1

 - We manage to resolve the pending issue regarding the integration of
   Apache Apex with Samoa.
 - Also, we made some testing progress with Apache Gearpump.
 - We also had a presentation of Apache Samoa and its integration with
   Apache Apex in Apache BigData Europe 2016
 - We also published a scientific paper using Apache Samoa and one of
   its machine learning techniques (Vertical Hoeffding Tree) (to be
   presented in IEEE BigData Conference in December 2016)

How has the project developed since the last report?

 - We have performed a new release with several bug fixes and new
   features (0.4.0)
 - We are looking into new ML techniques for development.

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:

   Very low activity on this project over the last few months, with
   last commit showing as Sept 22 and very few commits since last
   March.  A big need here beyond activity is to grow the community.
   There is a one individual who made several contributions last
   spring; I sent an email to the mailing list asking if it made sense
   to invite him to become a committer.

21 Sep 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Perform another release
 2. Grow the developer base
 3. Grow the user base

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (June 2016 - August 2016):
 * @dev: 77 messages

 Jira issues backlog (June 2016 - August 2016):
 * Created:  1
 * Resolved: 0

 We have had interest from both the Apex and Gearpump communities.

How has the project developed since the last report?

 We have been preparing a second release of the project, which was delayed
 due to some minor issues in the RC and then the summer break for most
 developers.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [ ](samoa) Ted Dunning

15 Jun 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (March 2016 - May 2016):
 * @dev: 202 messages

 Jira issues backlog (March 2016 - May 2016):
 * Created:  3
 * Resolved: 9

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 After the decision taken by the community to drop support for S4, we
 removed the support for S4. We fixed automatic build issues, we simplified
 input from HDFS, and we worked on adding support for Apache Gearpump.  Two
 talks were given to advertise Apache SAMOA, one at ApacheCon NA Big Data
 by Nicolas Kourtellis, and another one at the "J on the beach" conference
 by Albert Bifet.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:
   I sent a mail to the community poking on the long wait since
   the last release and whether they think there will be a release
   soon.  I also expressed my concern on not having added any
   committers and asked if there were any contributors who looked
   like they might be a good candidate for committership.

16 Mar 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (December 2015 - February 2016):
 * @dev: 114 messages

 Jira issues backlog (December 2015 - February 2016):
 * Created:  7
 * Resolved: 4

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 The community has decided to drop support for S4, which is now inactive.
 There is also some continued interest by Apache Apex about a possible
 integration.  Internally, we have been working on strengthening our main
 classifier (VHT).  There has also been work on improving the connection
 with Avro and Kafka.  The adapted for Flink has been updated to the latest
 stable version (0.10).

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Dec 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 Mailing list activity (September-October-November 2015):
 * @dev: 120 messages

 Jira issues backlog (September-October-November 2015):
 * Created:  8
 * Resolved: 6

 Our main goal is to grow the community, which is still pretty small.  We
 are doing a large amount of dissemination work in conferences and events
 to promote SAMOA.  For example, we had two talks at technical conferences:

 1) A talk at the Apache Conference BigData, in Budapest, Hungary, during
    the week of September 28-30, 2015.
 2) A talk at the Flink Forward, a conference for the Apache Flink DSPE, in
    Berlin, Germany, October 12-13, 2015

 We have had contributions from outside the PPMC. In particular, we have
 had contributions for:

 1) integrating Apache Avro input with Apache Samoa,
 2) a proposal for integrating Apache APEX as a new DSPE,
 3) continued the collaboration with the Apache Flink community.

How has the project developed since the last report?

 Main developments:

 * Performed development and testing of the VHT module.
 * Fixed various bugs and improved ensemble and bagging methods.
 * Worked on the integration of new data sources like Apache Avro.
 * Improved the website to include more material for new contributors.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Sep 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Grow the community
  2. Elect new PMC members
  3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None

How has the community developed since the last report?

  Mailing list activity (June-July-August 2015):

  * @dev     225 messages

  Jira issues backlog (June-July-August 2015):

  * Created:  10
  * Resolved: 5

  Our main goal is still to grow the community, which is still pretty small.
  We are doing a large amount of dissemination work in conferences and
  events to promote SAMOA.

  We have had contributions from outside the PMC.  In particular, we have
  been collaborating with the Apache Flink community.

How has the project developed since the last report?

  Main developments:

  * First release 0.3.0 in July.
  * Cleanup of codebase to ease adoption by new contributors.

  We should increase the rate of technical contribution to the project and
  move to less incremental ones.

Date of last release:

  2015-07-21

When were the last committers or PMC members elected?

  None

Signed-off-by:

  [X](samoa) Alan Gates
  [ ](samoa) Ashutosh Chauhan
  [ ](samoa) Enis Soztutar
  [ ](samoa) Ted Dunning

--------------------
Sentry

Sentry is a highly modular system for providing fine grained role
based authorization to both data and metadata stored on an Apache Hadoop
cluster.

Sentry has been incubating since 2013-08-08.

Three most important issues to address in the move towards graduation:

 1. Encourage more feature and direction discussions on the dev list rather
    than jira.
 2. Continue reporting on time
 3. Continue making periodic releases following the Apache guidelines.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  Community has made been making good progress on all of the above mentioned
  items.

  1. There have been multiple discussions on the dev list about features/
     design decisions/ roadmap/ dev practices.
  2. Community has been doing timely monthly reporting.
  3. Community is actively working on bug fixes and working to release 1.6.0
     next month

How has the community developed since the last report?

  We had 160 messages on dev list last month. (Got number from
  http://markmail.org/list/org.apache.sentry.dev)

How has the project developed since the last report?

  About 42 issues were created and about 42 resolved(Numbers from jira).

Date of last release:

  2015-07-14

When were the last committers or PMC members elected?

 Colin Ma, Dapeng Sun, Guoquan Shen and Xiaomeng Huang were added as
 committers on 12/24/2014.  No new PPMC members have been added since the
 project has entered the incubator.

Signed-off-by:

  [X](sentry) Arvind Prabhakar
  [ ](sentry) Joe Brockmeier
  [X](sentry) David Nalley
  [ ](sentry) Olivier Lamy
  [X](sentry) Patrick Hunt
  [ ](sentry) Thomas White

17 Jun 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Roll an Apache release
 2. Elect new PMC members
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

Two of our current PMC members (Olivier and Matthieu) have new jobs
and have declared that they will not be able to actively contribute to
the project for the moment being. This leaves 4 active PMC members.


How has the community developed since the last report?

 Mailing list activity (March-April-May 2015):
 * @dev     375 messages

 Jira issues backlog (March-April-May 2015):

 * Created:  14
 * Resolved: 19

We have had contributions from outside the PMC.
In particular, we have been collaborating with the Apache Flink community.

Our main goal is still to grow the community, which is still pretty
small.  To achieve this goal, we are doing a large amount of
dissemination work in conferences and events to promote SAMOA.


How has the project developed since the last report?

We are fully operational on the new infrastructure:
* Completed migration of issues from old GitHub repository.
* Completed migration of documentation from old wiki.

* Created new Confluence wiki with roadmap and contribution
 instruction:
 https://cwiki.apache.org/confluence/display/SAMOA/Samoa+Home

Main developments:
* New adaptor for Flink integrated.
* Simplified the website and solved several bugs.
* Preparing for our first release in June.

We should increase the rate of technical contribution to the project
and move to less incremental ones.  Codebase still need cleanup to
ease adoption by new contributors.


Date of last release:

None, plan to have one in June.

When were the last committers or PMC members elected?

None.


Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

18 Mar 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Start committing patches
 2. Discuss roadmap next release
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 The community is growing, and as an example, there are new people working
 on an adapter for Apache Flink-Streaming. Also, we have been giving talks
 at University of Waikato, and University of Auckland to try to attract
 more people to the community.

 Mailing list activity (since February 2015):

 * @dev     106 messages

 Jira issues backlog (since February 2015):

 * Created:  6
 * Resolved: 4

How has the project developed since the last report?

 The project setup is going on nicely.  We created bylaws for the project
 and we migrated documentation from the old website.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 P. Taylor Goetz (ptgoetz):

   The SAMOA poddling is still ramping up and there are no apparent issues
   requiring mentor guidance.

18 Feb 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Create bylaws for the project
 2. Migrate documentation from the old website
 3. Start committing patches

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 We have started using the official Apache channels to discuss about SAMOA.
 A couple of new people have showed up on the mailing lists.  We have one
 new contributor submitting patches.

 Mailing list activity (since January 2015):

 * @dev      101 messages

 Jira issues backlog (since January 2015):

 * Created:  13
 * Resolved: 2

How has the project developed since the last report?

 The project setup is going on nicely.  We got the SGA from Yahoo and
 migrated the code to the Apache git repository, enabled GitHub integration
 (https://github.com/apache/incubator-samoa), migrated the website to the
 Apache infrastructure and added the Incubator branding
 (http://samoa.incubator.apache.org), enabled testing via Travis CI.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Justin Mclean (jmclean):

   Just starting, mentors active.

21 Jan 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15 and is not fully functioning as a
project yet.

Three most important issues to address in the move towards graduation:

 1. Get IP clearance (SGA) from Yahoo
 2. Move the current code into ASF's git repository
 3. Start working as an Apache project

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 This is the first report.

How has the project developed since the last report?

 This is the first report.

Date of last release:

 No incubator release yet.

When were the last committers or PMC members elected?

 We just established the initial PPMC.

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning