Apache Logo
The Apache Way Contribute ASF Sponsors

This was extracted (@ 2017-05-22 18:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

SAMOA

15 Mar 2017

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as
classification, clustering, and regression, as well as programming
abstractions to develop new algorithms that run on top of distributed
stream processing engines (DSPEs).  It features a pluggable architecture
that allows it to run on several DSPEs such as Apache Flink, Apache
Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (December 2016 - February 2017):
 * @dev: 24 messages

 Jira issues backlog (December 2016 - February 2017):
 * Created:  2
 * Resolved: 0

 - We are organizing an internal workshop on SAMOA at Telefonica
   I+D with researchers of Orange Labs

 - We had a presentation of Apache Samoa at Paris Machine Learning
   Meetup and Hamburg Machine Learning Meetup

 - Bhupesh Chawda has presented Apache SAMOA in different venues
   with his presentation "Machine Learning Support in Apache Apex
   (Next Gen Hadoop) with Apache SAMOA"

 - We invited edi_bice who made several contributions last
   spring to become a committer.

 - We published a scientific paper using Apache Samoa and one of
   its machine learning techniques (Vertical Hoeffding Tree)
   presented in IEEE BigData Conference last December 2016.

 - We submitted a paper on SAMOA to a post-proceedings book of the
   workshops MUSE/MSM 2015/2016 in the Springer LNCS/LNAI series.

How has the project developed since the last report?

 - We are looking into new ML techniques for development.

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [ ] Initial setup
 [ ] Working towards first release
 [X] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
    Comments:

      Activity on this podling remains low.  It is good to see a new
      committer being elected.  Shouldn't that be noted in the final section
      on when new committers and PMC members were elected?

 [ ](samoa) Ashutosh Chauhan
    Comments:
 [ ](samoa) Enis Soztutar
    Comments:
 [ ](samoa) Ted Dunning
    Comments:

--------------------
Singa

Singa is a distributed deep learning platform.

Singa has been incubating since 2015-03-17.

Three most important issues to address in the move towards graduation:

 1. Improve distributed training in SINGA V1.2
 2. Improve the documentation and add more examples
 3. Attract more contributors

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of?

 No

How has the community developed since the last report?

 There were  66, 60, and 43 emails from dev@ list in December 2016, January
 2017 and February 2017 respectively.

 There are 67 new commits since the last report.

 One new committer (Li Boon Tan) was added.

How has the project developed since the last report?

 We released the V1.1 version after the last report.  The following features
 were added after last report

 + Ease the installation process via Docker images, debian packages, conda
   packages and Amazon AMI  (CPU version)
 + Integrate with Jenkins for automatically generating convenient packages
   and updating the website.
 + Improve the model classes: adding debug mode, adding the Concat and Slice
   layers, and supporting model loading and saving via the Snapshot API
 + Add image_tool.py for image augmentation and rafiki sub-package for
   providing RESTFul APIs.
 + Enable Java binding (basic) for SINGA
 + Add examples pre-trained from Caffe, e.g., GoogleNet, and examples
   pre-trained from torch, e.g. ResNet

How would you assess the podling's maturity?
Please feel free to add your own commentary.

  [ ] Initial setup
  [ ] Working towards first release
  [X] Community building
  [ ] Nearing graduation
  [ ] Other:

Date of last release:

 2017-02-12

When were the last committers or PMC members elected?

 2017-02-26

Signed-off-by:

   [ ](singa) Daniel Dai
     Comments:
   [X](singa) Alan Gates
     Comments:

       On the maturity assessment I would say the community is in the
       "Community building" phase but not far from the "Nearing graduation"
       phase.

   [ ](singa) Ted Dunning
     Comments:
   [ ](singa) Thejas Nair
     Comments:

21 Dec 2016

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as
classification, clustering, and regression, as well as programming
abstractions to develop new algorithms that run on top of distributed
stream processing engines (DSPEs).  It features a pluggable architecture
that allows it to run on several DSPEs such as Apache Flink, Apache
Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (September 2016 - November 2016):
 * @dev: 87 messages

 Jira issues backlog (September 2016 - November 2016):
 * Created:  0
 * Resolved: 1

 - We manage to resolve the pending issue regarding the integration of
   Apache Apex with Samoa.
 - Also, we made some testing progress with Apache Gearpump.
 - We also had a presentation of Apache Samoa and its integration with
   Apache Apex in Apache BigData Europe 2016
 - We also published a scientific paper using Apache Samoa and one of
   its machine learning techniques (Vertical Hoeffding Tree) (to be
   presented in IEEE BigData Conference in December 2016)

How has the project developed since the last report?

 - We have performed a new release with several bug fixes and new
   features (0.4.0)
 - We are looking into new ML techniques for development.

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:

   Very low activity on this project over the last few months, with
   last commit showing as Sept 22 and very few commits since last
   March.  A big need here beyond activity is to grow the community.
   There is a one individual who made several contributions last
   spring; I sent an email to the mailing list asking if it made sense
   to invite him to become a committer.

21 Sep 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Perform another release
 2. Grow the developer base
 3. Grow the user base

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (June 2016 - August 2016):
 * @dev: 77 messages

 Jira issues backlog (June 2016 - August 2016):
 * Created:  1
 * Resolved: 0

 We have had interest from both the Apex and Gearpump communities.

How has the project developed since the last report?

 We have been preparing a second release of the project, which was delayed
 due to some minor issues in the RC and then the summer break for most
 developers.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [ ](samoa) Ted Dunning

15 Jun 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (March 2016 - May 2016):
 * @dev: 202 messages

 Jira issues backlog (March 2016 - May 2016):
 * Created:  3
 * Resolved: 9

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 After the decision taken by the community to drop support for S4, we
 removed the support for S4. We fixed automatic build issues, we simplified
 input from HDFS, and we worked on adding support for Apache Gearpump.  Two
 talks were given to advertise Apache SAMOA, one at ApacheCon NA Big Data
 by Nicolas Kourtellis, and another one at the "J on the beach" conference
 by Albert Bifet.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:
   I sent a mail to the community poking on the long wait since
   the last release and whether they think there will be a release
   soon.  I also expressed my concern on not having added any
   committers and asked if there were any contributors who looked
   like they might be a good candidate for committership.

16 Mar 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (December 2015 - February 2016):
 * @dev: 114 messages

 Jira issues backlog (December 2015 - February 2016):
 * Created:  7
 * Resolved: 4

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 The community has decided to drop support for S4, which is now inactive.
 There is also some continued interest by Apache Apex about a possible
 integration.  Internally, we have been working on strengthening our main
 classifier (VHT).  There has also been work on improving the connection
 with Avro and Kafka.  The adapted for Flink has been updated to the latest
 stable version (0.10).

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Dec 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 Mailing list activity (September-October-November 2015):
 * @dev: 120 messages

 Jira issues backlog (September-October-November 2015):
 * Created:  8
 * Resolved: 6

 Our main goal is to grow the community, which is still pretty small.  We
 are doing a large amount of dissemination work in conferences and events
 to promote SAMOA.  For example, we had two talks at technical conferences:

 1) A talk at the Apache Conference BigData, in Budapest, Hungary, during
    the week of September 28-30, 2015.
 2) A talk at the Flink Forward, a conference for the Apache Flink DSPE, in
    Berlin, Germany, October 12-13, 2015

 We have had contributions from outside the PPMC. In particular, we have
 had contributions for:

 1) integrating Apache Avro input with Apache Samoa,
 2) a proposal for integrating Apache APEX as a new DSPE,
 3) continued the collaboration with the Apache Flink community.

How has the project developed since the last report?

 Main developments:

 * Performed development and testing of the VHT module.
 * Fixed various bugs and improved ensemble and bagging methods.
 * Worked on the integration of new data sources like Apache Avro.
 * Improved the website to include more material for new contributors.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Sep 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Grow the community
  2. Elect new PMC members
  3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None

How has the community developed since the last report?

  Mailing list activity (June-July-August 2015):

  * @dev     225 messages

  Jira issues backlog (June-July-August 2015):

  * Created:  10
  * Resolved: 5

  Our main goal is still to grow the community, which is still pretty small.
  We are doing a large amount of dissemination work in conferences and
  events to promote SAMOA.

  We have had contributions from outside the PMC.  In particular, we have
  been collaborating with the Apache Flink community.

How has the project developed since the last report?

  Main developments:

  * First release 0.3.0 in July.
  * Cleanup of codebase to ease adoption by new contributors.

  We should increase the rate of technical contribution to the project and
  move to less incremental ones.

Date of last release:

  2015-07-21

When were the last committers or PMC members elected?

  None

Signed-off-by:

  [X](samoa) Alan Gates
  [ ](samoa) Ashutosh Chauhan
  [ ](samoa) Enis Soztutar
  [ ](samoa) Ted Dunning

--------------------
Sentry

Sentry is a highly modular system for providing fine grained role
based authorization to both data and metadata stored on an Apache Hadoop
cluster.

Sentry has been incubating since 2013-08-08.

Three most important issues to address in the move towards graduation:

 1. Encourage more feature and direction discussions on the dev list rather
    than jira.
 2. Continue reporting on time
 3. Continue making periodic releases following the Apache guidelines.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  Community has made been making good progress on all of the above mentioned
  items.

  1. There have been multiple discussions on the dev list about features/
     design decisions/ roadmap/ dev practices.
  2. Community has been doing timely monthly reporting.
  3. Community is actively working on bug fixes and working to release 1.6.0
     next month

How has the community developed since the last report?

  We had 160 messages on dev list last month. (Got number from
  http://markmail.org/list/org.apache.sentry.dev)

How has the project developed since the last report?

  About 42 issues were created and about 42 resolved(Numbers from jira).

Date of last release:

  2015-07-14

When were the last committers or PMC members elected?

 Colin Ma, Dapeng Sun, Guoquan Shen and Xiaomeng Huang were added as
 committers on 12/24/2014.  No new PPMC members have been added since the
 project has entered the incubator.

Signed-off-by:

  [X](sentry) Arvind Prabhakar
  [ ](sentry) Joe Brockmeier
  [X](sentry) David Nalley
  [ ](sentry) Olivier Lamy
  [X](sentry) Patrick Hunt
  [ ](sentry) Thomas White

17 Jun 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Roll an Apache release
 2. Elect new PMC members
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

Two of our current PMC members (Olivier and Matthieu) have new jobs
and have declared that they will not be able to actively contribute to
the project for the moment being. This leaves 4 active PMC members.


How has the community developed since the last report?

 Mailing list activity (March-April-May 2015):
 * @dev     375 messages

 Jira issues backlog (March-April-May 2015):

 * Created:  14
 * Resolved: 19

We have had contributions from outside the PMC.
In particular, we have been collaborating with the Apache Flink community.

Our main goal is still to grow the community, which is still pretty
small.  To achieve this goal, we are doing a large amount of
dissemination work in conferences and events to promote SAMOA.


How has the project developed since the last report?

We are fully operational on the new infrastructure:
* Completed migration of issues from old GitHub repository.
* Completed migration of documentation from old wiki.

* Created new Confluence wiki with roadmap and contribution
 instruction:
 https://cwiki.apache.org/confluence/display/SAMOA/Samoa+Home

Main developments:
* New adaptor for Flink integrated.
* Simplified the website and solved several bugs.
* Preparing for our first release in June.

We should increase the rate of technical contribution to the project
and move to less incremental ones.  Codebase still need cleanup to
ease adoption by new contributors.


Date of last release:

None, plan to have one in June.

When were the last committers or PMC members elected?

None.


Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

18 Mar 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Start committing patches
 2. Discuss roadmap next release
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 The community is growing, and as an example, there are new people working
 on an adapter for Apache Flink-Streaming. Also, we have been giving talks
 at University of Waikato, and University of Auckland to try to attract
 more people to the community.

 Mailing list activity (since February 2015):

 * @dev     106 messages

 Jira issues backlog (since February 2015):

 * Created:  6
 * Resolved: 4

How has the project developed since the last report?

 The project setup is going on nicely.  We created bylaws for the project
 and we migrated documentation from the old website.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 P. Taylor Goetz (ptgoetz):

   The SAMOA poddling is still ramping up and there are no apparent issues
   requiring mentor guidance.

18 Feb 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Create bylaws for the project
 2. Migrate documentation from the old website
 3. Start committing patches

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 We have started using the official Apache channels to discuss about SAMOA.
 A couple of new people have showed up on the mailing lists.  We have one
 new contributor submitting patches.

 Mailing list activity (since January 2015):

 * @dev      101 messages

 Jira issues backlog (since January 2015):

 * Created:  13
 * Resolved: 2

How has the project developed since the last report?

 The project setup is going on nicely.  We got the SGA from Yahoo and
 migrated the code to the Apache git repository, enabled GitHub integration
 (https://github.com/apache/incubator-samoa), migrated the website to the
 Apache infrastructure and added the Incubator branding
 (http://samoa.incubator.apache.org), enabled testing via Travis CI.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Justin Mclean (jmclean):

   Just starting, mentors active.

21 Jan 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15 and is not fully functioning as a
project yet.

Three most important issues to address in the move towards graduation:

 1. Get IP clearance (SGA) from Yahoo
 2. Move the current code into ASF's git repository
 3. Start working as an Apache project

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 This is the first report.

How has the project developed since the last report?

 This is the first report.

Date of last release:

 No incubator release yet.

When were the last committers or PMC members elected?

 We just established the initial PPMC.

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning