Apache Logo
The Apache Way Contribute ASF Sponsors

Formal board meeting minutes from 2010 through present. Please Note: The board typically approves minutes from one meeting during the next board meeting, so minutes will be published roughly one month later than the scheduled date. Other corporate records are published, as is an alternate categorized view of all board meeting minutes.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

SAMOA

15 Mar 2017

Report was filed, but display is awaiting the approval of the Board minutes.

21 Dec 2016

SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as
classification, clustering, and regression, as well as programming
abstractions to develop new algorithms that run on top of distributed
stream processing engines (DSPEs).  It features a pluggable architecture
that allows it to run on several DSPEs such as Apache Flink, Apache
Storm, Apache Apex and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the developer base
 2. Grow the user base
 3. Add some more ML techniques

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (September 2016 - November 2016):
 * @dev: 87 messages

 Jira issues backlog (September 2016 - November 2016):
 * Created:  0
 * Resolved: 1

 - We manage to resolve the pending issue regarding the integration of
   Apache Apex with Samoa.
 - Also, we made some testing progress with Apache Gearpump.
 - We also had a presentation of Apache Samoa and its integration with
   Apache Apex in Apache BigData Europe 2016
 - We also published a scientific paper using Apache Samoa and one of
   its machine learning techniques (Vertical Hoeffding Tree) (to be
   presented in IEEE BigData Conference in December 2016)

How has the project developed since the last report?

 - We have performed a new release with several bug fixes and new
   features (0.4.0)
 - We are looking into new ML techniques for development.

Date of last release:

 2016-09-30

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:

   Very low activity on this project over the last few months, with
   last commit showing as Sept 22 and very few commits since last
   March.  A big need here beyond activity is to grow the community.
   There is a one individual who made several contributions last
   spring; I sent an email to the mailing list asking if it made sense
   to invite him to become a committer.

21 Sep 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Perform another release
 2. Grow the developer base
 3. Grow the user base

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (June 2016 - August 2016):
 * @dev: 77 messages

 Jira issues backlog (June 2016 - August 2016):
 * Created:  1
 * Resolved: 0

 We have had interest from both the Apex and Gearpump communities.

How has the project developed since the last report?

 We have been preparing a second release of the project, which was delayed
 due to some minor issues in the RC and then the summer break for most
 developers.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [ ](samoa) Ted Dunning

15 Jun 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache Flink, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (March 2016 - May 2016):
 * @dev: 202 messages

 Jira issues backlog (March 2016 - May 2016):
 * Created:  3
 * Resolved: 9

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 After the decision taken by the community to drop support for S4, we
 removed the support for S4. We fixed automatic build issues, we simplified
 input from HDFS, and we worked on adding support for Apache Gearpump.  Two
 talks were given to advertise Apache SAMOA, one at ApacheCon NA Big Data
 by Nicolas Kourtellis, and another one at the "J on the beach" conference
 by Albert Bifet.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Alan Gates:
   I sent a mail to the community poking on the long wait since
   the last release and whether they think there will be a release
   soon.  I also expressed my concern on not having added any
   committers and asked if there were any contributors who looked
   like they might be a good candidate for committership.

16 Mar 2016

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Release more often

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Mailing list activity (December 2015 - February 2016):
 * @dev: 114 messages

 Jira issues backlog (December 2015 - February 2016):
 * Created:  7
 * Resolved: 4

 We have a number of new users showing up on the mailing list, which is
 encouraging.

How has the project developed since the last report?

 The community has decided to drop support for S4, which is now inactive.
 There is also some continued interest by Apache Apex about a possible
 integration.  Internally, we have been working on strengthening our main
 classifier (VHT).  There has also been work on improving the connection
 with Avro and Kafka.  The adapted for Flink has been updated to the latest
 stable version (0.10).

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Dec 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Grow the community
 2. Elect new PMC members
 3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 Mailing list activity (September-October-November 2015):
 * @dev: 120 messages

 Jira issues backlog (September-October-November 2015):
 * Created:  8
 * Resolved: 6

 Our main goal is to grow the community, which is still pretty small.  We
 are doing a large amount of dissemination work in conferences and events
 to promote SAMOA.  For example, we had two talks at technical conferences:

 1) A talk at the Apache Conference BigData, in Budapest, Hungary, during
    the week of September 28-30, 2015.
 2) A talk at the Flink Forward, a conference for the Apache Flink DSPE, in
    Berlin, Germany, October 12-13, 2015

 We have had contributions from outside the PPMC. In particular, we have
 had contributions for:

 1) integrating Apache Avro input with Apache Samoa,
 2) a proposal for integrating Apache APEX as a new DSPE,
 3) continued the collaboration with the Apache Flink community.

How has the project developed since the last report?

 Main developments:

 * Performed development and testing of the VHT module.
 * Fixed various bugs and improved ensemble and bagging methods.
 * Worked on the integration of new data sources like Apache Avro.
 * Improved the website to include more material for new contributors.

Date of last release:

 2015-07-21

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

16 Sep 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache
Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

  1. Grow the community
  2. Elect new PMC members
  3. Following up our first release with further releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None

How has the community developed since the last report?

  Mailing list activity (June-July-August 2015):

  * @dev     225 messages

  Jira issues backlog (June-July-August 2015):

  * Created:  10
  * Resolved: 5

  Our main goal is still to grow the community, which is still pretty small.
  We are doing a large amount of dissemination work in conferences and
  events to promote SAMOA.

  We have had contributions from outside the PMC.  In particular, we have
  been collaborating with the Apache Flink community.

How has the project developed since the last report?

  Main developments:

  * First release 0.3.0 in July.
  * Cleanup of codebase to ease adoption by new contributors.

  We should increase the rate of technical contribution to the project and
  move to less incremental ones.

Date of last release:

  2015-07-21

When were the last committers or PMC members elected?

  None

Signed-off-by:

  [X](samoa) Alan Gates
  [ ](samoa) Ashutosh Chauhan
  [ ](samoa) Enis Soztutar
  [ ](samoa) Ted Dunning

--------------------
Sentry

Sentry is a highly modular system for providing fine grained role
based authorization to both data and metadata stored on an Apache Hadoop
cluster.

Sentry has been incubating since 2013-08-08.

Three most important issues to address in the move towards graduation:

 1. Encourage more feature and direction discussions on the dev list rather
    than jira.
 2. Continue reporting on time
 3. Continue making periodic releases following the Apache guidelines.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  Community has made been making good progress on all of the above mentioned
  items.

  1. There have been multiple discussions on the dev list about features/
     design decisions/ roadmap/ dev practices.
  2. Community has been doing timely monthly reporting.
  3. Community is actively working on bug fixes and working to release 1.6.0
     next month

How has the community developed since the last report?

  We had 160 messages on dev list last month. (Got number from
  http://markmail.org/list/org.apache.sentry.dev)

How has the project developed since the last report?

  About 42 issues were created and about 42 resolved(Numbers from jira).

Date of last release:

  2015-07-14

When were the last committers or PMC members elected?

 Colin Ma, Dapeng Sun, Guoquan Shen and Xiaomeng Huang were added as
 committers on 12/24/2014.  No new PPMC members have been added since the
 project has entered the incubator.

Signed-off-by:

  [X](sentry) Arvind Prabhakar
  [ ](sentry) Joe Brockmeier
  [X](sentry) David Nalley
  [ ](sentry) Olivier Lamy
  [X](sentry) Patrick Hunt
  [ ](sentry) Thomas White

17 Jun 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop new
algorithms that run on top of distributed stream processing engines (DSPEs).
It features a pluggable architecture that allows it to run on several DSPEs
such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Roll an Apache release
 2. Elect new PMC members
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

Two of our current PMC members (Olivier and Matthieu) have new jobs
and have declared that they will not be able to actively contribute to
the project for the moment being. This leaves 4 active PMC members.


How has the community developed since the last report?

 Mailing list activity (March-April-May 2015):
 * @dev     375 messages

 Jira issues backlog (March-April-May 2015):

 * Created:  14
 * Resolved: 19

We have had contributions from outside the PMC.
In particular, we have been collaborating with the Apache Flink community.

Our main goal is still to grow the community, which is still pretty
small.  To achieve this goal, we are doing a large amount of
dissemination work in conferences and events to promote SAMOA.


How has the project developed since the last report?

We are fully operational on the new infrastructure:
* Completed migration of issues from old GitHub repository.
* Completed migration of documentation from old wiki.

* Created new Confluence wiki with roadmap and contribution
 instruction:
 https://cwiki.apache.org/confluence/display/SAMOA/Samoa+Home

Main developments:
* New adaptor for Flink integrated.
* Simplified the website and solved several bugs.
* Preparing for our first release in June.

We should increase the rate of technical contribution to the project
and move to less incremental ones.  Codebase still need cleanup to
ease adoption by new contributors.


Date of last release:

None, plan to have one in June.

When were the last committers or PMC members elected?

None.


Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [x](samoa) Ted Dunning

Shepherd/Mentor notes:

18 Mar 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Start committing patches
 2. Discuss roadmap next release
 3. Grow the community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 The community is growing, and as an example, there are new people working
 on an adapter for Apache Flink-Streaming. Also, we have been giving talks
 at University of Waikato, and University of Auckland to try to attract
 more people to the community.

 Mailing list activity (since February 2015):

 * @dev     106 messages

 Jira issues backlog (since February 2015):

 * Created:  6
 * Resolved: 4

How has the project developed since the last report?

 The project setup is going on nicely.  We created bylaws for the project
 and we migrated documentation from the old website.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [X](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 P. Taylor Goetz (ptgoetz):

   The SAMOA poddling is still ramping up and there are no apparent issues
   requiring mentor guidance.

18 Feb 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15.

Three most important issues to address in the move towards graduation:

 1. Create bylaws for the project
 2. Migrate documentation from the old website
 3. Start committing patches

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 We have started using the official Apache channels to discuss about SAMOA.
 A couple of new people have showed up on the mailing lists.  We have one
 new contributor submitting patches.

 Mailing list activity (since January 2015):

 * @dev      101 messages

 Jira issues backlog (since January 2015):

 * Created:  13
 * Resolved: 2

How has the project developed since the last report?

 The project setup is going on nicely.  We got the SGA from Yahoo and
 migrated the code to the Apache git repository, enabled GitHub integration
 (https://github.com/apache/incubator-samoa), migrated the website to the
 Apache infrastructure and added the Incubator branding
 (http://samoa.incubator.apache.org), enabled testing via Travis CI.

Date of last release:

 None

When were the last committers or PMC members elected?

 None

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning

Shepherd/Mentor notes:

 Justin Mclean (jmclean):

   Just starting, mentors active.

21 Jan 2015

SAMOA provides a collection of distributed streaming algorithms for the most
common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs).  It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

SAMOA has been incubating since 2014-12-15 and is not fully functioning as a
project yet.

Three most important issues to address in the move towards graduation:

 1. Get IP clearance (SGA) from Yahoo
 2. Move the current code into ASF's git repository
 3. Start working as an Apache project

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 This is the first report.

How has the project developed since the last report?

 This is the first report.

Date of last release:

 No incubator release yet.

When were the last committers or PMC members elected?

 We just established the initial PPMC.

Signed-off-by:

 [ ](samoa) Alan Gates
 [ ](samoa) Ashutosh Chauhan
 [ ](samoa) Enis Soztutar
 [X](samoa) Ted Dunning