Apache Logo
The Apache Way Contribute ASF Sponsors

Formal board meeting minutes from 2010 through present. Please Note: The board typically approves minutes from one meeting during the next board meeting, so minutes will be published roughly one month later than the scheduled date. Other corporate records are published, as is an alternate categorized view of all board meeting minutes.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

DataFu

27 Feb 2017

DataFu provides a collection of Hadoop MapReduce jobs and functions in higher
level languages based on it to perform data analysis. It provides functions
for common statistics tasks (e.g. quantiles, sampling), PageRank, stream
sessionization, and set and bag operations. DataFu also provides Hadoop jobs
for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Resolve NOTICE and LICENSE issues for binary distributions
 2. Continued releases
 3. Increased committer activity

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 Received a patch from a new contributor.

How has the project developed since the last report?

 No updates

Date of last release:

 2016-08-10

When were the last committers or PPMC members elected?

 July 2016 (Eyal Allweil)

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

 Roman Shaposhnik:

   I really think we need to do one final push and either graduate or retire.

16 Nov 2016

DataFu provides a collection of Hadoop MapReduce jobs and functions in higher
level languages based on it to perform data analysis. It provides functions
for common statistics tasks (e.g. quantiles, sampling), PageRank, stream
sessionization, and set and bag operations. DataFu also provides Hadoop jobs
for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Resolve NOTICE and LICENSE issues for binary distributions
 2. Continued releases
 3. Increased committer activity

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 * No updates

How has the project developed since the last report?

 * Released 1.3.1.  Now using ASF-associated signing key.  Feedback from
   previous release addressed.
 * Website updated alongside 1.3.1 release.
 * Cleaned up release instructions.

Date of last release:

 2016-08-10

When were the last committers or PMC members elected?

 July 2016 (Eyal Allweil)

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

 Roman Shaposhnik:

   Pushing this community towards graduation is pretty high on my TODO list.
   I think they are as ready as they are ever going to be.

19 Oct 2016

 johndament:

   Discussions on this podling seem to have stopped completely.  There was a
     graduation discussion back in August, which seems to have dropped
     completely after some release content issues were identified.

20 Jul 2016

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Grow user and contributor base
 2. Increased committer activity
 3. Continued releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 * Eyal Allweil was voted in as the newest committer and member of the
   PPMC.

How has the project developed since the last report?

 * A new UDF provided by Eyal was committed and another was submitted for
   review.
 * ASF-associated signing key committed in prep for next release,
   addressing feedback from previous release.

Date of last release:

 2015-11-14

When were the last committers or PMC members elected?

 July 2016 (Eyal Allweil)

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [x](datafu) Roman Shaposhnik
 [x](datafu) Ted Dunning

20 Apr 2016

DataFu provides a collection of Hadoop MapReduce jobs and functions in higher
level languages based on it to perform data analysis. It provides functions
for common statistics tasks (e.g. quantiles, sampling), PageRank, stream
sessionization, and set and bag operations. DataFu also provides Hadoop jobs
for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Grow user and contributor base
 2. Increased committer activity
 3. Continued releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None

How has the community developed since the last report?

 * A new contributor opened several JIRAs regarding improvements and
   contributed patches.  Two have been committed so far.

How has the project developed since the last report?

 * Improved instructions on loading projects in Eclipse based on discussion
   in JIRA.
 * Added checks in build system to catch issues using wrong JDK version.
 * Some UDFs were improved to be more efficient.
 * A new UDF is pending review.

Date of last release:

 2015-11-14

When were the last committers or PMC members elected

 November 2014

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [X](datafu) Ted Dunning

20 Jan 2016

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Grow user and contributor base
 2. Increased committer activity
 3. Continued releases

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 * None

How has the community developed since the last report?

 * No new activity in the community since the last report.

How has the project developed since the last report?

 * Apache DataFu 1.3.0 source release completed, which is the first release
   since entering the Incubator.  DataFu 1.3.0 was also released to Maven.
 * Website (http://datafu.incubator.apache.org/) has been updated with
   instructions on how to use the source release or artifacts from Maven.

Date of last release:

 * 2015-11-14

When were the last committers or PMC members elected?



Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [x](datafu) Ted Dunning

Shepherd/Mentor notes:

 Roman Shaposhnik (rvs):

   The community appears to be in the final stretch before graduation,
   hopefully there's enough critical mass for it to happen.

18 Nov 2015

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Do first release
 2. Grow user and contributor base
 3. Increased committer activity

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 * Performing the initial release remains the most important milestone.

How has the community developed since the last report?

 * No new activity in the community since the last report.

How has the project developed since the last report?

 * The website documentation (http://datafu.incubator.apache.org/) has been
   updated and brought up to date with the current state of the project and
   build system, making it easier for newcomers to get started.  This was
   the last major task blocking release.
 * All the release tasks filed for our first release have now been
   completed.  A discussion has been opened in the dev mailing list on the
   topic of doing our first release.  A vote will likely be held in the
   next few days.

Date of last release:

 * Not yet released.  First release will likely happen within the coming
   weeks.

When were the last committers or PMC members elected?

 * November 2014

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [X](datafu) Ted Dunning

Shepherd/Mentor notes:

21 Jan 2015

DataFu provides a collection of Hadoop MapReduce jobs and functions in higher
level languages based on it to perform data analysis. It provides functions
for common statistics tasks (e.g. quantiles, sampling), PageRank, stream
sessionization, and set and bag operations. DataFu also provides Hadoop jobs
for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Grow user and contributor base.
 2. Make first release.
 3. Increase activity for initial committers.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 1. Has not yet made a release, but is in process of preparing first.
 2. Need to dramatically grow the contributor base.

How has the community developed since the last report?

 New committer and PMC member.  Several JIRAs filed by new users.

How has the project developed since the last report?

 1. 16 issues created, several from new contributors.
 2. 8 issues closed.
 3. Reasonable amount of mailing list traffic.

Date of last release:

 None yet. Currently preparing release: DATAFU-53.

When were the last committers or PMC members elected?

 Nov 2014, Russell Jurney, both committer and PPMC.

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [X](datafu) Ted Dunning

15 Oct 2014

DataFu provides a collection of Hadoop MapReduce jobs and functions in higher
level languages based on it to perform data analysis. It provides functions
for common statistics tasks (e.g. quantiles, sampling), PageRank, stream
sessionization, and set and bag operations. DataFu also provides Hadoop jobs
for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Building an ASF-based community.
 2. Release.
 3. Adding support for Hadoop 2.x

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 Three new users have contributed code since the last report.

How has the project developed since the last report?

 A couple more UDFs have been committed.  One bug fix was committed.  All
 JARs have been removed from the repo (a blocker for source release).  A
 build task has been added for creating a source release.  No open blockers
 for release left at this point.  Several more UDFs have been contributed but
 are still under review.

Date of last release:

 No release yet.

When were the last committers or PMC members elected?

 2014-02-22

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

(jmclean): Did not report on time. Low level mentor activity but no obvious
issues other than missing release. (Release mentioned in last report and
DATAFU-53 blocking release has been resolved).

16 Jul 2014

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

  1. Building an ASF-based community.
  2. Release.
  3. Decide on the future home of the project.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

  None.

How has the community developed since the last report?

  Will Vaughan gave a talk on DataFu at ApacheCon in April, and
  Casey Stella gave a talk on Pig and DataFu at the Hadoop Summit in
  June.

How has the project developed since the last report?

 Lots of JIRAs on bug fixes and new features, especially in April and May.
 Work slowed significantly in June, which probably means it's time for a
 release to mark our progress thus far.

Date of last release:

  None. Six month of incubation.

When were the last committers or PMC members elected?

  2014-02-22

Signed-off-by:

  [ ](datafu) Ashutosh Chauhan
  [X](datafu) Roman Shaposhnik
  [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

(jmclean) :  Mentor active, no obvious issues.

16 Apr 2014

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Building ASF community
 2. Release
 3. Remaining incubator paperwork

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 A talk was given at an Apache Pig meetup held on March 14th.  A talk
 is scheduled to be given at ApacheCon in Denver on April 7th.  Jian
 Wang accepted the invitation to become a committer.

How has the project developed since the last report?

 Two new Jiras have been filed and received patches.

Date of last release:

 None. Third month of incubation.

When were the last committers or PMC members elected?

 2014-02-22

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

 Justin Mclean (jmclean):

   Relative new podling yet to make a release. One mentor is active on
   public mailing list no obvious issues that need attention.

19 Mar 2014

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Building ASF community
 2. Release
 3. Remaining incubator paperwork

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 More contributions have been received from Jian Wang, who has also
 been voted in as the newest committer and PPMC member.  A talk
 is planned at the Apache Pig meetup to be held on March 14th.

How has the project developed since the last report?

 Three JIRAs have been opened, four have been closed.  The project has
 migrated from Ant to the Gradle build system, which will make it easier
 to add libraries for Hive, Crunch, etc.

Date of last release:

 None. Second month of incubation.

When were the last committers or PMC members elected?

 2014-02-22

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [x](datafu) Ted Dunning

19 Feb 2014

DataFu provides a collection of Hadoop MapReduce jobs and functions in
higher level languages based on it to perform data analysis. It provides
functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
stream sessionization, and set and bag operations. DataFu also provides
Hadoop jobs for incremental data processing in MapReduce.

DataFu has been incubating since 2014-01-05.

Three most important issues to address in the move towards graduation:

 1. Building ASF community
 2. Release
 3. Remaining incubator paperwork

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None.

How has the community developed since the last report?

 Since initial incubation, have received contributions from two new
 contributors.

How has the project developed since the last report?

 First report.  Have obtained all the necessary infra (git/jira/wiki,etc).
 Thirty JIRAs have been opened, 14 have been closed.  Active discussion on
 mailing list as to community development, etc.

Date of last release:

 None. First month of incubation.

When were the last committers or PMC members elected?

 None. First month of incubation.

Signed-off-by:

 [ ](datafu) Ashutosh Chauhan
 [X](datafu) Roman Shaposhnik
 [ ](datafu) Ted Dunning

Shepherd/Mentor notes:

 Dave Fisher (wave):

   New community to the incubator just getting started. Good guidance from
   Mentors. Needs Apache trademark attribution on site. Should have links
   to Mailing lists on the site.