Apache Logo
The Apache Way Contribute ASF Sponsors

This was extracted (@ 2017-08-16 19:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

Pig

19 Jul 2017 [Daniel Dai / Ted]

Apache Pig is a platform for analyzing large data sets on Hadoop.  It
provides a high-level language for expressing data analysis programs,
coupled with infrastructure for evaluating these programs.

## Issues:
- There are no issues requiring board attention at this time

## Activity:
- Pig 0.17.0 which adds support for running Pig on Spark was released
- Voted Koji Noguchi as new PMC chair
- Rohini gave a talk “Yahoo - Moving beyond running 100% of
 Apache Pig jobs on Apache Tez” during dataworks summit (San Jose, CA)

## Health report:
- Project activity in the community - mails, commits, jiras, etc is good
and is at the same levels as previous few quarters.

## PMC changes:

- Currently 17 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Koji Noguchi on Thu Aug 04 2016

## Committer base changes:

- Currently 30 committers.
- Ádám Szita was added as a committer on Fri May 19 2017

## Releases:

- 0.17.0 was released on Thu Jun 15 2017

## Mailing list activity:

- dev@pig.apache.org:
    - 400 subscribers (down -1 in the last 3 months):
    - 1167 emails sent to list (1361 in previous quarter)

 - user@pig.apache.org:
    - 1139 subscribers (down -8 in the last 3 months):
    - 24 emails sent to list (18 in previous quarter)

## JIRA activity:

- 53 JIRA tickets created in the last 3 months
 - 68 JIRA tickets closed/resolved in the last 3 months

21 Jun 2017 [Daniel Dai / Chris]

No report was submitted.

15 Mar 2017 [Daniel Dai / Shane]

Apache Pig is a platform for analyzing large data sets on Hadoop.  It provides
a high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs.

## Issues:

- There are no issues requiring board attention at this time

## Activity:

- We are weeks away from merging Pig on Spark to trunk
- Preparing Pig 0.16.1 release

## Health report:

- Project activity in the community - mails, commits, jiras, etc is good and
 is at the same levels as previous few quarters.

## PMC changes:

- Currently 17 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Koji Noguchi on Thu Aug 04 2016

## Committer base changes:

- Currently 29 committers.
- Liyun Zhang was added as a committer on Thu Dec 15 2016
- Jeff Zhang is reinstated from emeritus to active committer

## Releases:

- Last release was 0.16.0 on Tue Jun 07 2016

## Mailing list activity:

- dev@pig.apache.org:
 - 402 subscribers (down -1 in the last 3 months):
 - 1252 emails sent to list (1118 in previous quarter)

- user@pig.apache.org:
 - 1151 subscribers (down -6 in the last 3 months):
 - 37 emails sent to list (36 in previous quarter)

## JIRA activity:

- 104 JIRA tickets created in the last 3 months
- 72 JIRA tickets closed/resolved in the last 3 months

21 Dec 2016 [Daniel Dai / Bertrand]

## Description:

 Apache Pig is a platform for analyzing large data sets on Hadoop.  It
 provides a high-level language for expressing data analysis programs,
 coupled with infrastructure for evaluating these programs.

## Issues:

- There are no issues requiring board attention at this time

## Activity:

- Pig on Spark development is still under way in the spark branch, merge to
 trunk is near and target remain for 0.17.0
- Preparing Pig 0.16.1 release

## Health report:

- Project activity in the community - mails, commits, jiras, etc is good and
 is at the same levels as previous few quarters.

## PMC changes:

- Currently 17 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Koji Noguchi on Thu Aug 04 2016

## Committer base changes:

- Currently 28 committers.
- Liyun Zhang was added on Thu Dec 15 2016

## Releases:

- Last release was 0.16.0 on Tue Jun 07 2016

## Mailing list activity:

- dev@pig.apache.org:
  - 403 subscribers (up 2 in the last 3 months):
  - 1122 emails sent to list (876 in previous quarter)

- user@pig.apache.org:
  - 1157 subscribers (down -10 in the last 3 months):
  - 36 emails sent to list (41 in previous quarter)

## JIRA activity:

- 44 JIRA tickets created in the last 3 months
- 51 JIRA tickets closed/resolved in the last 3 months

21 Sep 2016 [Daniel Dai / Greg]

Apache Pig is a platform for analyzing large data sets on Hadoop. It provides
a high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs.

## Issues:

- There are no issues requiring board attention at this time

## Activity:

- Rohini Palaniswamy & Jon Eagles gave the talk "Yahoo’s Experience Running
 Pig on Tez at Scale" in Hadoop summit San Jose
- Co-host Apache Hive & Apache Pig birds of feather session during Hadoop
 Summit San Jose on 6/30/2016
- Pig on Spark development is still under way in the spark branch, merge to
 trunk is near and target for 0.17.0

## Health report:

- Project activity in the community - mails, commits, jiras, etc is good and
 is at the same levels as previous few quarters.

## PMC changes:

- Currently 17 PMC members.
- Koji Noguchi was added to the PMC on Thu Aug 04 2016

## Committer base changes:

- Currently 28 committers.
- No new committers added in the last 3 months
- Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:

- Last release was 0.16.0 on Wed Jun 08 2016

## Mailing list activity:

- dev@pig.apache.org:
- 404 subscribers (down -7 in the last 3 months):
- 956 emails sent to list (1184 in previous quarter)

- user@pig.apache.org:
- 1167 subscribers (down -8 in the last 3 months):
- 43 emails sent to list (56 in previous quarter)

## JIRA activity:

- 58 JIRA tickets created in the last 3 months
- 54 JIRA tickets closed/resolved in the last 3 months

@Marvin: give feedback to PMC regarding bringing on new committers

15 Jun 2016 [Daniel Dai / Mark]

Apache Pig is a platform for analyzing large data sets on Hadoop.  It provides
a high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs.

## Issues:
- There are no issues requiring board attention at this time

## Activity:
- Pig 0.16.0 is released
- Pig on Spark development is still under way in the spark branch, merge to
trunk is near and target for 0.17.0

## Health report:
- Project activity in the community - mails, commits, jiras, etc is good and
is at the same levels as previous few quarters.

## PMC changes:

- Currently 16 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Xuefu Zhang on Tue Feb 23 2016

## Committer base changes:

- Currently 28 committers.
- No new committers added in the last 3 months
- Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:

- 0.16.0 was released on Wed Jun 08 2016

## Mailing list activity:

- dev@pig.apache.org:
- 411 subscribers (down -10 in the last 3 months):
- 1148 emails sent to list (891 in previous quarter)

- user@pig.apache.org:
- 1175 subscribers (up 1 in the last 3 months):
- 53 emails sent to list (103 in previous quarter)


## JIRA activity:

- 83 JIRA tickets created in the last 3 months
- 74 JIRA tickets closed/resolved in the last 3 months

16 Mar 2016

Change the Apache Pig Project Chair

 WHEREAS, the Board of Directors heretofore appointed Rohini Palaniswamy
 (rohini) to the office of Vice President, Apache Pig, and

 WHEREAS, the Board of Directors is in receipt of the resignation of
 Rohini Palaniswamy from the office of Vice President, Apache Pig, and

 WHEREAS, the Project Management Committee of the Apache Pig project has
 chosen by vote to recommend Daniel Dai (daijy) as the successor to the
 post;

 NOW, THEREFORE, BE IT RESOLVED, that Rohini Palaniswamy is relieved and
 discharged from the duties and responsibilities of the office of Vice
 President, Apache Pig, and

 BE IT FURTHER RESOLVED, that Daniel Dai be and hereby is appointed to
 the office of Vice President, Apache Pig, to serve in accordance with
 and subject to the direction of the Board of Directors and the Bylaws
 of the Foundation until death, resignation, retirement, removal or
 disqualification, or until a successor is appointed.

 Special Order 7A, Change the Apache Pig Project Chair, was
 approved by Unanimous Vote of the directors present.

16 Mar 2016 [Rohini Palaniswamy / Rich]

## Description:
   Apache Pig is a platform for analyzing large data sets on Hadoop.  It
   provides a high-level language for expressing data analysis programs,
   coupled with infrastructure for evaluating these programs.

## Issues:
 - There are no issues requiring board attention at this time

## Activity:
  - Pig 0.16 which is a stabilization release for Pig on Tez will be targeted
    for end of May as there are still bugs being reported and fixed. Pig
    0.15.1 patch release is planned for early April to address critical
    issues.
  - Pig on Spark development is still under way in the spark branch

## Health report:
 - Project activity in the community - mails, commits, jiras, etc is good and
   is at the same levels as previous few quarters.

## PMC changes:

 - The Pig PMC has voted for a new PMC chair candidate - Daniel Dai
 - Currently 16 PMC members.
 - Xuefu Zhang was added to the PMC on Tue Feb 23 2016

## Committer base changes:

 - Currently 28 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:

 - Last release was 0.15.0 on Fri Jun 05 2015

## Mailing list activity:

 - dev@pig.apache.org:
    - 421 subscribers (up 0 in the last 3 months):
    - 861 emails sent to list (870 in previous quarter)

 - user@pig.apache.org:
    - 1173 subscribers (down -9 in the last 3 months):
    - 112 emails sent to list (95 in previous quarter)


## JIRA activity:

 - 67 JIRA tickets created in the last 3 months
 - 65 JIRA tickets closed/resolved in the last 3 months

16 Dec 2015 [Rohini Palaniswamy / Chris]

## Description:
   Apache Pig is a platform for analyzing large data sets on Hadoop.  It
   provides a high-level language for expressing data analysis programs,
   coupled with infrastructure for evaluating these programs.

## Issues:
 - There are no issues requiring board attention at this time

## Activity:
  - Pig 0.16 which is a stabilization release for Pig on Tez with many bug
    fixes is actively being worked on as more bugs get reported. Initially
    planned for December, will be targeting a January release due to more
    issues that require fixing.
  - Pig on Spark development is under way in the spark branch.

## Health report:
  - Project activity in the community is good.  In terms of focused
    development, there are lot of bug fixes and performance enhancements going
    on for Pig on Tez and feature development patches going in for Pig on
    Spark. In terms of users, there is still the same amount of questions on
    the user mailing list and jiras are being filed by different and new
    users. We had 8 different new contributors this quarter who contributed
    patches on small issues. This is also inline with the trend of new folks
    who do not dedicatedly contribute, but come and go contributing new UDFs,
    enhance existing ones or fix small issues they face.
 - There has been no new committership or PMC for more than an year which
   keeps affecting the project health score a lot. But when Pig on Spark
   project is ready for merge in to trunk in another quarter, couple of folks
   would be good to be voted in as committers.
  - We have not done talks, blogs or organize user meetups in past couple of
    quarters. That is something that we will have to work on in the next
    quarter.

## PMC changes:

 - Currently 15 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Aniket Mokashi on Tue Jan 07 2014

## Committer base changes:

 - Currently 28 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:

 - Last release was 0.15.0 on Fri Jun 05 2015

## Mailing list activity:

 - dev@pig.apache.org:
    - 423 subscribers (down -10 in the last 3 months):
    - 898 emails sent to list (915 in previous quarter)

 - user@pig.apache.org:
    - 1182 subscribers (down -2 in the last 3 months):
    - 100 emails sent to list (119 in previous quarter)


## JIRA activity:

 - 86 JIRA tickets created in the last 3 months
 - 61 JIRA tickets closed/resolved in the last 3 months

16 Sep 2015 [Rohini Palaniswamy / Bertrand]

## Description:
   Apache Pig is a platform for analyzing large data sets on Hadoop.  It
   provides a high-level language for expressing data analysis programs,
   coupled with infrastructure for evaluating these programs.

## Activity:
  - Development is under way for Pig 0.15.1 patch release and Pig 0.16 release
    with focus on performance optimizations, minor features/enhancements and
    stabilization of Pig on Tez.
  - Pig on Spark was running out of steam and had very little progress for
    past couple of months with only one person working on it and most of the
    interested parties backing out to focus on other projects. But it is back
    on track now with new contributors from another organization who are
    heavily invested in Pig scripts and looking to migrate from Mapreduce to
    Spark ready to drive it and target making it production ready by end of
    the year for deployment in their clusters.
  - Dilip Ramesh mentored by Daniel Dai on PIG-2597 for GSOC 2015 successfully
    completed the program. He is continuing to work on it post GSOC 2015 as
    well on rest of the pieces required to complete the full feature.


## Health report:
 - Project activity in the community - mails, commits, bugs etc is good. Users
   have also started using and reporting issues on Pig on Tez which was a new
   execution engine released in Pig 0.14.
 - There has been no new committership for an year. Patches are being
   contributed by lot of new and different users. But not enough complex
   patches or volume of patches or long time involvement per person to
   nominate for committership at the moment. There are couple of folks inching
   towards that with regular contributions and hopefully we will have some of
   them in the next board report.

## Issues:
 - There are no issues requiring board attention at this time

## LDAP committee group/Committership changes:

 - Currently 28 committers and 15 LDAP committee group members.
 - No new LDAP committee group members added in the last 3 months
 - No new committers added in the last 3 months
 - Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:

 - Last release was 0.15.0 on Fri Jun 05 2015

## Mailing list activity:

 - dev@pig.apache.org:
    - 431 subscribers (up 5 in the last 3 months):
    - 903 emails sent to list (1634 in previous quarter)

 - user@pig.apache.org:
    - 1184 subscribers (up 13 in the last 3 months):
    - 118 emails sent to list (106 in previous quarter)


## JIRA activity:

 - 74 JIRA tickets created in the last 3 months
 - 58 JIRA tickets closed/resolved in the last 3 months

17 Jun 2015 [Rohini Palaniswamy / Jim]

## Description:
   Apache Pig is a platform for analyzing large data sets on Hadoop.  It
   provides a high-level language for expressing data analysis programs,
   coupled with infrastructure for evaluating these programs.

## Activity:
 - Pig 0.15 was released on Jun 05 2015 which includes Pig on Tez
   stablization, improved tez auto-parallelism and support for invoking Hive
   UDFs from Pig.
 - Pig on Spark development is underway in the spark branch.
 - PIG-2597 - Move grunt from javacc to ANTLR has been accepted for work under
   GSoC project with Daniel Dai as mentor.

## Issues:
 - There are no issues requiring board attention at this time

## PMC/Committership changes:
 - Currently 28 committers and 15 PMC members in the project.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Aniket Namadeo Mokashi at Mon Feb 24 2014
 - No new committers added in the last 3 months
 - Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:
 - 0.15.0 was released on Fri Jun 05 2015.

## Mailing list activity:
 - dev@pig.apache.org:
    - 424 subscribers (down -5 in the last 3 months):
    - 1638 emails sent to list (1148 in previous quarter)

 - user@pig.apache.org:
    - 1172 subscribers (down -5 in the last 3 months):
    - 106 emails sent to list (145 in previous quarter)

## JIRA activity:
 - 137 JIRA tickets created in the last 3 months
 - 120 JIRA tickets closed/resolved in the last 3 months

18 Mar 2015

Change the Apache Pig Project Chair

 WHEREAS, the Board of Directors heretofore appointed Cheolsoo Park
 to the office of Vice President, Apache Pig, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Cheolsoo Park from the office of Vice President, Apache Pig,
 and

 WHEREAS, the Project Management Committee of the Apache Pig project has
 chosen by vote to recommend Rohini Palaniswamy as the successor to the
 post;

 NOW, THEREFORE, BE IT RESOLVED, that Cheolsoo Park is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Pig, and

 BE IT FURTHER RESOLVED, that Rohini Palaniswamy be and hereby is
 appointed to the office of Vice President, Apache Pig, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7C, Resolution to Change the Apache Pig Project
 Chair , was approved by Unanimous Vote of the directors
 present.

18 Mar 2015 [Cheolsoo Park / Sam]

## Description:
Apache Pig is a platform for analyzing large data sets on Hadoop. It provides a
high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs.

## Activity:
- Pig on Spark development is underway.

## Issues:
- There are no issues requiring board attention at this time.

## PMC/Committership changes:
- The Pig PMC has voted for a new PMC chair candidate - Rohini Palaniswamy.
- Currently 28 committers and 15 PMC members in the project.
- No new PMC members added in the last 3 months
- No new committers added in the last 3 months
- Last PMC addition was Aniket Namadeo Mokashi at Mon Feb 24 2014
- Last committer addition was Praveen Rachabattuni at Mon Sep 08 2014

## Releases:
- There was no release in the last 3 months.

## Mailing list activity:
- dev@pig.apache.org: 431 subscribers (up 6 in the last 3 months):
 1117 emails sent to list (1875 in previous quarter)
- user@pig.apache.org: 1175 subscribers (up 16 in the last 3 months):
 156 emails sent to list (182 in previous quarter)

## JIRA activity:
- 100 JIRA tickets created in the last 3 months
- 82 JIRA tickets closed/resolved in the last 3 months

17 Dec 2014 [Cheolsoo Park / Sam]

Description:
Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* 0.14.0: released in Nov 2014 that includes several new features as follows:
** Pig on Tez
** OrcStorage
** Loader predicate push down
** Constant calculation optimization

Community:
* 424 subscribers to the dev mailing list (420 in the last report)
* 1159 subscribers to the user mailing list (1163 in the last report)
* The PMC has 15 members and the project counts 6 additional committers
** No new committer

17 Sep 2014 [Cheolsoo Park / Brett]

Description:
Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* 0.13.0: released in July 2014 that includes several new features as follows:
** Pluggable execution engines
** Auto-local mode optimization
** Direct-fetch optimization
** Blacklisting and whitelisting operators
** Accumulo storage

Community:
* 420 subscribers to the dev mailing list (417 in the last report)
* 1163 subscribers to the user mailing list (1153 in the last report)
* The PMC has now 15 members and the project counts 9 additional committers
** New committer: Lorand Bendig on 06/14/2014
** New pig-on-spark branch committers: Mayur Rustagi and Praveen Rachabattuni
 on 08/29/2014

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

18 Jun 2014 [Cheolsoo Park / Sam]

Description

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure for
evaluating these programs. The salient property of Pig programs is that their
structure is amenable to substantial parallelization, which in turns enables
them to handle very large data sets.

Releases

- 0.12.1: maintenance release that contains several bug fixes (April 14, 2014)

Community

- 417 subscribers to the dev mailing list (413 in the last report)
- 1153 subscribers to the user mailing list (1142 in the last report)
- We have no new PMC members or committers this quarter
- The PMC has now 15 members and the project counts 8 additional committers
- Pig participated in GSoC 2014 program, we accepted 2 proposals from students

Status of branding checklist

- Project Naming and Description: DONE
- Website Navigation Links: DONE
- Trademark Attributions: DONE
- Logos and Graphics: NOT STARTED
- Project Metadata: NOT STARTED

19 Mar 2014

Change the Apache Pig Project Chair

 WHEREAS, the Board of Directors heretofore appointed Julien Le Dem
 to the office of Vice President, Apache Pig, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Julien Le Dem from the office of Vice President, Apache
 Pig, and

 WHEREAS, the Project Management Committee of the Apache Pig
 project has chosen by vote to recommend Cheolsoo Park as the
 Successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Julien Le Dem is
 relieved and discharged from the duties and responsibilities of
 the office of Vice President, Apache Pig, and

 BE IT FURTHER RESOLVED, that Cheolsoo Park be and hereby is
 appointed to the office of Vice President, Apache Pig, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7C, Change the Apache Pig Project Chair, was
 approved by Unanimous Vote of the directors present.

19 Mar 2014 [Julien Le Dem / Jim]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases: None this quarter

Community:
* 413 subscribers to the dev mailing list (400 in the last report)
* 1142 subscribers to the user mailing list (1096 in the last report)
* We have a new Committer: Mark Wagner
* We have a new PMC member: Aniket Mokashi
* The Pig PMC has voted for a new PMC chair candidate - Cheolsoo Park.
* The PMC has now 15 members and the project counts 8 additional
committers
* a Pig meet up is coming Fri Mar 14 at linked-in:
http://www.meetup.com/PigUser/events/160604192/


Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

18 Dec 2013 [Julien Le Dem / Doug]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* 0.12.0 : released on Oct. 7 2013:
 ASSERT operator,
 Streaming UDF,
 new AvroStorage,
 IN/CASE operator,
 BigInteger/BigDecimal data type,
 support for Windows.

Community:
* 400 subscribers to the dev mailing list (387 in the last report)
* 1096 subscribers to the user mailing list (1079 in the last report)
* We have no new PMC members or committers this quarter
* The PMC has now 14 members and the project counts 7 additional
committers
* a Pig meetup was held by Mortar Data during Hadoop World
http://www.meetup.com/NYC-Pig-User-Group/events/139876962/

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

18 Sep 2013 [Julien Le Dem / Bertrand]

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure for
evaluating these programs. The salient property of Pig programs is that their
structure is amenable to substantial parallelization, which in turns enables
them to handle very large data sets.

Releases:  None

Community:
* 387 subscribers to the dev mailing list (349 in the last report)
* 1079 subscribers to the user mailing list (944 in the last report)
* We have one new Committer: Koji Noguchi
* We have two new PMC members:  Rohini Palaniswamy and Cheolsoo Park
* The PMC has 14 active members and the project counts 7 additional active committers
* We're going to have a Pig Meetup on October 29th in New York as part of Hadoop World.

19 Jun 2013 [Julien Le Dem / Greg]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* 0.11.1 : released on 03/28/2013 bug fixes

Community:
* 361 subscribers to the dev mailing list (349 in the last report)
* 1003 subscribers to the user mailing list (944 in the last report)
* We have one new Committer:  Prashant Kommireddi
* The PMC has 12 members and the project counts 6 additional
committers
* We're going to have a Pig Meetup on June 25, 2013 The day before
Hadoop summit
* We have one project accepted for GSOC:
"A better plan/data flow visualizer"

20 Mar 2013 [Julien Le Dem / Sam]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* 0.10.1 : released on 01/06/2013. bug fixes.
* 0.11.0 : released on 02/21/2013 This release includes hundreds of
bug fixes and many new features including DateType datatype,
RANK, CUBE and ROLLUP operators, Groovy UDFs, pluggable
reducer estimation logic, additional UDF features, schema-based
tuples and HCatalog DDL integration.
https://blogs.apache.org/pig/entry/apache_pig_it_goes_to

Community:
* 349 subscribers to the dev mailing list (330 in the last report)
* 944 subscribers to the user mailing list (918 in the last report)
* We have one new PMC member: Bill Graham
* We have a new PMC chair: Julien Le Dem replacing Daniel Dai
* A few committers and PMC members have been moved to emeritus
status: Corinne Chandel, Nigel Daley, Pradeep Kamath, Jeff Zhang,
Ben Reed and Yan Zhou
* The PMC has now 12 members and the project counts 5 additional
committers
* We had a Pig Meetup on February 7, 2013 in Palo Alto at
Hortonworks headquarters
* Pig has been listed among 15 high impact Apache projects by
computer world
http://www.computerworld.com.au/slideshow/455802/pictures_15_high-impact_apache_projects/?image=1

20 Feb 2013

Change the Apache Pig Chair

 WHEREAS, the Board of Directors heretofore appointed Daniel Dai
 to the office of Vice President, Apache Pig, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Daniel Dai from the office of Vice President, Apache
 Pig, and

 WHEREAS, the Project Management Committee of the Apache Pig
 project has chosen by vote to recommend Julien Le Dem as the
 Successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Daniel Dai is
 relieved and discharged from the duties and responsibilities of
 the office of Vice President, Apache Pig, and

 BE IT FURTHER RESOLVED, that Julien Le Dem be and hereby is
 appointed to the office of Vice President, Apache Pig, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7E, Change the Apache Pig Chair, was approved by
 Unanimous Vote of the directors present.

19 Dec 2012 [Daniel Dai / Bertrand]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* No new release

Community:
* 330 subscribers to the dev mailing list (308 in the last report)
* 918 subscribers to the user mailing list (842 in the last report)
* We have two new Pig committers: Cheolsoo Park and Rohini
Palaniswamy. We also have a new PMC member: Jonathan Coveney
* We had a Pig Meetup during Hadoop World on Nov 8th in New York city

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

19 Sep 2012 [Daniel Dai / Roy]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* No new release

Community:
* 308 subscribers to the dev mailing list (286 in the last report)
* 842 subscribers to the user mailing list (819 in the last report)
* We have successfully completed four Google Summer of code projects
 - PIG: Move Grunt from Javacc to ANTRL(Boski Shah)
 - Adding the Datetime Type as a Primitive for Pig(Zhijie Shen)
 - Statistical Inference Operator on Pig (List Operators)(Allan Avendaño)
 - CUBE operation in Pig(Prasanth Jayachandran)
* We had a Hadoop Summit Pig Meetup on Jun 12th, and a Pig hackthon on
Aug 24th. Both events are in SF bay area.

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

20 Jun 2012 [Daniel Dai / Sam]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* Pig 0.10.0 released on 04/25/12.

Community:
* We have two new committers: Bill Graham and Jonathan Coveney. We have one
 new PMC member: Julien Le Dem
* 286 subscribers to the dev mailing list (264 in the last report)
* 819 subscribers to the user mailing list (741 in the last report)
* Pig participated in GSoC 2012 program, we've got 6 proposals and 5 of them
 are accepted

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

21 Mar 2012 [Daniel Dai / Brett]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turns enables them to handle
very large data sets.

Releases:
* Pig 0.9.2 released on 01/22/12.

Community:
* No new committers or PMC members
* 264 subscribers to the dev mailing list (247 in the last report)
* 741 subscribers to the user mailing list (681 in the last report)

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

21 Dec 2011 [Daniel Dai / Brett]

Pig status report for December 2011.

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure
for evaluating these programs. The salient property of Pig programs is that
their structure is amenable to substantial parallelization, which in turns
enables them to handle very large data sets.

Releases:
* Pig 0.9.1 released on 10/5/11.

Community:
* PMC Chair is changing from Olga Natkovich to Daniel Dai
* No new committers or PMC members
* 247 subscribers to the dev mailing list (248 in the last report)
* 681 subscribers to the user mailing list (642 in the last report)

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

26 Oct 2011

Change the Apache Pig Chair

   WHEREAS, the Board of Directors heretofore appointed Olga Natkovich
   to the office of Vice President, Apache Pig, and

   WHEREAS, the Board of Directors is in receipt of the resignation
   of Olga Natkovich from the office of Vice President, Apache
   Pig, and

   WHEREAS, the Project Management Committee of the Apache Pig
   project has chosen by vote to recommend Daniel Dai as the
   Successor to the post;

   NOW, THEREFORE, BE IT RESOLVED, that Olga Natkovich is
   relieved and discharged from the duties and responsibilities of
   the office of Vice President, Apache Pig, and

   BE IT FURTHER RESOLVED, that Daniel Dai be and hereby is
   appointed to the office of Vice President, Apache Pig, to
   serve in accordance with and subject to the direction of the
   Board of Directors and the Bylaws of the Foundation until
   death, resignation, retirement, removal or disqualification, or
   until a successor is appointed.

 Resolution 7F was approved by unanimous roll call vote

21 Sep 2011 [Olga Natkovich / Greg]

Pig status report for September 2011.

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure for
evaluating these programs. The salient property of Pig programs is that their
structure is amenable to substantial parallelization, which in turns enables them
to handle very large data sets.

As part of Hadoop Summit we conducted user meetup that covered topics including
interactive discussions on embedding Pig in Python, Elephant Bird, Pig and
Cassandra, and cube operations in Pig. The meeting was attended by 20+ users and
developers.

We have successfully completed three Google Summer of code projects:
* Nested foreach statement (https://issues.apache.org/jira/browse/PIG-1631)
* Nested cross statement (https://issues.apache.org/jira/browse/PIG-1916)
* Syntax sugar (https://issues.apache.org/jira/browse/PIG-1904,
 https://issues.apache.org/jira/browse/PIG-1387, and
 https://issues.apache.org/jira/browse/PIG-1926)

Releases:
* Pig 0.9.0 released on 7/29/11.

Committers:
* Gianmarco De Francisci Morales became new Pig committer

Community:

* 248 subscribers to the dev mailing list (231 in the last report)

* 642 subscribers to the user mailing list (592 in the last report)

Status of branding checklist:
Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: DONE
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

15 Jun 2011 [Olga Natkovich / Sam]

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure
for evaluating these programs. The salient property of Pig programs is that
their structure is amenable to substantial parallelization, which in turns
enables them to handle very large data sets.

Pig held its 5th Contributor Workshop. We discussed issue related to the
release process as well as how to improve our testing infrastructure.

Three Google Summer of code projects have been approved for Pig:
* Nested foreach statement (https://issues.apache.org/jira/browse/PIG-1631)
* Nested cross statement (https://issues.apache.org/jira/browse/PIG-1916)
* Syntax sugar (https://issues.apache.org/jira/browse/PIG-1904,
 https://issues.apache.org/jira/browse/PIG-1387,
 https://issues.apache.org/jira/browse/PIG-1926)

Releases:
* Pig 0.8.1 released on 4/24/11.

Committers:
Aniket Mokashi became Pig committer
Xuefu Zhang became Pig committer

Community:
* 231 subscribers to dev mailing list (211 in the last report)
* 592 subscribers to user mailing list (531 in the last report)

Status of branding checklist:

Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: IN PROGRESS
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

16 Mar 2011 [Olga Natkovich / Bertrand]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs. The salient property of
Pig programs is that their structure is amenable to substantial
parallelization, which in turns enables them to handle very large data sets.

Pig held its 4th Contributor Workshop attended by 10+ developers. We
have discussed error handling proposal for pig as well as project plans
for Pig 0.9.0 and beyond.

Pig now has an active IRC room on freenode, #hadoop-pig, which has
between 18 and 24 people in it daily.

Releases:
* Pig 0.8.0 released on 12/17/10.

Committers:
* Julien Le Dem became Pig committer

Community:
* 531 subscribers to user mailing list (513 in the last report)
* 211 subscribers to dev mailing list (204 in the last report)

Status of branding checklist:

Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: IN PROGRESS
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

15 Dec 2010 [Olga Natkovich / Sam]

Pig is a platform for analyzing large data sets that consists of a high-level
language for expressing data analysis programs, coupled with infrastructure for
evaluating these programs. The salient property of Pig programs is that their
structure is amenable to substantial parallelization, which in turns enables
them to handle very large data sets.

Releases:
* Release of Pig 0.8.0 is in progress.

Community:
* 513 subscribers to user mailing list (485 in the last report)
* 204 subscribers to dev mailing list (197 in the last report)

Status of branding checklist:

Project Naming and Description: DONE
Website Navigation Links: DONE
Trademark Attributions: IN PROGRESS
Logos and Graphics: NOT STARTED
Project Metadata: NOT STARTED

17 Nov 2010 [Olga Natkovich / Doug]

Pig status report for October 2010.

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs. The salient property of
Pig programs is that their structure is amenable to substantial
parallelization, which in turns enables them to handle very large
data sets.

As of September 22, 2010 Pig became a TLP. Prior to this move Pig was
a subproject of Hadoop. Project by-laws have been approved by the PMC:
http://wiki.apache.org/pig/ProposedByLaws.

Pig held its Third Contributors Workshop attended by 10+ developers
some of them remotely via Skype. We have discussed plans for Pig 0.9
release as well as issues around user function management. Please see
the notes here: http://s.apache.org/S4I

Releases:
*In the process of releasing Pig 0.8

Committers:
* Corinne Chandel became Pig committer in the documentation area.

Community:
* 492 subscribers to user mailing list (485 in the last report)
* 196 subscribers to dev mailing list (197 in the last report)

20 Oct 2010 [Olga Natkovich / Shane]

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled
with infrastructure for evaluating these programs. The salient
property of Pig programs is that their structure is amenable to
substantial parallelization, which in turn enables them to handle very
large data sets.

As of September 22, 2010 Pig became a TLP. Prior to this move Pig was
a subproject of Hadoop. The vote is in progress for the proposed
by-laws: http://wiki.apache.org/pig/ProposedByLaws.

In August, Pig held its Second Contributors Workshop attended by 10+
developers. We have discussed a wide range of issues including use
cases, new features, upcoming releases, and changes to the process.

Releases:
* Branched for Pig 0.8.0 release. Expected to release in early November.

Community:
* 485 subscribers to user mailing list (452 in the last report)
* 197 subscribers to dev mailing list (191 in the last report)

22 Sep 2010

Establish the Apache Pig project

 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software related to parallel analysis of large
 data sets for distribution at no charge to the public.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache Pig Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache Pig Project be and hereby is
 responsible for the creation and maintenance of software
 related to parallel analysis of large data sets; and be
 it further

 RESOLVED, that the office of "Vice President, Apache Pig" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache Pig Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache Pig Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache Pig Project:

   * Benjamin Reed          <breed@apache.org>
   * Daniel Dai             <daijy@apache.org>
   * Alan Gates             <gates@apache.org>
   * Giridharen Kesavan     <gkesavan@apache.org>
   * Olga Natkovich         <olga@apache.org>
   * Pradeep Kamath         <pradeepkth@apache.org>
   * Santhosh Srinivasan    <sms@apache.org>
   * Yan Zhou               <yanz@apache.org>
   * Jeff Zhang             <zjffdu@apache.org>
   * Ashutosh Chauhan       <hashutosh@apache.org>
   * Richard Ding           <rding@apache.org>
   * Dmitriy Ryaboy         <dvryaboy@apache.org>
   * Thejas Nair            <thejas@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Olga Natkovich
 be appointed to the office of Vice President, Apache Pig, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache Pig PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache Pig Project; and be it further

 RESOLVED, that the Apache Pig Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Hadoop Pig sub-project; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Hadoop Pig sub-project encumbered upon the
 Apache Hadoop Project are hereafter discharged.

 This resolution was passed unanimously by roll call vote.

 Jim volunteers to help the new project by monitoring the private
 mailing list for the first few months.

17 Sep 2008

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs. The salient property of Pig
programs is that their structure is amenable to substantial parallelization,
which in turns enables them to handle very large data sets.

* Pig's infrastructure is in good shape. We just announced our first
release from the incubator Pig 0.1.0! In addition, a major system redesign
is underway that introduces type system, improves performance, and provides
better platform for future work. The rework is expected to complete in
October 2008.

* The development community is growing with addition of Daniel Dai as a new
committer. More work needed to attract developers to the project.

* The user community is also growing with more activity on the user mailing
list. In addition, a tutorial and user function repository were added to
help users to come up to speed on the product. Also there is ongoing work on
the user documentation.

Incubating since: October 2007

25 Jun 2008

Pig is a platform for analyzing large data sets that consists of a
high-level language for expressing data analysis programs, coupled with
infrastructure for evaluating these programs. The salient property of Pig
programs is that their structure is amenable to substantial parallelization,
which in turns enables them to handle very large data sets.

Pig's infrastructure is in good shape. Several new features have been added
since Pig joined the incubator. This includes streaming, parameter
substitution, custom splits, better configuration management, improved
memory management, etc. 147 issues resolved since. Work in progress includes
addition of types and rework of the execution pipeline.

The community is growing with the addition of Pi Song as a new committer.

There is increased activity on the pig-dev mailing list with a number of
contributors.

The project needs to improve its compliance with some standard Incubator
tasks, like updating the status file on a regular basis.

Incubating since: October 2007