Apache Logo
The Apache Way Contribute ASF Sponsors

Formal board meeting minutes from 2010 through present. Please Note: The board typically approves minutes from one meeting during the next board meeting, so minutes will be published roughly one month later than the scheduled date. Other corporate records are published, as is an alternate categorized view of all board meeting minutes.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

Parquet

19 Apr 2017 [Julien Le Dem / Brett]

Report was filed, but display is awaiting the approval of the Board minutes.

18 Jan 2017 [Julien Le Dem / Marvin]

## Description:
Parquet is a standard and interoperable columnar file format
for efficient analytics.

## Issues:
there are no issues requiring board attention at this time

## Activity:
- parquet-arrow integration has been added in parquet-cpp
- We're preparing a 1.8.2 patch release for the Apache Spark project
- We're preparing parquet-cpp 0.1: its first release (PARQUET-713)

## Health report:
Discussion is happening on the mailing list, JIRA and
regular hangout sync up. Notes are sent to the mailing list.

## PMC changes:
- Currently 22 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Wes McKinney on Thu Sep 01 2016

## Committer base changes:
- Currently 25 committers.
- No new committers added in the last 3 months
- Last committer addition was Uwe Korn at Sun Sep 04 2016

## Releases:
- 1.9.0 was released on Sun Oct 23 2016

## Mailing list activity:
- Activity on the mailing list is still relatively the same
- JIRAS are resolved about at the same pace they are opened.

- dev@parquet.apache.org:
- 176 subscribers (up 3 in the last 3 months):
- 452 emails sent to list (436 in previous quarter)

## JIRA activity:
- 81 JIRA tickets created in the last 3 months
- 67 JIRA tickets closed/resolved in the last 3 months

19 Oct 2016 [Julien Le Dem / Sam]

Report from the Apache Parquet committee [Julien Le Dem]

## Description:
Parquet is a standard and interoperable columnar file format for
efficient analytics.

## Issues:
there are no issues requiring board attention at this time

## Activity:
The community has been converging toward a 1.9 release. The vote will start in
the coming days. Discussion about better encoding and vectorization apis are
ongoing.  The parquet-cpp repo has reached a stable state and should release
soon. Integration with arrow-cpp is now in the parquet-cpp repo.

## Health report:
The PMC and committer list are growing. Discussion is happening on the mailing
list, JIRA and regular hangout sync up. Notes are sent to the mailing list.

## PMC changes:
 - Currently 22 PMC members.
 - Wes McKinney was added to the PMC on Thu Sep 01 2016

## Committer base changes:
 - Currently 25 committers.
 - Uwe Korn was added as a committer on Sun Sep 04 2016

## Releases:
 - Last release was Format 2.3.1 on Thu Dec 17 2015
 - parquet-mr 1.9.0 vote ongoing

## Mailing list activity:
 - Activity on the mailing list is still relatively the same
 - JIRAS are resolved about at the same pace they are opened.

 - dev@parquet.apache.org:
    - 172 subscribers (up 9 in the last 3 months):
    - 486 emails sent to list (394 in previous quarter)

## JIRA activity:
 - 85 JIRA tickets created in the last 3 months
 - 74 JIRA tickets closed/resolved in the last 3 months

20 Jul 2016 [Julien Le Dem / Isabel]

## Description:
Parquet is a standard and interoperable columnar file format for
efficient analytics.

## Issues:
there are no issues requiring board attention at this time

 ## Activity:
- Work on stabilizing master preparing for a release of parquet-mr (ByteBuffer)
- encoding strategy experiments
- Bytebuffer stabilization.
- Brotli compression experiments
- parquet-cpp development
- discussion about vectorized reads and Apache Arrow integration

 ## Health report:
- JIRAs opened and closed at the same rate
- email activity was more important last quarter due to parquet-cpp kickoff and discussions.

 ## PMC changes:
 - Currently 21 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Alex Levenson on Tue Apr 21 2015

 ## Committer base changes:
 - Currently 24 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Wes McKinney at Thu Mar 03 2016

 ## Releases:
 - Last release was Format 2.3.1 on Thu Dec 17 2015

 ## Mailing list activity:
Last quarter had more email activity dues to the kickoff of parquet-cpp
- dev@parquet.apache.org:
 - 163 subscribers (up 5 in the last 3 months):
 - 427 emails sent to list (901 in previous quarter)

 ## JIRA activity:
 - 81 JIRA tickets created in the last 3 months
 - 80 JIRA tickets closed/resolved in the last 3 months

20 Apr 2016 [Julien Le Dem / Bertrand]

## Description:
 Parquet is a standard and interoperable columnar file format for
efficient analytics.

## Issues:
 there are no issues requiring board attention at this time

## Activity:
There is a surge of activity related to the development of the Parquet-cpp
library.
Initially Parquet had a java implementation as well as reference
implementations for some encodings in C++. The C++ version is now being
fully implemented. A new committer has been recently invited based on that
work.

## Health report:
 The project is healthy. We have new contributors. Communication happens
on the mailing list and on regular public hangout sync ups for which
notes are published on the mailing list.

## PMC changes:
 - Currently 21 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Alex Levenson on Tue Apr 21 2015

## Committer base changes:
 - Currently 24 committers.
 - Wes McKinney was added as a committer on Thu Mar 03 2016

## Releases:
 - Format 2.3.1 was released on Thu Dec 17 2015

## Mailing list activity:
   A surge of emails related to the development of parquet-cpp

 - dev@parquet.apache.org:
    - 152 subscribers (up 14 in the last 3 months):
    - 940 emails sent to list (361 in previous quarter)

## JIRA activity:
 - 158 JIRA tickets created in the last 3 months
 - 109 JIRA tickets closed/resolved in the last 3 months

20 Jan 2016 [Julien Le Dem / David]

## Description:
   Apache Parquet is a general-purpose columnar storage format.

## Issues:
  there are no issues requiring board attention at this time

## Activity:
 All changes required by Apache Drill have been merged into Apache Parquet,
 getting Drill off of its Parquet fork.  Releases are ongoing to allow Drill
 to upgrade its dependencies.  Several efforts are ongoing to improve
 vectorized reads from Java and C++ They involve collaboration of several
 organizations.  Communication is happening in JIRA

## Health report:
  We have now a rotation to have someone responsible for answering JIRAs and
  emails each week.  Level of ticket creation and resolution is about the
  same, keeping opened tickets to a reasonable amount.  Typically user
  activity shows up in the user lists of other projects depending on parquet
  (drill, impala, presto, spark, ...)

## PMC changes:

 - Currently 21 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Alex Levenson on Tue Apr 21 2015

## Committer base changes:

 - Currently 23 committers.
 - New commmitters:
    - Cheng Lian was added as a committer on Wed Dec 02 2015
    - Sergio Peña was added as a committer on Wed Dec 02 2015

## Releases:

 - Parquet-Format 2.3.1 was released on Thu Dec 17 2015
 - Parquet-mr 1.9.0 in preparation

## Mailing list activity:

 - dev@parquet.apache.org:
    - 147 subscribers (up 17 in the last 3 months):
    - 466 emails sent to list (396 in previous quarter)

## JIRA activity:

 - 40 JIRA tickets created in the last 3 months
 - 36 JIRA tickets closed/resolved in the last 3 months

21 Oct 2015 [Julien Le Dem / Rich]

## Description:
 Apache Parquet is a general-purpose columnar storage format.
## Issues:
there are no issues requiring board attention at this time
## Activity:
- Bloom filters: need to finalize the design. Have use cases to validate it
  (query execution, etc)
 - Vectorized read API: refactoring of the code based on feedback.
 - Using dict in filter push down: rework to have better code reuse.
 - ByteBuffer: close to being merged.
## Health report:
The project is fairly stable with new features and
compatibility testing underway.
## PMC changes:
- Currently 21 PMC members
. - No new PMC members added in the last 3 months
- Last PMC addition was Alex Levenson at Tue Apr 21 2015
## LDAP changes:
- Currently 21 committers and 21 committee group members.
- No new changes to the committee group or committership since last report.
## Releases:
- 1.8.1 was released on Tue Jul 21 2015
## Mailing list activity:
- dev@parquet.apache.org:
- 130 subscribers (up 13 in the last 3 months):
- 367 emails sent to list (705 in previous quarter)
## JIRA activity:
- 53 JIRA tickets created in the last 3 months
- 25 JIRA tickets closed/resolved in the last 3 months

15 Jul 2015 [Julien Le Dem / Shane]

Apache Parquet is a general-purpose columnar storage format.

## Activity:
We're working towards a 1.8.0 release and merging the ByteBuffer PR (ZeroCopy
HDFS reads) Our goal is to keep master in a releasable state and to do
releases quickly.

## Issues:
- there are no issues requiring board attention at this time

## LDAP committee group/Committership changes:
- Currently 21 committers and 21 LDAP committee group members.
- No new changes to the LDAP committee group or committership since last
 report. Two new PMC members Alex Levenson and Daniel Weeks were added on Dec
 28th 2014

## Releases:
- 1.7.0 was released on Mon May 18 2015
- 1.8.0 is being voted on.

## Mailing list activity:
- dev@parquet.apache.org:
- 116 subscribers (up 5 in the last 3 months):
- 707 emails sent to list (722 in previous quarter)

## JIRA activity:
- 79 JIRA tickets created in the last 3 months
- 64 JIRA tickets closed/resolved in the last 3 months

17 Jun 2015 [Julien Le Dem / Shane]

## Description:
   Apache Parquet is a general-purpose columnar storage format.

## Activity:
We're working towards a 1.8.0 release and merging the ByteBuffer PR (ZeroCopy
HDFS reads) Our goal is to keep master in a releasable state and to do
releases quickly.

## Issues:
 there are no issues requiring board attention at this time

## PMC/Committership changes:

 - Currently 21 committers and 21 PMC members in the project.
 - No new changes to the PMC or committership since last report. Two new PMC
   members Alex Levenson and Daniel Weeks were added on Dec 28th 2014

## Releases:

 - 1.7.0 was released on Mon May 18 2015

## Mailing list activity:

 - dev@parquet.apache.org:

    - 112 subscribers (up 12 in the last 3 months):
    - 829 emails sent to list (459 in previous quarter)


## JIRA activity:

 - 91 JIRA tickets created in the last 3 months
 - 57 JIRA tickets closed/resolved in the last 3 months

20 May 2015 [Julien Le Dem / Jim]

Parquet is a columnar file format for Hadoop.

## Project Status

The project just graduated from the incubator and is voting on its first
release as a TLP.  No issues to report.

## Community

 - Two new PMC members Alex Levenson and Daniel Weeks on Dec 28th 2014
 - No new committer or PMC member since last report in April
 - JIRA past 30 days: 30 created and 22 resolved as of May 18th
https://issues.apache.org/jira/browse/PARQUET
 - 114 subscribers to the dev mailing list as of May 18th
 - emails on the dev list: Apr: 397, Mar: 319, Feb: 135, Jan: 112
http://mail-archives.apache.org/mod_mbox/parquet-dev/
 - commits: Apr: 84, Mar: 38, Feb: 47, Jan: 9
http://mail-archives.apache.org/mod_mbox/parquet-commits/
 - regular project sync ups are held on hangout.
They are open to anyone and advertised on the dev mailing list
notes are then published on the list as well
 - several Parquet related presentations scheduled at the Hadoop summit in June
http://2015.hadoopsummit.org/san-jose/agenda/

# Community Objectives

The community main objectives (not excluding other efforts also ongoing)
 - Working towards merging the ByteBuffer access work
 - Vectorized execution improvements (and integration with Apache Drill,
   Apache Hive, Presto)
 - Improving Projection and Predicate APIs
 - Standardizing nested type representations (thrift and avro write-side)
 - Improving high-level type specs (microsecond time/timestamp)

## Releases

 - Last releases:
    - parquet-mr 1.6.0-incubating on Apr 12th:
https://dist.apache.org/repos/dist/release/parquet/parquet-mr-1.6.0-incubating/
    - parquet-mr 1.7.0 on May 18th (just voted):
https://dist.apache.org/repos/dist/release/parquet/parquet-mr-1.7.0/
 - Next release: a parquet-format release will happen soon.

22 Apr 2015

Establish the Apache Parquet Project

 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software, for distribution at no charge to the
 public, related to a columnar storage format for Hadoop.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache Parquet Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache Parquet Project be and hereby is
 responsible for the creation and maintenance of software
 related to a columnar storage format for Hadoop; and be it further

 RESOLVED, that the office of "Vice President, Apache Parquet" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache Parquet Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache Parquet Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache Parquet Project:

    * Chris Aniszczyk <caniszczyk@apache.org>
    * Ryan Blue <blue@apache.org>
    * Jonathan Coveney <jcoveney@apache.org>
    * Tim <tianshuo@apache.org>
    * Jake Farrell <jfarrell@apache.org>
    * Marcel Kornacker <marcel@apache.org>
    * Mickael Lacour <mlacour@apache.org>
    * Julien Le Dem <julien@apache.org>
    * Alex Levenson <alexlevenson@apache.org>
    * Nong Li <nong@apache.org>
    * Todd Lipcon <todd@apache.org>
    * Chris Mattmann <mattmann@apache.org>
    * Aniket Mokashi <aniket486@apache.org>
    * Lukas Nalezenec <lukas@apache.org>
    * Brock Noland <brock@apache.org>
    * Wesley Graham Peck <wesleypeck@apache.org>
    * Remy Pecqueur <rpecqueur@apache.org>
    * Dmitriy Ryaboy <dvryaboy@apache.org>
    * Roman Shaposhnik <rvs@apache.org>
    * Daniel Weeks <dweeks@apache.org>
    * Thomas White <tomwhite@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Julien Le Dem
 be appointed to the office of Vice President, Apache Parquet, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache Parquet PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache Parquet Project; and be it further

 RESOLVED, that the Apache Parquet Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Incubator Parquet podling; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Incubator Parquet podling encumbered upon the Apache Incubator
 Project are hereafter discharged.

 Special Order 7D, Establish the Apache Parquet Project, was
 approved by Unanimous Vote of the directors present.

22 Apr 2015

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20.

Three most important issues

 - 1st releases toward org.apache Parquet 1.6.0 GA
 - Expanding the community and adding new committers
 - Ensuring timely code reviews by committers, developing reviewers

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 - None at this time

Latest Additions:

 * PMC addition:        None
 * Committer addition:  Dan Weeks and Alex Levenson (from last report)

Issue backlog status since last report:

 * Created:   34
 * Resolved:  50

Mailing list activity since last report:

 * dev      560 messages: 111 in Jan, 136 in Feb, and 313 in Mar

How has the project developed since the last report?

 - Preparing last commits for the first parquet-mr release candidate
 - Planned parquet-mr 1.6.0 release schedule
 - ASF required changes to parquet-mr are finished
 - Released parquet-format 2.3.0, with org.apache packages
 - Parquet presentation at Strata 2015 San Jose and the Presto meetup

Date of last release:

 - parquet-format 2.3.0 released 19 Feb
 - Not yet released: parquet-mr and parquet-cpp

Signed-off-by:

 [ ](parquet) Todd Lipcon
 [X](parquet) Jake Farrell
 [X](parquet) Chris Mattmann
 [X](parquet) Roman Shaposhnik
 [ ](parquet) Tom White

21 Jan 2015

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20 .

Three most important issues

 - Expanding the community and adding new committers
 - 1st releases toward org.apache Parquet 1.6.0 GA
 - Identifying how to ensure timely code reviews by committers

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 - None at this time

Latest Additions:

 * PMC addition:          None
 * Contributor addition:  Dan Weeks and Alex Levenson

Issue backlog status since last report:

 * Created:   45
 * Resolved:  20

Mailing list activity since last report:

 * dev      310 messages: 90 in Oct, 126 in Nov, and 94 in Dec

How has the project developed since the last report?

 - Completed first release, Apache Parquet Format (incubating) 2.2.0
 - Established a by-law for adding committers
 - Added 2 new committers
 - Parquet presentation accepted for Strata San Jose

Date of last release:

 - parquet-format released 14 November 2014
 - Not yet released: parquet-mr and parquet-cpp

Signed-off-by:

 [ ](parquet) Todd Lipcon
 [X](parquet) Jake Farrell
 [X](parquet) Chris Mattmann
 [X](parquet) Roman Shaposhnik
 [ ](parquet) Tom White

Shepherd/Mentor notes:

 Mailing lists are active; most mentors are active.

15 Oct 2014

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20 .

Three most important issues

 - Expanding the community and adding new committers
 - 1st releases
 - Identifying how to ensure timely code reviews by committers

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 - None at this time

Latest Additions:

 * PMC addition:          N/A
 * Contributor addition:  N/A

Issue backlog status since last report:

 * Created:   27
 * Resolved:  19

Mailing list activity since last report:

 * dev      144 messages

How has the project developed since the last report?

 - Attempted parquet-format release twice, next RC in early October.
 - Assembled tasks to complete for a parquet-mr release
 - New push-down filter API and task-side block metadata reading

Date of last release:

 - No releases as of yet.

Signed-off-by:

 [ ](parquet) Todd Lipcon
 [X](parquet) Jake Farrell
 [ ](parquet) Chris Mattmann
 [X](parquet) Roman Shaposhnik
 [X](parquet) Tom White

20 Aug 2014

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20.

Three most important issues to address in the move towards graduation:

 1. Expanding the community and adding new committers
 2. 1st release
 3. Identifying how to ensure timely code reviews by committers

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None at this time

Latest Additions:

 * PMC addition:          N/A
 * Contributor addition:  N/A

Issue backlog status since last report:

 * Created:   60
 * Resolved:  17

Mailing list activity since last report:

 * dev      212 messages

How has the project developed since the last report?

 * New commit workflow has been documented and commits have been
 increasing using the commit script.
 * Project website is posted: parquet.incubator.apache.org, working on
 moving more content from github hosting
 * Moved to issues.apache.org for all new issues
 * Planning first release of parquet-format and parquet-mr. Using
 parquet-format release to identify steps needed to release the
 larger projects (e.g., parquet-mr)
 * Adding documentation on reviews and contacts for specific modules

Signed-off-by:

 [X](parquet) Jake Farrell
 [ ](parquet) Chris Mattmann
 [ ](parquet) Roman Shaposhnik
 [X](parquet) Tom White
 [X](parquet) Todd Lipcon

Shepherd/Mentor notes:
 Mailing list has a healthy traffic, mostly bug reports. Mentors are
 active and participating in the community.

16 Jul 2014

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20 .

Three most important issues

 - Finish bootstrapping project(completed), IP clearance (completed),
 initial website (in progress)
 - Expanding the community and adding new committers
 - 1st release

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 - None at this time

Latest Additions:

 * PMC addition:          N/A
 * Contributor addition:  N/A

 Issue backlog status since last report:

 * Created:   8
 * Resolved:  2

 Mailing list activity since last report:

 * @dev       69 messages

How has the project developed since the last report?

 - All bootstrap tickets have been completed and status page updated
   - Mailing lists created, Jira setup, Code imported
 - Jira issues starting to be imported to issues.apache.org
 - Website in the works and will be available soon, infra for this is
 all ready setup
 - Working on documenting contributing guide and committers workflow
   - We have now setup the mechanisms to accept contributions through
   the Apache Github and have already accepted one external contribution.

Date of last release:

 - No releases as of yet.

Signed-off-by:

 [X](parquet) Todd Lipcon
 [X](parquet) Jake Farrell
 [ ](parquet) Chris Mattmann
 [X](parquet) Roman Shaposhnik
 [X](parquet) Tom White

18 Jun 2014

Parquet is a columnar storage format for Hadoop.

Parquet has been incubating since 2014-05-20 .

Three most important issues

 - Finish bootstrapping project, IP clearance, initial website
 - Expanding the community and adding new committers
 - 1st release

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 - None at this time

How has the community developed since the last report?

 - All initial committers have submitted ICLAs and the accounts have been
   created. The mailing lists have been setup and we are starting to use
   them for communication.

How has the project developed since the last report?

 - We have setup the incubator status page and are waiting on the final
   SGA to be sent in to start the code import (INFRA-7782)

Date of last release

 - No releases as of yet. Working through initial IP clearance.

When were the last committers or PMC members elected?

 - N/A, still bootstrapping the project.

Signed-off-by:

 [ ](parquet) Todd Lipcon
 [X](parquet) Jake Farrell
 [ ](parquet) Chris Mattmann
 [X](parquet) Roman Shaposhnik
 [X](parquet) Tom White