ApacheCon is Coming 9-12 Sept. 2019 - Las Vegas The Apache Software Foundation
Apache 20th Anniversary Logo

Community-led development "The Apache Way"

Apache Support Logo

This was extracted (@ 2020-09-29 22:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

Meeting times vary, the exact schedule is available to ASF Members and Officers, search for "calendar" in the Foundation's private index page (svn:foundation/private-index.html).

Hudi

16 Sep 2020 [Vinoth Chandar / Craig]

Report was filed, but display is awaiting the approval of the Board minutes.

19 Aug 2020 [Vinoth Chandar / Niclas]

## Description:
The mission of Apache Hudi is the creation and maintenance of software related
to providing atomic upserts and incremental data streams on Big Data

## Issues:
There are no issues requiring board attention.

## Membership Data:
Apache Hudi was founded 2020-05-19 (3 months ago)
There are currently 14 committers and 11 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:6.

Community changes, past quarter:
- No new PMC members (project graduated recently).
- No new committers were added.
- Ongoing discussions in PMC on committer/pmc candidates

## Project Activity:
Developement
 - We made great progress towards our 0.6.0 major release, expecting first RC
   this week, as planned by the community.
 - Notably, we have merged support for bootstrapping any parquet datasets into
   hudi tables seamlessly, spark streaming/async compaction support, several
   performance fixes
 - Author of RFC-15 (design docs in Hudi) has an initial version working, we
   made progress on several key RFCs like record indexing, clustering
 - Few large PRs could not make it into 0.6.0, due to timeline risks/expanded
   scope. Plans are to target this on the first bug fix release in 0.6.x

Outreach
 - Hudi PMC and Contributors from Amazon gave a talk around past, present,
   future of PrestoDB/Hudi integration. Also authored a blog on presto site.


## Community Health:
 - We held our very first community code/design walkthrough session, attended
   by almost ~10 contributors across time zones. Slides/Video recording shared
   with entire community
 - Since the last report, we saw good uptick (38%) in conversations on dev
   mailing list. We also have a users mailing list now, but users are still
   preferring dev@
 - We are nearing almost 500 members on our slack channel.
 - Our project JIRA/Github activity recorded a 34% uptick, as we picked up pace
   towards 0.6.0

15 Jul 2020 [Vinoth Chandar / Sam]

## Description:
The mission of Apache Hudi is the creation and maintenance of software related
to providing atomic upserts and incremental data streams on Big Data

## Issues:
There are no issues requiring board attention

## Membership Data:
Apache Hudi was founded 2020-05-19 (2 months ago)
There are currently 14 committers and 11 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:6.

Community changes, past quarter:
- No new PMC members (project graduated recently).
- No new committers were added.
- 4 committers, 1 PMC candidates in the pipeline

## Project Activity:
Apache Hudi released 0.5.3, which marks the first release since graduation.
Release contained more than 30+ bug/performance fixes. The community also used
this opportunity to rework the release guide as a TLP. This sets us up well
for future releases.

We continued to make steady progress towards the 0.6.0 release, which delivers
several large features. To this end, we have merged ~25 pull requests,
contributors have proposed ~15 new pull requests.

Press/Articles:
- Uber published an article on Apache Hudi graduation
 https://eng.uber.com/apache-hudi-graduation/
- PMC Member Nishith Agarwal presented Hudi at BerlinBuzz words

## Community Health:
158 emails (-33%) on dev mailing list, across 38 topics, 41 participants.
1841 (-25%) interactions across Github Issues, Pull requests, JIRA issues.
~500 messages on Slack

Engagement metrics are lower month over month, even as the absolute values
remain healthy.
We don't clearly understand any patterns here (seasonal or otherwise).

17 Jun 2020 [Vinoth Chandar / Patricia]

## Description:
The mission of Apache Hudi is the creation and maintenance of software related
to providing atomic upserts and incremental data streams on Big Data

## Issues:
There are no issues requiring board attention

## Membership Data:
Apache Hudi was founded 2020-05-19 (20 days ago)
There are currently 14 committers and 11 PMC members in this project.
The Committer-to-PMC ratio is roughly 7:6.

Community changes, past quarter:
- No new PMC members (project graduated recently).
- No new committers were added (since graduation).

## Project Activity:
Apache Hudi is currently in the process of finalizing 0.5.3 release, which
delivers large performance, usability improvements.

Community continues to work towards the next 0.6.0 major release, planned over
the next month or so. We are also adding more testing to further improve
developer velocity and quality. Community also participated in a bug bash,
that ran over 10 days.

We have had 4 major design proposals (RFCs) submitted and under review,
targetting major releases beyond 0.6.0.

Blogs/Talks
- Blogs have been moved over to the hudi.apache.org site and contributors have
 written some useful new blogs
- Hudi PMC has authored a very popular info.cn article on technical
 underpinnings of Hudi, that was featured as a top story.
- Couple of planned talks were cancelled due to COVID.

## Community Health:

Dev mailing list activity is a mix of user questions and technical
discussions, which have been pretty steady.

We continue to steadily ship code, with contributors driving a large chunk of
them.

We use Github Issues as the support channel, and there was a lot of growth in
users engaging with us on Github to file support issues.

We are triaging some flaky test issues now, that has slightly affected our
ability to land PRs quickly. Community is actively working on mitigating this.

20 May 2020

Establish the Apache Hudi Project

 WHEREAS, the Board of Directors deems it to be in the best interests of
 the Foundation and consistent with the Foundation's purpose to
 establish a Project Management Committee charged with the creation and
 maintenance of open-source software, for distribution at no charge to
 the public, related to providing atomic upserts and incremental data
 streams on Big Data.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management Committee
 (PMC), to be known as the "Apache Hudi Project", be and hereby is
 established pursuant to Bylaws of the Foundation; and be it further

 RESOLVED, that the Apache Hudi be and hereby is responsible for the
 creation and maintenance of software related to providing atomic
 upserts and incremental data streams on Big Data; and be it further

 RESOLVED, that the office of "Vice President, Apache Hudi" be and
 hereby is created, the person holding such office to serve at the
 direction of the Board of Directors as the chair of the Apache Hudi
 Project, and to have primary responsibility for management of the
 projects within the scope of responsibility of the Apache Hudi Project;
 and be it further

 RESOLVED, that the persons listed immediately below be and hereby are
 appointed to serve as the initial members of the Apache Hudi Project:

 * Nishith Agarwal <nagarwal@apache.org>
 * Vinoth Chandar <vinoth@apache.org>
 * Anbu Cheeralan <anchee@apache.org>
 * Shaofeng Li <leesf@apache.org>
 * Suneel Marthi <smarthi@apache.org>
 * Prasanna Rajaperumal <prasanna@apache.org>
 * Luciano Resende <lresende@apache.org>
 * Bhavani Sudha <bhavanisudha@apache.org>
 * Balaji Varadarajan <vbalaji@apache.org>
 * Thomas Weise <thw@apache.org>
 * Vino Yang <vinoyang@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Vinoth Chandar be
 appointed to the office of Vice President, Apache Hudi, to serve in
 accordance with and subject to the direction of the Board of Directors
 and the Bylaws of the Foundation until death, resignation, retirement,
 removal or disqualification, or until a successor is appointed; and
 be it further

 RESOLVED, that the Apache Hudi Project be and hereby is tasked with the
 migration and rationalization of the Apache Incubator Hudi podling; and
 be it further

 RESOLVED, that all responsibilities pertaining to the Apache Incubator
 Hudi podling encumbered upon the Apache Incubator PMC are hereafter
 discharged.

 Special Order 7H, Establish the Apache Hudi Project, was
 approved by Unanimous Vote of the directors present.

15 Jan 2020

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

### Three most important unfinished issues to address before graduating:

 1. Making sufficient number of Apache releases.
 2. Continue to grow the community.
 3. Work towards graduation. Finish pending issues in the Maturity Matrix
document :
https://cwiki.apache.org/confluence/display/HUDI/Apache+Hudi+Maturity+Matrix

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None

### How has the community developed since the last report?

 1. 630 conversations on dev ML across ~100 topics
 2. 70 participants during this period

### How has the project developed since the last report?
 1. ~130 Commits in gitbox
 2. ~200 issues opened on Jira. ~80 issues resolved in Jira
 3. Hudi 0.5.0 (first Apache Release) released. Next release 0.5.1 planned
 for January 2020.
 4. 3 new committers (vinoyang, leesf and bhavanisudha) added to project.
 5. Apache Hudi is now packaged as part of AWS EMR. Apache Hudi talk in
 AWS re:Invent was well received
 6. Project took a first pass at assessing Apache Maturity Model for the
 project.

### How would you assess the podling's maturity?
The project now has a diverse developer and user community, and seeing
increased adoption.

 - [ ] Initial setup
 - [ ] Working towards first release
 - [X] Community building
 - [X] Nearing graduation
 - [ ] Other:

### Date of last release:

 2019-10-24

### When were the last committers or PPMC members elected?

 2019-11-08 - Bhavani Sudha Saktheeswaran, Vino Yang and Leesf.

### Have your mentors been helpful and responsive?
 Yes. Very helpful!

### Is the PPMC managing the podling's brand / trademarks?
 Yes

### Signed-off-by:

 - [x] (hudi) Thomas Weise
    Comments:
 - [x] (hudi) Luciano Resende
    Comments:
 - [ ] (hudi) Kishore Gopalakrishnan
    Comments:
 - [X] (hudi) Suneel Marthi
    Comments:

### IPMC/Shepherd notes:

16 Oct 2019

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

### Three most important unfinished issues to address before graduating:

 1. Making sufficient number of releases in the Apache way
 2. Growing community further by grooming contributors to committers
 3.

### Are there any issues that the IPMC or ASF Board need to be aware of?

 None

### How has the community developed since the last report?

 1. ~400 conversations on dev ML across ~50 topics
 2. 20-30 participants for each one month period
 3. ~40 support issues opened on GitHub

### How has the project developed since the last report?
 1. ~1500 gitbox activities over the three months
 2. 122 JIRA issues created, 77 resolved
 3. Community voted on two release candidates so far. RC3 underway towards
 first release
 4. ApacheCon NA talk was well received at the conference
 5. Hudi also featured in few industry blogs as an interesting project in
 the category.

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [X] Working towards first release
 - [X] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 N/A

### When were the last committers or PPMC members elected?

 During inception into incubator

### Have your mentors been helpful and responsive?

 Yes. Very helpful!

### Signed-off-by:

 - [X] (hudi) Thomas Weise
    Comments: Nice open collaboration on mailing list, close to first
    incubator release.
 - [X] (hudi) Luciano Resende
    Comments:
 - [ ] (hudi) Kishore Gopalakrishnan
    Comments:
 - [X] (hudi) Suneel Marthi
    Comments:

### IPMC/Shepherd notes:

17 Jul 2019

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

### Three most important unfinished issues to address before graduating:

 1. Making sufficient number of releases in the Apache way
 2. Legal/IP Clearance of software artifacts (LEGAL-461)
 3. Growing community further by grooming contributors to committers

### Are there any issues that the IPMC or ASF Board need to be aware of?

 1. PODLINGNAMESEARCH-162 has been completed. But not reflected on whimsy
 2. Software grant has been signed by Uber. But not reflected on whimsy

### How has the community developed since the last report?

 1. Mailing list subs grown to >50, 65 new mailing list threads
 2. Slack is about 99 signups total (20-30 WAU), 39 total contributors on
 github, ~25 support issues closed on GitHub
 3. 3 new organizations reported usage onto the Hudi site

### How has the project developed since the last report?

 1. ~66 commits from ~15 contributors/committers, across 2 releases
 2. All development now happening on ASF infrastructure, with source code
 being prepared for ASF release
 3. External talks on DataCouncil SF19 and SF BigAnalytics Meetup

### How would you assess the podling's maturity?
Please feel free to add your own commentary.

 - [ ] Initial setup
 - [X] Working towards first release
 - [ ] Community building
 - [ ] Nearing graduation
 - [ ] Other:

### Date of last release:

 N/A

### When were the last committers or PPMC members elected?

 During inception into incubator.

### Have your mentors been helpful and responsive?

 No Answer.

### Signed-off-by:

 - [X] (hudi) Thomas Weise
    Comments: Nice work on the collaboration side. Are there new
    contributors that could become committer candidates?
 - [X] (hudi) Luciano Resende
    Comments: The issues mentioned above that needs board attention
    are probably just a question of updating the project page file
    with the proper done status/date. Please speakup if the community
    needs help from mentors updating the file.
 - [ ] (hudi) Kishore Gopalakrishnan
    Comments:
 - [X] (hudi) Suneel Marthi
    Comments:

### IPMC/Shepherd notes:

17 Apr 2019

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

Three most important unfinished issues to address before graduating:

 1. Make frequent releases as per Apache guidelines
 2. Grow community
 3. Complete SGA, transfer code to ASF infra

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 * PODLINGNAMESEARCH-162 has been completed, but is not reflected on Whimsy
 * SGA process has been delayed due to inability to quickly interact with
 ASF legal counsel. Help expediting this communication would help a lot to
 get the SGA done by Uber.

How has the community developed since the last report?

 - Project source code/docs/issue management all now hosted on apache
 infrastructure
 - HIP, a process for proposing large changes to the project has been
 formalized by the community
 - 30+ new threads on dev ML, with ~10 non-PPMC contributors

How has the project developed since the last report?

 1. Code has been moved over to apache/incubator-hudi
 2. hudi.apache.org site has been restructured and simplified for
 community consumption
 3. Hudi Improvement Plan (based off Apache Kafka KIP) ratified and
 formalized. Few first HIPs written
 4. Submitted a Hudi talk abstract for Kafka Summit 2019
 5. ~20 PRs merged

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [X] Initial setup
 [ ] Working towards first release
 [ ] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 Project still being established in Incubator

When were the last committers or PPMC members elected?

 No new committers since incubation.

Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.

 Yes. Mentors are continuing to help us make things better

Signed-off-by:

 [X](hudi) Thomas Weise
    Comments:
 [X](hudi) Luciano Resende
    Comments: The podling is claiming 'Initial Setup'. What is still missing?
    Any help required from mentors ?
 [ ](hudi) Kishore Gopalakrishnan
    Comments:
 [X](hudi) Suneel Marthi
    Comments:

IPMC/Shepherd notes:

20 Mar 2019

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

Three most important issues to address in the move towards graduation:

 1. Make frequent releases as per Apache guidelines
 2. Grow community
 3. Complete SGA, transfer code to ASF infra

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 * PODLINGNAMESEARCH-162 has been completed, but is not reflected on Whimsy

How has the community developed since the last report?

 Project's still being setup. Few new user inquiries, 10+ signups on dev@
 ML

How has the project developed since the last report?

 1. Podling name search has been completed (PODLINGNAMESEARCH-162)
 2. hudi.apache.org site has been published with community guidelines
 3. Hudi Improvement Plan (based off Apache Kafka KIP) under review
 4. Agreement over code migration method, JIRA vs Github Issues
 5. Submitted a Hudi talk abstract for upcoming Berlin Buzzwords in June
 2019
 6. Issues migrated to Apache JIRA from github.

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [X] Initial setup
 [ ] Working towards first release
 [ ] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 Project still being established

When were the last committers or PPMC members elected?

 No new committers since incubation.

Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.

   Yes. Mentors are actively following up on ML questions and pointing out
   gaps

Signed-off-by:

 [X](hudi) Thomas Weise
    Comments: Nice uptick on mailing list activity and collaboration
    thinking.
 [x](hudi) Luciano Resende
    Comments: A formal improvement process such based on Kafka KIP
    might be overkill for a podling that is actively looking for
    growing the community. Usually these processes are implemented
    on big communitites that want to have some control over stability
    or backward compatibility of the code.
 [ ](hudi) Kishore Gopalakrishnan
    Comments:
 [X](hudi) Suneel Marthi
    Comments: Good adaption of the Apache Way by PPMC.

IPMC/Shepherd notes:

20 Feb 2019

Hudi provides atomic upserts and incremental data streams on Big Data

Hudi has been incubating since 2019-01-17.

Three most important issues to address in the move towards graduation:

 1. Make frequent releases as per Apache guidelines
 2. Grow community
 3.

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?

 None so far

How has the community developed since the last report?

 Project's still being setup

How has the project developed since the last report?

 1. Initial set of committers have filed ICLA
 2. Jira and Github repo have been setup
 3. Mailing lists have been setup

How would you assess the podling's maturity?
Please feel free to add your own commentary.

 [X] Initial setup
 [ ] Working towards first release
 [ ] Community building
 [ ] Nearing graduation
 [ ] Other:

Date of last release:

 Project's still being established

When were the last committers or PPMC members elected?

 Initial set of committers added to the project

Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.

 The mentors have been very helpful in getting this podling established.
 No open issues.

Signed-off-by:

 [X](hudi) Thomas Weise
    Comments:
 [X](hudi) Luciano Resende
    Comments:
 [ ](hudi) Kishore Gopalakrishnan
    Comments:
 [X](hudi) Suneel Marthi
    Comments:

IPMC/Shepherd notes: