Apache Logo
The Apache Way Contribute ASF Sponsors

Formal board meeting minutes from 2010 through present. Please Note: The board typically approves minutes from one meeting during the next board meeting, so minutes will be published roughly one month later than the scheduled date. Other corporate records are published, as is an alternate categorized view of all board meeting minutes.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

Drill

27 Feb 2017 [Parth Chandra / Bertrand]

## Description:

 - Drill is a Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage

## Issues:

 - There are no issues requiring board attention at this time

## Activity:

 - Since the last board report, Drill has released version 1.9
 - Drill has added many new features since the last report. More Parquet
   reader performance improvements, temp tables support, an improved work
   assignment algorithm,  and an httpd format plugin.
 - Work continues on improved use of statistics, and security enhancements
   (including support for Kerberos) and a sort with managed memory usage.

## Health report:

 - The project is healthy. Development activity is high and is reflected in an
   increase in the number of mails to the mailing list, many new pull requests
   and increased activity in JIRA. Two new committers were added in the last
   period.

## PMC changes:

 - Currently 18 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Sudheesh Katkam on Wed Oct 05 2016

## Committer base changes:

 - Currently 30 committers.
 - New commmitters:
   - Chris Westin was added as a committer on Wed Nov 30 2016
   - Neeraja Rentachintala was added as a committer on Wed Nov 16 2016

## Releases:

 - 1.9.0 was released on Mon Nov 28 2016

## Mailing list activity:

 - Mailing list activity is healthy.

 - dev@drill.apache.org:
    - 436 subscribers (up 2 in the last 3 months):
    - 1919 emails sent to list (1599 in previous quarter)

 - issues@drill.apache.org:
    - 20 subscribers (up 0 in the last 3 months):
    - 2618 emails sent to list (2003 in previous quarter)

 - user@drill.apache.org:
    - 577 subscribers (up 12 in the last 3 months):
    - 372 emails sent to list (430 in previous quarter)

## JIRA activity:

 - 236 JIRA tickets created in the last 3 months
 - 85 JIRA tickets closed/resolved in the last 3 months

16 Nov 2016 [Parth Chandra / Chris]

## Description:
 - Drill is a Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage

## Issues:

 - There are no issues requiring board attention at this time

## Activity:

 - Since the last board report, Drill has released version 1.8
 - Drill has added many new features reffered to in the last report. Dynamic
   UDFs, Parquet reader performance improvements, filter pushdown for Parquet,
   and improved support for Metadata in the clients has been added.
 - Improved use of statistics, and security enhancements (including support for
   Kerberos) continue to be in the works. Also in progress is an improvement to
   the data locality algorithm.

## Health report:

 - There has been a good increase in the number posts in the dev and jira
   lists. This reflects the increased activity on the development front. User
   list activity is down this period, but not a concern at the moment.

## PMC changes:

 - Currently 18 PMC members.
 - Sudheesh Katkam was added to the PMC on Wed Oct 05 2016

## Committer base changes:

 - Currently 28 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Hsuan-Yi Chu at Thu Apr 07 2016

## Releases:

 - 1.8.0 was released on Mon Aug 29 2016

## Mailing list activity:

 - dev@drill.apache.org:
    - 436 subscribers (down -10 in the last 3 months):
    - 1797 emails sent to list (1231 in previous quarter)

 - issues@drill.apache.org:
    - 20 subscribers (up 0 in the last 3 months):
    - 2188 emails sent to list (1550 in previous quarter)

 - user@drill.apache.org:
    - 567 subscribers (down -14 in the last 3 months):
    - 436 emails sent to list (824 in previous quarter)

## JIRA activity:

 - 173 JIRA tickets created in the last 3 months
 - 88 JIRA tickets closed/resolved in the last 3 months

17 Aug 2016 [Parth Chandra / Brett]

## Description:
- Drill is a Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage

## Issues:
- There are no issues requiring board attention at this time.


## Activity:

- Since the last board report, Drill has released version 1.7
- The focus of the releases continues to be on stability and performance.
- We've seen a nice trend in design discussions with developers writing
  detailed design documents that is leading to good feedback.
- Work is in progress on multiple features including dynamic loading of UDF's,
  resource management with YARN, enhanced security, improved use of
  statistics, and performance of reading parquet files. A large part of these
  are being done by new contributors.

## Health report:

- We are continuing to add new users at a steady pace with a healthy number
  of emails being posted by new users.
- The developer community has seen a small growth in the number of people
  providing new contributions both in code and in the discussions. As indicated
  above, we hope to see larger contributions from some of the new contributors.

## PMC changes:

 - Currently 17 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Hanifi Gunes on Thu Feb 11 2016

## Committer base changes:

 - Currently 28 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Hsuan-Yi Chu at Thu Apr 07 2016

## Releases:

 - 1.7.0 was released on Mon Jun 27 2016

18 May 2016

Change the Apache Drill Project Chair

 WHEREAS, the Board of Directors heretofore appointed Jacques
 Nadeau (jacques) to the office of Vice President, Apache Drill, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Jacques Nadeau from the office of Vice President, Apache Drill,
 and

 WHEREAS, the Project Management Committee of the Apache Drill
 project has chosen by vote to recommend Parth Chandra (parthc) as
 the successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Jacques Nadeau is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Drill, and

 BE IT FURTHER RESOLVED, that Parth Chanrda be and hereby is
 appointed to the office of Vice President, Apache Drill, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7H, Change the Apache Drill Project Chair, was
 approved by Unanimous Vote of the directors present.

18 May 2016 [Jacques Nadeau / Brett]

## Description:
- A distributed SQL MPP for Hadoop and NoSQL

## Issues:
- There are no issues requiring board attention at this time.

## Activity:
- Since the last board report, Drill has released versions 1.5 and 1.6.
- These releases saw a continued focus on stability, reliability and
 performance.
- There have been good discussions on the mailing list around items including
 backwards compatibility, performance improvements and technical debt.
- There has been good initial discussions around key development foci for
 Drill 2.0.
- We've seen a nice uptick in discussions about implementing new workload
 management capabilities. A nice attribute of this is combined discussion
 that includes both long-time and newer contributors.


## Health report:
- New user engagement and adoption is on the rise. We've seen new
 interactions from a large number of different users across a wide range of
 use cases.
- The developer community has seen a small growth in the number of people
 providing new code contributions.
- The community continues to find new ways to make development and code
 contribution easier. Recently a powerful new unit testing framework should
 allow easier development of unit tests for new contributors.

## PMC changes:

- Currently 17 PMC members.
- Hanifi Gunes was added to the PMC on Thu Feb 11 2016

## Committer base changes:

- Currently 28 committers.
- Hsuan-Yi Chu was added as a committer on Thu Apr 07 2016

## Releases:

- 1.5.0 was released on Tue Feb 16 2016
- 1.6.0 was released on Wed Mar 16 2016

## JIRA activity:

- 275 JIRA tickets created in the last 3 months
- 122 JIRA tickets closed/resolved in the last 3 months

17 Feb 2016 [Jacques Nadeau / Chris]

## Description:
- A distributed SQL MPP for Hadoop and NoSQL

## Issues:
- there are no issues requiring board attention at this time

## Activity:
- The Drill community has released 1.3, 1.4 since the last board report. The
community vote is also underway for the 1.5 release at the time of this
writing.
- Drill has added a number of powerful capabilities around partition
optimizations
- Work is underway to leverage secondary indexes for improving query
performance
- New connectors have been contributed to work with JDBC sources and image
metadata formats
- Substantial refactoring of the memory allocation and accounting layer was
completed.
- Web and REST security features were added.
- The Drill development community is also working with the newly formed Arrow
community.


## Health report:
- Drill is in a strong phase of user and developer growth. Daily we see new
interested community members providing patches, test and documentation
feedback and general project engagement.
- A number of people are working on specific JIRAs to help improve code
approachability, testing and documentation.
- Two new PMC members have driven there first release, helping to broaden the
responsibility of release management.
- Weekly Google hangouts are well attended and continue to help welcome new
members in a personal way. Decisions continue to happen on list with any
discussions from hangouts reported to the list.

## PMC changes:

- Currently 17 PMC members.
- Hanifi Gunes was added to the PMC on Friday Feb 12 2016

## Committer base changes:

- Currently 27 committers.
- New commmitters:
 - Ellen Friedman was added as a committer on Sun Nov 22 2015
 - Kris Hahn was added as a committer on Fri Dec 04 2015

## Releases:

- 1.4.0 was released on Sat Dec 14 2015
- 1.3.0 was released on Sat Nov 21 2015

## Mailing list activity:

- dev@drill.apache.org:
 - 412 subscribers (up 6 in the last 3 months):
 - 1960 emails sent to list (2576 in previous quarter)

- issues@drill.apache.org:
 - 19 subscribers (up 2 in the last 3 months):
 - 2873 emails sent to list (4068 in previous quarter)

- user@drill.apache.org:
 - 511 subscribers (up 25 in the last 3 months):
 - 979 emails sent to list (1006 in previous quarter)


## JIRA activity:

- 310 JIRA tickets created in the last 3 months
- 156 JIRA tickets closed/resolved in the last 3 months

18 Nov 2015 [Jacques Nadeau / Sam]

## Description:
   A distributed SQL MPP for Hadoop and NoSQL

## Issues:
 - Not at this time.

## Activity:
 - The Drill development community has decided to move back to ~monthly
   releases after having a long gap between the 1.1 and 1.2 releases.
 - A lot of work is currently focused on continued stabilization of the
   codebase as we see larger and more complex user deployments.
 - The community released 1.2 in October and is in the process of releasing 1.3.
 - Some recent new features like access to JDBC sources have drawn in new users.
 - The community is working on a new ValueVector initiative that will broaden
   collaboration on a piece of the Drill codebase. This should help increase
   cross-pollination between the Drill community and other Apache projects.
 - Some recent committers have shown great Apache mentality so it seems likely
   we will add new PMC members shortly. (something that we haven't done since
   we graduated)
 - The community is actively voting on adding a couple new committers whose
   primary contributions are doc and social media related rather than code
   development.
 - We've added two new committers in the last quarter.


## Health report:
 - The community members get along well and are productive.
 - We continue to see nice growth in the user community.
 - New developer contributions have been less frequent that we would like to
   see.
 - Some new developers have found it hard to get started. As such, we continue
   to try to make efforts to ease the effort required around becoming a casual
   contributor. This includes:
   - trying to speed up the unit test suite
   - reducing the memory requirements for the unit test suite
   - switching to a pull request contribution model as opposed to a
     reviewboard/patch model
   - improving developer documentation
   - exploring options for how to ease effort executing the extended test suite
   - marking Newbie tasks on JIRA


## PMC changes:

 - Currently 16 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Parth Chandra on Tue Nov 18 2014

## Committer base changes:

 - Currently 25 committers.
 - New commmitters:
    - Sudheesh Katkam was added as a committer on Tue Nov 03 2015
    - Abdel Hakim Deneche was added as a committer on Mon Aug 31 2015
    - Three votes are new committers

## Releases:

 - 1.2.0 was released on Fri Oct 16 2015

## Mailing list activity:

 - dev@drill.apache.org:
    - 406 subscribers (up 3 in the last 3 months):
    - 2865 emails sent to list (2112 in previous quarter)

 - user@drill.apache.org:
    - 486 subscribers (up 43 in the last 3 months):
    - 1113 emails sent to list (1020 in previous quarter)

 - issues@drill.apache.org:
    - 17 subscribers (down -1 in the last 3 months):
    - 4375 emails sent to list (4667 in previous quarter)


## JIRA activity:

 - 440 JIRA tickets created in the last 3 months
 - 256 JIRA tickets closed/resolved in the last 3 months

21 Oct 2015 [Jacques Nadeau / Chris]

## Description:
   A distributed SQL MPP for Hadoop and NoSQL

## Activity:
 - Drill adoption has seen strong increases in the last few months. This is
   expected as Drill is now something useful to end users (as opposed to being
   mostly in development).
 - There have been a number of community events that have been helpful in
   continuing to drive adoption and awareness of the project. This includes
   countless meetups, talks and tutorials at a number of major conferences (such
   as NoSQL Now & Strata/Hadoop World NYC).
 - The community has been primarily focused on addressing user issues for the
   last few months. Activity continues to increase on the user list post the
   1.0 release of Drill.
 - Code contributors are starting to appear more frequently. These new
   contributions are most often focused on extending Drill (such as storage
   plugins, format plugins and udfs)
 - A key corporate contributor has created a new extended test suite for
   Drill. This will likely be considered for incorporation into the Drill
   codebase to provide additional support for product quality goals.

## Health report:
 - Drill's community diversity is increasing as we see a broadening of
   companies sponsoring engineers to work on Drill. This is due to both new
   contributors and existing contributors moving to new companies.
 - New contributors are arriving but we need to continue to ease their
   experience.
   - New contributors currently struggle with a lack of code documentation.
     The community is working on improving this to ease the newbie experience.
   - New (and existing) contributors find Drill's precommit testing
     requirement to be burdensome as the tests typically take 30-40 minutes to
     complete. As such the community is looking at ways to speed this up.

## Issues:
 - There are currently no issues that require board attention.

## PMC changes:

 - Currently 16 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Parth Chandra at Tue Nov 18 2014

## LDAP changes:

 - Currently 24 committers and 16 committee group members.
 - No new committee group members added in the last 3 months
 - Abdel Hakim Deneche was added as a committer on Mon Aug 31 2015

## Releases:

 - Last release was 1.1.0 on Sun Jul 05 2015

## Mailing list activity:

 - dev@drill.apache.org:
    - 408 subscribers (up 11 in the last 3 months):
    - 2155 emails sent to list (2859 in previous quarter)

 - user@drill.apache.org:
    - 467 subscribers (up 45 in the last 3 months):
    - 1033 emails sent to list (855 in previous quarter)

 - issues@drill.apache.org:
    - 18 subscribers (up 0 in the last 3 months):
    - 3755 emails sent to list (7519 in previous quarter)


## JIRA activity:

 - 431 JIRA tickets created in the last 3 months
 - 229 JIRA tickets closed/resolved in the last 3 months

16 Sep 2015 [Jacques Nadeau / Brett]

No report was submitted.

19 Aug 2015 [Jacques Nadeau / Chris]

No report was submitted.

20 May 2015 [Jacques Nadeau / Brett]

## Description:
   A distributed SQL MPP for Hadoop and NoSQL

## Activity:
 - The community is very active, driving towards a 1.0 release.  The last two
   releases have driven additional engagement on the user mailing list.  While
   we've had some development interest from beyond the core community, there
   is a hope among the PMC that having a 1.0 release will provide an easier
   foundation upon which new contributors can engage in the project.

## Issues:
 - there are no issues requiring board attention at this time

## PMC/Committership changes:

 - Currently 23 committers and 16 PMC members in the project.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Venki Korukanti at Wed Nov 26 2014
 - Hanifi Gunes was added as a committer on Thu Apr 16 2015

## Releases:

 - 0.9.0 was released on Sun May 03 2015
 - 0.8.0 was released on Mon Mar 30 2015

## Mailing list activity:

 - dev@drill.apache.org:
    - 394 subscribers (down -5 in the last 3 months):
    - 2899 emails sent to list (1313 in previous quarter)

 - user@drill.apache.org:
    - 403 subscribers (up 28 in the last 3 months):
    - 628 emails sent to list (662 in previous quarter)

 - issues@drill.apache.org:
    - 19 subscribers (up 1 in the last 3 months):
    - 7790 emails sent to list (3764 in previous quarter)


## JIRA activity:

 - 841 JIRA tickets created in the last 3 months
 - 602 JIRA tickets closed/resolved in the last 3 months

18 Feb 2015 [Jacques Nadeau / Brett]

Description:
Apache Drill is a distributed query layer that supports querying JSON, NoSQL
and Hadoop using SQL.

Current Activity:
There is lots of activity around Drill.  JIRA issues continue to be opened and
closed at a rapid rate.  Regular Google Hangouts also bring the community
closer together to better discuss questions and welcome new contributors to
the community.  Community is working towards the release of Drill 0.8.


Releases:
* No new releases since last report.
* The 0.7 release of Drill was released on 12/23/2014.

Community:
* 530 emails on the dev mailing list in January
* 402 subscribers to dev mailing list
* 219 emails on the user mailing list in January
* 374 subscribers to user mailing list
* 1624 emails to issues list reflecting substantial activity on JIRA and
 commits.

* The PMC has 16 members
* Newest committer: Bridget Bevens (2/2/2015)
* Newest PMC: None added since graduation

Issues:
* There are no issues requiring board attention at this time.

@Brett: Are hangouts documents so non-attendees can participate later?

21 Jan 2015 [Jacques Nadeau / Jim]

Description: Apache Drill is a distributed query layer that supports querying
JSON, NoSQL and Hadoop using SQL.

Current Activity: Drill continues to see positive energy and discussion on the
mailing lists.  One of the great things we are now seeing is a broader set of
users answering new user queries on the mailing list.  This is a positive sign
towards further diversification and health of the community. The website also
has moved to be being based on Jekyll and markdown, which has reduced the
burden for update and thus increased the number of updates and freshness of
the content.

Releases:
* The 0.7 release of Drill was released on 12/23/2014.  This was Drill's first
TLP release and included more than 230 closed JIRAs.

Community:
* 334 emails on the dev mailing list in December
* 177 emails on the user mailing list in December
* The PMC has 16 members
* No new committers or PMC members added since graduation

Issues:
* There are no issues requiring board attention at this time.

17 Dec 2014 [Jacques Nadeau / Bertrand]

Description:
Apache Drill is a distributed query layer that supports querying JSON, NoSQL
and Hadoop using SQL.

Current Activity:
Drill graduated to a TLP at the last board meeting.  Since then, Drill has
migrated mailing lists, the website and the git repository to top-level
resources. Traffic is good on the user and dev mailing lists within many new
user and contributor engagements.

Releases:
* The 0.6 release of Drill was on 10/26/2014
* The 0.7 release is targeted for a release vote in the next couple weeks.
 This will be the first release as a top-level project.

Community:
* 335 emails on the dev mailing list in November
* 204 emails on the user mailing list in November
* The PMC has 16 members
* No new committers or PMC members added since graduation

Issues:
* There are no issues requiring board attention at this time.

19 Nov 2014

Establish the Apache Drill Project

 WHEREAS, the Board of Directors deems it to be in the best interests of the
 Foundation and consistent with the Foundation's purpose to establish a Project
 Management Committee charged with the creation and maintenance of open-source
 software, for distribution at no charge to the public, related to interactive
 analysis of large-scale datasets.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management Committee (PMC), to
 be known as the "Apache Drill Project", be and hereby is established pursuant
 to Bylaws of the Foundation; and be it further

 RESOLVED, that the Apache Drill Project be and hereby is responsible for the
 creation and maintenance of software related to interactive analysis of
 large-scale datasets; and be it further

 RESOLVED, that the office of "Vice President, Apache Drill" be and hereby is
 created, the person holding such office to serve at the direction of the Board
 of Directors as the chair of the Apache Drill Project, and to have primary
 responsibility for management of the projects within the scope of
 responsibility of the Apache Drill Project; and be it further

 RESOLVED, that the persons listed immediately below be and hereby
 are appointed to serve as the initial members of the Apache Drill
 Project:

   * Jacques Nadeau <jacques@apache.org>
   * Tomer Shiran <tshiran@apache.org>
   * Ted Dunning <tdunning@apache.org>
   * Jason Frantz <jason@apache.org>
   * MC Srivas <srivas@apache.org>
   * Keys Botzum <kbotzum@apache.org>
   * Julian Hyde <jhyde@apache.org>
   * Tim Chen <tnachen@apache.org>
   * Mehant Baid <mehant@apache.org>
   * Jinfeng Ni <jni@apache.org>
   * Venki Korukanti <venki@apache.org>
   * Jason Altekruse <json@apache.org>
   * Aditya Kishore <adi@apache.org>
   * Parth Chandra <parthc@apache.org>
   * Aman Sinha <amansinha@apache.org>
   * Steven Phillips <smp@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Jacques Nadeau be appointed to
 the office of Vice President, Apache Drill, to serve in accordance with and
 subject to the direction of the Board of Directors and the Bylaws of the
 Foundation until death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the Apache Drill Project be and hereby is tasked with the
 migration and rationalization of the Apache Incubator Drill podling; and be it
 further

 RESOLVED, that all responsibilities pertaining to the Apache Incubator Drill
 podling encumbered upon the Apache Incubator Project are hereafter discharged.

 Special Order 7C, Establish the Apache Drill Project, was
 approved by Unanimous Vote of the directors present.

17 Sep 2014

Description:

Apache Drill is a distributed system for interactive analysis of
large-scale datasets that is based on Google's Dremel. Its goal is to
efficiently process nested data, scale to 10,000 servers or more and to be
able to process petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

In the previous reports, the following were listed as goals before
graduation

 1. Complete the feature set
 2. Continue to attract new developers/contributors with a variety of
    skills and viewpoints
 3. Continue the outreach activities to build the early user community for
    the technology

These have been achieved and the podling has made several releases with no
more than minor issues that were related to changing requirements for
notices in incubator projects. The next release (0.5) is currently being
voted on.  Subsequent to that, the podling is likely to vote to request the
board to graduate Drill to TLP status.

Issues to Call to Attention of PMC or ASF Board:

 None

How community has developed since last report:

 Community awareness and outreach were strengthened in multiple forums as
 below

 8/7/14  Big Data Analytics Melbourne         MC Srivas
 8/13/14 Chicago HUG        Chicago           Jim Scott
 8/20/14 Pittsburgh HUG     Pittsburgh        Andy Pernsteiner
 8/21/14 Heartland Big Data Omaha, NE         Neeraja Rentachintala
 8/26/14 Data Mining        San Francisco, CA Tomer Shiran

 Mailing list discussions:

 Activity summary for the user mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-user/

   * June 2014: 79
   * July 2014, 12
   * August 2014, 63

 Activity summary for the dev mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/

   * June 2014, 374
   * July 2014, 294
   * August 2014, 247

 For details of code commits, see
 https://github.com/apache/incubator-drill/graphs/commit-activity (about
 400 commits in the past 3 months)

 31 contributors have participated in GitHUB code activity; there have
 been 152 forks.

 Community Interactions

 Monthly Drill hangout continues, conducted remotely through Google
 hangouts Tuesday mornings 10am Pacific Time to keep core developers in
 contact in realtime despite geographical separation.

 Community stays in touch through @ApacheDrill Twitter ID, and by postings
 on various blogs including Apache Drill User http://drill-user.org/ which
 has had several updates and through international presentations at
 conferences.

 Articles

 Examples of articles or reports on Apache Drill since last report
 include:

 * Self Service Data Exploration is Here by Neeraja Rentachintala

 Social Networking

 @ApacheDrill Twitter entity is active and has grown substantially by 20%,
 to 1057 followers.

How project has developed since last report

 Web-site clean slate revamp

 Significant progress has been made in performance and stability

 New functionality has been added to the product including reading and
 writing complex types in Parquet, as well as using hadoop 2 API for
 Parquet

 Nearly ~450 bugs filed and ~550 bugs resolved

 New docs have been published on Drill wiki ( Develop Custom Functions,
 Querying HBase Tables, Querying Complex Data)

 Started monthly releases. 0.4 release at end of July.
 Announcement: http://s.apache.org/t0a

 0.5 release currently up for vote.

Signed-off-by:

 [x](drill) Ted Dunning
 [x](drill) Grant Ingersoll
 [ ](drill) Isabel Drost-Fromm
 [X](drill) Sebastian Schelter

--------------------
Falcon
Falcon is a data processing and management solution for Hadoop designed for
data motion, coordination of data pipelines, lifecycle management, and data
discovery. Falcon enables end consumers to quickly onboard their data and
its associated processing and management tasks on Hadoop clusters.

Falcon has been incubating since 2013-03-27.

Three most important issues to address in the move towards graduation:

 1. Continue to build community

Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware
of?
- No

How has the community developed since the last report?
* Three more committers were invited and they have accepted to join the
 project
* More users & contributors have joined the falcon project and the
 community continues to grow

How has the project developed since the last report?
* Development activity has been very hectic more than 200 JIRAs have been
 created and about 120 of them resolved since the last report
* There are more than 100 users subscribed on the dev mailing list
* We have formaulated bi weekly sync up to coordinate with developers and
 contributors across the world
* 0.5-incubating release has been withdrawn due to LICENSE & NOTICE issues
 and same is intended to be prepared for vote shortly and will be released
 in Sep 2014.

Date of last release:

 2014-02-03 (0.4-incubating)

When were the last committers or PMC members elected?
 Aug 28, 2014

Signed-off-by:

 [ ](falcon) Arun Murthy
 [X](falcon) Chris Douglas
 [ ](falcon) Owen O'Malley
 [ ](falcon) Devaraj Das
 [X](falcon) Alan Gates

18 Jun 2014

Description:

Apache Drill is a distributed system for interactive analysis of large-scale
datasets that is based on Google's Dremel. Its goal is to efficiently
process nested data, scale to 10,000 servers or more and to be able to
process petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

 1. Complete the 1.0 feature set (team targets next release and graduation
    in the month of July)
 2. Continue to attract new developers/contributors with a variety of
    skills and viewpoints
 3. Continue the outreach activities to build the early user community for
    the technology

Issues to Call to Attention of PMC or ASF Board:

 None

How community has developed since last report:

 Community awareness and outreach were strengthened in multiple forums as
 below

 * First Apache Drill Hackathon was organized on 4/24. Over 40 participants
   including members from Visa, Linkedin, Cisco, Hortonworks worked to
   harden/enhance Drill project. Several new features have been added to
   Drill product Array reference functions, enhanced Optiq support, Kafka
   storage plugin, robust testing framework etc

 * Hive big data think tank meet up on 5/14- Talk by MC Srivas, with ~200
   member participation
 * Open Source Cloud meet up on 4/23 - Talk by Keys Botzum
 * Apache Conference session on 4/8 - Talk by Neeraja Rentachintala, with
   ~100 members participation

 Apache Drill is also showcased at the Hadoop Summit 6/3-6/5

 Mailing list discussions:

 Activity summary for the user mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-user/

   * June to date 6/10: 28
   * May 2014, 82
   * March 2013, 15

 Activity summary for the dev mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/

   * June to date 6/10: 87 (jira focussed discussions were removed from
     this thread recently)
   * May 2014, 1183(jira, focused discussions)
   * April 2014, 772 (jira; focused discussions)

 For details of code commits, see
 https://github.com/apache/incubator-drill/graphs/commit-activity (about
 300 commits in the past 3 months)

 26 contributors have participated in GitHUB code activity; there have been
 142 forks.

 Community Interactions

 Weekly Drill hangout continues, conducted remotely through Google hangouts
 Tuesday mornings 10am Pacific Time to keep core developers in contact in
 realtime despite geographical separation.

 Community stays in touch through @ApacheDrill Twitter ID, and by postings
 on various blogs including Apache Drill User http://drill-user.org/ which
 has had several updates and through international presentations at
 conferences.

 Articles

 Examples of articles or reports on Apache Drill since last report include:

 * Drill Hackathon summary blog post by Jacques Nadeau
 * Drill milestone roadmap blog post by Neeraja Rentachintala
 * Drill code samples by Nitin Bandugula

 Social Networking

 @ApacheDrill Twitter entity is active and has grown substantially by 19%,
 to 887 followers.

How project has developed since last report

 Significant progress is being made on the performance and distributed
 optimization

 C++ client API and ODBC driver leveraging the C++ API was built for Drill
 by a group led by George Chow in Vancouver. The initial drops for the
 driver are available

 New functionality has been added to the product namely distributed
 optimization, join order optimization, Table/view creation, repeated map
 support, HBase support, expanded SQL support, Text readers, new data types
 and functions, Session options for query tuning and lot more

 Nearly ~500 bugs files and ~400 bugs resolved

 Significant progress on running ANSI standard queries such as TPC-H

 Significant code drops have been checked in from a number of contributors
 and committers

 New docs have been published on Drill wiki (Apache Drill in 10 mins,
 Working with various data sources and Installing and Running Apache Drill
 on a cluster)

 Work toward a Beta milestone is progressing substantially.

Signed-off-by:

 [x](drill) Ted Dunning
 [x](drill) Grant Ingersoll
 [ ](drill) Isabel Drost-Fromm
 [x](drill) Sebastian Schelter

Shepherd/Mentor notes:

 Konstantin Boudnik (cos):

   Project's dev@ list is very active both with the JIRA traffic and
   otherwise.  June report to the board hasn't been sent on time.

19 Mar 2014

Description:

Apache Drill is a distributed system for interactive analysis of large-scale
datasets that is based on Google's Dremel. Its goal is to efficiently
process nested data, scale to 10,000 servers or more and to be able to
process petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

 1. Continue to attract new developers and and early users with a variety
    of skills and viewpoints
 2. Continue to develop deeper community skills and knowledge by building
    additional releases
 3. Demonstrate community robustness by rotating project tasks among
    multiple project members

Issues to Call to Attention of PMC or ASF Board:

 None

How community has developed since last report:

 Community awareness and participation were strengthened through a meeting
 of the Bay Area Apache Drill User Group in San Jose sponsored by Yahoo!
 This event expanded participation to include many new to Drill and
 particularly those interested as potential users (analysts rather than
 developers).

 Speakers included Drill project mentor Ted Dunning from MapR, Data
 Scientist Will Ford from Alpine Data Labs, new Drill committer Julian Hyde
 from HortonWorks and Aman Sinha, MapR Drill engineer.

 Additional events include:

 • Two new Drill committers accepted appointment: Julian Hyde (HortonWorks)
   and Tim Chen (Microsoft).
 • Drill has a new project mentor, Sebastian Schelter.

 Mailing list discussions:

 Subscriptions to the Drill mailing lists have risen to 399 on dev list and
 308 on the user list and 508 uniques across both lists.  There has been
 active and increasing participation in discussions on the developer
 mailing list, including new participants and developers. Participation on
 the user list is growing although still small; mainly activity takes place
 on developer mailing list.

 Activity summary for the user mailing list:

 http://mail-archives.apache.org/mod_mbox/incubator-drill-user/

   February to date 02/26/2014: 25
   January 2014, 12
   December 2013, 62

 Topics in discussion on the user mailing list included but not limited to:

 • Feb 2014: Connecting Drill to HBase, Support for Distinct/Count
 • Jan 2014: Loading Data into Drill, Data Locality
 • December 2013: Loading Data into Drill, Setting Drill with HDFS and
   other Storage engines

 Activity summary for the dev mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/
   February to date 02/26/2014: 250 (jira; discussion; review requests)
   January 2014, 156 (jira; focused discussions)
   December 2013, 51 (jira; focused discussions)

 Topics in discussion on the dev mailing list included but not limited to:

 • February to date 02/26/2014: How to contribute to Drill; review requests
   for Drill 357, 346, 366, 364; status of Drill functions including Hash
   functions; support operators +,- for date and interval arithmetic
 • January: Sql Options discussions, Casting discussions, Multiplex Data
   Channel feedbacks
 • December: Guide for new comers contribution, Aggregate functions code
   gen feedback Code

 For details of code commits, see http://bit.ly/14YPXN9
 There has been continued activity in code commits

 19 contributors have participated in GitHUB code activity; there have been
 116 forks.

 February code commits include but not limited to: Support for
 Information_schema, Hive storage and metastore integration, Optiq JDBC
 thinning and refactoring, Math functions rework to use codegen, Column
 pruning for Parquet/Json, Moving Sql parsing into Drillbit server side,
 TravisCI setup

 January code commits include but not limited to: Implicit and explicit
 casting support, Broadcast Sender exchange, add TPC-H test queries,
 Refactor memory allocation to use hierarchical memory allocation and
 freeing.

 Community Interactions

 Weekly Drill hangout continues, conducted remotely through Google hangouts
 Tuesday mornings 9am Pacific Time to keep core developers in contact in
 realtime despite geographical separation.

 Community stays in touch through @ApacheDrill Twitter ID, and by postings
 on various blogs including Apache Drill User http://drill-user.org/ which
 has had several updates and through international presentations at
 conferences.

 Viability of community is also apparent through active participation in
 the Bay Area Apache Drill User group meeting in early November, which has
 grown to 440 members.

 Sample presentations:

 • “How to Use Drill” by Ted Dunning and Will Ford, Bay Area Apache Drill
   Meet-up 24 February
 • “How Drill Addresses Dynamic Typing” by Julian Hyde, Bay Area Apache
    Drill Meet-up 24 February
 • “New Features and Infrastructure Improvements” by Aman Sinha, Bay Area
    Apache Drill Meet-up 24 February

 Articles

 Examples of articles or reports on Apache Drill since last report include:

 • Drill blog post by Ellen Friedman at Apache Drill User updating
   community on how people will use Drill and inviting comments/ questions
   from remote participants as part of the Drill User Group
   http://bit.ly/1p1Qvgn
 • Drill blog post by Ellen Friedman at Apache Drill User reports on
   appointment of new Drill committers and new mentor http://bit.ly/JIcwQe

 Social Networking

 @ApacheDrill Twitter entity is active and has grown substantially by 19%,
 to 744 followers.

How project has developed since last report:

 1. Significant progress is being made on execution engine and sql front
    end to support more functionality, also more integrations with storage
    engines.
 2. Work on ODBC driver has begun with a new group led by George Chow in
    Vancouver.
 3. Significant code drops have been checked in from a number of
    contributors and committers
 4. Work toward 2nd milestone is progressing substantially.

Signed-off-by:

 [x](drill) Ted Dunning
 [x](drill) Grant Ingersoll
 [x](drill) Isabel Drost-Fromm
 [x](drill) Sebastian Schelter

Shepherd/Mentor notes:

 Isabel Drost-Fromm (isabel):

   For the next report, please include information on date of last
   release and when last committer/PMC member was elected.

18 Dec 2013

Apache Drill is a distributed system for interactive analysis of large-scale
datasets that is based on Google's Dremel. Its goal is to efficiently process
nested data, scale to 10,000 servers or more and to be able to process
petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

 1. Continue to attract new developers and and early users with a variety of
    skills and viewpoints
 2. Continue to develop deeper community skills and knowledge by building
    additional releases
 3. Demonstrate community robustness by rotating project tasks among
    multiple project members.

 The community has made significant progress on items 1 and 2.

Issues to Call to Attention of PMC or ASF Board:

 none

How community has developed since last report:

 Community awareness and participation were strengthened through a meeting
 of the Bay Area Apache Drill User Group with over 100 participants locally
 in San Jose and remotely via Cisco-hosted Webex. On site speakers included
 3 Drill contributors, two from San Jose and one from Seattle
 http://www.meetup.com/Bay-Area-Apache-Drill-User-Group/

 Additional events include:

   * Code for 1st milestone release was posted and made available via
     project website; release was socialized via mailing list, Twitter,
     blogs and presentations
   * Several new full-time developers joined the project
   * Apache Drill received a Bossie award "Best open source big data tools"
     17 September 2013 http://s.apache.org/B3H (infoworld.com)

 Mailing list discussions:

   Subscriptions to the Drill mailing lists have risen to 415. There has
   been active and increasing participation in discussions on the developer
   mailing list, including new participants and developers. Participation
   on the user list is growing although still small; mainly activity takes
   place on developer mailing list.

 Activity summary for the user mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-user/
   December to date 12/5/2013: 14
   November 2013, 28
   October 2013, 37
   September 2013, 13

 Topics in discussion on the user mailing list included but not limited to:

   How to load data into Drill; use of distributed mode; direction to study
   the src

 Activity summary for the dev mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/

   * December to date 12/6/2013: 35 (mainly jira; some discussion)
   * November 2013, 347  (jira, focused discussions)
   * October 2013, 299  (jira; focused discussions)
   * September 2013, 659 (jira, focused discussions)

 Recent topics on the dev mailing list have included:

   * Buffer allocation of cast into var length type.
   * Improve Parquet read  performance.
   * Schema discovery tool for scanning raw files and generating optiq schema.
   * Discussions on CAST functionality.
   * sqlline connect to Remote Drill
   * Limit operator end-to-end
   * many focused discussions about 1st milestone release

 Code

   Total Commits from 01/09/2013 - 06/012/2013: 106

   Detailed information regarding the commits is shown in this chart from
   GitHub: http://s.apache.org/MnI (github.com)

   Nine contributors have participated in this GitHub code activity; there
   have been 101 forks of the Apache Drill project on GitHub which is a
   good indicator of strong interest outside the group of core
   contributors.

   Code commits (during the period 01/09/2013 - 06/012/2013) include but
   not limited to following:

   * Spooling batch buffer
   * Fix over memory pre-allocation within ParquetRecordReader.
   * Implement simple metrics framework
   * prepare release drill-1.0.0-m1.
   * Implement builders for Scan, Sort, LogicalPlan and PlanProp.

 Community Interactions

   The weekly Drill hangout continues, conducted remotely through Google
   hangouts Tuesday mornings 9am Pacific Time to keep core developers in
   contact in realtime despite geographical separation. A Gdoc is being
   updated regarding the discussions during hangout.
   http://s.apache.org/4Gc (docs.google.com)

   The community stays in touch through @ApacheDrill Twitter ID, and by
   postings on various blogs including Apache Drill User
   http://drill-user.org/ which has had several updates and through
   international presentations at conferences.

   Viability of the community is also apparent as participants in the open
   source Apache Drill community came together on November 4th meet-up of
   the Bay Area Apache Drill User Group, with 391 members enrolled. The
   group looked at how Drill works now and what will be the next steps in
   the project. The event marked the recent first official release of the
   Apache Drill project.

 Presentations

   * A talk on Apache Drill by Michael Hausenblas is scheduled on Tue 10
     Dec "Query engine for heterogenous large scale datasets" at Decemberi
     Big Data Meetup in Budapest
     http://www.meetup.com/Big-Data-Meetup-Budapest/events/138089032/
   * WebEx of talks by Jacques Nadeau, Tim Chen and Steven Phillips at Bay
     Area Drill User Group; play video:
     http://s.apache.org/eN (cisco.webex.com)
   * Drill talk by Michael Hausenblas at JAX London "Large-scale,
     interactive ad-hoc queries over different data stores with Apache
     Drill" including demo, 29 October 2013
   * Podcast by Jacques Nadeau at All Things Hadoop, episode 17: "Using
     Apache Drill for Large Scale, Interactive, Real-Time Analytic Queries"
     http://s.apache.org/8ZM (allthingshadoop.com)
   * Drill talk by Michael Hausenblas at the Stockholm HUG "Interactive
     analytics for large-scale data-sets"
     https://speakerdeck.com/mhausenblas/hug-stockholm-apache-drill

 Sample Articles/presentations (out of many):

   * How to use Apache Drill (inc distributed mode) Detailed description is
     available via Drill Github sandbox.
     https://github.com/mhausenblas/apache-drill-sandbox/tree/master/M1
   * Interactive analytics: large scale data-set
     https://speakerdeck.com/mhausenblas/hug-stockholm-apache-drill
   * Lifetime of a Query in Drill by Timothy Chen, includes link to his
     slides.  http://s.apache.org/INZ (tnachen.wordpress.com)
   * Drill blog post by Ellen Friedman at Apache Drill User reports on
     meet-up of the Bay Area Apache Drill User Group
     http://s.apache.org/a8r (drill-user.org)
   * Drill article posted by Ted Dunning on MapR Technologies blog site 8
     November 2013: "Apache Drill Achieves 1st Milestone Release"
     http://www.mapr.com/blog?s=Apache+Drill
   * Blog post by Yash Sharma on "How to Contribute to Apache Drill:
     Implementing Drill Math Functions"
     http://s.apache.org/od (confusedcoders.com)

 Social Networking

   @ApacheDrill Twitter entity is active and has grown by ~44%, to 632
   followers.

How project has developed since last report:

 1. Approval of first release of Apache Drill - M1 achieved; code posted
    via project website
 2. Query now works in distributed mode
 3. Significant code drops have been checked in from a number of developers
    to achieve Milestone 2 release.
 4. New developers are contributing.

Signed-off-by:

 [x](drill) Ted Dunning
 [x](drill) Grant Ingersoll
 [x](drill) Sebastian Schelter

Shepherd notes:

 Matt Franklin (mfranklin):

   I have reviewed the community and have concluded that it is very active and
   healthy.  IMO, #3 in the issues to address before graduation is not critical
   and can be successfully mitigated by documenting the tasks that are currently
   being executed by single individuals.  My recommendation is that the Drill
   podling begin the graduation process after completion of the next release.

 Marvin Humphrey (marvin):

   The report is admirably thorough and must have taken a long time to
   prepare.  Perhaps consider that a report with a reduced level of detail
   similar to the other reports may be more in line with Board
   expectations.

   The report was filed too late to be incorporated into a review by the
   assigned shepherd.  Please file on time.

   Please wrap future reports at 77 columns, and please take more care with
   making indentation reflect logical hierarchy.  Also, use only
   s.apache.org URL shorteners in future reports.  Otherwise, someone
   downstream must clean up the report -- either someone from the IPMC (I
   took care of reformatting this month) or the Board -- before it is
   published in the official Board minutes.

18 Sep 2013

Apache Drill is a distributed system for interactive analysis of large-scale
datasets that is based on Google's Dremel. Its goal is to efficiently process
nested data, scale to 10,000 servers or more and to be able to process
petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

 1. Continue to attract new developers and and early users with a variety of
    skills and viewpoints
 2. Continue to develop deeper community skills and knowledge by building
    additional releases
 3. Demonstrate community robustness by rotating project tasks among
    multiple project members

The community has made significant progress on items 1 and 2.

Issues to Call to Attention of PMC or ASF Board:

 none

How community has developed since last report:

 The most important activity is the run up to the Milestone 1 release.

 Additional events include:

   * Apache Drill project website redesigned to have a new look:
     http://incubator.apache.org/drill/

   * Interactive "How to Run Drill" demo added to the Apache Drill wiki:
     https://cwiki.apache.org/confluence/display/DRILL/Demo+HowTo

 Mailing list discussions:

   Subscriptions to the Drill mailing lists have risen to 383. There has
   been active and increasing participation in discussions on the developer
   mailing list, including new participants and developers. Participation on
   the user list is growing although still small; mainly activity takes
   place on developer mailing list.

 Activity summary for the dev mailing list:

   http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/

   * September to date 05/010/2013: 397(mainly jira; some discussion)
   * August 2013, 394  (jira, focused discussions)
   * July 2013,370  (jira; focused discussions)
   * June 2013,297 (jira, focused discussions)

 Recent topics on the dev mailing list have included:

   * Usability and introductory tutorials
   * SQL semantics and extensions to type inference cases
   * Implementation of various storage engines, including Parquet and ORC.
   * Optimizer rewrites and operator implementations.

 Code

   For details of code commits, see http://bit.ly/14YPXN9

   There has been a very significant ramp up of code commits during this
   quarter, as shown in this chart from GitHub:

   Ten contributors have participated in this GitHub code activity; there
   have been 77 forks of the Apache Drill project on GitHub which is a good
   indicator of strong interest outside the group of core contributors.

   Recent code commits include but not limited to:

     * full end-to-end execution of queries
     * reorganization of the source tree to simplify initial user experience
     * a number of new operators for the execution engine
     * a pro tempore query optimizer that allows a physical plans to be
       generated
     * the entire code generation framework
     * Value Vector implementation

 Community Interactions

   The weekly Drill hangout continues, conducted remotely through Google
   hangouts Tuesday mornings 9am Pacific Time to keep core developers in
   contact in realtime despite geographical separation.

   The community stays in touch through @ApacheDrill Twitter ID, and by
   postings on various blogs including Apache Drill User
   http://drill-user.org/

   Viability of community is also apparent through interest in next meet-up
   event for the Bay Area Apache Drill User group in late September, which
   is already attracting a robust audience. Volunteers are coming forward
   from audience members of presentations, such as the Drill workshop in
   July (see following).

 Presentations

   There have been presentations and a Drill workshop from community members
   at conferences and meet-ups.

   Several Drill contributors have other talks scheduled with different
   meetups in the upcoming months.

 Sample presentations (out of many):

   * Drill talks by @mhausenblas at Hive London and in Paris in June
   * Talk on Apache Drill by @mhausenblas and @ted_dunning at Berlin
     Buzzwords
   * Apache Drill hands-on workshop by @ted_dunning and @intjesus at OSCON
     in Portland, Oregon USA in July for ~40 participants.
   * Apache Drill project featured by panelist @tshiran in Aug for the
     "Hadoop + SQL" Hive Data Think Tank event in California Bay Area.
   * Next meeting for the Bay Area Apache Drill User group is planned for
     September with talk and demo by Steve Phillips

 Slides

   Slides from Drill presentations posted online such as at slideshare get a
   large number of views.

   Example:

     OSCON Apache Drill workshop posted 1 Aug 2013 by Ted Dunning and
     Jacques Nadeau, 436 views.

 Articles

   Examples of articles on Apache Drill since last report include:

   * Article by @mhausenblas and @intjesus "Introduction to Apache Drill:
     Interactive Ad-Hoc Query for Large-scale Datasets"  Michael Hausenblas
     and Jacques Nadeau. Big Data. June 2013, 1(2): 100-104.
     doi:10.1089/big.2013.0011. http://bit.ly/15101Y7
   * A blog post by @Ellen_Friedman reports on that Drill-via-Amazon-Cloud
     event and includes links to slides: http://bit.ly/18aS3Lk
   * Drill  blog article by S. J. Vaughan-Nichols "Drilling into Big Data
     with Apache Drill" in Aug:  http://bit.ly/1309MXA
   * A blog posting on Drill by T. Shiran as a prelude to the Hadoop + SQL
     event by Hive Data Think Tank can be found here: http://bit.ly/1cvxn5D

 Social Networking

   @ApacheDrill Twitter entity is active and has grown by ~20%, to 437
   followers.

How project has developed since last report:

 1. Website homepage has a new design
 2. Wiki has been updated
 3. Significant code drops have been checked in from a number of developers
 4. Started to create release candidates for the milestone one [first]
    release of Drill
 5. New developers are contributing.
 6. Additional non-code contributors have become active and are being
    encouraged

Signed-off-by:

 [ ](drill) Ted Dunning
 [X](drill) Grant Ingersoll
 [X](drill) Isabel Drost-Fromm

19 Jun 2013

Description:

Apache Drill is a distributed system for interactive analysis of
large-scale datasets that is based on Google's Dremel. Its goal is to
efficiently process nested data, scale to 10,000 servers or more and
to be able to process petabyes of data and trillions of records in
seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

1. Continue to attract new developers with a variety of skills and viewpoints
2. Develop community skills and knowledge by building some releases
3. Demonstrate community robustness by rotating project tasks among multiple
  project members

Issues to Call to Attention of PMC or ASF Board:

none

How community has developed since last report:

Mailing list discussions:

There has been active participation in discussions on the developer mailing
list, including new participants and developers. A few have participated in
the users list; mainly activity takes place on developer mailing list.

Activity summary:

http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/
June to date 5 June, 29 (mainly jira; some discussion)
May 2013, 135  (jira, focused discussions)
April 2013, 188  (jira; focused discussions)
March 2013 260 (jira, focused discussions)

Topics in discussion on the dev mailing list included but not limited to:

* Evolution of logical plan syntax with addition of operators including
 the Value and Union Distinct operators

* Advantages and disadvantages of Parquet versus ORC

* ValueVector construct and requirements

* The relative performance of Janino based compilation versus
 javax.tools.Javacompiler

* Initial development of execution engine environment

* Discussion of various types of large array and off heap data structure
 libraries

* RPC protocol and framework


Code

For details of code commits, see http://bit.ly/14YPXN9 and http://bit.ly/19IyID1
There has been great progress around both evolution of the reference interpreter and

In the last three months, there have been many commits including:

* Initial implementation of RPC framework

* Base client and Zookeeper based client abstraction

* SQL parser with JDBC driver

* Distributed query scheduling framework

* ValueVector implementations

* Large number of reference interpreter tests and fixes


Community Interactions

There is now a weekly Drill hangout conducted remotely through Google
hangouts Tuesday mornings 9am Pacific Time to keep core developers in
contact in realtime despite geographical separation.  Results from
these discussions are shared with the discussion list through meeting
minutes and all are welcome to attend.  This has been helpful in
speeding development and averages attendance of 8-10 developers each
week.

Presentations

There have been presentations from community members at conferences, meet-ups
and through the weekly Google hangout.

* As you can see from http://drill-user.org/ there were few more
 HUGs/BUGs where Drill was presented/discussed (in Europe) - the blog
 itself might also be considered to manifest a contribution (?)

* We have published an article on Drill in the Big Data journal
 http://www.liebertpub.com/big

Sample presentations:
* Introduction to Apache Drill, Bay Area Analytics Group 2 April 2013 by
 Tomer Shiran

* Interactive Ad hoc query at scale: talk at Hadoop User Group UK by
 @mhausenblas

* Apache Drill Technical Overview: talk at Google Hangout, May 22 by
 Jacques Nadeau available at http://slidesha.re/123mSDh

* Drill Technical update @April 16 Hangout by Jacques Nadeau available at
 http://slidesha.re/ZDBvWP

* Drill Dissection at NoSQL matters (April) @mhausenblas video available
 at http://bit.ly/13Ffk7b

* All You Need to Know About Drill, talk during Big Data Week #bdw13 by
 Michael Hausenblas on 26 April http://bit.ly/17L1rD

* Deep Dive into Drill Implementation 3 June at Berlin Buzzwords by Ted
 Dunning and Michael Hausenblas


Slides

Slides from Drill presentations posted online such as at slideshare get a
large number and increasing number of views.

Articles

An invited interview with Ted Dunning in an O’Reilly white paper by
Mike Barlow titled “Real Time Big Data Analytics: Emerging
Architecture” discussed Apache Drill; there have been a number of blog
posts.

Social Networking

@ApacheDrill Twitter entity is active and has grown to 362 followers.

How project has developed since last report:

1. Wiki has been updated regularly
2. Significant code drops have been checked in from a number of developers
3. Significant design documents have been created and discussed
4. Additional non-code contributors have become active and are being encouraged

Please check this [ ] when you have filled in the report for Drill.

Signed-off-by:
Ted Dunning: [x](drill)
Grant Ingersoll: [x](drill)
Isabel Drost-Fromm: [x](drill)

20 Mar 2013

Description:

Apache Drill is a distributed system for interactive analysis of
large-scale datasets that is based on Google's Dremel. Its goal is to
efficiently process nested data, scale to 10,000 servers or more and to
be able to process petabyes of data and trillions of records in seconds.

Drill has been incubating since 2012-08-11.

Three Issues to Address in Move to Graduation:

1. Continue to attract new developers with a variety of skills and
viewpoints
2. Develop community skills and knowledge by building some releases
3. Demonstrate community robustness by rotating project tasks among
multiple project members

Issues to Call to Attention of PMC or ASF Board:

none

How community has developed since last report:

Mailing list discussions:

There has been active participation in discussions on the developer
mailing list, including new participants and developers. A few have
participated in the users list; mainly activity takes place on developer
mailing list.

Activity summary:

http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/
March 2012, 21 by 6th of March (mainly jira; some discussion)
February 2013, 227 (jira, focused discussions)
January 2013, 169 (jira; focused discussions)
Dec 2012, 51 (jira, focused discussions)

Topics in discussion on the dev mailing list included but not limited to:

* JSON scanner API
* implementation of reference interpreter
* building SQL parser
* implementation of a variety of reference operators including flatten
and WindowsPane
* Mocking Library
* Drill plus behavioral data

Presentations

There have been more than a dozen presentations from community members
at international Hadoop conferences, Strata Conference, HUGs, JUG and
an Apache Drill Users Group in at least four countries.

Slides

Slides from Drill presentations posted online such as at slideshare get
a large number of views. Examples:

Japan Hadoop Conf. 2013 Winter, 2114 views
Boulder/Denver HUG, 848 views
PJUG Portland Oregon, 404 views
HUG Munich, 475 views

Articles

An invited article on Apache Drill, “Apache Drill: Newcomer in the
Hadoop Ecosystem” appeared in the 30 January 2013 Software Developers
Journal, authored by Ted Dunning and Jacques Nadeau.

In addition there have been a variety of blog postings about Drill.

Social Networking

@ApacheDrill Twitter entity is active and has grown to 147 followers.

How project has developed since last report:

1. Wiki has been built
2. Significant code drops have been checked in from a number of new
developers
3. Added our first additional committer and PMC member, additional
candidates are developing
4. Additional non-code contributors have become active and are being
encouraged

Signed-off-by:
Ted Dunning: [x](drill)
Grant Ingersoll: [ ](drill)
Isabel Drost: [ ](drill)


Shepherd notes:

Drill appears to be healthy.  Mailing lists are seeing a ton of traffic
and work in the sandbox seems to be progressing at a reasonable pace.
Question to the community: When do you estimate that you would want to
start putting a preliminary release of some kind together?  I assume
this would require identifying at least some components that should be
moved from "sandbox".

16 Jan 2013

Project Summary:

Drill is a distributed system for interactive analysis of large-scale
datasets, inspired by Google's Dremel.

Drill has been incubating since 2012-08-11.


Issues:

Discussions on these key areas were __very focused__ and productive toward
this project's graduation goals:

Healthy discussion of target use cases from the community

New Syntax Interpreter

Continued Logical Plan Syntax discussion & development (with focus on JSON)

Leveraging existing ideas/lessons learned from Optiq, LucidDB, DynamoBI,
Eigenbase and Saffron

How has the community & project developed since the last report:

User interest has slowed due to the Holiday season but discussion on the
the above topics is healthy. Commit and list activity are consistent with
the above. Development is continuing onward.

An addition of employer supported contributor to the project.

New users continue to ask to be formally part of this project.

On going discussion with schema-less data scanners amongst the project
members.

There is some cause for concern due to a drift toward isolated
development of components with less on-list discussion than before.
We will work to encourage more public styles of work.

List Summary:
 * http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/
 * Jan 2013, ? subscribers
 * Dec 2012, 264 subscribers
 * Nov 2012, 73 (jira, focused discussions)
 * Oct 2012, 214 (svn, discussions, jira)
 * Sep 2012, 413
 * Aug 2012, 85

Signed-off-by:
Ted Dunning: [x](drill)
Grant Ingersoll: [x](drill)
Isabel Drost: [ ](drill)


Shepherd notes: No report as of 1/9/13

19 Dec 2012

Issues:
Discussions on these key areas were __very focused__ and productive toward
this project's graduation goals:
1.) Logical Plan Expressions, syntax, and parser
2.) Schema-less Management
3.) Wire Protocols

User interest has increased (thanks to the media) and as the project's
source commitments increase so will user interaction within the next
quarter.

How has the community developed since the last report:
Discussion counts have gotten smaller from the month of October through
November 2012.
For all intents and purposes initial code was checked in and builds running
during the middle of October 2012 and from that time focused discussion and
development have occurred.

Many new users joined the list as well as asking formally to be part of the
community.

List Summary:
 * http://mail-archives.apache.org/mod_mbox/incubator-drill-dev/
 * Dec 2012, [current], 264 subscribers
 * Nov 2012, 73 (jira, focused discussions)
 * Oct 2012, 214 (svn, discussions, jira)
 * Sep 2012, 413
 * Aug 2012, 85

How has the project developed since the last report:
== Milestones ==
1.) During the month of October, the SVN repository was initiated and
initial source checked in.
2.) ~88% of JIRA tasks were created during the months October through
November, showing growth and healthy discussion.
3.) Post initial commit, many users have come forth asking to engage in
active development, showing healthy growth and interest amongst the
developer community at large regarding the goals of this project.

Signed-off-by: tdunning, berndf, gsingers, isabel

21 Nov 2012

Drill's goal is to build an open source clone of Dremel with appropriate
extensions to foster greater flexibility.

Drill has been incubating since September of 2012.

Since last month, we have been working on bringing in existing code assets.
We now have the following items in our source repository:

 - A web-based GUI front-end for DRILL
 - A query parser for a Dremel equivalent language
 - An early prototype of a physical plan interpreter

The web front-end and query parser still need formalized IP clearance
including ICLA's or CCLA's as appropriate.  Informal clearances have been
granted on all components.

In terms of infrastructure, the project web site has been incorporated into CMS.

Graduation is still very far away, but the community activity has been high
and the mailing list has been active with over 200 postings in October.
Numerous public presentations have been made since the last report.

Most important issues to address before we can graduate:

 Get the basics in place, build up a working code base, make releases
 (that is, everything)

Any issues the Incubator PMC or ASF board need to be aware of:

 None at this time

How has the community developed since the last report:

 The active contributors mentioned in the previous report continue to
 be active and additional contributors have been identified.  At least
 one corporate supporter of the project has hired a full-time engineer
 to focus on Drill. We are working to bind these new contributors into the
 community and several appear likely to become committers over time.

How has the project developed since the last report:

 The community has continued to make progress and substantial code assets
 are in the process of being contributed.

Signed-off-by: berndf

17 Oct 2012

Drill's goal is to build an open source clone of Dremel with appropriate
extensions to foster greater flexibility.

Drill has been incubating since September of 2012.

Since last month, we have been working on infrastructure. A prototype
web-site is ready and several code contributions are nearly ready to commit.

Graduation is still very far away, but the community activity has been high
and the mailing list has been active.  Numerous public presentations have
been made and several Drill Users' Groups have been formed and meetings held.

Most important issues to address before we can graduate:

 Get the basics in place, build up a working code base, make releases
 (that is, everything)

Any issues the Incubator PMC or ASF board need to be aware of:

 None at this time

How has the community developed since the last report:

 Several active contributors outside the current committer group
 have emerged. We are working to bind these new contributors into the
 community and several appear likely to become committers over time.

How has the project developed since the last report:

 The community has begun to gel nicely and significant code contributions
 have moved forward.

Signed-off-by: Ted Dunning acting for Grant Ingersoll

19 Sep 2012

Drill is a distributed system for interactive analysis of large-scale datasets,
inspired by Google's Dremel.

We have just started and have mailing lists and svn up.  Git has been delayed
by issues in infra.

Community development is progressing well with several companies offering paid
developers and 90 subscribers to the dev list.

A hackathon in the SF bay area is scheduled.  A lunchtime meetup is
scheduled for Boston.  Additional meetups in New York and London are in
the planning stages.  All such physical meetups will have remote access if
possible (probably not for the lunch) and all will be reported back to the
mailing list to be sure to include those in different places and time zones
can participate.

Graduation is still a distant vision since we haven't got all the basic
mechanics in place yet.  The community side of things is going well and the
development of a realistic release looks like it will be moving shortly.

Signed-off-by: tdunning