Apache Logo
The Apache Way Contribute ASF Sponsors

This was extracted (@ 2017-11-16 00:10) from a list of minutes which have been approved by the Board.
Please Note The Board typically approves the minutes of the previous meeting at the beginning of every Board meeting; therefore, the list below does not normally contain details from the minutes of the most recent Board meeting.

2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | Pre-organization meetings

Mahout

18 Oct 2017 [Andrew Palumbo / Rich]

Apache Mahout is an environment for quickly creating scalable performant
machine learning applications.

## Issues:
- None

## Activity:


* 0.13.1 release in the works, though a code freeze has been temporarily
 lifted. 0.13.1 is a multi-artifact release extending 0.13.0 to all
 combinations of Spark from 1.6 - 2.x and 2.10, scala 2.11
* Current work is on building out an algorithm library and continued native
 optimizations.
* More work on a modern Website
 * A designer has been found.
 * David Miller, Creator of Start Bootstrap has agreed to do a site redesign
   pro-bono.
 * Work is ongoing to update the way the website is built and deployed.  We
   are working with the Apache infrastructure team to move from a custom
   process to a more standardized way of deploying the website using
   pre-built deployment templates.
* Google Summer of Code - We have enthusiastically accepted Aditya Sarma’s
 proposal to add the DBSCAN clustering algorithm, and additionally an
 alternate implementation of the DBSCAN algorithm which reduces complexity
 from O(n^2) to O(log(n) * n).
* GSoC experience [Aditya] - I proposed to add an distributed DBSCAN
 implementation on the lines of the paper “A new scalable parallel DBSCAN
 algorithm using the disjoint-set data structure” authored by Md. Mostofa Ali
 Patwary, Diana Palsetia, Ankit Agrawal, Wei-keng Liao, Fredrik Manne, Alok
 Choudhary of Northwestern University. But it turned out that the
 distribution strategy that they have adopted does not fit well with Mahout’s
 underlying framework. So, I contributed the Sequential algorithm and am
 working on completing the RTree module (which can be used by both the
 sequential as well as the distributed algorithm). In the meanwhile, I got in
 touch with a professor from the Barcelona Supercomputing Center and her
 group worked on an approximate dbscan algorithm that scaled well. (As an
 aside, I’m planning to work on making Mahout accessible to newcomers along
 with Trevor)
* GSoC Student Aditya Sarma passed with the mentoring of Trevor Grant.

## Health report:
-  The health of the project is good with a devoted team of committers.


## PMC changes:

- Currently 14 PMC members.
- No new PMC members added in the last 3 months
- Last PMC addition was Trevor Grant on Sat Feb 04 2017
- PMC member Benson Margulies has changed his status to PMC Emeritus

## Committer base changes:

- Currently 28 committers.
- New commmitters:
 - Holden Karau was added as a committer on Wed Jul 12 2017
 - Dustin VanStee was added as a committer on Tue Jun 20 2017

## Releases:

- Last release was 0.13.0 on Mon Apr 17 2017

## External Events


Eigenfaces for Realtime Facial Recognition Scott Cote, Trevor Grant. Lucene
Revolution. Las Vegas, NV- September 15, 2017.

Do I Know You?  Realtime Facial Recognition with an Apache Stack. Trevor
Grant. Flink Forward. Berlin, DE - September 12, 2017.

Using Open Source AI with Drones to identify humans… Friendly Cylons 1.0…
Trevor Grant, who did not have editing privileges on the title or abstract
which is why it seems so hokey. Data and Cognitive Developers Meetup. New
York, NY - September 25- 2017.

Open Source AI - Roll Your Own Cylon. Trevor Grant Chicago Hadoop Users Group
(CHUG) / Chicago Apache Flink Meetup (CHAF) Joint Meetup. Chicago, IL - August
24, 2017. Weekend Project: Real World AirBnB Data Science and Pricing Bot.
Trevor Grant, Andrew Weiner. Berlin Buzzwords 2017.
https://berlinbuzzwords.de/17/session/weekend-project-real-world-airbnb-data-s
cience-and-pricing-bot.

Introduction to Online Machine Learning Algorithms. Trevor Grant, Dataworks
Summit, San Jose, CA -
https://dataworkssummit.com/san-jose-2017/sessions/introduction-to-online-mach
ine-learning-algorithms/.

Success at Apache: All My Roads Led to Apache, Pat Ferrel:
https://blogs.apache.org/foundation/entry/success-at-apache-all-my

Apache Mahout: Distributed Matrix Math for Machine Learning. Andrew Musselman,
Seattle Data/Analytics/Machine Learning Meetup, Seattle, WA - October 17,
2017.

Distributed Evolution of Spiking Neuron Models on Apache Mahout for Time
Series Analysis.  Andrew Palumbo, Annual Symposium on Biomathematics and
Ecology: Education and Research, Illinois State University, Bloomington
Illinois, October 8, 2017.

Open Source Artificial Intelligence in a Biological/Ecological Context. Trevor
Grant, Annual Symposium on Biomathematics and Ecology: Education and Research,
Illinois State University, Bloomington Illinois, October 8, 2017.


## Question asked by board to clarify from March’s board report:


* AWS has been sending emails to private@mahout.apache.org RE: a small (~16$)
 balance.  This is due to Amazon donating 1000$ of cluster time to a project
 member, who has since taken a position with a different organization.  The
 1000$ was on a now discontinued corporate card.  We are actively working on
 getting the situation worked out (the usual large corporate SNAFU keeps this
 fix at a snail’s pace), and getting more compute time donated from AWS.

* Resolution:  Balance has been paid, and the account moved to an active
 credit card.

## Mailing list activity:

- dev@mahout.apache.org:
  - 918 subscribers (down -1 in the last 3 months):
  - 256 emails sent to list (582 in previous quarter)

- issues@mahout.apache.org:
  - 16 subscribers (up 16 in the last 3 months):
  - 84 emails sent to list (0 in previous quarter)


We’ve moved all Jira (including Github linked) comments from
dev@mahout.apache.org to  issues@mahout.apache.org, in order to reduce noise
on dev@mahout.apache.org and to facilitate discussion on the list.


This move however does not account for the full dip in dev@mahout.apache.org
emails over the summer  (582 to 256).   We will be monitoring the activity on
this list.

- user@mahout.apache.org:
  - 1783 subscribers (down -8 in the last 3 months):
  - 41 emails sent to list (155 in previous quarter)

As well we can see a dip in user@mahout.apache.org emails.  We will also
continue to monitor this.

19 Jul 2017 [Andrew Palumbo / Chris]

Apache Mahout is an environment for quickly creating scalable performant
machine learning applications.

## Issues:
 - None

## Activity:

 - 0.13.1 release in the works extends 0.13.0 to Spark 2.x and scala 2.11
Current work is on building out an algorithm library and continued native
optimizations.

 - More work on a modern Website

   - A designer has been found.

 - Google Summer of Code - We have enthusiastically accepted Aditya Sarma’s
 proposal to add the DBSCAN clustering algorithm, and additionally an
 alternate implementation of the DBSCAN algorithm which reduces complexity
 from O(n^2) to O(log(n) * n).

## Health report:
 -  The health of the project is good with a devoted team of committers.

## PMC changes:

 - Currently 15 PMC members.

## Committer base changes:

 New Committers this quarter:
 - Dustin VanStee was made committer on Jun 19, 2017

 - Holden Karu was made a committer on Jul 11, 2017

 - Currently 29 committers.

## External Events

 - Eigenfaces for Realtime Facial Recognition Scott Cote, Trevor Grant.
 Lucene Revolution. Las Vegas, NV- September 15, 2017.

 - INTRODUCTION TO ONLINE MACHINE LEARNING ALGORITHMS Trevor Grant. Dataworks
 Summit. San Jose, CA- June 15, 2007

 - Distributed and Native Hybrid optimizations for Machine Learning Workloads
 Suneel Marthi. Berlin Buzzwords. Berlin, Germany- June 12, 2017

 - Apache Mahout: Distributed Matrix Math for Machine Learning Andrew
 Musselman. MLConf. Seattle, WA- May 19, 2017

 - An Apache Based Intelligent IoT Stack for Transportation Trevor Grant, Joe
 Olsen. ApacheCon IoT. Miami, FL- May 18, 2017

 - Apache Mahout: An Extendable Machine Learning Framework for Spark and
 Flink Trevor Grant. Apache Big Data. Miami, FL- May 16, 2017

 - APACHE MAHOUT’S NEW RECOMMENDER ALGORITHM AND USING GPUS TO SPEED MODEL
 CREATION Pat Ferrel, Andy Palumbo. GPU Technology Conference. Silicon
 Valley, CA- May 11, 2017

 - EXTENDING MAHOUT-SAMSARA LINEAR ALGEBRA DSL TO SUPPORT GPU CLUSTERS Suneel
 Marthi, Trevor Grant. GPU Technology Conference. Silicon Valley, CA- May 11,
 2017

## Question asked by board to clarify from last quarter’s report:

 - AWS has been sending emails to private@mahout.apache.org RE: a small
 (~16$) balance.  This is due to Amazon donating 1000$ of cluster time to a
 project member, who has since taken a position with a different
 organization.  The 1000$ was on a now discontinued corporate card.  We are
 actively working on getting the situation worked out (the usual large
 corporate SNAFU keeps this fix at a snail’s pace), and
 getting more compute time donated from AWS.

@Rich: help resolve billing issue with AWS

17 May 2017 [Andrew Palumbo / Brett]

Apache Mahout Board Report, May 2017

Apache Mahout is an environment for quickly creating scalable performant machine learning applications.

## Issues:
 - None

## Activity:

Mahout released its benchmark 0.13.0 release with GPU and multi-threaded native
solvers using OpenCL, OpenMP (ViennaCL), and CUDA (NVIDIA) in the works.
An intuitive Algorithm Development Framework was also released in 0.13.0 based
on the sk-learn model.
Current work is on building out an algorithm library and continued native
optimizations.
New more modern Website
Google Summer of Code - We have enthusiastically accepted Aditya Sarma’s
proposal to add the DBSCAN clustering algorithm, and additionally an alternate
implementation of the DBSCAN algorithm which reduces complexity from O(n^2) to
O(log(n) * n).

## Health report:
 -  The health of the project is good with a devoted team of committers.

## PMC changes:

 - Currently 15 PMC members.

 - Last PMC addition was Trevor Grant on Feb 4 2017

## Committer base changes:

 - Nikolai Sakarnykh was added as a committer on April 21, 2017

 - Currently 27 committers.



## External Events

APACHE MAHOUT'S NEW RECOMMENDER ALGORITHM AND USING GPUS TO SPEED MODEL CREATION
Pat Ferrel, Andy Palumbo. GPU Technology Conference. Silicon Valley, CA- May 11,
2017

EXTENDING MAHOUT-SAMSARA LINEAR ALGEBRA DSL TO SUPPORT GPU CLUSTERS Suneel
Marthi, Trevor Grant. GPU Technology Conference. Silicon Valley, CA- May 11,
2017

Apache Mahout: An Extendable Machine Learning Framework for Spark and Flink
Trevor Grant. Apache Big Data. Miami, FL- May 16, 2017

An Apache Based Intelligent IoT Stack for Transportation
Trevor Grant, Joe Olsen. ApacheCon IoT. Miami, FL- May 18, 2017

Apache Mahout: Distributed Matrix Math for Machine Learning
Andrew Musselman. MLConf. Seattle, WA- May 19, 2017

19 Apr 2017 [Andrew Palumbo / Rich]

No report was submitted.

18 Jan 2017 [Andrew Palumbo / Mark]

## Description:

Apache Mahout is an environment for quickly creating scalable performant
machine learning applications.

## Issues:
None

## Activity:

 - The Team is currently in the process of putting together a milestone 0.13.0
   release.
 - Work is presently focused on adding support for Visualization, GPU and
   native optimization.
 - Sebastian Schelter presented a poster at Machine Learning Systems Workshop,
   NIPS 2016 Dec 10, 2016 “Samsara: Declarative Machine Learning on
   Distributed Dataflow Systems” - https://ssc.io/pdf/poster-mlsystems.pdf
 - Andrew Palumbo presented “Apache Mahout: Beyond MapReduce” at the Orange
   County Big Data Meetup, October, 2016.
 - Trevor Grant presented:  “Apache Mahout?! What’s Next!” At
     Chicago Hadoop Users Group, October 2016
     Seattle Data Science Meetup, December 2016
     San Diego Big Data Meetup, December 2016
     Austin Data Meetup, December 2016
     DFW Data Science Meetup, December 2016
 - Andrew Musselman presented: “Apache Mahout?! What’s Next!” at Seattle Data
   Science Meetup, December 2016
 - Suneel Marthi presented: “Native and Distributed Machine Learning with
   Apache Mahout” Apache Big Data Europe 2016, Nov 13 2016, Seville, Spain

## Health report:
 - The project has a dedicated team of voluntary committers.

## PMC changes:

 - Currently 14 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Stevo Slavić on Tue Apr 21 2015

## Committer base changes:

 - Currently 26 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Trevor Grant at Tue May 24 2016

## Releases:

 - Last release was 0.12.2 on Sun Jun 12 2016

## JIRA activity:

 - 16 JIRA tickets created in the last 3 months
 - 15 JIRA tickets closed/resolved in the last 3 months

19 Oct 2016 [Andrew Palumbo / Marvin]

Apache Mahout is an environment for quickly creating scalable performant
machine learning applications.

## Issues:
 - None

## Activity:
1. Work is presently focused on adding support for Visualization, GPU and
   native optimization
2. Suneel Marthi and Trevor Grant did a Mahout on Flink talk at Flink Forward
   2016, Berlin, Germany - September 13, 2016
3. Suneel Marthi did a Mahout talk at Department of Theoretical Physics,
   Fritz-Haber Institut der Max Planck Gessellschaft, Berlin, Germany -
   September 16, 2016
4. Suneel Marthi did a ‘Distributed Machine Learning with Apache Mahout’ talk
   at Big Data Ignite, Grand Rapids, Michigan - September 30, 2016
5. Upcoming Apache Mahout talk at Apache Big Data Europe, Seville, Spain - Nov
   2016
6. Team presently working on 0.13.0 release planned for Oct 2016.

## Health report:
 -  The health of the project is good with a devoted team of committers.

## PMC changes:

 - Currently 14 PMC members.
 - Last PMC addition was Stevo Slavić on Tue Apr 21 2015.

## Committer base changes:

 - Currently 26 committers.

## Releases:

 - Mahout 0.12.2 released on June 12, 2016

20 Jul 2016 [Andrew Palumbo / Brett]

## Description:

Apache Mahout is an environment for quickly creating scalable performant machine learning applications.

## Issues:
 - None

## Activity:
1. Work is presently focused on adding support for Visualization and Native optimization.

2. Suneel Marthi did talks on Apache Mahout at Apache Big Data 2016, Vancouver [1] and MapR BigData EveryWhere, Washington DC [2].

3. Integration of Mahout with Apache Zeppelin being worked on by Trevor Grant [3].

4.  Presently working towards 0.13.0 release that would add native optimizations.

## Health report:
 -  The health of the project is good with a devoted team of committers.

## PMC changes:

 - Currently 14 PMC members.
 - Last PMC addition was Stevo Slavić on Tue Apr 21 2015.

## Committer base changes:

 - Currently 26 committers.
 - Trevor Grant was added as a committer on Tue May 24 2016.

## Releases:

 - 0.12.1 was released on Wed May 18 2016.
 - 0.12.2 was released on Mon Jun 13 2016.

## JIRA activity:

 - 46 JIRA tickets created in the last 3 months.
 - 26 JIRA tickets closed/resolved in the last 3 months.


[1]http://events.linuxfoundation.org/events/apache-big-data-north-america/program/schedule
[2]http://www.bigdataeverywhere.com/dcarea-hadoop-conference-2016/#t0
[3]https://trevorgrant.org/2016/05/19/visualizing-apache-mahout-in-r-via-apache-zeppelin-incubating/

20 Apr 2016

Change the Apache Mahout Project Chair

 WHEREAS, the Board of Directors heretofore appointed Suneel
 Marthi (smarthi) to the office of Vice President, Apache Mahout, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Suneel Marthi from the office of Vice President, Apache Mahout, and

 WHEREAS, the Project Management Committee of the Apache Mahout
 project has chosen by vote to recommend Andrew Palumbo (apalumbo) as
 the successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Suneel Marthi is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Mahout, and

 BE IT FURTHER RESOLVED, that Andrew Palumbo be and hereby is
 appointed to the office of Vice President, Apache Mahout, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7H, Change the Apache Mahout Project Chair, was
 approved by Unanimous Vote of the directors present.

20 Apr 2016 [Suneel Marthi / Shane]

The goal of Apache Mahout project is to build an environment for quickly
creating scalable performant machine learning applications.

Activity:

 - New Apache Mahout book - “Apache Mahout: Beyond MapReduce” authored by
   Mahout committers - Dmitriy Lyubimov and Andrew Palumbo, published by
   Createspace on February 18, 2016 (1)

 - Apache Mahout 0.11.2 was released on March 11, 2016, this release
   introduced major performance enhancements for linear algebra computations
   and also supports Apache Spark 1.5.2.

 - Apache Mahout 0.12.0 was released on April 11, 2016.
   This release adds Apache Flink as an execution engine to Mahout Samsara.
   With the milestone 0.12.0 release, Mahout now supports Spark, Flink and
   H2O.

 - Suneel Marthi will be doing a talk on the new Mahout Distributed Linear
   Algebra at Apache Big Data, Vancouver on May 11, 2016 (2)

PMC changes:
 - Currently 14 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Stevo Slavić on Tue Apr 21 2015

Committer base changes:
 - Currently 25 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Anand Avati on Thu Apr 23 2015

Releases:
 - Mahout 0.11.2 was released on Fri Mar 11 2016
 - Mahout 0.12.0 was released on Mon Apr 11 2016

Issues:

None

JIRA activity:
 - 34 JIRA tickets created in the last 3 months
 - 71 JIRA tickets closed/resolved in the last 3 months

Mailing list activity:

 - dev@mahout.apache.org:
    - 947 subscribers (down -6 in the last 3 months):
    - 587 emails sent to list (434 in previous quarter)

 - user@mahout.apache.org:
    - 1878 subscribers (down -16 in the last 3 months):
    - 141 emails sent to list (114 in previous quarter)

[1] http://www.amazon.com/Apache-Mahout-MapReduce-Dmitriy-Lyubimov/dp/1523775785
[2] http://events.linuxfoundation.org/events/apache-big-data-north-america/program/schedule

20 Jan 2016 [Suneel Marthi / Greg]

The goal of Apache Mahout project is to build an environment for quickly
creating scalable performant machine learning applications.

Activity:

 Apache Mahout 0.11.1 was released on Nov 6, 2015.  This release supports
 Spark 1.4+ and has major performance improvements for vector and matrix
 operations.

 Sebastian Schelter presented the new Mahout distributed linear algebra
 framework at Flink Forward, Berlin On October 12, 2015. [1]

 Present activity is restricted to finalizing the Flink - Mahout integration
 which would be Mahout 1.0 release and to bolster the performance of the
 backend linear algebra by rebasing the code with alternate native
 implementations.

PMC changes:

 - Currently 14 PMC members.
 - No new PMC members added in the last 3 months
 - Last PMC addition was Stevo Slavić on Tue Apr 21 2015

Committer base changes:

 - Currently 25 committers.
 - No new committers added in the last 3 months
 - Last committer addition was Anand Avati at Thu Apr 23 2015

Releases:

 - Mahout 0.11.1 was released on Fri Nov 06 2015

Issues:

Decline in the project user and developer base over the past 2 years, in large
part due to the availability of competing Machine Learning libraries with very
active developer teams backed by organizations.

Its hard to sustain a Machine Learning project on voluntary basis with no
dedicated resources and yet be relevant with changing times and increasing
competition.

In the past, there was some promise of dedicated resources from organizations
but nothing promising enough.

JIRA activity:

 - 22 JIRA tickets created in the last 3 months
 - 35 JIRA tickets closed/resolved in the last 3 months

[1] https://www.youtube.com/watch?v=Uh92PK0K0mA

21 Oct 2015 [Suneel Marthi / Chris]

The goal of Apache Mahout project is to build an environment for quickly
creating scalable performant machine learning applications.

ISSUES FOR BOARD'S ATTENTION

None at this time.

RELEASES

- 0.10.2 was released on Aug 6, 2015
- 0.11.0 was released on Aug 7, 2015

ACTIVITY

No new PMC members or Committers added in the last 3 months.

Last PMC addition was Stevo Slavic on April 21, 2015.

Sebastian Schelter will be presenting the new Mahout-Samsara Linear Algebra
framework at the upcoming Flink Forward conference in Berlin on October 12,
2015. [1]

0.10.2 was released on Aug 6, 2015. This release had major optimizations and
performance improvements to the new Samsara Linear Algebra backend.

0.11.0 was released on Aug 7, 2015. This release makes Mahout compatible with
Spark 1.3.1.

Mahout 0.11.0 has been integrated with Apache BigTop 1.0.1.

Integration of Apache Mahout with Apache Flink is presently in the works and
is being done in collaboration with TU Berlin and Data Artisans.

Apache Mahout has been recognized as one of the 5 Big Data Open Source
projects to watch out for in a ZDNet article dated Aug 21, 2015. [2]

STATS

25 committers
14 PMC members
19 JIRA tickets created in last 3 months
30 JIRA tickets closed/resolved in last 3 months

[1]http://www.flink-forward.org
[2]http://www.zdnet.com/article/five-open-source-big-data-projects-to-watch/

15 Jul 2015 [Suneel Marthi / Rich]

  DESCRIPTION:
  The goal of Apache Mahout project is to build an environment for quickly
  creating scalable performant machine learning applications.

ACTIVITY:
 - Apache Mahout’s next generation 0.10.0 was released on April 11, 2015.

 - Apache Mahout 0.10.1 was released on May 31, 2015. This was a minor bug fix
   release following 0.10.0.

 - Apache Mahout now supports scalable Machine Learning on Spark, H2O and
   MapReduce.

 - The project has been working closely with Apache BigTop to integrate Apache
   Mahout into BigTop following a release.

 - Integration of Apache Mahout with Apache Flink is in the works and is being
   done in collaboration with Data Artisans and TU Berlin.

 - Anand Avati was added as a new committer.

 - Stevo Slavic was added as a PMC member.

 - Team presently working on 0.10.2 release, planned for the week of July 10,
   2015.

ISSUES:
 - Lately most design and tech discussions have been happening off the dev@
   mailing lists, the PMC is well aware of the issue and working on addressing
   that.

PMC/Committership changes:

 - Currently 25 committers and 14 PMC members in the project.
 - Stevo Slavić was added to the PMC on Fri May 08 2015
 - Anand Avati was added as a committer on Thu Apr 23 2015

RELEASES:

 - 0.10.1 was released on Sun May 31 2015
 - 0.10.0 was released on Sat Apr 11 2015

MAILING LIST ACTIVITY:

 - dev@mahout.apache.org:
   - 977 subscribers (down -8 in the last 3 months):
   - 1324 emails sent to list (1419 in previous quarter)

 - user@mahout.apache.org:
   - 1933 subscribers (down -10 in the last 3 months):
   - 243 emails sent to list (252 in previous quarter)

 - general@mahout.apache.org:
   - 10 subscribers (up 0 in the last 3 months):
   - 0 emails sent to list (0 in previous quarter)

JIRA ACTIVITY:

  - 85 JIRA tickets created in the last 3 months
  - 74 JIRA tickets closed/resolved in the last 3 months

22 Apr 2015

Change the Apache Mahout Chair

 WHEREAS, the Board of Directors heretofore appointed Grant Ingersoll
 to the office of Vice President, Apache Mahout, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Grant Ingersoll from the office of Vice President, Apache Mahout,
 and

 WHEREAS, the Project Management Committee of the Apache Mahout
 project has chosen by vote to recommend Suneel Marthi as the successor
 to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Grant Ingersoll is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Mahout, and

 BE IT FURTHER RESOLVED, that Suneel Marthi be and hereby is
 appointed to the office of Vice President, Apache Mahout, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7G, Change the Apache Mahout Chair, was approved
 by Unanimous Vote of the directors present.

22 Apr 2015 [Grant Ingersoll / Rich]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining.

Project Status
--------------

The project continues to have a large and active user base.  The project now has
integrations with Spark and H2O execution engines, this is in addition to the
traditional MapReduce.  Integration with Apache Flink is next on the cards with
a possibly dedicated resource available from the Flink community to work with
Mahout.

The new integrations with H2O and Spark engines extend Mahout Machine Learning
to other more popular Big Data platforms.

Community
---------

* We have added 3 new PMC members: Pat Ferrel, Andrew Musselman and
 Andrew Palumbo

There is a healthy committer base to the project that are actively working on
the project on a voluntary basis. There is no dedicated full time resource
available for the project yet as most large scale Machine Learning libraries
cannot be built and sustained on voluntary contributions.


Community Objectives
--------------------

The project has an active committer base and there’s a renewed interest in the
project with the new Scala based Engine agnostic distributed linear algebra
library with bindings for Spark, H2O and Flink in the future. The project got a
shot in the arm with backing from Apache BigTop community and we are looking to
keep that momentum going for future releases.

The project is targeting more frequent minor releases and a major release once
every quarter.  While the 0.10.0 release is targeted for the week of April 7-11
2015, a subsequent 0.10.1 release is planned in the subsequent releases.


Releases
--------

The team is working towards Mahout 0.10.0 release targeted for the week of April
7-11 in time for ApacheCon North America 2015.


Issues
------
None now.

18 Feb 2015 [Grant Ingersoll / Bertrand]

=== Apache Mahout Status Report: February (missed January) 2015 ===

-----

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base.  Development
continues by a small number of dedicated individuals.  The PMC is reviewing
how we can improve contributions as well as exploring other options to make
sure the project remains viable to the user base.


Community
---------

* As per the status, the main issue is we have only 2-3 committers who are
contributing on a regular basis.  While they are doing good work, it is
concerning from a sustainment issue.  We are discussing as a PMC how
to rectify this situation.  The main issue is that developing machine learning
libraries is involved process that is hard to do on a part time basis and
we have yet to find anyone that can be dedicated full time to the project.



Community Objectives
--------------------

Identify next steps for either growing the list of active committers or
finding an appropriate home for the code that exists (attic or elsewhere).


Releases
--------

The migration to Spark is still ongoing and no new releases are planned at this
time.


Issues
------
See above.

21 Jan 2015 [Grant Ingersoll / Greg]

No report was submitted.

15 Oct 2014 [Grant Ingersoll / Bertrand]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base.  Development
continues at a steady pace.


Community
---------

* The main issue concerning the community right now is the addition
of new contributions from 0xData and the integration of Mahout with Scala/Spark.


Community Objectives
--------------------

Our goal is to build scalable machine learning libraries. See the Issues
section below for the debate in the community about our objectives.


Releases
--------

The migration to Spark is still ongoing and no new releases are planned at this
time.


Issues
------
The community is still actively working on converting the codebase to Scala and
Spark.  The number of devs contributing	is still small,	  but it is sustained.

16 Jul 2014 [Grant Ingersoll / Doug]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base.  While
the developer base has continued to grow, there is a very active
and healthy debate going on about where Mahout goes next.  We have
worked through many of these issues, but are not out of the proverbial
woods just yet.



Community
---------

* Andrew Palumbo and Pat Ferrel are new committers
* Dmitriy Lyubimov has resigned from the PMC

* The main issue concerning the community right now is the addition
of new contributions from 0xData and the integration of Mahout with Scala/Spark.



Community Objectives
--------------------

Our goal is to build scalable machine learning libraries. See the Issues
section below for the debate in the community about our objectives.


Releases
--------

In addition to an ongoing debate on Mahout's future, the community is actively
 working on integrating Mahout with Scala/Spark,  and bringing in new code and
 committers to update the core project.


Issues
------
For the most part, the community has gotten back to work by adding a couple of
new committers and pursuing the path of Scala support.  While there is still
not a huge developer base, people are contributing and working through the
issues.

16 Apr 2014 [Grant Ingersoll / Doug]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base.  While
the developer base has continued to grow, there is a very active
and healthy debate going on about where Mahout goes next.  Please
see the Issues section below for more details.



Community
---------

* Andrew Musselman was voted in as new committer.
* No changes to the PMC in the reporting period.

* The main issue concerning the community right now is the addition
of new contributions from 0xData and the integration of Mahout with Spark.



Community Objectives
--------------------

Our goal is to build scalable machine learning libraries. See the Issues
section below for the debate in the community about our objectives.


Releases
--------

In addition to an ongoing debate on Mahout's future, the community is actively
 working on integrating Mahout with Scala/Spark,  and bringing in new code and
 committers to update the core project.

A lot of work on improving documentation has been done. The project has
 finished the move from the wiki to Apache CMS, redesigned the project
 website and is in the process of updating all pages.

Issues
------
The Mahout community is at a crossroads in terms of where
to go next.  While the project has a broad number of users and interested
parties, most committers are trying to maintain the code base on a purely
part time basis, when the amount of work to sustain these users
clearly points to it needing to
be full time.  Furthermore, much of our original code base is written
for Hadoop MapReduce 1.0, which many in the community have come to realize
is not well-suited for solving the kinds of problems that Mahout has set
out to solve.  There have been several lengthy discussions and prototypes
going on to work out next directions along the lines of the Spark and
0xData contributions (there are numerous threads on the dev@mahout.a.o
mailing list.)

The PMC does not think this requires Board intervention at this time
as the debate is, as far as we can tell, healthy.  We do, however,
expect that this debate will take some time to resolve and may mean we
won't be shipping a 1.0 release any time soon.  We will keep the Board
apprised of our next steps as we work through the process.

15 Jan 2014 [Grant Ingersoll / Roy]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base and the developer
base continues to grow, as well.



Community
---------

* On November 28th Frank Scholten was voted in as new committer.
* No changes to the PMC in the reporting period.

* With Suneel Marthi now working full time on the project there has
 been a flurry of patches reviewed and committed.
* The project has moved to Apache CMS, is in the process of tidying
 most of the wiki based documentation.
* After a small Hackathon in Berlin pre-Christmas activity has been
 steady even during the holiday season.



Community Objectives
--------------------

With most committers not working on Mahout full time there is always a
lack of time on lists as well as when it comes to dealing with patches
submitted quickly. The current goal is to grow the committer base to
deal with that issue.

As for students that would like to contribute the problem remains that
the most interesting work seems to be adding new algorithms and
implementations. It remains a challenge to motivate those interested in
contributing to work on getting existing implementations stable,
improving documentation and reviewing incoming patches.


Releases
--------

The community is actively working on getting the 0.9 release out the
door with just one scaling issue remaining the the k-means++ code newly
added as part of the 0.8 release (June 2013).

This is supposed to be the last release before 1.0.



Issues
------

There are no issues requiring board attention at this time.

16 Oct 2013 [Grant Ingersoll / Sam]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base and the developer
base continues to grow, as well.

Suneel Marthi was added to the PMC.

Community
---------

The third quarter of 2013 has seen continued activity on par with the
last report.  We are primarily working on 0.9 release, some new
recommendation integration with Solr.  The user list is quite
active with a mix of new and experienced users.

No new committers have been added since the last report.

If all goes well one of the committers will be having her first baby
early April 2014. Patches/commits from her will need some extra
careful review from the community. [Disclaimer: Due to timing issues
this amendment was added to the report after it was submitted by the
committer in question. Sorry for the additional noise.]

Community Objectives
--------------------

Our main focus is on cleanup and preparation of 0.9 and 1.0 releases, as
well as the usual bug fixes.


Releases
--------

None since last report.  Next likely one is sometime between Nov. '13 and Jan. '14.


Issues
------

There are no issues requiring board attention at this time.

18 Sep 2013

Change the Apache Mahout Project Chair

 WHEREAS, the Board of Directors heretofore appointed Jake Mannix
 to the office of Vice President, Apache Mahout, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Jake Mannix from the office of Vice President, Apache Mahout,
 and

 WHEREAS, the Project Management Committee of the Apache Mahout
 project has chosen by vote to recommend Grant Ingersoll as the
 successor to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Jake Mannix is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Mahout, and

 BE IT FURTHER RESOLVED, that Grant Ingersoll be and hereby is
 appointed to the office of Vice President, Apache Mahout, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7C, Change the Apache Mahout Project Chair, was
 approved by Unanimous Vote of the directors present.

18 Sep 2013 [Jake Mannix / Doug]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base. With the book
Mahout in Action it has become simpler for beginners to get started using
the project.

Community
---------

The third quarter of 2013 was seen continued activity on par with the
second quarter, with our first new release in more than a year, a
new committer, and our first live google hangout for users and
developers. There has been continued effort fixing bugs and reviewing
contributor patches, especially with the recent release.

We added one new committer to the project: Ellen Friedman

There is a SF-Bay Area Mahout MeetUp scheduled for August 27 in Redwood
City. Sebastian Schelter will be the main speaker, talking about new
directions with Mahout recommendation. Grant Ingersoll, Ted Dunning and
Ellen Friedman be there to do a short introduction for the meet-up and
update on the 0.8 release.


Community Objectives
--------------------

Discussions regarding the 0.9 planning and 1.0 release has continued on
the mailing list, revolving significantly around what features/algorithms
will be supported in 1.0 and onward, with an eye toward streamlining the
scope of the project to not contain as many rarely used / unsupported
algorithms.

The PMC and especially the PMC Chair apologize for missing the last several
Board Reports, and we have discussed internally as a PMC the need for a
new PMC chair who is a bit more "bureaucratically minded", and with
several experienced volunteers stepping forward, we should be calling a
vote and moving forward with this by the end of August.

Releases
--------

Mahout 0.8 was released in July, see below for details, and
https://cwiki.apache.org/confluence/display/MAHOUT/Release+0.8 for
release notes.

Code
----

The 0.8 release contains one significant new algorithm
implementation, Streaming K-Means ( MAHOUT-1154 ), as well as numerous
performance enhancements and API improvements to the core linear algebra
library and many bugfixes. Additionally, two new directions have started
up, regarding visualization of recommender and co-occurrence calculations
(http://s.apache.org/mahout_viz_thread); and creating a scala DSL for
some Mahout calculations (http://s.apache.org/mahout_scala_dsl). Both of
these are at the design and prototyping phase, but seem promising.


Issues
------

There are no issues requiring board attention at this time.

21 Aug 2013 [Jake Mannix / Brett]

No report was submitted.

Report was not received and is expected next month.

17 Jul 2013 [Jake Mannix / Doug]

No report was submitted.

AI: Doug to pursue a report for Mahout

19 Jun 2013 [Jake Mannix / Brett]

Apache Mahout has implementations of a wide range of machine learning and
data mining algorithms: clustering, classification, collaborative filtering
and frequent pattern mining

Project Status
--------------

The project continues to have a large and active user base. With the book
Mahout in Action it has become simpler for beginners to get started using
the project.

Community
---------

The second quarter of 2013 was relatively more active, with many committers
and PMC members fixing bugs, reviewing contributor patches, and slowly
removing old dead code.

We added four new committers to the project: Suneel Marthi, Dan Filimon,
Gokhan Capan, and Stevo Slavic.

There are a few committers who volunteered to become GSoC mentors. As for
them it will be the first year participating as mentors on behalf of Mahout
they will need some guidance on what the process looks like at the ASF.

Community Objectives
--------------------

Discussions regarding the 0.9 planning and 1.0 release happened in person
among many of the core committers at Berlin Buzzwords, and has continued on
the mailing list, revolving significantly around what features/algorithms
will be supported in 1.0 and onward, with an eye toward streamlining the
scope of the project to not contain as many rarely used / unsupported
algorithms.

The PMC and especially the PMC Chair apologize for missing the last two
Board Reports, and we have discussed internally as a PMC whether we should
make any changes and are working to make sure it doesn't happen again.


Code
----

The upcoming 0.8 release contains one significant new algorithm
implementation, Streaming K-Means ( MAHOUT-1154 ), as well as numerous
performance enhancements and API improvements to the core linear algebra
library and many bugfixes.

Releases
--------

No releases since the last report. 0.8 is targeted for the end of June, and
currently bugfixes are the primary focus. Only two open issues remaining at
the time of this writing ( http://s.apache.org/mahout_0.8_issues )


Issues
------

There are no issues requiring board attention at this time.

15 May 2013 [Jake Mannix / Ross]

No report was submitted.

AI: Ross to pursue a report for Mahout

17 Apr 2013 [Jake Mannix / Ross]

No report was submitted.

AI: Ross to pursue a report for Mahout

20 Mar 2013 [Jake Mannix / Jim]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

Issues:

Sean Owen wishes to leave the Mahout PMC (but retain his commit rights),
but this is the only issue which needs the Board attention.

Current Activity: How has the community developed since the last
report? In February:

Originally planned for 0.8 release by March 8, but will be letting that
slip forward a few weeks.

Selection of Presentations, Articles and Outreach:

* Ted Dunning on new fast streaming clustering
(http://www.slideshare.net/tdunning/news-frommahout20130305)
* Fast clustering at ACM http://www.slideshare.net/tdunning/acm-20130225
* Real time learning http://www.slideshare.net/tdunning/real-time-learning
* MapR-Lucidworks on reflected intelligence
http://www.slideshare.net/tdunning/mapr-lucidworks-joint-webinar
* Ted Dunning at Strata on Mahout
http://www.slideshare.net/tdunning/strata-newyork2012
* Ted Dunning on fast clustering at Oxford
http://www.slideshare.net/tdunning/oxford-05oct2012
* MapR and Amex speak about large-scale analytics with Mahout
http://www.slideshare.net/tdunning/customer-analysisatscalestrata10022012
* Overstock and Mahout
http://www.wired.com/wiredenterprise/2012/12/mahout/
* Advanced Analytics in Mahout
http://portfortune.wordpress.com/2012/12/05/advanced-analytics-in-hadoop-part-one
* London Data Science http://datasciencelondon.org/tag/mahout/
* Mahout Updated in CDH 4.1
http://blog.cloudera.com/blog/2012/11/whats-new-in-cdh4-1-mahout/

Scientific publications based on Mahout

* Sebastian Schelter, Sean Owen: Collaborative Filtering with Apache Mahout,
Recommender Systems Challenge Workshop in conjunction with ACM RecSys 2012
http://ssc.io/wp-content/uploads/2013/02/cf-mahout.pdf
* Sebastian Schelter, Christoph Boden, Volker Markl: Scalable
Similarity-Based Neighborhood Methods with MapReduce,
ACM Conference on Recommender Systems 2012, Dublin
http://dl.acm.org/citation.cfm?id=2365984
http://ssc.io/wp-content/uploads/2012/06/rec11-schelter.pdf

Code

We were able to attract the developer of one of the leading scientific
recommender libraries [http://mymedialite.net/] to port a few
implementations to Mahout
(https://issues.apache.org/jira/browse/MAHOUT-1106,
 https://issues.apache.org/jira/browse/MAHOUT-1089)

However, new code contributions have slowed to a crawl, the number of
commits in the past few months, compared to prior years:

Feb 2013, 7
Jan 2013, 20
Dec 2012, 7

Feb 2012, 98
Jan 2012, 27
Dec 2011, 99

Feb 2011, 35
Jan 2011, 52
Dec 2010, 37

Feb 2010, 207
Jan 2010, 132
Dec 2009, 135

New Commercial Integrations

* Predixion Readmission Insight, a "a preventable readmission healthcare
solution" announced
http://www.virtual-strategy.com/2013/03/05/predixion-software-wins-microsoft-health-users-group-innovation-award
integration with Mahout, Greenplumb, Hive, and Microsoft's BI stack.
* Overstock and Mahout http://www.wired.com/wiredenterprise/2012/12/mahout

New Open Source Integrations

* The recommendation and advertisement network http://www.plista.com/en
has built an open source weblayer for Mahout's recommenders
https://github.com/plista/kornakapi
* Mahout seems to be the framework of choice for PredictionIO
http://prediction.io/, an open source prediction server for software
developers to create predictive features, such as personalization,
recommendation and content discovery


Mailing List Summary:

User list discussions are currently focussed primarily on bug reporting
and helping new users, but very little about future feature work.

Developer Mailing List Posting:

http://mail-archives.apache.org/mod_mbox/mahout-dev/
February 2013, 123
January 2013, 213
Dec 2012, 155

as compared to the same months in previous years:
Feb 2012, 578
Jan 2012, 545
Dec 2011, 1079

and

Feb 2011, 352
Jan 2011, 473
Dec 2010, 267

We've not had this low developer involvement since the first half of 2009.

User Mailing List Posting

http://mail-archives.apache.org/mod_mbox/mahout-user/
User list discussions are primarily in support of very new users, as well
as bug reporting on released versions (0.6 and sometimes even 0.5),
highlighting the need for 0.8 to be released.

While the traffic to the user mailing list has gone down slightly from
previous years:

Feb 2012, 288
Jan 2012, 367

Feb 2011, 359
Jan 2011, 458

Feb 2010, 497
Jan 2010, 272

This is not a dramatic decrease, as there is still considerable
interest in the user community.

Summary: How has the project developed since the last report:

A 1.0 release is not yet on the horizon.

== Milestones ==
1.) Working towards a 0.8 release
2.) Development on new, faster clustering code

20 Feb 2013 [Jake Mannix / Ross]

No report was submitted.

16 Jan 2013 [Jake Mannix / Greg]

No report was submitted.

17 Oct 2012

Change the Apache Mahout Project Chair

 WHEREAS, the Board of Directors heretofore appointed Jeff Eastman
 to the office of Vice President, Apache Mahout, and

 WHEREAS, the Board of Directors is in receipt of the resignation
 of Jeff Eastman from the office of Vice President, Apache Mahout,
 and

 WHEREAS, the Project Management Committee of the Apache Mahout
 project has chosen by vote to recommend Jake Mannix as the successor
 to the post;

 NOW, THEREFORE, BE IT RESOLVED, that Jeff Eastman is relieved and
 discharged from the duties and responsibilities of the office
 of Vice President, Apache Mahout, and

 BE IT FURTHER RESOLVED, that Jake Mannix be and hereby is
 appointed to the office of Vice President, Apache Mahout, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification, or
 until a successor is appointed.

 Special Order 7B, Change the Apache Mahout Project Chair, was
 approved by Unanimous Vote of the directors present.

17 Oct 2012 [Jeff Eastman / Roy]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has remained high during the past 3 months.

The user@mahout.a.o mailing list has 1448 current subscribers.
The dev@mahout.a.o mailing list has 734 current subscribers.

Now we are embarked upon a new 0.8 release. A goal of 0.8
is to continue clean up of existing functionality to improve
consistency and improve user experience. In this release, some
new additions to Mahout functionality are also planned.

Code freeze for 0.8 is targeted for Nov 15.

A 1.0 release is not yet on the horizon.

COMMUNITY

Jake Mannix has been elected to be the new Mahout PMC Chair.
Paritosh Ranjan has been elected to the Mahout PMC.
We have no new committers since our July report

25 Jul 2012 [Jeff Eastman / Sam]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has remained high during the past 3 months. We completed our
0.7 release on June 16th that closed 63 JIRA issues.

The user@mahout.a.o mailing list has 1379 current subscribers

Now we are embarked upon a new 0.8 release. A goal of 0.8
is to continue clean up of existing functionality to improve
consistency and improve user experience. In this release, some
new additions to Mahout functionality are also planned.

Code freeze for 0.8 is targeted for Nov 15.

A 1.0 release is not yet on the horizon.

COMMUNITY

We have no new committers since our April report.

(Mahout)

18 Apr 2012 [Jeff Eastman / Greg]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has remained high during the past 3 months. We completed our
0.6 release on Feb. 6th that closed 182 JIRA issues.

The user@mahout.a.o mailing list has 1271 current subscribers
The dev@mahout.a.o mailing list has 661 current subscribers

Now we are embarked upon a new 0.7 release. The goal of 0.7
is to clean up and refactor existing functionality to improve
consistency and improve user experience.

Code freeze for 0.7 is targeted for May 15.

A 1.0 release is not yet on the horizon.

COMMUNITY

We have two new committers since our January report:
- Paritosh Ranjan
- Tom Pierce


MAHOUT DISTRIBUTIONS

At least two commercially-supported Hadoop distributions now include
Mahout in their offerings (Cloudera, MapR). We will keep an eye out to
make sure they are distributed in accordance with Apache trademark
guidelines.


MAHOUT IN PRINT

"Mahout in Action", Owen, Anil, Dunning & Friedman is being well
received.
(http://manning.com/owen/)

21 Mar 2012 [Jeff Eastman / Greg]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has remained high during the past 3 months. We completed our
0.6 release on Feb. 6th that closed 182 JIRA issues.

Now we are embarked upon a new 0.7 release. The goal of 0.7
is to clean up and refactor existing functionality to improve
consistency and improve user experience.

Code freeze for 0.7 is targeted for May 15.

A 1.0 release is not yet on the horizon.

COMMUNITY

We have two new committers since our last report:
- Paritosh Ranjan
- Tom Pierce


MAHOUT DISTRIBUTIONS

Mahout now has multiple commercial distributions.

MAHOUT IN PRINT

"Mahout in Action", Owen, Anil, Dunning & Friedman is being well
received.
(http://manning.com/owen/)

AI: Shane ask PMC to update agenda

24 Jan 2012 [Jeff Eastman / Shane]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has been high during the past 3 months as we have begun
the release process for 0.6. Of 181 issues targeted for this release,
there are only 4 remaining.

Code freeze, originally targeted for Jan 1, is currently being delayed
by these outstanding issues. We still expect to release 0.6 in the
near future.

A 1.0 release is not yet on the horizon.

COMMUNITY

There are no new committers since last report.

Dmitriy Lyubimov has been elected a member of the Mahout PMC.

MAHOUT DISTRIBUTIONS

Mahout has been included in the Cloudera CDH3u2 release.
(http://www.cloudera.com/blog/2011/11/cdh3u2-apache-mahout-integration)

As with other commercial distributions we will keep an eye out to
make sure it is distributed in accordance with Apache trademark
guidelines.

MAHOUT IN PRINT

"Mahout in Action", Owen, Anil, Dunning & Friedman has been published
and is being well received.
(http://manning.com/owen/)

16 Nov 2011

Change the Apache Mahout Project Chair

 WHEREAS, the Board of Directors heretofore appointed Sean Owen to the
 office of Vice President, Apache Mahout, and

 WHEREAS, the Board of Directors is in receipt of the resignation of
 Sean Owen from the office of Vice President, Apache Mahout, and

 WHEREAS, the Project Management Committee of the Apache Mahout project
 has chosen by vote to recommend Jeff Eastman as the Successor to the
 post;

 NOW, THEREFORE, BE IT RESOLVED, that Sean Owen is relieved and
 discharged from the duties and responsibilities of the office of Vice
 President, Apache Mahout, and

 BE IT FURTHER RESOLVED, that Jeff Eastman be and hereby is appointed
 to the office of Vice President, Apache Mahout, to serve in accordance
 with and subject to the direction of the Board of Directors and the
 Bylaws of the Foundation until death, resignation, retirement, removal
 or disqualification, or until a successor is appointed.

 Special Order 7C, Resolution to Change the Apache Mahout
 Project Chair, was approved by Unanimous Vote of the directors
 present.

26 Oct 2011 [Sean Owen / Bertrand]

Apache Mahout provides implementations of machine learning algorithms
(collaborative filtering, clustering, classification, and more) for
large-scale data, mostly via Hadoop-based implementations.

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Activity has been moderate during the past 3 months. There were no
new releases, and the 0.6 release process is not yet begun, though
will likely start within the next 2 months. Judging by Fixed issue
count, 0.6 is about 60% as far along as previous releases.

A 1.0 release is not yet on the horizon.

MAHOUT DISTRIBUTIONS

It appears that Mahout will be bundled with Cloudera soon.
(https://groups.google.com/a/cloudera.org/group/cdh-user/
 browse_thread/thread/5df8c1cb6d39288d?pli=1)

As with other commercial distributions we'll keep an eye out to
make sure it's distributed in accordance with Apache trademark
guidelines.

MAHOUT IN PRINT

Mahout in Action has at last been published.
http://manning.com/owen/

Bertrand notes that the community section is missing.

AI Bertrand: ask Mahout PMC chair for a community report next time

20 Jul 2011 [Sean Owen / Doug]

=== Apache Mahout Status Report: July 2011 ===

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Apache Mahout 0.5 was released on May 27 2011. It resolved 137
issues:

https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true
 &jqlQuery=project+%3D+MAHOUT+AND+fixVersion+%3D+%220.5%22

The PMC plans an 0.6 release at the end of the year. The focus
continues to be on polish and refinement in advance of a 1.0 release;
A 1.0 release may come in mid 2012 but is not yet being planned.

The community continues to grow steadily. The user and dev lists
contained 793 and 470 subscribers, respectively, in January 2011.
They now contain 983 and 557 respectively. We've seen healthy
community activity around the world, including new talks at events
from Berlin, Seoul, London and Chicago.

The project has one area of significant new activity: graph mining
and graph-related algorithms. For example, Mahout has a
PageRank-like implementation now.

MAHOUT PMC

Sebastian Schelter was added to the PMC in May 2011.

PROJECT BRANDING

The project made changes to comply with Apache branding guidelines
earlier in the year, but reconfirms that the site is in compliance
with http://www.apache.org/foundation/marks/pmcs#checklist

20 Apr 2011 [Sean Owen / Jim]

=== Apache Mahout Status Report: April 2011 ===

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

The project expects to continue with an 0.5 release around May 2011.
115 issues have been resolved for 0.5, with 7 more planned before the
release:
https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true
 &jqlQuery=project+%3D+MAHOUT+AND+fixVersion+%3D+%220.5%22

After that, we believe, will be a 1.0 release, though it is possible the
PMC will elect to issue an interim 0.6 release later in the year.
The focus will change to making the code base stable and 1.0-ready.

NEW MAHOUTS

Apache Mahout added Dmitriy Lyubimov and Shannon Quinn as new committers
in February 2011.

MAHOUT ON THE GO

The community has recorded 12 talks on Mahout since the last release, a
substantial increase in volume and diversity:
https://cwiki.apache.org/MAHOUT/books-tutorials-and-talks.html

MAHOUT IN PRINT

The book "Mahout in Action", published by Manning, has been completed and
will be published in July 2011.

The book "Taming Text", also published by Manning, is also nearing completion
and contains substantial coverage of Mahout and text clustering.

19 Jan 2011 [Sean Owen / Greg]

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

Apache Mahout released version 0.4 on October 31, 2010. 0.4 included changes
related to 153 issues, summarized here:
https://issues.apache.org/jira/browse/MAHOUT/fixforversion/12314396

It continues to change significantly and across the board, though a certain
consistent scope and identity is confirming itself at this stage. It is
a Java-based scalable data mining library that currently has much of its
implementation based on Apache Hadoop 0.20.x. It currently covers, primarily,
collaborative filtering, clustering, classification, frequent itemset mining,
and some related and supporting algorithms.

The project expects to continue with an 0.5 release around May 2011.
The 57 issues to date that are resolved or are being worked on for 0.5 are:
https://issues.apache.org/jira/secure/IssueNavigator.jspa?pid=12310751
 &fixfor=12315255

After that, we believe, will be a 1.0 release. From 0.5, the focus will
change to making the code base stable and 1.0-ready.

MAHOUT IN ACTION

The book "Mahout in Action", published by Manning, has been completed and
will be published in February 2011.

20 Oct 2010 [Sean Owen / Roy]

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

The project is in "code freeze" leading up to a final 0.4 release
planned for this week. The 150 issues resolved for this release
can be viewed here:

https://issues.apache.org/jira/secure/IssueNavigator.jspa?
 pid=12310751&fixfor=12314396

As of 0.4, the project will still be in a state of significant
change and evolution. We still plan an 0.5 release in 6 months
before contemplating a 1.0 release. However we believe the project's
code base is beginning to stabilize, as relatively more effort is
going into code cleanup, tests, polishing, removal of stale code.

Judging by volume of mailing list messages and diversity of senders
we have reason to believe usage of Apache Mahout is beginning to
significantly expand.

NEW COMMITTERS

Sebastian Schelter was elected as a new committer in recognition of
work on distributed recommender implementations.

GOOGLE SUMMER OF CODE

Mahout completed its GSoC projects. Two did not complete due to lack of
student participation. Two completed successfully. One remains in progress.

MAHOUT IN ACTION

The book "Mahout in Action", published by Manning, has reached 15/16
chapters complete and will soon enter final review.

PROJECT BRANDING

We've reviewed the Apache Mahout home page (http://mahout.apache.org)
just this week, per the e-mail request regarding branding.

Project committer Robin Anil is addressing the following issues in
this regard:

- Add standard www.apache.org links to navigation
- Ensure "TM" is used appropriate in names and logos
- Add a DOAP file (we are having issues with the generator but that
 can be taken up offline)

Shane appreciates Mahout's being proactive on implementing the new branding policy.

21 Jul 2010 [Sean Owen / Bertrand]

=== Mahout Status Report: July 2010 ===

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

The project continues to target September, 2010 for release of version 0.4.
This is unchanged since the last report. Recent activity in the project
can be viewed here:

https://issues.apache.org/jira/secure/IssueNavigator.jspa?
 pid=12310751&fixfor=12314396&resolution=1

WEBSITE

The project's website at mahout.apache.org has been completely
redesigned:
http://mahout.apache.org/

GOOGLE SUMMER OF CODE

As part of Google's Summer of Code program, Mahout is halfway through
mentoring five projects. The projects will add or enhance capability in
the specific areas of:

- Boltzmann Machines
- Support Vector Machines
- Singular Value Decomposition for recommendations
- Neural network with back propagation learning
- Eigencuts spectral clustering

MAHOUT IN ACTION

The book "Mahout in Action", published by Manning, continues to be
written and is in 2/3 completion review with the publisher.

EXTERNAL EVENTS

Mahout's recommender system was presented in the key note and two talks
at the Berlin Buzzwords 2010 event.

Jim complemented the project on the format of their report.

16 Jun 2010 [Sean Owen / Roy]

ISSUES

There are no issues requiring board attention at this time.

CURRENT ACTIVITY

The project continues to target September, 2010 for release of version 0.4.
Recent activity in the project can be viewed here:

https://issues.apache.org/jira/secure/IssueNavigator.jspa?pid=12310751&fixfor=12314396&resolution=1

In particular:
- First real support for distributed recommenders has been released

The project has completed migration of mailing lists and website
to mahout.apache.org.

GOOGLE SUMMER OF CODE

As part of Google's Summer of Code program, Mahout has begun work
mentoring five projects.
The projects will add or enhance capability in the specific
areas of:

- Boltzmann Machines
- Support Vector Machines
- Singular Value Decomposition for recommendations
- Neural network with back propagation learning
- Eigencuts spectral clustering

MAHOUT IN ACTION

The book "Mahout in Action", published by Manning, continues to be
written and is entering 2/3 completion review with the publisher.

19 May 2010 [Sean Owen / Roy]

=== Mahout Status Report: May 2010 ===

(This is the first report from Mahout as a top-level Apache project;
previously it was a subproject of Apache Lucene. Mahout
recently reported status with Lucene's special April report. We take the
opportunity to summarize Mahout state and restate recent activity.)

ISSUES

There are no issues requiring board attention at this time.

OVERVIEW

Mahout's goal is to build scalable implementations of machine learning and
data mining algorithms. "Scalable" means designed with exceptional scale in
mind, for efficiency and low memory consumption, and in many cases means
providing Hadoop-based implementations. The "machine learning" implemented
to date has been primarily in the broad areas of:

- Collaborative filtering / recommender engines
- Clustering
- Classification
- Frequent item set mining
- Evolutionary algorithms

CURRENT ACTIVITY

Mahout has created a release approximately every six months, most recently
releasing version 0.3 in March 2010. The project remains in a state of
rapid change and evolution, and looks to release 0.4 in September, 2010.
Recent activity in the project can be viewed here:

https://issues.apache.org/jira/secure/IssueNavigator.jspa?
 pid=12310751&fixfor=12314396&resolution=1

This month, Mahout will complete migration of website, mailing lists,
SVN, and other information to reflect its status as a top-level project.

GOOGLE SUMMER OF CODE

Mahout will mentor five projects as part of Google's Summer of Code
program. The projects will add or enhance capability in the specific
areas of:

- Boltzmann Machines
- Support Vector Machines
- Singular Value Decomposition for recommendations
- Neural network with back propagation learning
- Eigencuts spectral clustering

MAHOUT IN ACTION

The book "Mahout in Action", published by Manning, continues to be written
and is approximately half complete. It has received some favorable feedback
via Manning's early access program.

Great progress!

21 Apr 2010

Establish the Apache Mahout Project

 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software related to a machine learning platform
 for distribution at no charge to the public.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the "Apache Mahout Project",
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache Mahout Project be and hereby is
 responsible for the creation and maintenance of software
 related to a machine learning platform; and be it further

 RESOLVED, that the office of "Vice President, Apache Mahout" be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache Mahout Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache Mahout Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache Mahout Project:

   * Abdelhakim Deneche <adeneche@apache.org>>
   * Isabel Drost <isabel@apache.org>
   * Ted Dunning <tdunning@apache.org>
   * Jeff Eastman <jeastman@apache.org>
   * Drew Farris <drew@apache.org>
   * Grant Ingersoll <gsingers@apache.org>
   * Benson Margulies <bimargulies@apache.org>
   * Sean Owen <srowen@apache.org>
   * Robin Anil <robinanil@apache.org>
   * Jake Mannix <jmannix@apache.org>

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Sean Owen
 be appointed to the office of Vice President, Apache Mahout, to
 serve in accordance with and subject to the direction of the
 Board of Directors and the Bylaws of the Foundation until
 death, resignation, retirement, removal or disqualification,
 or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache Mahout PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache Mahout Project; and be it further

 RESOLVED, that the Apache Mahout Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Lucene Mahout sub-project; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Lucene Mahout sub-project encumbered upon the
 Apache Lucene Project are hereafter discharged.

 Special Order 7A, Establish the Apache Mahout Project, was
 approved by Unanimous Vote of the directors present.