This was extracted (@ 2024-11-19 16:10) from a list of minutes
which have been approved by the Board.
Please Note
The Board typically approves the minutes of the previous meeting at the
beginning of every Board meeting; therefore, the list below does not
normally contain details from the minutes of the most recent Board meeting.
WARNING: these pages may omit some original contents of the minutes.
Meeting times vary, the exact schedule is available to ASF Members and Officers, search for "calendar" in the Foundation's private index page (svn:foundation/private-index.html).
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? None. ### How has the community developed since the last report? - Mailing list activity: - @dev: 4 messages ### How has the project developed since the last report? We are working in an implementation of adaptive random forests. ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2020-04-23 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? There are no mentors since June. ### Is the PPMC managing the podling's brand / trademarks? Yes. There are no 3rd parties incorrectly using the podling‘s name and brand for now. ### Signed-off-by: ### IPMC/Shepherd notes: Justin Mclean: I suggest you reach out the the IPMC general list and ask for more mentors. Given the low activity it may be hard to attract one. Can the project point to me to where this development on adaptive random forests is? There doesn't seem to be any commits for almost a year.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? None. ### How has the community developed since the last report? - Mailing list activity: - @dev: 14 messages ### How has the project developed since the last report? We are working in an implementation of adaptive random forests. ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2020-04-23 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? Yes, the mentors have been helpful and responsive. ### Is the PPMC managing the podling's brand / trademarks? Yes. There are no 3rd parties incorrectly using the podling‘s name and brand for now. ### Signed-off-by: ### IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? None. ### How has the community developed since the last report? - Mailing list activity: - @dev: 11 messages ### How has the project developed since the last report? With the help of the new commiter Corey Sterling, we prepared a new release of Apache SAMOA. ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2020-04-23 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? Yes, the mentors have been helpful and responsive. ### Is the PPMC managing the podling's brand / trademarks? Yes. There are no 3rd parties incorrectly using the podling‘s name and brand for now. ### Signed-off-by: - [X] (samoa) Alan Gates Comments: Good to see an initial release. ### IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? None. ### How has the community developed since the last report? - Mailing list activity: - @dev: 31 messages ### How has the project developed since the last report? With the help of the new commiter Corey Sterling, we prepared a new release of Apache SAMOA, that is already being discussed and voted (RC3). ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2016-09-30 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? Yes, the mentors have been helpful and responsive. ### Is the PPMC managing the podling's brand / trademarks? Yes. There are no 3rd parties incorrectly using the podling‘s name and brand for now. ### Signed-off-by: - [X] (samoa) Alan Gates Comments: With the election of the new committer it looks like the project is slowly making some progress. I'm happy to see it start moving forward. ### IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? Nothing important. It took more than 2 months to create the account of Corey Sterling, as it seems that some things are still done manually, and there was a mistake with his email address. Finally, we could solve it out. ### How has the community developed since the last report? * Mailing list activity: * @dev: 33 messages ### How has the project developed since the last report? * Retirement has been suggested due to its very low activity * To restart engagement, Corey Sterling has been elected as commiter in late August. With his help, we prepared a new release of Apache SAMOA, that is already being discussed and voted (RC). ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2016-09-30 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? Yes, the mentors have been helpful and responsive. ### Is the PPMC managing the podling's brand / trademarks? Yes. There are no 3rd parties incorrectly using the podling‘s name and brand for now. ### Signed-off-by: - [X] (samoa) Alan Gates Comments: - [ ] (samoa) Ashutosh Chauhan Comments: ### IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Are there any issues that the IPMC or ASF Board need to be aware of? None ### How has the community developed since the last report? * Mailing list activity: * @dev: 12 messages ### How has the project developed since the last report? * Retirement has been suggested due to its very low activity * To restart engagement, Corey Sterling has been elected as commiter in late August. With his help, we are preparing a new release of Apache SAMOA in early September. Corey Sterling has experience on open source software, as an example, he did the last release of MOA, an open source software non-distributed for data streams very related to SAMOA. He works at University of Waikato. ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2016-09-30 ### When were the last committers or PPMC members elected? August 2019 ### Have your mentors been helpful and responsive? Yes, the mentors have been helpful and responsive. ### Signed-off-by: - [X] (samoa) Alan Gates Comments: Happy to see a new committer, hopefully this will help kickstart the project. - [ ] (samoa) Ashutosh Chauhan Comments: ### IPMC/Shepherd notes: Dave Fisher: Let's see if bringing on a new committer revitalizes the podling.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. ### Three most important unfinished issues to address before graduating: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community ### Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None ### How has the community developed since the last report? * Mailing list activity: * @dev: 8 messages ### How has the project developed since the last report? * 1 new PRs created * Working on code for supporting Apache Heron ### How would you assess the podling's maturity? Please feel free to add your own commentary. - [ ] Initial setup - [ ] Working towards first release - [X] Community building - [ ] Nearing graduation - [ ] Other: ### Date of last release: 2016-09-30 ### When were the last committers or PPMC members elected? January 2018 ### Have your mentors been helpful? Yes, the mentors have been helpful and responsive. ### Signed-off-by: - [X] (samoa) Alan Gates Comments: Dave Fisher (shepherd for this reporting cycle) recently suggested that this podling consider retirement as there is very low activity and it has been incubator for quite some time. Whether and how to attempt to restart engagement is being discussed. - [ ] (samoa) Ashutosh Chauhan Comments: ### IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? * Mailing list activity (September 2018 - November 2018): * @dev: 8 messages How has the project developed since the last report? * 1 new PRs created * Working on code for supporting Apache Heron How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [x] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? January 2018 Have your mentors been helpful and responsive or are things falling through the cracks? In the latter case, please list any open issues that need to be addressed. Yes, the mentors have been helpful and responsive. Signed-off-by: [X](samoa) Alan Gates Comments: [ ](samoa) Ashutosh Chauhan Comments: IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? * Mailing list activity (September 2018 - November 2018): * @dev: 30 messages How has the project developed since the last report? * 2 new PRs created * Working on code for supporting Apache Heron How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [x] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? January 2018 Have your mentors been helpful and responsive or are things falling through the cracks? In the latter case, please list any open issues that need to be addressed. Yes, the mentors have been helpful and responsive. Signed-off-by: [X](samoa) Alan Gates Comments: [ ](samoa) Ashutosh Chauhan Comments: IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Revitalize the project by resuming development 2. Enlarge the user base and contributing community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? * Mailing list activity (June 2018 - August 2018): * @dev: 19 messages How has the project developed since the last report? * 2 new PRs created * Presentation at KDD 2018 (MUD3 workshop) on urban data mining using Apache SAMOA How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? January 2018 Signed-off-by: [X](samoa) Alan Gates Comments: I agree with Ted, I remain concerned about the lack of activity in this podling. [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [x](samoa) Ted Dunning Comments: IPMC/Shepherd notes: ted: Almost no mailing list activity since April. March looked promising, but after voting in a new committer, nothing much else happened.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Revitalise the project by resuming development 2. Enlarge the user base and contributing community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? * Mailing list activity (March 2018 - May 2018): * @dev: 25 messages How has the project developed since the last report? * Work on the project has slowed down, no new development since the last report How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? January 2018 Signed-off-by: [X](samoa) Alan Gates Comments: I am glad to see the project is aware it needs to increase the pace of development. Hopefully this will lead to more contributions from the core team. [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [ ](samoa) Ted Dunning Comments: IPMC/Shepherd notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache Flink, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Enlarge the contributing community 2. Have new companies and organizations using the Apache Samoa technology Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (December 2017 - February 2018): * @dev: 23 messages Jira issues backlog (December 2017 - February 2018): * Created: 2 * Resolved: 1 How has the project developed since the last report? * Just elected two new members to the Apache Samoa PPMC * Planning a new release in the next month * Worked on consolidating input and output API, to work better with multiple types of data How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? January 2018 Signed-off-by: [ ](samoa) Alan Gates Comments: [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [X](samoa) Ted Dunning Comments: Is this community really generating enough momentum to graduate?
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache Flink, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Elect new PPMC members 2. Enlarge the community 3. Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (September 2017 - November 2017): * @dev: 49 messages Jira issues backlog (September 2017 - November 2017): * Created: 5 * Resolved: 1 How has the project developed since the last report? * Planning a new release in the next month. * Presentation at the Flink Forward Conference from Orange's team * Integration of a new Boosting algorithm * Improved Apache Kafka support and support for JSON and Avro. * Improved SAMOA instances. How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? None Signed-off-by: [X](samoa) Alan Gates Comments: In response to shepherd's comments (see below) community started a vote on a couple of contributors becoming committers. [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [ ](samoa) Ted Dunning Comments: IPMC/Shepherd notes: Concerned about the lack of community growth. Very few commits this year yet there are pull requests by contributors. Sent an email to private@samoa
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache Flink, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Elect new PPMC members 2. Enlarge the community 3. Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (June 2017 - August 2017): * @dev: 120 messages Jira issues backlog (June 2017 - August 2017): * Created: 4 * Resolved: 5 We saw the engagement of new contributors, who are plausible candidates for the PPMC. How has the project developed since the last report? * Planning a new release in the next month. Work on the release was delayed due to summer holidays for most of the contributors. * Integrated Apache Kafka support and support for JSON and Avro. * Developed a new Boosting algorithm that will be integrated shortly. * Integrated new version of the SAMOA instances. * Integrated ability to store predictions on disk. How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PPMC members elected? None Signed-off-by: [X](samoa) Alan Gates Comments: [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [ ](samoa) Ted Dunning Comments:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Flink, Apache Storm, Apache Apex and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the developer base 2. Grow the user base 3. Add some more ML techniques Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (March 2017 - May 2017): * @dev: 38 messages Jira issues backlog (March 2017 - May 2017): * Created: 6 * Resolved: 1 - We organized a 2-day internal workshop on SAMOA at Telefonica I+D with researchers of Orange Labs, Telecom Paris and QCRI. Various have been discussed to improve SAMOA's ML and other interfaces with systems like Kafka, etc. - We submitted a proposal for funding which will help the development of new features on Samoa. - We have been working on a new release 0.5.0, probably coming up in one month. This release will have support for Kafka, a more well-rounded support for Avro and Json formats, as well as support to store predictions outputted from the model. - There was also significant work done to integrate correctly the instances between MOA and SAMOA, as there was a deviation of how they were defined and hindered the portability of new methods from MOA to SAMOA. - Early discussions to increase the committers / developers team by inviting new contributors to the panel. How has the project developed since the last report? - We are looking into new ML techniques for development. - Worked on the integration of Samoa-MOA instances - Engaging interactions with new parties (Orange Labs) and potential collaborations How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates Comments: [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [ ](samoa) Ted Dunning Comments: IPMC/Shepherd notes: Drew Farris (shepherd): One mentor active. Light activity on the project in general.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Flink, Apache Storm, Apache Apex and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the developer base 2. Grow the user base 3. Add some more ML techniques Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (December 2016 - February 2017): * @dev: 24 messages Jira issues backlog (December 2016 - February 2017): * Created: 2 * Resolved: 0 - We are organizing an internal workshop on SAMOA at Telefonica I+D with researchers of Orange Labs - We had a presentation of Apache Samoa at Paris Machine Learning Meetup and Hamburg Machine Learning Meetup - Bhupesh Chawda has presented Apache SAMOA in different venues with his presentation "Machine Learning Support in Apache Apex (Next Gen Hadoop) with Apache SAMOA" - We invited edi_bice who made several contributions last spring to become a committer. - We published a scientific paper using Apache Samoa and one of its machine learning techniques (Vertical Hoeffding Tree) presented in IEEE BigData Conference last December 2016. - We submitted a paper on SAMOA to a post-proceedings book of the workshops MUSE/MSM 2015/2016 in the Springer LNCS/LNAI series. How has the project developed since the last report? - We are looking into new ML techniques for development. How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2016-09-30 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates Comments: Activity on this podling remains low. It is good to see a new committer being elected. Shouldn't that be noted in the final section on when new committers and PMC members were elected? [ ](samoa) Ashutosh Chauhan Comments: [ ](samoa) Enis Soztutar Comments: [ ](samoa) Ted Dunning Comments: -------------------- Singa Singa is a distributed deep learning platform. Singa has been incubating since 2015-03-17. Three most important issues to address in the move towards graduation: 1. Improve distributed training in SINGA V1.2 2. Improve the documentation and add more examples 3. Attract more contributors Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? No How has the community developed since the last report? There were 66, 60, and 43 emails from dev@ list in December 2016, January 2017 and February 2017 respectively. There are 67 new commits since the last report. One new committer (Li Boon Tan) was added. How has the project developed since the last report? We released the V1.1 version after the last report. The following features were added after last report + Ease the installation process via Docker images, debian packages, conda packages and Amazon AMI (CPU version) + Integrate with Jenkins for automatically generating convenient packages and updating the website. + Improve the model classes: adding debug mode, adding the Concat and Slice layers, and supporting model loading and saving via the Snapshot API + Add image_tool.py for image augmentation and rafiki sub-package for providing RESTFul APIs. + Enable Java binding (basic) for SINGA + Add examples pre-trained from Caffe, e.g., GoogleNet, and examples pre-trained from torch, e.g. ResNet How would you assess the podling's maturity? Please feel free to add your own commentary. [ ] Initial setup [ ] Working towards first release [X] Community building [ ] Nearing graduation [ ] Other: Date of last release: 2017-02-12 When were the last committers or PMC members elected? 2017-02-26 Signed-off-by: [ ](singa) Daniel Dai Comments: [X](singa) Alan Gates Comments: On the maturity assessment I would say the community is in the "Community building" phase but not far from the "Nearing graduation" phase. [ ](singa) Ted Dunning Comments: [ ](singa) Thejas Nair Comments:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Flink, Apache Storm, Apache Apex and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the developer base 2. Grow the user base 3. Add some more ML techniques Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (September 2016 - November 2016): * @dev: 87 messages Jira issues backlog (September 2016 - November 2016): * Created: 0 * Resolved: 1 - We manage to resolve the pending issue regarding the integration of Apache Apex with Samoa. - Also, we made some testing progress with Apache Gearpump. - We also had a presentation of Apache Samoa and its integration with Apache Apex in Apache BigData Europe 2016 - We also published a scientific paper using Apache Samoa and one of its machine learning techniques (Vertical Hoeffding Tree) (to be presented in IEEE BigData Conference in December 2016) How has the project developed since the last report? - We have performed a new release with several bug fixes and new features (0.4.0) - We are looking into new ML techniques for development. Date of last release: 2016-09-30 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning Shepherd/Mentor notes: Alan Gates: Very low activity on this project over the last few months, with last commit showing as Sept 22 and very few commits since last March. A big need here beyond activity is to grow the community. There is a one individual who made several contributions last spring; I sent an email to the mailing list asking if it made sense to invite him to become a committer.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Perform another release 2. Grow the developer base 3. Grow the user base Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (June 2016 - August 2016): * @dev: 77 messages Jira issues backlog (June 2016 - August 2016): * Created: 1 * Resolved: 0 We have had interest from both the Apex and Gearpump communities. How has the project developed since the last report? We have been preparing a second release of the project, which was delayed due to some minor issues in the RC and then the summer break for most developers. Date of last release: 2015-07-21 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [ ](samoa) Ted Dunning
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache Flink, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the community 2. Elect new PMC members 3. Release more often Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (March 2016 - May 2016): * @dev: 202 messages Jira issues backlog (March 2016 - May 2016): * Created: 3 * Resolved: 9 We have a number of new users showing up on the mailing list, which is encouraging. How has the project developed since the last report? After the decision taken by the community to drop support for S4, we removed the support for S4. We fixed automatic build issues, we simplified input from HDFS, and we worked on adding support for Apache Gearpump. Two talks were given to advertise Apache SAMOA, one at ApacheCon NA Big Data by Nicolas Kourtellis, and another one at the "J on the beach" conference by Albert Bifet. Date of last release: 2015-07-21 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning Shepherd/Mentor notes: Alan Gates: I sent a mail to the community poking on the long wait since the last release and whether they think there will be a release soon. I also expressed my concern on not having added any committers and asked if there were any contributors who looked like they might be a good candidate for committership.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the community 2. Elect new PMC members 3. Release more often Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (December 2015 - February 2016): * @dev: 114 messages Jira issues backlog (December 2015 - February 2016): * Created: 7 * Resolved: 4 We have a number of new users showing up on the mailing list, which is encouraging. How has the project developed since the last report? The community has decided to drop support for S4, which is now inactive. There is also some continued interest by Apache Apex about a possible integration. Internally, we have been working on strengthening our main classifier (VHT). There has also been work on improving the connection with Avro and Kafka. The adapted for Flink has been updated to the latest stable version (0.10). Date of last release: 2015-07-21 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning Shepherd/Mentor notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the community 2. Elect new PMC members 3. Following up our first release with further releases Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None. How has the community developed since the last report? Mailing list activity (September-October-November 2015): * @dev: 120 messages Jira issues backlog (September-October-November 2015): * Created: 8 * Resolved: 6 Our main goal is to grow the community, which is still pretty small. We are doing a large amount of dissemination work in conferences and events to promote SAMOA. For example, we had two talks at technical conferences: 1) A talk at the Apache Conference BigData, in Budapest, Hungary, during the week of September 28-30, 2015. 2) A talk at the Flink Forward, a conference for the Apache Flink DSPE, in Berlin, Germany, October 12-13, 2015 We have had contributions from outside the PPMC. In particular, we have had contributions for: 1) integrating Apache Avro input with Apache Samoa, 2) a proposal for integrating Apache APEX as a new DSPE, 3) continued the collaboration with the Apache Flink community. How has the project developed since the last report? Main developments: * Performed development and testing of the VHT module. * Fixed various bugs and improved ensemble and bagging methods. * Worked on the integration of new data sources like Apache Avro. * Improved the website to include more material for new contributors. Date of last release: 2015-07-21 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [x](samoa) Ted Dunning Shepherd/Mentor notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Flink, Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Grow the community 2. Elect new PMC members 3. Following up our first release with further releases Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? Mailing list activity (June-July-August 2015): * @dev 225 messages Jira issues backlog (June-July-August 2015): * Created: 10 * Resolved: 5 Our main goal is still to grow the community, which is still pretty small. We are doing a large amount of dissemination work in conferences and events to promote SAMOA. We have had contributions from outside the PMC. In particular, we have been collaborating with the Apache Flink community. How has the project developed since the last report? Main developments: * First release 0.3.0 in July. * Cleanup of codebase to ease adoption by new contributors. We should increase the rate of technical contribution to the project and move to less incremental ones. Date of last release: 2015-07-21 When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [ ](samoa) Ted Dunning -------------------- Sentry Sentry is a highly modular system for providing fine grained role based authorization to both data and metadata stored on an Apache Hadoop cluster. Sentry has been incubating since 2013-08-08. Three most important issues to address in the move towards graduation: 1. Encourage more feature and direction discussions on the dev list rather than jira. 2. Continue reporting on time 3. Continue making periodic releases following the Apache guidelines. Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? Community has made been making good progress on all of the above mentioned items. 1. There have been multiple discussions on the dev list about features/ design decisions/ roadmap/ dev practices. 2. Community has been doing timely monthly reporting. 3. Community is actively working on bug fixes and working to release 1.6.0 next month How has the community developed since the last report? We had 160 messages on dev list last month. (Got number from http://markmail.org/list/org.apache.sentry.dev) How has the project developed since the last report? About 42 issues were created and about 42 resolved(Numbers from jira). Date of last release: 2015-07-14 When were the last committers or PMC members elected? Colin Ma, Dapeng Sun, Guoquan Shen and Xiaomeng Huang were added as committers on 12/24/2014. No new PPMC members have been added since the project has entered the incubator. Signed-off-by: [X](sentry) Arvind Prabhakar [ ](sentry) Joe Brockmeier [X](sentry) David Nalley [ ](sentry) Olivier Lamy [X](sentry) Patrick Hunt [ ](sentry) Thomas White
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Roll an Apache release 2. Elect new PMC members 3. Grow the community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? Two of our current PMC members (Olivier and Matthieu) have new jobs and have declared that they will not be able to actively contribute to the project for the moment being. This leaves 4 active PMC members. How has the community developed since the last report? Mailing list activity (March-April-May 2015): * @dev 375 messages Jira issues backlog (March-April-May 2015): * Created: 14 * Resolved: 19 We have had contributions from outside the PMC. In particular, we have been collaborating with the Apache Flink community. Our main goal is still to grow the community, which is still pretty small. To achieve this goal, we are doing a large amount of dissemination work in conferences and events to promote SAMOA. How has the project developed since the last report? We are fully operational on the new infrastructure: * Completed migration of issues from old GitHub repository. * Completed migration of documentation from old wiki. * Created new Confluence wiki with roadmap and contribution instruction: https://cwiki.apache.org/confluence/display/SAMOA/Samoa+Home Main developments: * New adaptor for Flink integrated. * Simplified the website and solved several bugs. * Preparing for our first release in June. We should increase the rate of technical contribution to the project and move to less incremental ones. Codebase still need cleanup to ease adoption by new contributors. Date of last release: None, plan to have one in June. When were the last committers or PMC members elected? None. Signed-off-by: [ ](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [x](samoa) Ted Dunning Shepherd/Mentor notes:
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Start committing patches 2. Discuss roadmap next release 3. Grow the community Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? The community is growing, and as an example, there are new people working on an adapter for Apache Flink-Streaming. Also, we have been giving talks at University of Waikato, and University of Auckland to try to attract more people to the community. Mailing list activity (since February 2015): * @dev 106 messages Jira issues backlog (since February 2015): * Created: 6 * Resolved: 4 How has the project developed since the last report? The project setup is going on nicely. We created bylaws for the project and we migrated documentation from the old website. Date of last release: None When were the last committers or PMC members elected? None Signed-off-by: [X](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning Shepherd/Mentor notes: P. Taylor Goetz (ptgoetz): The SAMOA poddling is still ramping up and there are no apparent issues requiring mentor guidance.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15. Three most important issues to address in the move towards graduation: 1. Create bylaws for the project 2. Migrate documentation from the old website 3. Start committing patches Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? We have started using the official Apache channels to discuss about SAMOA. A couple of new people have showed up on the mailing lists. We have one new contributor submitting patches. Mailing list activity (since January 2015): * @dev 101 messages Jira issues backlog (since January 2015): * Created: 13 * Resolved: 2 How has the project developed since the last report? The project setup is going on nicely. We got the SGA from Yahoo and migrated the code to the Apache git repository, enabled GitHub integration (https://github.com/apache/incubator-samoa), migrated the website to the Apache infrastructure and added the Incubator branding (http://samoa.incubator.apache.org), enabled testing via Travis CI. Date of last release: None When were the last committers or PMC members elected? None Signed-off-by: [ ](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning Shepherd/Mentor notes: Justin Mclean (jmclean): Just starting, mentors active.
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. SAMOA has been incubating since 2014-12-15 and is not fully functioning as a project yet. Three most important issues to address in the move towards graduation: 1. Get IP clearance (SGA) from Yahoo 2. Move the current code into ASF's git repository 3. Start working as an Apache project Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? None How has the community developed since the last report? This is the first report. How has the project developed since the last report? This is the first report. Date of last release: No incubator release yet. When were the last committers or PMC members elected? We just established the initial PPMC. Signed-off-by: [ ](samoa) Alan Gates [ ](samoa) Ashutosh Chauhan [ ](samoa) Enis Soztutar [X](samoa) Ted Dunning