This was extracted (@ 2025-10-25 13:10) from a list of minutes
which have been approved by the Board.
Please Note
The Board typically approves the minutes of the previous meeting at the
beginning of every Board meeting; therefore, the list below does not
normally contain details from the minutes of the most recent Board meeting.
ASF Members may have access to a
private draft
WARNING: these pages may omit some original contents of the minutes.
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption
3. Make it easier to contribute to the project (eg, documentation,
framework)
### Are there any issues that the IPMC or ASF Board need to be aware of?
No notification was sent about our upcoming March 2022 report and as result
we missed submitting our report.
### How has the community developed since the last report?
We're continuing to solidify a short term and long term on a roadmap with
the community that will allow more contributors to focus and work on
different levels of the project (i.e. data modeling and schema design,
use-cases, ingestion, ml, ui, dev ops, docs, etc). We're working to
prioritize these discussions, onboarding new committers, and getting
schedules aligned relative to everyone's other workloads.
### How has the project developed since the last report?
The master branch of Apache Spot was largely developed for commercial but
freely available Hadoop software (i.e. Cloudera, Hortonworks) are now in an
end of life (eol) state. In the next several quarters there should be some
priority to shift and add more native implementations on cloud providers
(i.e. AWS, Azure, GCP).
In the short-term it may be feasible to get things working in AWS EMR, and
specifically focus on fixing any issues related to getting the existing
Spot software installed and working. We anticipate lots of compatibility
issues such as working with newer versions of Spark with the existing code
base, as well as updating python code to be compatible with Python 3.9 or
later. Once we get this working we can also release a Cloudformation
template to the public community so that getting Spot setup is much less
labor intesnive than in the past.
In the long term we will want to support several cloud providers and
on-premise options depending on the interest of the community.
Investigate and exploring a replacement for the Spot UI is also still a
high priority. In the short-term, exploring Apache Supersets might be the
best approach. Apache Supersets would give us a basic authentication and
visualization layer into the data landed and processed by Spot.
In the long-term we will need to develop a more comprehensive security and
use-case roadmap for the UI and determine if it makes sense to develop
something from scratch. This will require more involvement and feedback in
the community before we can start designing and planning.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [X] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2019-09-02
### Have your mentors been helpful and responsive?
Our mentor has been helpful. However, the board and our mentor have
suggested we request an additional mentor to help us keep on top of things.
We would like make this a high priority during the next quarter.
### Is the PPMC managing the podling's brand / trademarks?
Yes
### Signed-off-by:
- [ ] (spot) Uma Maheswara Rao G
Comments:
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption
3. Make it easier to contribute to the project (eg, documentation,
framework)
### Are there any issues that the IPMC or ASF Board need to be aware of?
We last reported on June 2021. Our ordinarily scheduled quarterly report
was due in October 2021. The ASF board meeting notes for October 2021
noted that the report reminders did not go out, and so we did not report as
expected. We were expected to report for November 2021. On November 6,
2021, we were informed that our quarterly report was due to be completed on
November 3, 2021. Therefore, this report is being done quickly, to do as
much as possible to comply with ASF expectations. Regrettably this means
this is being done with minimum community discussion.
### How has the community developed since the last report?
We're continuing to solidify a short term and long term on a roadmap with
the community that will allow more contributors to focus and work on
different levels of the project (i.e. data modeling and schema design,
use-cases, ingestion, ml, ui, dev ops, docs, etc). These discussions have
been delayed this quarter due to COVID and getting schedules aligned
relative to other workloads.
### How has the project developed since the last report?
The master branch of Apache Spot was largely developed for commercial but
freely available Hadoop software (i.e. Cloudera, Hortonworks) that now have
been either End of Life’d (or will be in 2022) in favor of the newer
Cloudera Data Platform (CDP), or have been put behind a paywall to get
updates. The cloud provider landscape has also changed drastically in the
last few years and it's time that we start looking at better support for
Quickstart options in each major cloud providers so we can further develop
the project for the community with wider platform support, and give
committers better options for creating development environments to more
efficiently contribute to the project.
The following topics are planned for the next set of discussions with the
community to determine what we focus on for the next several quarters:
Add more native support on cloud infrastructure and software (i.e. AWS,
Azure, GCP)
In the short-term we can focus on AWS EMR on standing up Spot and fixing
any issues related to getting the software installed and working.
In the long term we will want to support several cloud providers and
on-premise options depending on the interest of the community.
Add an infrastructure module to Spot to automate creating clusters in
each providers we plan to support.
Initially we can focus on Cloudformation support with AWS EMR.
Our goal will be to generalize further in the long term with Terraform
across each provider.
Investigate and explore alternative replacement for Spot UI (i.e. Apache
Supersets)
In the short-term Apache Supersets may give us a basic authentication and
visualization layer into the data landed and processed by Spot.
In the long-term we will need to develop a more comprehensive security
and use-case roadmap for the UI and determine if it makes sense to develop
something from scratch.
The next step will be to create Jira Epic’s and tasks for these items so
we can start to distribute the work across the community.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [ ] Initial setup
- [ ] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2019-09-02
### Have your mentors been helpful and responsive?
Our mentor has been helpful. However, we notice the board has suggested
we request an additional mentor to help us keep on top of things. We will
undertake this for the next quarter.
### Is the PPMC managing the podling's brand / trademarks?
Yes
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: I don't think this project is anywhere near to graduation.
And I don't see any noticeable activities in mailing lists. This project
needs additional mentors.
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption
3. Make it easier to contribute to the project (e.g. documentation,
framework).
### Are there any issues that the IPMC or ASF Board need to be aware of?
There is confusing communication with the organization. We submitted our
quarterly
report last month. The final report from last month noted that our
report was not
accepted because it was not signed off. However, we read the minutes
from the Board
that said all submitted reports were accepted, so we assumed we didn't
need to worry
about a report for another quarter. Last night at midnight we were
notified for the
first time that our report was due the same day. With less than 24 hours
notice, it
seems unlikely we'll be able to get sign-off from our mentor.
### How has the community developed since the last report?
We're continuing to solidify a short term and long term on a roadmap with
the community
that will allow more contributors to focus and work on different levels
of the project
(i.e. data modeling and schema design, use-cases, ingestion, ml, ui, dev
ops, docs, etc).
These discussions have been delayed this quarter due to COVID and getting
schedules aligned
relative to other workloads.
### How has the project developed since the last report?
The master branch of Apache Spot was largely developed for commercial but
freely
available Hadoop software (i.e. Cloudera, Hortonworks) that now have been
either
End of Life’d (or will be in 2022) in favor of the newer Cloudera Data
Platform
(CDP), or have been put behind a paywall to get updates. The cloud
provider
landscape has also changed drastically in the last few years and it's
time that
we start looking at better support for Quickstart options in each major
cloud
providers so we can further develop the project for the community with
wider
platform support, and give committers better options for creating
development
environments to more efficiently contribute to the project.
The following topics are planned for the next set of discussions with the
community
to determine what we focus on for the next several quarters:
* Add more native support on cloud infrastructure and software (i.e. AWS,
Azure, GCP)
* In the short-term we can focus on AWS EMR on standing up Spot and
fixing any issues
related to getting the software installed and working.
* In the long term we will want to support several cloud providers and
on-premise
options depending on the interest of the community.
* Add an infrastructure module to Spot to automate creating clusters in
each providers
we plan to support.
* Initially we can focus on Cloudformation support with AWS EMR.
* Our goal will be to generalize further in the long term with
Terraform across each provider.
* Investigate and explore alternative replacement for Spot UI (i.e.
Apache Supersets)
* In the short-term Apache Supersets may give us a basic authentication
and visualization
layer into the data landed and processed by Spot.
* In the long-term we will need to develop a more comprehensive
security and use-case
roadmap for the UI and determine if it makes sense to develop
something from scratch.
The next step will be to create Jira Epic’s and tasks for these items so
we can start to distribute the work across the community.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [ ] Initial setup
- [ ] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2019-09-02
### Have your mentors been helpful and responsive?
Are things falling through the cracks? If so, please list any
open issues that need to be addressed.
Yes our mentors have been helpful. We will be engaging them more soon for
guidance
on on-boarding new committers, as well as a few other items related to
the Whimsy
and Clutch reports and also updating the Apache Spot Podling website as
this is missing a number of updates.
### Is the PPMC managing the podling's brand / trademarks?
Are 3rd parties respecting and correctly using the podlings
name and brand? If not what actions has the PPMC taken to
correct this? Has the VP, Brand approved the project name?
Yes.
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments:
### IPMC/Shepherd notes:
Justin Mclean: The board signed off the incubator report but it can still
have reportsnot signed off by mentor. In your case it looks liked the
sign off come in late after the Incubator report was submitted, so sorry
for asking you to report again. Best to try and submit the report and have
it signed off by the due date if possible. Reminders are sent
to the incubator general list and you mentor(s) should be watching for
those. I would suggest you ask for another mentor so that you are not
dependant on just one person.
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption
3. Make it easier to contribute to the project (e.g. documentation,
framework).
### Are there any issues that the IPMC or ASF Board need to be aware of?
Not at this time
### How has the community developed since the last report?
We're continuing to solidify a short term and long term on a roadmap with
the community that will allow more contributors to focus and work on
different levels of the project (i.e. data modeling and schema design,
use-cases, ingestion, ml, ui, dev ops, docs, etc). These discussions have
been delayed this quarter due to COVID and getting schedules aligned
relative to other workloads.
### How has the project developed since the last report?
The master branch of Apache Spot was largely developed on commercial but
freely available Hadoop software (i.e. Cloudera, Hortonworks) that now have
been either End of Life’d (or will be in 2022) in favor of the newer
Cloudera Data Platform (CDP), or have been put behind a paywall to get
updates. The cloud provider landscape has also changed drastically in the
last few years and it's time that we start looking at better support for
Quickstart options in each major cloud providers so we can further develop
the project for the community with wider platform support, and give
committers better options for creating development environments to more
efficiently contribute to the project.
The following topics are planned for the next set of discussions with the
community to determine what we focus on for the next several quarters:
- Add more native support on cloud infrastructure and software (i.e. AWS,
Azure, GCP)
- In the short-term we can focus on AWS EMR on standing up Spot and
fixing any issues related to getting the software installed and
working.
- In the long term we will want to support several cloud providers and
on-premise options depending on the interest of the community.
- Add an infrastructure module to Spot to automate creating clusters in each
providers we plan to support.
- Initially we can focus on Cloudformation support with AWS EMR.
- Our goal will be to generalize further in the long term with Terraform
across each provider.
- Investigate and explore alternative replacement for Spot UI
(i.e. Apache Supersets)
- In the short-term Apache Supersets may give us a basic authentication
and visualization layer into the data landed and processed by Spot.
- In the long-term we will need to develop a more comprehensive security
and use-case roadmap for the UI and determine if it makes sense to
develop something from scratch.
The next step will be to create Jira Epic’s and tasks for these items so we
can start to distribute the work across the community.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [ ] Initial setup
- [ ] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2019-09-02
### Have your mentors been helpful and responsive?
Are things falling through the cracks? If so, please list any open issues
that need to be addressed.
Yes our mentors have been helpful. We will be engaging them more soon for
guidance on on-boarding new committers, as well as a few other items related
to the Whimsy and Clutch reports and also updating the Apache Spot Podling
website as this is missing a number of updates.
### Is the PPMC managing the podling's brand / trademarks?
Are 3rd parties respecting and correctly using the podlings name and brand?
If not what actions has the PPMC taken to correct this? Has the VP, Brand
approved the project name?
Yes.
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G Comments: Need few more mentors. Very less
IPMC involvement currently. Need more efforts from IPMC in building
community. IPMC roll call was going on.
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Increase community activity in mailing list and commits
2. Make the product more approachable for new contributors and users
3. Develop a roadmap that focuses on delivering one use case effectively
### Are there any issues that the IPMC or ASF Board need to be aware of?
Not at this time.
### How has the community developed since the last report?
We're continuing to work on a roadmap with the community that will allow
more contributors to focus and work on different levels of the project
(i.e. data modeling and schema design, use-cases, ingestion, ml, ui, dev
ops, docs, etc).
### How has the project developed since the last report?
We've started regular weekly discussions around several topics, including
configuration management, authentication, and authorization models within
Spot so that we have a better backbone for adding more features and can
also backport existing functionality to a stronger foundation. These
discussions were delayed this quarter due to COVID and getting schedules
aligned relative to other workloads so we don't have any proposals
documented yet, but that is our next step.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [X] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2020-08-24 - We removed PPMC members who were not subscribed to the
private mailing list and who did not respond to an email asking them to
subscribe
### Have your mentors been helpful and responsive?
Are things falling through the cracks? If so, please list any
open issues that need to be addressed.
Our mentors have been helpful in onboarding new contributors, and in
getting us the Whimsy and Clutch reports and updating the website so we
more completely conform to ASF policy requirements for podlings.
### Is the PPMC managing the podling's brand / trademarks?
Are 3rd parties respecting and correctly using the podlings
name and brand? If not what actions has the PPMC taken to
correct this? Has the VP, Brand approved the project name?
We have no knowledge of any issues with the brand or trademark.
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: This community needs few more mentors.
### IPMC/Shepherd notes:
Justin Mclean: Where are these weekly discussion taking place? Is what is
discussed in them brought back to the mailing list?
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Increase community activity in mailing list and commits
2. Make the product more approachable for new contributors and users
3. Develop a roadmap that focuses on delivering one use case effectively
### Are there any issues that the IPMC or ASF Board need to be aware of?
Not at this time.
### How has the community developed since the last report?
Our new contributors have helped us increase our capacity to fix and
resolve issues. Our goal in the near future is to work with the
community on a roadmap that will allow more contributors to focus
and work on different levels of the project (i.e. data modeling and
schema design, use-cases, ingestion, ml, ui, dev ops, docs, etc).
### How has the project developed since the last report?
We have increased our conformance with ASF podling policies by removing
from the PPMC all who are not on the private mailing list. This has
been the longest-standing caution against us.
We have significantly revamped the project website with the object of
making it more approachable for new contributors. We have used whimsy
to ensure that all ASF requirements are complied with.
We have captured all of the sample data used by the project and are now
hosting them using ASF resources, instead of a third party AWS S3 link,
which was hosted by unknown persons.
We have cleared away a large development effort that was initiated by
a large company but not completed, and no contributors appear to be
ready to resume work on it. Discussions have been started to take
this feature in a direction fulfilling different use cases.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [ ] Initial setup
- [ ] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2020-08-24 - We removed PPMC members who were not subscribed
to the private mailing list and who did not respond to
an email asking them to subscribe
### Have your mentors been helpful and responsive?
Are things falling through the cracks? If so, please list any
open issues that need to be addressed.
Our mentors have been helpful in onboarding new contributors,
and in getting us the Whimsy and Clutch reports and updating
the website so we more completely conform to ASF policy
requirements for podlings.
### Is the PPMC managing the podling's brand / trademarks?
Are 3rd parties respecting and correctly using the podlings
name and brand? If not what actions has the PPMC taken to
correct this? Has the VP, Brand approved the project name?
We have no knowledge of any issues with the brand or trademark.
### Signed-off-by:
- [ ] (spot) Uma Maheswara Rao G
Comments:
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Increase community activity in mailing list and commits
2. Make the product more approachable for new contributors and users
3. Develop a roadmap that focuses on delivering one use case effectively
### Are there any issues that the IPMC or ASF Board need to be aware of?
Not at this time.
### How has the community developed since the last report?
We have on-boarded new contributors in the past quarter, who are very
excited to help us increase our capacity to address issues and move
us to a new level of maturity. One of the new contributors has adopted
the responsibility of secretary and ensuring process compliance and
improvement for the project facing Apache leadership.
We have continued to make further strides towards compliance with all
Apache Software Foundation requirements for incubating projects,
particularly ensuring that all contributors are on the private list.
### How has the project developed since the last report?
As the project has a large backlog of issues inherited from before its
contribution, we continue to emphasize finding issues that are still
relevant and blocking progress, and addressing them. We have engaged
with a dozen issues and resolved quite a few of them. There is work
that is ongoing to remove a third party dependency which is a major
impediment to installing the product.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [X] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2020-05-14 - Jeremy Nelson
### Have your mentors been helpful and responsive?
Our mentors have been helpful in on-boarding new contributors, and
in getting us the Whimsy and Clutch reports and updating the website
so we more completely conform to ASF policy requirements for podlings.
### Is the PPMC managing the podling's brand / trademarks?
Yes
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: Recently project improved on mailing list activities and
project needs few more mentors.
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
- Revive community activity (Discussion in mailing lists, increase
frequency of commits)
- Create a new roadmap and release plan that will improve adoption.
- Make it easier to contribute to the project (e.g. documentation,
framework).
### Are there any issues that the IPMC or ASF Board need to be aware of?
No
### How has the community developed since the last report?
The number of active committers has increased over the past quarter, and
we also have several new potential community members looking to become
committers. We will work with our mentors to go through the process to
onboard them.
Much of the activity from the community has been focused on stabilizing
the codebase and fixing known issues in the documentation and Spot website.
This will greatly benefit new users and reduce confusion when standing up
the Spot different modules.
### How has the project developed since the last report?
Several projects have been proposed, discussed, and completed during the
past quarter. We're also working to close the remaining open PRs as soon
once the community finishes its review. Additionally there were some
Podling Website check failures in the Whimsy report that we began to look
into and resolve. These should be mostly resolved by the next Podling
report update.
We've also begun discussions on the second version of the Open Data Model
(ODM v2). There were several problems seen in the implementation with ODM
v1 that need to be resolved before substantial adoption of Apache Spot
can take place. Its our goal to come to a decision on the design of ODM
v2 this next quarter. Once a data model redesign is agreed upon this will
serve as a catalyst to modernizing other Spot modules so everything is
properly integrated.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [X] Working towards first release
- [X] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
- 2017-09-08
### When were the last committers or PPMC members elected?
- 2019-09-02 - Mark Schoeni
### Have your mentors been helpful and responsive?
Yes our mentors have been helpful. We will be engaging them more soon for
guidance on on-boarding new committers, as well as a few other items
related to the Whimsy and Clutch reports and also updating the Apache
Spot Podling website as this is missing a number of updates.
### Is the PPMC managing the podling's brand / trademarks?
Yes
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: There is some improvement on mailing lists recently.
IMO, Still needs improvement in community development.
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption.
3. Make it easier to contribute to the project (e.g. documentation,
framework).
### Are there any issues that the IPMC or ASF Board need to be aware of?
No
### How has the community developed since the last report?
There are community members that have reached out and are interested in
contributing, but there are only a couple of active committers at this
time. Currently the main hurdle for new contributors is the amount of
effort required today to deploy Spot and build development environments.
### How has the project developed since the last report?
Open PRs with sufficient votes against the master branch and SPOT-181
epic have been merged. There's also been a substantial amount of work
to categorize all of the open JIRAs into a Project board on Github to
help the community more easily triage high priority issues, organize
feature enhancements into future epics, clean-up closed tickets, and
also group issues that require more information from the community
before taking action. Additionally there's been some work to clean-up
the asf-site branch, and minor updates (PRs are being pushed soon).
There's a larger project to re-organize the asf-site to make it easier
for the community to update and add new pages, but that hasn't started
yet.
The project with the most priority going forward is to make Spot easier
to deploy. There's a large learning curve for deploying Spot today, which
makes it difficult for both users and committers to be successful when
standing up Spot, using certain features, testing, and making
contributions back into the project. Using Cloudbreak to stand-up an
environment and deploying Spot as an Ambari package will help lower the
barriers to entry for new and existing users and contributors alike.
We expect during this project that we'll also uncover a number of deployment
issues and incompatibilities along the way and by resolving these will also
help bring confidence to users that this new deployment pattern will be
better tested and reliable.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [x] Initial setup
- [x] Working towards first release
- [x] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2018-01-18
### Have your mentors been helpful and responsive?
Yes
### Is the PPMC managing the podling's brand / trademarks?
Yes
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: Community is not active in mailing lists. I have not seen
any discussions on release plans etc. I don't see committers/PPMC
responding to queries on dev list. I am concerned on community
building as well. With this pace, I don't think project can apply
for graduation in near future time.
### IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Create a new roadmap and release plan that will improve adoption.
3. Make it easier to contribute to the project (e.g. documentation,
framework).
### Are there any issues that the IPMC or ASF Board need to be aware of?
No
### How has the community developed since the last report?
Unfortunately there has not been much more community activity. However,
we are need of more committers as there are only a couple of active
committers at this time. This creates a bottleneck which makes it hard
for the community to grow in interest.
### How has the project developed since the last report?
There are a series of PRs that have been evaluated for the SPOT-181 epic,
and Spot website branch (asf-site) that are in the process of being
merged. We have enough votes to merge, and have put out a request to
committers to review so they can be auto-merged if possible, otherwise we
will just manually merge them via cli.
Once those are merged, we will work on the plan to merge SPOT-181 into
master so that the Open Data Model is consolidated into the main branch
of our code.
We're also working on re-focusing the project release roadmap. Our first
step will be to re-engage with the community to better understand
how/where Spot is being used today, what features/enhancements would benefit
the community best, and which committers (existing and potential) are
interested in contributing to the roadmap and releases moving forward.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [x] Initial setup
- [x] Working towards first release
- [x] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2019-09-05
### Have your mentors been helpful and responsive?
Yes, but we would welcome more mentors and further guidance on
positioning the project for future graduation.
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: Low/no activities on the lists. Raised the same concern on spot dev lists. PPMC is planning to address this concern.
### IPMC/Shepherd notes:
Justin Mclean: Please ask on the incubator general@ list for more mentors.
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
### Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Make it easier for Devs to contribute to the project (e.g.
documentation, framework).
3. Developing a workflow in Spot that allows for intuitive analytics.
### Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
No
### How has the community developed since the last report?
Unfortunately there has not been much more community activity. However,
we are need of more committers as there are only a couple of active
committers at this time. This creates a bottleneck which makes it hard
for the community to grow in interest.
### How has the project developed since the last report?
There's been discussions on the dev list to merge the existing SPOT-181
ODM branch into the master branch so that its easier to use the Open Data
Model within the existing architecture. This should avoid a lot of
confusion among new community members in the future as most of the recent
work as been done in the SPOT-181 branch. A series of PRs have also been
evaluated and we're waiting for more votes before starting the merge
process on those PR, and also the PR for moving SPOT-181 into master.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [x] Initial setup
- [x] Working towards first release
- [x] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2017-09-08
### When were the last committers or PPMC members elected?
2018-01-18
### Have your mentors been helpful?
Yes our single mentor is responsive, however we only have one and may
need 1 more to help.
### Signed-off-by:
- [X] (spot) Uma Maheswara Rao G
Comments: I don't see much progress in community building. Very low
activities on mailing lists. Also this project needs additional mentors.
### IPMC/Shepherd notes:
Justin Mclean: Please ask for more mentors the incubators general@ list.
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important unfinished issues to address before graduating:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Make it easier for Devs to contribute to the project (e.g.
documentation, framework).
3. Developing a workflow in Spot that allows for intuitive analytics.
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
No
How has the community developed since the last report?
Unfortunately there has not been much more community activity. However, we
are need of more committers as there are only a couple of active committers
at this time. This creates a bottleneck which makes it hard for the
community to grow in interest.
How has the project developed since the last report?
A few members have been looking into merging one of the major branches that
the community has been waiting on into master. This should avoid a lot of
confusion among new community members in the future.
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[x] Initial setup
[x] Working towards first release
[x] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.
Yes our single mentor is responsive, however we only have one and may need
1 more to help.
Signed-off-by:
[X](spot) Uma Maheswara Rao G
Comments:
Very low activity on mailing lists. Some times we don't even get replies
for questions in private lists ( May be all PPMC busy with other activities).
I don't see any steps or thoughts being discussed on Graduation.
Considering the community activity, I don't think its ready yet though.
Hope to see better responses in mailing lists and community growth.
IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits)
2. Make it easier for Devs to contribute to the project (e.g.
documentation, framework).
3. Developing a workflow in Spot that allows for intuitive analytics.
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
We are currently down to 1 mentor. Although we put a request for an
additional mentor, we are not going to push the request until we wrap up
what we need to do for the next release. Once we have finished up work
and are ready to set up a vote we will push harder to invite another
mentor to help us with the process. We anticipate having a lot of
questions around the release process.
How has the community developed since the last report?
Since having pushed through some stagnant pull requests we have noticed a
little more activity. We are trying out different ways to make it easier
for developers to contribute. We are talking to users to understand what
pieces of Spot are important to them.
How has the project developed since the last report?
We have identified issues we think would block our release; mainly around
the ease of installation and use. We are doing some documentation
write-ups and code updates to make the Install process easier. We’re also
testing out setup on a VM with the aim to make it easier for developers to
contribute.
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[x] Initial setup
[x] Working towards first release
[x] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Have your mentors been helpful and responsive or are things falling
through the cracks? In the latter case, please list any open issues
that need to be addressed.
Holding off on pushing for more mentors until we are ready to start the
official release process.
Signed-off-by:
[X](spot) Uma Maheswara Rao G
Comments: Project is seeking for additional mentors.
IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Revive community activity (Discussion in mailing lists, increase
frequency of commits).
2. Make it easier for Devs to create plug-ins for ingest of data.
3. Developing a workflow in Spot that allows intuitive analytics without
the need of licensed software.
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware
of?
N/A
Have your mentors been helpful and responsive or are things falling through
the cracks?
Due to the community’s irregular activity level over the past months we
have had trouble aligning with our mentors.
The project is in the process of establishing a cadence to our activities
and we welcome help and advice from mentors.
How has the community developed since the last report?
Project wide meetings are continuing every 2 weeks. We are starting to see
more Pull Requests come in. Hoping that the end of summer (end of
vacations) will bring further participation from the community. We are
working on updating the Spot website and confluence pages to reflect
changes in the project.
How has the project developed since the last report?
A roadmap draft has been constructed. Our next focus is creating a new
release. Therefore, work is underway on closing some long running Pull
Requests and important bug fixes.
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[ ] Initial setup
[ ] Working towards first release
[X] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Signed-off-by:
[ ](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[X](spot) Uma Maheswara Rao G
Comments:
IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. More activity in the User and Dev mail lists
2. Revive Community activity (Discussion, commits)
3. Develop a better release process
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
Lack of community discussion and activity is being addressed.
How has the community developed since the last report?
Started Bi-weekly (2 weeks) project wide community live meetings to try to
revive development on the project.
How has the project developed since the last report?
Community revival is underway. Bi-weekly meetings are taking place in order
to achieve momentum in the project. Goals are to prioritize Pull Requests
and promote the creation of new features in the community.
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[ ] Initial setup
[ ] Working towards first release
[X] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Signed-off-by:
[ ](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[X](spot) Uma Maheswara Rao G
Comments: Recently there was some improvement in discussions and
conducting community meetings etc.
IPMC/Shepherd notes:
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. More activity in the User and Dev mail lists
2. Integrate Open Data Model on top of Spot modules
3. Develop a better release process
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
How has the community developed since the last report?
A. more interaction from partners
B. Seeing more interest and usage of Spot leading to more contributions in on the way
How has the project developed since the last report?
A. More development around Open Data Model
B. New data sources being integrated with ODM is maturing the structure
C. Adding Kerberos support to the project
D. Ingest Redesign
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[ ] Initial setup
[ ] Working towards first release
[X] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Signed-off-by:
[ ](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[X](spot) Uma Maheswara Rao G
Comments: Dev list discussions are low and community should give more attention to this point.
Apache Spot is a platform for network telemetry built on an open data model
and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Develop a better release process
2. Handle additional data types for ingestion and enrichment into ODM shema
3. Fostering more activity in the user, dev and private mail lists
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
a. Issues providing podling reports during December, January.
This is being addressed moving forward.
How has the community developed since the last report?
a. Seeing more pull requests from new contributors
How has the project developed since the last report?
a. Development on the ODM branch has been moving forward and will continue to push towards adoption into the master branch.
b. Ingest redesign underway
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[ ] Initial setup
[ ] Working towards first release
[x] Community building
[ ] Nearing graduation
[ ] Other:
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2018-01-18
Signed-off-by:
[ ](spot) Jarek Jarcec Cecho
Comments:
[X](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[ ](spot) Uma Maheswara Rao G
Comments:
IPMC/Shepherd notes:
Dave Fisher: It's not clear to me where this podling has development discussions.
It would be good for the mentors to guide Spot to at least reflect discussions
on the dev@ list from Slack, Github or wherever these are happening.
Apache Spot is a solution stack that provides the capability to ingest IT related telemetry (network flows, domain name service DNS, and proxy server logs) and provide unsupervised machine learning capabilities to identify suspicious activity. The information is organized and presented using operational analytics so that a security analyst can investigate the most suspicious connections. Apache Spot is built on and using Apache Spark and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Build diverse community
2. Continue building a strong and diverse body of PPMC members for long term project continuity
3. Demonstrate ability to create releases
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
* Recently Facebook decided to re-license React under MIT license, to enable compatibility with Apache licensed projects and many others. If there are not other indications from Incubator board we’ll call out a vote to keep it for now: https://code.facebook.com/posts/300798627056246/relicensing-react-jest-flow-and-immutable-js/
How has the community developed since the last report?
* Committers that are specialized on Ingest functions are merging branches for Open Data Model (ODM) development.
* Same case for views in order to simplify the process of normalization through the UI.
* Datasets with both benign and malicious traffic have been crafted to allow community to test/evaluate ML executions, and facilitate visualization exercises when doing Threat Hunting .
* Number of issues and discussion have increased with both feedback/RFE and bug fixes, which probes adoption, of course Community growth also consists on continue expanding the Committer baseline.
How has the project developed since the last report?
* The Spot project completed the first official Apache release for Spot v1.0, approved and voted by Incubator PMC.
* Open Data Model, development efforts continue over its branch in both directions specification and code pushes.
How would you assess the podling's maturity?
Please feel free to add your own commentary.
[ ] Initial setup
[ ] Working towards first release
[X] Community building
[ ] Nearing graduation
[ ] Other:
Comments:
* Completion of the first Apache release was a significant milestone. The focus now needs to be converting the large number of project observers into project committers to foster a more diverse community, creating project awareness through technology blog posts, and Meetups is a significant effort that needs to happen for project evangelization
Date of last release:
2017-09-08
When were the last committers or PPMC members elected?
2017-06-02
Signed-off-by:
[X](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[X](spot) Uma Maheswara Rao G
Comments:
IPMC/Shepherd notes:
Project is active and moving forward well. They do use ReactJS and will need to upgrade and release as soon as the relicensed version is available.
Dave Fisher
IPMC/Shepherd notes: - Drew Farris (shepherd): No mentor activity apparent from the mailing lists, ongoing project activity observed. - John D. Ament: Podling needed some reminding about the board report, I suspect next report will be in and of high quality.
Apache Spot is a solution stack that provides the capability to ingest IT
related telemetry (network flows, domain name service DNS, and proxy server
logs) and provide unsupervised machine learning capabilities to identify
suspicious activity. The information is organized and presented using
operational analytics so that a security analyst can investigate the most
suspicious connections. Apache Spot is built on an open data model using
Apache Spark and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Move infrastructure and development to ASF (code, issues, mailing list,
...)
2. Build diverse community
3. Demonstrate ability to create releases
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware
of? Two out of the four mentors are the ones showing the most active
participation and involvement on the project so we need to discuss with IPMC
our options to get the other two replaced.
How has the community developed since the last report? By remaking the
Community section on the Project webpage and driving users on Slack to limit
the usage of the same tool just for quick interactions, the traffic over Mail
Lists has increased, we still need to continue driving usage questions to USER
list. A group of Developers from Brazil is coming with a proposal for UX and
UI, based on the merits of the proposal, and the potential that the same has
we have invited them to become Committers on the projects.
The initial purpose of the Project on Network & Security needs to expand the
scope to IT Analytics to cover other uses cases and/or enrich content with
User and Endpoint data. While a normalization framework has been drafted to
what we call Open Data Model we still need more work and contributions on
technical delivery to make it a reality. Contributions to improve the concept
of Ingestion Pipeline & Data Lake are delivered with more cadence, there is a
recognition from Community to normalize data to open the space for other
sources of Ingestion
How has the project developed since the last report? The team has created the
epic on ASF JIRA towards the first Apache release, we’re working now including
the user stories that will be part of release, and doing checks to what can be
delivered before we do the the formal announcement on DEV for the code freeze.
Integrations for context enrichment are critical, however also the need of
bringing a plugin manager where does can be handled, the Spec was drafted, and
the team now is finalizing the last Pull requests to begin merging
functionality onto project code. Documentation is an area of focus so from May
to July committers we’ll continue improving & detailing more, architecture
definitions, deployment options, and UI usage sections. As the whole delivery
is concluded we’ll vote for a FAQ section which can be a point of reference
for common issues/situations. Being part of ASF is important Community
requests that project enables integration/consumption of other Apache
projects, one of our DEVOPS committers, for that reason having an API
framework now will open the door for more smooth integrations.
Date of last release: N/A
When were the last committers or PMC members elected? Last committer was
elected on 06/02
How does the podling rate their own maturity? By know we’re almost ready to do
our first Apache Release which is good, but we still need to mature Community,
the ramping of other Contributors has been slow, we have a strong base of
Consumer positive aspect as it probed consumption, however more Commits from
other interested parties will accelerate functionality delivery.
Signed-off-by:
[X](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[X](spot) Uma Maheswara Rao G
Comments:
Apache Spot is a solution stack that provides the capability to ingest IT
related telemetry (network flows, domain name service information and proxy
server logs) and provide unsupervised machine learning capabilities to
identify suspicious activity. The information is organized and presented
using operational analytics so that a security analyst can investigate the
most suspicious connections. Apache Spot is built on an open data model using
Apache Spark and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Move infrastructure and development to ASF (code, issues, mailing
list, ...)
2. Build diverse community
3. Demonstrate ability to create releases
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of?
None
How has the community developed since the last report?
Based on initial feedback, process for issues was updated on the project
website, with primary objective to facilitate the workflow and interaction
with Community, and having diverse ownership of them until resolution.
Slack channel has become very popular, however in order to adhere to ASF
official communications channels, those conversations that involve quick
question will be kept there, any extended discussion on code, pipeline, and
architecture, are being ported to Dev mail list.
There has been a concern to get continuous improvement of the UI on Apache
Spot, committers have been working on process to get UX interviews that
help to understand and address community and compliance needs.
How has the project developed since the last report?
The project got a major architecture change in order to remove CSV’s,
offload storage from the OS file system and instead leverage HDFS (for
distributed architecture).
Per community request through JIRA issue, and in order to facilitate the
installation/adoption of the project for scale deployments, an installation
script consolidating the ones that were built per pipeline component has
been pulled to repository, it will reduce the complexity of porting files
and configuration variables, by having a central point of deployment.
Open Data Models, which serves as a framework for data sources
normalization, initially for ingestion, was ported through its
Specification to the project repository, this will bring more contributions
to facilitate the design of the NoSQL table, and then determine technology
selection through discussion and voting through mailing list.
February Metrics:
+---------------------------------------------+
| Metric | counts |
+---------------------------------------------+
| Active Contributors (incl mentors)| 14 |
| Jira New Issues | 6 |
| Resolved Issues | 1 |
| Pull Requests merged | 3 |
| Pull Requests proposed | 3 |
+---------------------------------------------+
Date of last release:
N/A
When were the last committers or PMC members elected?
Last committer was elected on 2/24
How does the podling rate their own maturity?
There was a strong focus on code delivery over the last quarter, towards
Apache release, community adoption keeps growing, now by adding a layer of
process for opening issues and commits, plus interaction over channels
we're bringing contributions from others, towards Apache release.
Signed-off-by:
[x](spot) Jarek Jarcec Cecho
Comments:
[ ](spot) Brock Noland
Comments:
[ ](spot) Andrei Savu
Comments:
[x](spot) Uma Maheswara Rao G
Comments:
Apache Spot is a solution stack that provides the capability to ingest IT
related telemetry (network flows, domain name service information and proxy
server logs) and provide unsupervised machine learning capabilities to
identify suspicious activity. The information is organized and presented
using operational analytics so that a security analyst can investigate the
most suspicious connections. Apache Spot is built on an open data model
using Apache Spark and Apache Hadoop.
Spot has been incubating since 2016-09-23.
Three most important issues to address in the move towards graduation:
1. Move infrastructure and development to ASF (code, issues, mailing
list, ...)
2. Build diverse community
3. Demonstrate ability to create releases
Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be
aware of?
IPMC this point has been posted on the last two report, we have not
received any feedback in terms of how it goes, please comment.
Please elaborate on the process for code scanning and IP reviews. On the
last report this was brought to attention but no communication was
received.
How has the community developed since the last report?
Community is pushing more aggressively on project interaction which is
good, by opening issues on ASF JIRA, some of them have been closed with
their respective PR for bug fixing. Also developers in Brazil are
proposing a new UI schema by their own, which once is voted will
incorporate as contribution/new functionality towards our first Apache
release.
Adoption rate and popularity of the project has been recognized by
Industry receiving an award from Infoworld as Technology of the year and
also finalist on Edison Awards.
How has the project developed since the last report?
The Spot team launched a new project webpage version which incorporate
the long term vision and objectives of Apache Spot, and also includes a
new documentation package which is similar in structure to other ASF
projects.
Two new processes were published to explain on commits and how to open
issues, providing a structured frame that Community in general can follow
up in order to facilitate follow-up in combination of technology tools,
that can facilitate the life cycle continuity.
During this period 14 pull request were merged over different pipeline
components , as committers work also on the new design for data storage
through non-relational database technologies.
Also facilitating a quick demo of the project, we have updated our
container demo, and added automation in order to reflect those changes as
commits to UI are pulled to ASF Git repository.
Date of last release:
N/A
When were the last committers or PMC members elected?
Last committer was elected on 1/24
Signed-off-by:
[ ](spot) Jarek Jarcec Cecho
[ ](spot) Brock Noland
[ ](spot) Andrei Savu
[X](spot) Uma Maheswara Rao G
Apache Spot is a solution stack that provides the capability to ingest network related telemetry (network flows, domain name service information and proxy server logs) and provide unsupervised machine learning capabilities to identify suspicious activity. The information is organized and presented using operational analytics so that a security analyst can investigate the most suspicious connections. Apache Spot is built on an open data model using Apache Spark and Apache Hadoop. Spot has been incubating since 2016-09-23. Three most important issues to address in the move towards graduation: 1. Move infrastructure and development to ASF (code, issues, mailing list, ...) 2. Build diverse community 3. Demonstrate ability to create releases Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? Please elaborate on the process for code scanning and IP reviews. How has the community developed since the last report? Community interaction has ramped velocity, as new adopters of the project have been posting questions through Twitter and Slack. We have added approximately 20% to our Slack channel and added several contributors through the ICLA. How has the project developed since the last report? We have moved our repo to ASF as of December 8, 2016. We have removed our LDA-C dependencies by switching to Spark LDA, eliminating licensing issues on this particular branch. Date of last release: N/A When were the last committers or PMC members elected? N/A Signed-off-by: [ ](spot) Jarek Jarcec Cecho [ ](spot) Brock Noland [X](spot) Andrei Savu [X](spot) Uma Maheswara Rao G
Apache Spot is a solution stack that provides the capability to ingest network related telemetry (network flows, domain name service information and proxy server logs) and provide unsupervised machine learning capabilities to identify suspicious activity. The information is organized and presented using operational analytics so that a security analyst can investigate the most suspicious connections. Apache Spot is built on an open data model using Apache Spark and Apache Hadoop. Spot has been incubating since 2016-09-23. Three most important issues to address in the move towards graduation: - Move infrastructure and development to ASF (code, issues, mailing list, …) - Build diverse community - Demonstrate ability to create releases Any issues that the Incubator PMC (IPMC) or ASF Board wish/need to be aware of? We currently have a hard dependency via our LDA implementation that requires a GPL license. A rewrite is in progress to move the code from LDA-C to Spark LDA. How has the community developed since the last report? This is first report after accepting Apache Spot to incubator and we still have not transitioned everything to ASF. We do however see increased interest in the project, primarily on our Slack channel. How has the project developed since the last report? This is first report. Date of last release: N/A When were the last committers or PMC members elected? N/A Signed-off-by: [x](spot) Jarek Jarcec Cecho [ ](spot) Brock Noland [ ](spot) Andrei Savu [X](spot) Uma Maheswara Rao G