Statistics

Count: 41 PPMCs (history)

Mean age: 291 days

Median age: 470 days

Currently in incubation, sorted by age

Project Description Sponsor (Champion) Mentors Start Date
Droids Droids aims to be an intelligent standalone robot framework that allows to create and extend existing droids (robots). HttpComponents, Lucene Thorsten Scherler, Richard Frovarp 2008-10-09
Wave A wave is a hosted, live, concurrent data structure for rich communication. It can be used like email, chat, or a document. Incubator Christian Grobmeier, Upayavira 2010-12-04
ODF Toolkit Java modules that allow programmatic creation, scanning and manipulation of OpenDocument Format (ISO/IEC 26300 == ODF) documents Incubator Sam Ruby, Nick Burch, Yegor Kozlov 2011-08-01
Kalumet Kalumet a complete environment manager and deployer including J2EE environments (application servers, applications, etc), softwares, and resources. Incubator Jim Jagielski, Henri Gomez, Jean-Baptiste Onofre, Olivier Lamy 2011-09-20
Blur Blur is a search platform capable of searching massive amounts of data in a cloud computing environment. Incubator(Patrick Hunt) Doug Cutting, Patrick Hunt, Tim Williams 2012-07-24
Ripple Ripple is a browser based mobile phone emulator designed to aid in the development of HTML5 based mobile applications. Ripple is a cross platform and cross runtime testing/debugging tool. It currently supports such runtimes as Cordova, WebWorks aand the Mobile Web. Incubator(Ross Gardler) Jukka Zitting, Christian Grobmeier, Andrew Savory 2012-10-16
Streams Apache Streams is a lightweight server for ActivityStreams. Incubator(Matt Franklin) Matt Franklin, Ate Douma, Craig McClanahan 2012-11-20
MRQL MRQL is a query processing and optimization system for large-scale, distributed data analysis, built on top of Apache Hadoop, Hama, Spark, and Flink. Incubator(Edward J. Yoon) Alan Cabrera, Alex Karasulu, Mohammad Nour El-Din 2013-03-13
Sentry Sentry is a highly modular system for providing fine grained role based authorization to both data and metadata stored on an Apache Hadoop cluster. Incubator Arvind Prabhakar, Joe Brockmeier, David Nalley, Olivier Lamy, Patrick Hunt, Thomas White 2013-08-08
BatchEE BatchEE projects aims to provide a JBatch implementation (aka JSR352) and a set of useful extensions for this specification. Incubator(FIXME) Jean-Baptiste Onofré, Olivier Lamy, Mark Struberg 2013-10-03
Usergrid Usergrid is Backend-as-a-Service (BaaS) composed of an integrated database (Cassandra), application layer and client tier with SDKs for developers. Incubator Dave Johnson, Jake Farrell, Jim Jagielski, John D. Ament, Lewis John Mcgibbney, Luciano Resende 2013-10-03
Sirona Monitoring Solution. Incubator(Olivier Lamy) Olivier Lamy, Henri Gomez, Jean-Baptiste Onofre, Tammo van Lessen, Mark Struberg 2013-10-15
Twill Twill is an abstraction over Apache Hadoop YARN that reduces the complexity of developing distributed applications, allowing developers to focus more on their business logic Incubator(Vinod K) Arun C Murthy, Tom White, Patrick Hunt, Andrei Savu 2013-11-14
log4cxx2 Logging for C++ Logging Services(Christian Grobmeier) Christian Grobmeier, Scott Deboy 2013-12-09
DataFu DataFu provides a collection of Hadoop MapReduce jobs and functions in higher level languages based on it to perform data analysis. It provides functions for common statistics tasks (e.g. quantiles, sampling), PageRank, stream sessionization, and set and bag operations. DataFu also provides Hadoop jobs for incremental data processing in MapReduce. Incubator(Jakob Homan) Ashutosh Chauhan, Roman Shaposhnik, Ted Dunning 2014-01-05
Slider Slider is a collection of tools and technologies to package, deploy, and manage long running applications on Apache Hadoop YARN clusters. Incubator(Vinod K) Arun C Murthy, Devaraj Das, Jean-Baptiste Onofré, Mahadev Konar 2014-04-29
Brooklyn Brooklyn is a framework for modelling, monitoring, and managing applications through autonomic blueprints. Incubator(Chip Childers) Matt Hogstrom, Alex Karasulu, David Nalley, Marcel Offermans, Jean-Baptiste Onofré, Olivier Lamy, Chip Childers, Andrei Savu, Joe Brockmeier, Jim Jagielski 2014-05-01
Calcite Calcite is a highly customizable engine for parsing and planning queries on data in a wide variety of formats. It allows database-like access, and in particular a SQL interface and advanced query optimization, for data not residing in a traditional database. (Renamed from Optiq on 2014-09-30.) Incubator(Ashutosh Chauhan) Ted Dunning, Alan Gates, Steven Noels 2014-05-19
Johnzon Implementation of JSR-353 JavaTM API for JSON Processing (Renamed from Fleece) Incubator(Mark Struberg) Justin Mclean, Daniel Kulp 2014-06-09
Ranger The Ranger project is a framework to enable, monitor and manage comprehensive data security across the Hadoop platform. Incubator(Owen O'Malley) Alan Gates, Daniel Gruno, Devaraj Das, Jakob Homan, Owen O'Malley 2014-07-24
REEF REEF (Retainable Evaluator Execution Framework) is a scale-out computing fabric that eases the development of Big Data applications on top of resource managers such as Apache YARN and Mesos. Incubator Chris Douglas, Chris Mattmann, Ross Gardler, Owen O'Malley 2014-08-12
Ignite A unified In-Memory Data Fabric providing high-performance, distributed in-memory data management software layer between various data sources and user applications. Incubator(Konstantin Boudnik) Branko Čibej, Konstantin Boudnik, Henry Saputra, Roman Shaposhnik, Michael Stack 2014-10-01
Lens Lens is a platform that enables multi-dimensional queries in a unified way over datasets stored in multiple warehouses. Lens integrates Apache Hive with other data warehouses by tiering them together to form logical data cubes. Incubator(Vinod Kumar Vavilapalli) Christopher Douglas, Jakob Glen Homan, Jean-Baptiste Onofre 2014-10-10
Taverna Taverna is a domain-independent suite of tools used to design and execute data-driven workflows. Incubator(Andy Seaborne) Andy Seaborne, Chris Mattmann, Suresh Srinivas, Suresh Marru, Marlon Pierce 2014-10-20
HTrace HTrace is a tracing framework intended for use with distributed systems written in java. Incubator(Roman Shaposhnik) Jake Farrell, Todd Lipcon, Lewis John Mcgibbney, Andrew Purtell, Billie Rinaldi, Michael Stack 2014-11
Tamaya Tamaya is a highly flexible configuration solution based on an modular, extensible and injectable key/value based design, which should provide a minimal but extendible modern and functional API leveraging SE, ME and EE environments. Incubator(David Blevins) John D. Ament, Mark Struberg, Gerhard Petracek, David Blevins 2014-11-14
NiFi NiFi is a dataflow system based on the concepts of flow-based programming. Incubator(Benson Margulies) Billie Rinaldi, Arvind Prabhakar, Sergio Fernandez, Benson Margulies, Brock Noland, Drew Farris, Andrew Purtell 2014-11-24
Kylin Kylin is a distributed and scalable OLAP engine built on Hadoop to support extremely large datasets. Incubator(Owen O’Malley) Owen O'Malley, Ted Dunning, Henry Saputra 2014-11-25
Corinthia Corinthia is a toolkit/application for converting between and editing common office file formats, with an initial focus on word processing. It is designed to cater for multiple classes of platforms - desktop, web, and mobile - and relies heavily on web technologies such as HTML, CSS, and JavaScript for representing and manipulating documents. The toolkit is small, portable, and flexible, with minimal dependencies. The target audience is developers wishing to include office viewing, conversion, and editing functionality into their applications. Incubator(Jan Iversen) Daniel Gruno, Jan Iversen, Dave Fischer 2014-12-08
SAMOA SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. Incubator(Daniel Dai) Alan Gates, Ashutosh Chauhan, Enis Soztutar, Ted Dunning 2014-12-15
Zeppelin A collaborative data analytics and visualization tool for distributed, general-purpose data processing systems such as Apache Spark, Apache Flink, etc. Incubator(Roman Shaposhnik) Konstantin Boudnik, Henry Saputra, Roman Shaposhnik, Ted Dunning, Hyunsik Choi 2014-12-23
TinkerPop TinkerPop is a graph computing framework written in Java Incubator(David Nalley) Rich Bowen, Daniel Gruno, Hadrian Zbarcea, Matt Franklin, David Nalley 2015-01-16
OpenAz Tools and libraries for developing Attribute-based Access Control (ABAC) Systems in a variety of languages. Incubator(Paul Fremantle) Emmanuel Lecharny, Colm O Heigeartaigh, Hadrian Zbarcea 2015-01-20
AsterixDB Apache AsterixDB is a scalable big data management system (BDMS) that provides storage, management, and query capabilities for large collections of semi-structured data. Incubator(Chris Mattmann) Ate Douma, Chris Mattmann, Henry Saputra, Jochen Wiedmann, Ted Dunning 2015-02-28
Myriad Myriad enables co-existence of Apache Hadoop YARN and Apache Mesos together on the same cluster and allows dynamic resource allocations across both Hadoop and other applications running on the same physical data center infrastructure. Incubator(Benjamin Hindman) Benjamin Hindman, Danese Cooper, Ted Dunning, Luciano Resende 2015-03-01
CommonsRDF Commons RDF is a set of interfaces and classes for RDF 1.1 concepts and behaviours. The commons-rdf-api module defines interfaces and testing harness. The commons-rdf-simple module provides a basic reference implementation to exercise the test harness and clarify API contracts. Incubator(Lewis John McGibbney) Rob Vesse, John D Ament, Gary Gregory 2015-03-06
Groovy Groovy is an object-oriented programming language for the Java platform. It is a language with features similar to those of Python, Ruby, Java, Perl, and Smalltalk. Incubator(Roman Shaposhnik) Andrew Bayer, Konstantin Boudnik, Bertrand Delacretaz, Jim Jagielski, Emmanuel Lecharny, Roman Shaposhnik 2015-03-17
Singa Singa is a distributed deep learning platform. Incubator(Thejas Nair) Daniel Dai, Alan Gates, Ted Dunning, Thejas Nair 2015-03-17
Geode Geode is a data management platform that provides real-time, consistent access to data-intensive applications throughout widely distributed cloud architectures. Incubator(Roman Shaposhnik) Konstantin Boudnik, Chip Childers, Justin Erenkrantz, Jan Iversen, Chris Mattmann, William A. Rowe Jr., Henry Saputra, Roman Shaposhnik 2015-04-27
Atlas Apache Atlas is a scalable and extensible set of core foundational governance services that enables enterprises to effectively and efficiently meet their compliance requirements within Hadoop and allows integration with the complete enterprise data ecosystem Incubator(Jitendra Nath Pandey) Arun Murthy, Chris Douglas, Jakob Homan, Vinod Kumar Vavilapalli 2015-05-05
Climate Model Diagnostic Analyzer CMDA provides web services for multi-aspect physics-based and phenomenon-oriented climate model performance evaluation and diagnosis through the comprehensive and synergistic use of multiple observational data, reanalysis data, and model outputs. Incubator(Chris Mattmann) James W. Carman, Chris Mattmann, Michael James Joyce, Kim Whitehall, Gregory D. Reddin 2015-05-08
Trafodion Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Hadoop. Incubator(Michael Stack) Andrew Purtell, Devaraj Das, Enis Söztutar, Lars Hofhansl, Michael Stack, Roman Shaposhnik 2015-05-24