Project |
Description |
Sponsor (Champion) |
Mentors |
Start Date |
SAMOA |
SAMOA provides a collection of distributed streaming algorithms for the most common data mining and machine learning tasks such as classification, clustering, and regression, as well as programming abstractions to develop new algorithms that run on top of distributed stream processing engines (DSPEs). It features a pluggable architecture that allows it to run on several DSPEs such as Apache Storm, Apache S4, and Apache Samza. |
Incubator
(Daniel Dai)
|
|
2014-12-15 |
Toree |
Toree provides applications with a mechanism to interactively and remotely access Apache Spark. |
Incubator
(Sam Ruby)
|
Luciano Resende, Julien Le Dem, Ryan Blue
|
2015-12-02 |
Milagro |
Milagro is core security infrastructure and crypto libraries for decentralized networks and distributed systems. |
Incubator
(Nick Kew)
|
Nick Kew, Jean-Frederic Clere
|
2015-12-21 |
Pony Mail |
Pony Mail is a mail-archiving, archive viewing, and interaction service, that can be integrated with many email platforms. |
Incubator
(Suneel Marthi)
|
John D. Ament, Sharan Foga
|
2016-05-27 |
Flagon |
Flagon is a software tool usability testing platform |
Incubator
(Lewis John McGibbney)
|
Lewis John McGibbney, David Meikle, Tim Allison, Furkan Kamaci
|
2016-07-13 |
Annotator |
Annotator provides annotation enabling code for browsers, servers, and humans. |
Incubator
(Daniel Gruno)
|
Nick Kew, Tommaso Teofili, Benjamin Young
|
2016-08-30 |
Hivemall |
Hivemall is a library for machine learning implemented as Hive UDFs/UDAFs/UDTFs. |
Incubator
(Roman Shaposhnik)
|
Daniel Dai, Koji Sekiguchi
|
2016-09-13 |
Spot |
Apache Spot is a platform for network telemetry built on an open data model and Apache Hadoop. |
Incubator
(Doug Cutting)
|
Uma Maheswara Rao G
|
2016-09-23 |
Weex |
Weex is a framework for building Mobile cross-platform high performance UI. |
Incubator
(Edward J. Yoon)
|
Willem Ning Jiang, Myrle Krantz
|
2016-11-30 |
Ratis |
Ratis is a java implementation for RAFT consensus protocol |
Incubator
(Jitendra Pandey)
|
Uma Maheswara Rao G, Devaraj Das, Arpit Agarwal
|
2017-01-03 |
MXNet |
A Flexible and Efficient Library for Deep Learning |
Incubator
(Henri Yandell)
|
Markus Weimer, Bob Paulin, Jason Dai, Furkan Kamaci
|
2017-01-23 |
Livy |
Livy is web service that exposes a REST interface for managing long running Apache Spark contexts in your cluster. With Livy, new applications can be built on top of Apache Spark that require fine grained interaction with many Spark contexts. |
Incubator
(Sean Busbey)
|
Bikas Saha, Luciano Resende, Jean-Baptiste Onofre
|
2017-06-05 |
Heron |
A real-time, distributed, fault-tolerant stream processing engine. |
Incubator
(Julien Le Dem)
|
Jake Farrell, Julien Le Dem, P. Taylor Goetz, Dave Fisher, Ming Wen, Kevin Ratnasekera
|
2017-06-23 |
PageSpeed |
PageSpeed represents a series of open source technologies to help make the
web faster by rewriting web pages to reduce latency and bandwidth.
|
Incubator
(Leif Hedstrom)
|
Jukka Zitting, Leif Hedstrom, Nick Kew
|
2017-09-30 |
SDAP |
SDAP is an integrated data analytic center for Big Science problems. |
Incubator
(Lewis John McGibbney)
|
Jörn Rottmann, Trevor Grant
|
2017-10-22 |
Crail |
Crail is a storage platform for sharing performance critical data in distributed data processing jobs at very high speed. |
Incubator
(Luciano Resende)
|
Julian Hyde, Luciano Resende, Felix Cheung
|
2017-11-01 |
Nemo |
Nemo is a data processing system to flexibly control the runtime behaviors of a job to adapt to varying deployment characteristics. |
Incubator
(Byung-Gon Chun)
|
Hyunsik Choi, Byung-Gon Chun, Jean-Baptiste Onofre, Markus Weimer
|
2018-02-04 |
Doris |
Doris is a MPP-based interactive SQL data warehousing for reporting and analysis. |
Incubator
(Dave Fisher)
|
Willem Ning Jiang, Shao Feng Shi, Ming Wen
|
2018-07-18 |
DataLab |
DataLab is a platform for creating self-service, exploratory data science environments in the cloud using best-of-breed data science tools. |
Incubator
(P. Taylor Goetz)
|
P. Taylor Goetz, Henry Saputra, Konstantin I Boudnik, Furkan Kamaci
|
2018-08-20 |
Marvin-AI |
Marvin-AI is an open-source artificial intelligence (AI) platform that helps data scientists, prototype and productionalize complex solutions with a scalable, low-latency, language-agnostic, and standardized architecture while simplifies the process of exploration and modeling. |
Incubator
(Luciano Resende)
|
Luciano Resende, William Colen
|
2018-08-21 |
Pinot |
Pinot is a distributed columnar storage engine that can ingest data in real-time and serve analytical queries at low latency. |
Incubator
(Olivier Lamy)
|
Kishore Gopalakrishna, Jim Jagielski, Olivier Lamy, Felix Cheung
|
2018-10-17 |
brpc |
brpc is an industrial-grade RPC framework for building reliable and high-performance services. |
Incubator
|
Kevin A. McGrail, Jean-Baptiste Onofré, Von Gosling
|
2018-11-13 |
Training |
The Training project aims to develop resources which can be used for training purposes in various media formats, languages and for various Apache and non-Apache target projects. |
Incubator
(Lars Francke)
|
Craig Russell, Christofer Dutz, Justin Mclean, Lars Francke
|
2019-02-21 |
Tuweni |
Tuweni is a set of libraries and other tools to aid development of blockchain and other decentralized software in Java and other JVM languages. |
Incubator
(Jim Jagielski)
|
Jean-Baptiste Onofré, Furkan Kamaci
|
2019-03-25 |
Teaclave |
Teaclave is a universal secure computing platform. |
Incubator
(Zhijie Shen)
|
Felix Cheung, Furkan Kamaci, Jianyong Dai, Matt Sicker, Zhijie Shen
|
2019-08-20 |
DolphinScheduler |
DolphinScheduler is a distributed ETL scheduling engine with powerful DAG visualization interface.. |
Incubator
(Sheng Wu)
|
Sheng Wu, ShaoFeng Shi, Liang Chen, Furkan KAMACI, Kevin Ratnasekera
|
2019-08-29 |
TubeMQ |
TubeMQ is a distributed messaging queue (MQ) system. |
Incubator
(David Nalley)
|
Junping Du, Justin Mclean, Sijie Guo, Zhijie Shen, Jean-Baptiste Onofre
|
2019-11-03 |
StreamPipes |
StreamPipes is a self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore (Industrial) IoT data streams. |
Incubator
(Christofer Dutz)
|
Christofer Dutz, Jean-Baptiste Onofré, Julian Feinauer, Justin Mclean, Kenneth Knowles
|
2019-11-11 |
NuttX |
NuttX is a mature, real-time embedded operating system (RTOS). |
Incubator
(Junping Du)
|
Junping Du, Justin Mclean, Mohammad Asif Siddiqui, Flavio Paiva Junqueira, Duo Zhang
|
2019-12-09 |
YuniKorn |
YuniKorn is a standalone resource scheduler responsible for scheduling batch jobs and long-running services on large scale distributed systems running in on-premises environments as well as different public clouds. |
Incubator
(Vinod Kumar Vavilapalli)
|
Junping Du, Felix Cheung, Jason Lowe, Holden Karau
|
2020-01-21 |
NLPCraft |
A Java API for NLU applications |
Incubator
(Konstantin Boudnik)
|
Roman Shaposhnik, Furkan Kamaci, Evans Ye, Paul King, Konstantin I Boudnik
|
2020-02-13 |
AGE |
AGE is a multi-model database that enables graph and relational models built on PostgreSQL. |
Incubator
(Jim Jagielski)
|
Kevin Ratnasekera, Von Gosling, Raphael Bircher, Felix Cheung
|
2020-04-29 |
Liminal |
Apache Liminal is an end-to-end platform for data engineers and scientists, allowing them to build, train and deploy machine learning models in a robust and agile way. |
Incubator
(Jean-Baptiste Onofre)
|
Jean-Baptiste Onofre, Henry Saputra, Uma Maheswara Rao G, Davor Bonaci, Liang Chen
|
2020-05-23 |
BlueMarlin |
BlueMarlin will develop a web service to add intelligence functionality to a plain ad system. |
Incubator
(Dave Fisher)
|
Craig Russell, Jean-Baptiste Onofré, Von Gosling, Junping Du, Uma Maheswara Rao G
|
2020-06-09 |
Pegasus |
Pegasus is a distributed key-value storage system which is designed to be simple, horizontally scalable, strongly consistent and high-performance. |
Incubator
(Von Gosling)
|
Kevin A. McGrail, Duo zhang, Liang Chen, Von Gosling
|
2020-06-28 |
Sedona |
Sedona is a big geospatial data processing engine. It provides an easy to use APIs for spatial data scientists to manage, wrangle, and process geospatial data. |
Incubator
(Felix Cheung)
|
Felix Cheung, Jean-Baptiste Onofré, George Percivall, Von Gosling
|
2020-07-19 |
Hop |
Hop is short for the Hop Orchestration Platform. Written completely in Java it aims to provide a wide range of data orchestration tools, including a visual development environment, servers, metadata analysis, auditing services and so on. As a platform, Hop also wants to be a reusable library so that it can be easily reused by other software. |
Incubator
(Maximilian Michels)
|
Tom Barber, Julian Hyde, Maximilian Michels, Francois Papon, Kevin Ratnasekera
|
2020-09-24 |
Wayang |
Wayang is a cross-platform data processing system that aims at decoupling the business logic of data analytics applications from concrete data processing platforms, such as Apache Flink or Apache Spark. Hence, it tames the complexity that arises from the "Cambrian explosion" of novel data processing platforms that we currently witness. |
Incubator
(Christofer Dutz)
|
Christofer Dutz, Lars George, Bernd Fondermann, Jean-Baptiste Onofré
|
2020-12-16 |
EventMesh |
EventMesh is a dynamic cloud-native basic service runtime used to decouple the application and middleware layer. |
Incubator
(Von Gosling)
|
Francois Papon, Junping Du, Jean-Baptiste Onofre, Justin Mclean, Von Gosling
|
2021-02-18 |