This was extracted (@ 2025-10-29 19:10) from a list of minutes
which have been approved by the Board.
Please Note
The Board typically approves the minutes of the previous meeting at the
beginning of every Board meeting; therefore, the list below does not
normally contain details from the minutes of the most recent Board meeting.
ASF Members may have access to a
private draft
WARNING: these pages may omit some original contents of the minutes.
Texera is an open-source system to support collaborative data science, AI,
and ML using GUI-based workflows. Our vision is to develop a system to
support cloud platforms on which users can easily analyze data and use AI/ML
techniques provided as operators. Users with various backgrounds,
irrespective of whether they know coding or not, can collaborate on the same
project to construct a pipeline. Experienced users can use programming
languages such as Python, R, Java, and Scala to implement customized
computation logic. The platform allows users to pause the execution of a
workflow to investigate the operator states, and resume the execution at a
later time. The platform can be used by a research community to publish
valuable resources such as data sets, workflows, and ML models to share
their domain-specific knowledge and support reproducibility of scientific
research. The platform also allows users to elastically request computing
resources from public clouds for computationally-intensive tasks.
Texera has been incubating since 2025-04-12.
### Three most important unfinished issues to address before graduating:
1. Set up the Apache Texera webpage
[http://texera.apache.org/](http://texera.apache.org/).
Work is underway to consolidate user guides, developer setup
instructions, and governance policies.
We are analyzing the documentation structures of successful projects
like Apache Flink and Spark to
ensure an optimal user experience.
### Are there any issues that the IPMC or ASF Board need to be aware of?
No.
### How has the community developed since the last report?
1. **August 06, 2025:** The repository has officially transferred to
Apache organization
2. **August 28, 2025:** We conducted our monthly group sync meeting with
the PPMC members, committers, and contributors.
### How has the project developed since the last report?
* Cloud Deployment: Investigated on AWS ECS for low cost deployment of
Texera.
* Preparing for NIH: Added option to allow owner to give download access
to public datasets.
* UI Enhancements:
* Added user's activeness to admin dashboard
* Displayed file upload speed and time
* Added dashboard tab for computing units
* Docs: All design decisions are documented in `issues` tab in Github.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [ ] Working towards first release
- [ ] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release: 2025-04-04 (not an ASF release)
### When were the last committers or PPMC members elected?
July 4 2025.
### Have your mentors been helpful and responsive?
Yes, our mentors were highly responsive and joined our meetings. They
also actively worked on our JIRA tickets.
### Signed-off-by:
- [X] (texera) PJ Fanning
Comments: Next priority must be to get the web site working
- [X] (texera) Ian Maxon
Comments: It's great to see the code donation cleared up. Looking
forward to an incubating release in the near future!
### IPMC/Shepherd notes:
Texera is an open-source system to support collaborative data science, AI,
and ML using GUI-based workflows. Our vision is to develop a system to
support cloud platforms on which users can easily analyze data and use AI/ML
techniques provided as operators. Users with various backgrounds,
irrespective of whether they know coding or not, can collaborate on the same
project to construct a pipeline. Experienced users can use programming
languages such as Python, R, Java, and Scala to implement customized
computation logic. The platform allows users to pause the execution of a
workflow to investigate the operator states, and resume the execution at a
later time. The platform can be used by a research community to publish
valuable resources such as data sets, workflows, and ML models to share
their domain-specific knowledge and support reproducibility of scientific
research. The platform also allows users to elastically request computing
resources from public clouds for computationally-intensive tasks.
Texera has been incubating since 2025-04-12.
### Three most important unfinished issues to address before graduating:
1. Donate the codebase to Apache.
2. Clean up parts in the codebase that are conflicting with Apache
3. Set up the Apache Texera webpage http://texera.apache.org/. Work is
underway to consolidate user guides, developer setup instructions, and
governance policies. We are analyzing the documentation structures of
successful projects like Apache Flink and Spark to ensure an optimal user
experience.
### Are there any issues that the IPMC or ASF Board need to be aware of?
No.
### How has the community developed since the last report?
* August 02, 2025: We have completed the SGA form. It has been submitted
and documented by Apache. We are ready to donate codebase now!
* July 30, 2025: We conducted our monthly group sync meeting with the
PPMC members, committers, and contributors.
* July 31, 2025: Zuozhi Wang joined as a PPMC member.
* We have engaged in significant community outreach and educational
activities this month, increasing the users, including 17 undergraduate
students, 34 students from high schools and community colleges, and 32
middle school students.
* Texera has been deployed and used for research at the UCI Department of
Ophthalmology.
### How has the project developed since the last report?
* Cloud Deployment: Texera now runs on AWS EKS with better resource
management.
* Machine Learning: Added scikit-learn training operators.
* UI Enhancements: Dynamic workflow setup and richer port-level metrics.
* Performance: Faster Texera Hub resource access.
* Docs: New tutorial videos in the works.
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [ ] Working towards first release
- [ ] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2025-04-04 (non ASF)
### When were the last committers or PPMC members elected?
The PPMC voted to add a new member on July 4 2025.
### Have your mentors been helpful and responsive?
Yes, our mentors were highly responsive and joined our meetings. We thank
PJ for guiding us on the SGA form. We also thank Ian for his help with his
experience from Apache AsterixDB.
### Is the PPMC managing the podling's brand / trademarks?
No answer.
### Signed-off-by:
- [ ] (texera) Cezar Andrei
Comments:
- [ ] (texera) Gordon King
Comments:
- [X] (texera) PJ Fanning
Comments: SGA is now accepted by ASF Secretary and Git repo has been
moved to ASF
- [ ] (texera) Ian Maxon
Comments:
### IPMC/Shepherd notes:
Texera is an open-source system to support collaborative data science, AI,
and ML using GUI-based workflows. Our vision is to develop a system to
support cloud platforms on which users can easily analyze data and use AI/ML
techniques provided as operators. Users with various backgrounds,
irrespective of whether they know coding or not, can collaborate on the same
project to construct a pipeline. Experienced users can use programming
languages such as Python, R, Java, and Scala to implement customized
computation logic. The platform allows users to pause the execution of a
workflow to investigate the operator states, and resume the execution at a
later time. The platform can be used by a research community to publish
valuable resources such as data sets, workflows, and ML models to share
their domain-specific knowledge and support reproducibility of scientific
research. The platform also allows users to elastically request computing
resources from public clouds for computationally-intensive tasks.
Texera has been incubating since 2025-04-12.
### Three most important unfinished issues to address before graduating:
1. Complete the SGA form.
2. Donate the Texera codebase from the
[Texera/texera](https://github.com/Texera/texera) GitHub repo to the
Apache/Texera repo.
3. Set up the Apache Texera webpage http://texera.apache.org/.
### Are there any issues that the IPMC or ASF Board need to be aware of?
No.
### How has the community developed since the last report?
- 2025-06-12: Ian Maxon (imaxon) has been added as a Texera mentor.
- 2025-06-25: We got approval from UCI on realeasing their copyright to us.
We are finalizing the paperwork. After this, we can sign and submit the SGA
form.
- 2025-06-30: We conducucted a monthly group sync meeting, meeting minutes
and recording can be found in [this email
thread](https://lists.apache.org/thread/hq6tsnzw05c00lqgb1trwf5w31kvvzg2).
- 2025-06-30: PPMC started voting for a new PPMC member.
- 2025-06-30: There were six more contributors onboarding.
- 2025-06-30: There were four more users showing interests through other
channels. We will gradually move to use general@texera.apache.org to let
potential users contact us.
### How has the project developed since the last report?
1. Logo & Branding: Complete refresh of branding assets (**NEW LOGO!**):
- Replaced the old logo with a redesigned new version and improved logo
loading behavior. [#3481](https://github.com/Texera/texera/pull/3481),
[#3483](https://github.com/Texera/texera/pull/3483),
[#3485](https://github.com/Texera/texera/pull/3485),
[#3486](https://github.com/Texera/texera/pull/3486),
[#3487](https://github.com/Texera/texera/pull/3487)
2. Execution Engine
- Improved schema propagation.
[#3501](https://github.com/Texera/texera/pull/3501)
- Refactored messaging classes for better semantic clarity:
`ControlPayload` → Direct Control Message (DCM), `ChannelMarker` → Embedded
Control Message (ECM). [#3492](https://github.com/Texera/texera/pull/3492),
[#3472](https://github.com/Texera/texera/pull/3472)
- Enhanced execution stability and correctness.
[#3488](https://github.com/Texera/texera/pull/3488),
[#3491](https://github.com/Texera/texera/pull/3491),
[#3470](https://github.com/Texera/texera/pull/3470),
[#3471](https://github.com/Texera/texera/pull/3471),
[#3482](https://github.com/Texera/texera/pull/3482)
3. Services & Configuration
- Introduced and extended Admin Site Settings service.
[#3453](https://github.com/Texera/texera/pull/3453),
[#3480](https://github.com/Texera/texera/pull/3480),
[#3489](https://github.com/Texera/texera/pull/3489)
4. UI & Usability
- Added Parquet as a supported export format and alpha channel support
for scatter plots. [#3478](https://github.com/Texera/texera/pull/3478),
[#3498](https://github.com/Texera/texera/pull/3498)
- Improved operator usability.
[#3479](https://github.com/Texera/texera/pull/3479),
[#3411](https://github.com/Texera/texera/pull/3411)
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [ ] Working towards first release
- [ ] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2025-04-04
### When were the last committers or PPMC members elected?
We conducted a group sync meeting on 2025-06-30 (see [email
thread](https://lists.apache.org/thread/hq6tsnzw05c00lqgb1trwf5w31kvvzg2)
for meeting minutes, slides and recordings). We discussed about adding a
new PPMC member and voting the case on email list. the vote is to be
conducted on 2025-07-03.
### Have your mentors been helpful and responsive?
Yes, our mentors have been both helpful and highly responsive, and we
sincerely appreciate their continued support throughout the incubation
process. We would especially like to thank PJ Fanning for his outstanding
guidance and active engagement. We also warmly welcome our new mentor, Ian
Maxon, and look forward to benefiting from his valuable experience with the
successful incubation of Apache AsterixDB.
### Is the PPMC managing the podling's brand / trademarks?
Texera PPMC just released a new logo for branding purpose. The [new
logo](https://github.com/Texera/texera/raw/master/core/gui/src/assets/logos/
full_logo_small.png) is based on a peacock. Texera PPMC is working with
third parties who are willing to use Texera brand/trademarks, according to
https://incubator.apache.org/guides/publicity.html and
https://incubator.apache.org/guides/branding.html.
### Signed-off-by:
- [ ] (texera) Cezar Andrei
Comments:
- [X] (texera) Gordon King
Comments:
- [X] (texera) PJ Fanning
Comments:
- [X] (texera) Ian Maxon
Comments: Looks good to me. I think the project is making great
progress toward adopting good practices around project discussions and
license management.
### IPMC/Shepherd notes:
Texera is an open-source system to support collaborative data science, AI,
and ML using GUI-based workflows. Our vision is to develop a system that
supports cloud platforms on which users can easily analyze data and use
AI/ML techniques provided as operators. Users with various backgrounds,
irrespective of whether they know coding or not, can collaborate on the
same project to construct a pipeline. Experienced users can use programming
languages such as Python, R, Java, and Scala to implement customized
computation logic. The platform allows users to pause the execution of a
workflow to investigate operator states and resume execution at a later
time. The platform can be used by a research community to publish valuable
resources such as data sets, workflows, and ML models to share their
domain-specific knowledge and support reproducibility of scientific
research. The platform also allows users to elastically request computing
resources from public clouds for computationally intensive tasks.
Texera has been incubating since 2025-04-12.
### Three most important unfinished issues to address before graduating:
1. Complete the SGA form.
2. Donate the Texera codebase from the
[Texera/texera](https://github.com/Texera/texera) GitHub repo to the
Apache/Texera repo.
3. Set up the Apache Texera webpage http://texera.apache.org/.
### Are there any issues that the IPMC or ASF Board need to be aware of?
No
### How has the community developed since the last report?
- 2025-04-12: The project enters the incubation phase.
- 2025-05-12: We have expanded our PPMC members to 12 members. All of them
have signed and submitted their iCLA forms. The latest members (excluding
mentors) are listed as follows:
| # | ID | GitHub Username | Public Name |
|----|--------------|------------------|----------------|
| 1 | aicam | aicam | Ali Risheh |
| 2 | chenli | | Chen Li |
| 3 | kunwp1 | | Chris Park |
| 4 | jiadongb | | Jiadong Bai |
| 5 | shengquan | | Shengquan Ni |
| 6 | xiaozhenliu | | Xiaozhen Liu |
| 7 | linxinyuan | aglinxinyuan | Xinyuan Lin |
| 8 | yiconghuang | Yicong-Huang | Yicong Huang |
| 9 | yunyd | | Yunyan Ding |
- 2025-05-12: We have identified 3 mentors:
| # | ID | GitHub Username | Public Name |
|----|------------|------------------|------------------------|
| 1 | cezar | cezarfx | Cezar Cristian Andrei |
| 2 | garyw | bigdata-memory | Gordon King |
| 3 | fanningpj | pjfanning | PJ Fanning |
- 2025-05-12: 10/12 PPMC members (excluding two mentors) have subscribed
to
mailing lists.
- 2025-05-18: We have gathered 52 approvals (in the form of email replies)
from past codebase committers to donate the
[Texera/texera](https://github.com/Texera/texera) codebase to
Apache/Texera.
- 2025-05-25: We are in discussion with Ian Maxon (Apache ID: imaxon), PMC
Chair of Apache AsterixDB and a long-time contributor, about his joining
as
a mentor for Apache Podling-Texera. Ian is currently in the process of
self-nominating to join the Incubator PMC in order to become eligible to
serve as a mentor.
- 2025-06-01: As Texera is a research project conducted at University of
California, Irvine (UCI), the university owns the copyright of the
codebase/software. We are talking to the UCI licensing office about
releasing their copyright to the committers of the codebase, with Chen Li
as the representative. After the UCI licensing office releases the
copyright, we will prepare the SGA form.
- 2025-06-01: As of today, we are actively designing a new logo for
Texera.
### How has the project developed since the last report?
- We released Texera
[v1.0.0](https://github.com/Texera/texera/releases/tag/1.0.0) on April 04,
2025.
- Many codebase clean ups to prepare codebase for the donation to Apache.
- Created a `NOTICE` file to comply with ASF requirements
[#3388](https://github.com/Texera/texera/pull/3388)
- Added ASF license headers to JOOQ-generated files
[#3414](https://github.com/Texera/texera/pull/3414)
- Added ASF license headers to source files and enabled Release Audit
Tool (RAT)) in CI [#3415](https://github.com/Texera/texera/pull/3415)
- New major features:
- Added new visualization operators: Bullet Chart
[#3362](https://github.com/Texera/texera/pull/3362), RangeSlider
[#3374](https://github.com/Texera/texera/pull/3374), Nested Table
[#3397](https://github.com/Texera/texera/pull/3397), and Volcano Plot
[#3418](https://github.com/Texera/texera/pull/3418)
- Improved file handling with MultipartUpload implementation for file
uploads [#3396](https://github.com/Texera/texera/pull/3396) and pre-checks
for file size/type before preview
[#3427](https://github.com/Texera/texera/pull/3427)
- Enhanced result exportation with support for the HTML format and
large Apache Arrow files
[#3429](https://github.com/Texera/texera/pull/3429)
- Added size metadata to a workflow JSON for better visibility
[#3256](https://github.com/Texera/texera/pull/3256)
- Enabled email notifications to users and administrators for inactive
or unauthorized login attempts
[#3391](https://github.com/Texera/texera/pull/3391),
[#3393](https://github.com/Texera/texera/pull/3393)
- Added admin account configuration and restricted local sign-ups
[#3387](https://github.com/Texera/texera/pull/3387)
- Added support for new R libraries and fixed job ordering in the Helm
chart [#3386](https://github.com/Texera/texera/pull/3386)
- Enabled adjustable shared memory settings for the computing unit
[#3451](https://github.com/Texera/texera/pull/3451)
- UI improvements
- Improved the layout and interface, including the change to the
"terminate button icon"
[#3424](https://github.com/Texera/texera/pull/3424)
and dynamic height adjustment of the operator property sidebar
[#3437](https://github.com/Texera/texera/pull/3437)
- Enhanced interactivity with auto-expansion of the result panel on an
operator click [#3435](https://github.com/Texera/texera/pull/3435) and
improved the comment box usability
[#3436](https://github.com/Texera/texera/pull/3436)
- Added individual progress indicators for concurrent file uploads
[#3426](https://github.com/Texera/texera/pull/3426)
- General UI consistency and refinements
[#3359](https://github.com/Texera/texera/pull/3359),
[#3372](https://github.com/Texera/texera/pull/3372)
- Refactoring
- Reimplemented the User Quota Service to remove the MongoDB dependency
[#3402](https://github.com/Texera/texera/pull/3402)
- Replaced cache source operators with input port materialization
reader threads [#3425](https://github.com/Texera/texera/pull/3425)
- Bug fixes
- Fixed the Helm chart metric resolution, cluster export issues, and
Envoy HTTP timeout/configuration problems
[#3376](https://github.com/Texera/texera/pull/3376),
[#3378](https://github.com/Texera/texera/pull/3378),
[#3380](https://github.com/Texera/texera/pull/3380),
[#3382](https://github.com/Texera/texera/pull/3382)
- Updated the base image in Dockerfile for compatibility
[#3413](https://github.com/Texera/texera/pull/3413)
- Fixed the UI layout and state issues, including centering operators,
incorrect workflow reloading, and auto-persistence logic
[#3342](https://github.com/Texera/texera/pull/3342),
[#3344](https://github.com/Texera/texera/pull/3344),
[#3412](https://github.com/Texera/texera/pull/3412)
- Resolved errors in dataset display for unauthenticated users and
improved handling of compressed files in File Scan operator
[#3373](https://github.com/Texera/texera/pull/3373),
[#3385](https://github.com/Texera/texera/pull/3385)
- Fixed multiple ChannelMarker issues across Java and Python layers,
including transmission, partitioning, and worker routing
[#3434](https://github.com/Texera/texera/pull/3434),
[#3439](https://github.com/Texera/texera/pull/3439),
[#3445](https://github.com/Texera/texera/pull/3445),
[#3448](https://github.com/Texera/texera/pull/3448)
- Corrected backend query versioning and execution order, and fixed
Python Lambda operator compilation
[#3410](https://github.com/Texera/texera/pull/3410),
[#3438](https://github.com/Texera/texera/pull/3438)
### How would you assess the podling's maturity?
Please feel free to add your own commentary.
- [X] Initial setup
- [ ] Working towards first release
- [ ] Community building
- [ ] Nearing graduation
- [ ] Other:
### Date of last release:
2025-04-04
### When were the last committers or PPMC members elected?
On 2025-05-12, the last Texera PPMC members (yunyad, aicam) were added.
### Have your mentors been helpful and responsive?
Yes, our mentors have been both helpful and highly responsive. We are
grateful for their continued support throughout the incubation process. In
particular, we would like to extend a special thanks to PJ Fanning for his
outstanding guidance and engagement.
### Is the PPMC managing the podling's brand / trademarks?
Texera PPMC is working with third parties who are willing to use Texera
brand/trademarks, according to
https://incubator.apache.org/guides/publicity.html and
https://incubator.apache.org/guides/branding.html. Texera PPMC is also
working on a new design of the Texera logo/trademark.
Name is approved:
https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-233
### Signed-off-by:
- [ ] (texera) Cezar Andrei
Comments:
- [ ] (texera) Gordon King
Comments:
- [X] (texera) PJ Fanning
Comments: I will ask the Texera team to provide a shorter report in
future. The SGA is not yet signed so we have not yet transferred the
code.
### IPMC/Shepherd notes: