A youth-led research collective · Est. 2025

hi! we're students
building useful
things, openly.

We're a nonprofit collective of high schoolers and undergrads working on research, software, and community projects across machine learning, language preservation, education, and accessibility — and giving everything we make back to the public.

Structure
501(c)(3) nonprofit · Student-led
Output
Open research, code, and datasets
Where
California, USA
Open Research
Bhutan-focused NLP
ML Education
Save CGA
Accessible Tools
Knowledge Graphs
Youth-Led
Public Models
Open Research
Bhutan-focused NLP
ML Education
Save CGA
Accessible Tools
Knowledge Graphs
Youth-Led
Public Models
01 · About

a small collective with a broad mandate.

One Sphere Array Inc. is run by students, for the public. We don't have a single product or vertical — we have a worldview: that research, education, and software should be made by the people they serve, and shared without gatekeeping. We work across a range of ventures depending on what the moment calls for.

01

Open by default

Code, datasets, papers, and process notes go out under permissive licenses. If we make it, you can use it.

02

Locally rooted

We pick problems because they matter to a community we know — not because they're trending on a leaderboard.

03

Student-built

Everything ships from young people learning out loud. Mistakes included, written down, and learned from publicly.

02 · Ventures

what we're working on, right now.

Our latest set of long-running projects designed by students and shipped openly. Most are still works in progress.

// Language
In Development

Bhutan-focused NLP

Datasets, embeddings, and small language models for Dzongkha and other under-served languages of the Himalayas — built with native speakers, not despite them.

// Education
Active

ML Education & Research

Workshops, reading groups, and original research papers aimed at students who don't have a research lab next door. Free, asynchronous, and rigorous.

// Tools
In Development

Accessible Applications

Small, free utilities — assistive tools, study aids, language helpers — built for users who get ignored by the rest of the software industry.

// Infrastructure
Planning

Open Knowledge Graph

An experiment in using LLMs to build cross-language knowledge graphs that help under-resourced communities surface and connect their own information.

03 · Focus areas

our values and topics.

We're deliberately not a single-vertical organization. Different ventures call for different muscles, and we like it that way.

Machine Learning

Small models, applied research, and honest evaluation — over hype.

Language & NLP

Especially for languages and dialects that don't get attention.

Education

Free curricula, peer mentorship, and pathways into research.

Cultural Heritage

Archives, oral histories, digitization, preservation tools.

Software Tools

Useful, accessible, free. Minimum viable, maximum reach.

Accessibility

Designing first for users that mainstream tooling ignores.

Applied Research

Original investigations, written up and shared in full.

Community

Mentorship, hosting, programs, and just being good neighbors.

04 · Why we exist

we're not a startup or a data company.

we're a group of young people who think research, software, and learning should belong to the public, not large corporations.

05 · Contact

let's get in touch.

We're a very small team and we read everything. Reach out about partnerships, questions, joining a venture, or just to say hi.

SMS & Voicemail +1 (321) 234-7586
Response time ~24 hours