Linux & Edris | Character dialogue

Linux

Hey Edris, I’ve been tinkering with an open‑source tool that could help us digitize and share endangered language resources—want to see if we can make it better together?

Edris

Sure thing! What does the tool do right now, and where do you feel it’s missing or could use a tweak? I’m all ears for how we can make it a better fit for preserving those living, breathing languages.

Linux

Right now it just pulls audio files, chops them into snippets, and tags them with a basic metadata form. The interface is fine, but the speech‑to‑text engine is weak on tonal languages and the library of phonetic symbols is incomplete. I’d love a tighter workflow that lets a community member annotate pronunciations directly on the waveform, and a plugin that pulls in the local dialect dictionary for auto‑suggestions. Also, versioning each annotation set would help keep the data reproducible. What do you think about adding a lightweight editor and a simple API to hook in those tools?

Edris

Sounds like a solid plan. The annotation step is where the community really gets to shape the data, so a waveform‑based editor would let them see the exact prosody. I can imagine a thin overlay that lets a volunteer click a tone marker or add a glottal stop in real time, and the system could lock that annotation so future users see the history. Pulling in a local dialect dictionary as a plugin would give auto‑suggestions right on the spot, but we’d need to keep the interface uncluttered. Maybe just a dropdown that appears when you hover over a segment. As for the API, a REST endpoint that returns the audio file plus its current metadata and all annotation layers would let other apps—like your analysis scripts—fetch the data cleanly. And versioning each annotation set is essential; we could keep a simple JSON log that records who edited what and when. Overall, the lighter the editor, the more likely community members will use it. I’ll start sketching a design that keeps the UI minimal, maybe a few buttons for “add tone,” “add glottal stop,” and “save.” Let me know if you want to dive into any specific part.

Linux

Nice outline, Edris. I’d suggest we prototype the overlay first—just a quick canvas on the waveform where a click drops a tiny icon. That way we can test latency and see if the community finds it intuitive before adding the drop‑down logic. Also, let’s keep the JSON log schema lightweight; a single array of edits with user, timestamp, and change type will keep the files small and still give us traceability. Once the core editor is stable, we can layer in the dictionary plugin and the REST API. What do you think about a quick test with one of the pilot languages?