The big data layer
Translation-specific interlinear data
How it is built
OpenScripture is building its own word-level alignment layer for every supported translation. The system starts with source-language morphology and Strong's-linked source data, then maps each source token to the actual English token or phrase used by a translation. Reviewed rows become runtime gold data; generated rows stay labelled as approximate until they pass audit.
Technical complications
Bible translation is not a neat word-for-word spreadsheet. One Hebrew or Greek word can become a phrase. English can add helper words that are not separate source tokens. Word order changes. Some translations follow different textual bases or versification systems. The alignment model has to preserve direct word anchors, phrase groups, supplied English, and untranslated source words without pretending the problem is simpler than it is.
Desired result
The desired result is native original-language word data beneath the reader, not a generic gloss pasted under every translation. A reader should be able to tap a word in the English text, see the matching Hebrew, Aramaic, or Greek data, switch interlinear bases, and trust that the app is describing that translation rather than a nearby approximation.