Hieroglyphica · ΙΕΡΟΓΛΥΦΩ
A Browser-Native Composition Environment for Egyptian Epigraphy
Format: .docx | Draft v0.3
Abstract
This paper introduces Hieroglyphica (ΙΕΡΟΓΛΥΦΩ), a browser-native composition environment for Ancient Egyptian writing, and situates it within the existing landscape of digital epigraphy tools. For two decades the desktop application JSesh (Rosmorduc 2014) has served as the de facto reference implementation for Manuel de Codage (MdC) authoring. Hieroglyphica retains the strengths of that lineage while extending the model in four directions relevant to contemporary epigraphic practice: (1) zero-install deployment via the web platform, removing the licensing, runtime, and font-configuration overhead that constrains teaching and peer review; (2) first-class support for the Egyptian Hieroglyphs Extended-A Unicode block (U+13460–U+143FF), enabling composition of the approximately 4,897 signs that fall outside the Gardiner core; (3) a linked three-row block primitive that binds glyph row, transliteration, and translation as a single editorial unit, mirroring the conventional publication layout; and (4) a hybrid PDF export that embeds an invisible Unicode text layer over the rasterised composition, producing artefacts that are simultaneously visually faithful in print and machine-searchable in digital indexes. We document the architecture, present a feature-by-feature comparison with JSesh, and discuss limitations and the planned roadmap.
Keywords: digital epigraphy, Egyptian hieroglyphs, Manuel de Codage, JSesh, Unicode, Leiden conventions, Thesaurus Linguae Aegyptiae, Egyptology, web tools.
1. Introduction
The digital encoding of Egyptian hieroglyphic writing has matured significantly since the codification of the Manuel de Codage (Buurman, Grimal, et al. 1988) and the subsequent Unicode allocations for Egyptian Hieroglyphs (U+13000–U+1342F, 2009) and Egyptian Hieroglyphs Extended-A (U+13460–U+143FF, 2022). Throughout that period JSesh has served as the de facto desktop authoring environment for epigraphers rendering MdC strings as typeset glyph blocks (Rosmorduc 2014). JSesh remains the reference implementation against which any new tool must be measured.
Two recent shifts have changed the constraints on tooling. First, browsers now ship font shaping, complex-script support, canvas APIs, and PDF generation libraries adequate for typesetting tasks that previously required native applications. Second, the editorial expectations placed on digital epigraphic publication — full Unicode fidelity, machine-readable text alongside visual layout, Leiden-convention support, and reproducible artefacts — have raised the bar for what a hieroglyph editor must produce.
Hieroglyphica is a web-native composition tool built against these constraints. The goal is not to replicate JSesh but to advance the toolkit in the directions that the modern epigraphic workflow demands: zero-install access, complete Unicode coverage, a linked editorial primitive that mirrors publication layout, and PDFs that are simultaneously print-faithful and machine-readable. This paper documents the design choices behind Hieroglyphica and contrasts them, point by point, with the established JSesh workflow.
2. Background: JSesh and the State of the Art
JSesh's design choices, made in the early 2000s and refined consistently since, reflect the desktop paradigm of its era:
- Local installation. JSesh ships as a Java application; runtime installation, font configuration, and version management are user-managed.
- MdC-centric input. The primary input mode is a textual MdC stream, with WYSIWYG cursor positioning over rendered glyphs.
- Native rendering. JSesh ships its own vector glyph set and rendering engine, decoupling it from operating-system font fallback.
- Export coverage. Output formats include EMF, SVG, PDF, and bitmap rasters.
These choices have served the community for two decades. JSesh's glyph repertoire is comprehensive, its MdC parser is mature, and its outputs are accepted in print publication workflows. Any successor tool must acknowledge that baseline. The desktop-and-MdC model, however, carries trade-offs that have become increasingly visible as the field has shifted toward collaborative editing, web publication, and machine-readable archives:
- Installation friction. Students, peer reviewers, and collaborators must each install the application and matching fonts before they can open, edit, or even view a file. This complicates seminar use, peer review, and quick consultation.
- Text-layer opacity. PDFs produced from native vector rendering contain glyphs as paths; they are not Unicode-searchable, copy-paste yields nothing useful, and accessibility tools cannot read them.
- Limited Unicode Extended coverage. Because rendering is decoupled from system fonts, support for newly-encoded Unicode ranges depends on JSesh's own glyph additions rather than the user's installed Unicode fonts.
- Editorial unit fragmentation. A typical epigraphic line — glyph row, transliteration, and translation — must be composed as three independent objects, with no enforced linkage between them.
Hieroglyphica was designed to address these specific gaps directly, while preserving the conventions JSesh established as community baseline.
3. Hieroglyphica: Design and Architecture
3.1 Platform
Hieroglyphica is a single-page web application served from hieroglyphica.org, requiring no installation. It is implemented in vanilla JavaScript with no build step; the canvas composition layer uses Fabric.js 5.2.4. Persistence is handled via JSON workspace files (load/save) and IndexedDB for dictionary caching. The application runs entirely client-side; no editorial content leaves the user's machine.
3.2 Glyph Repertoire and Fonts
Hieroglyphica renders glyphs using Noto Sans Egyptian Hieroglyphs (Apache-2.0 / SIL-OFL) for the core Unicode block (U+13000–U+1342F) and a custom Hieroglyphica Magna font — developed by M. Mazzon specifically for this project — for the Egyptian Hieroglyphs Extended-A Unicode block (U+13460–U+143FF), exposing approximately 4,897 additional signs. Because rendering uses standard Unicode codepoints with embedded fonts, the same glyph data passes unchanged through copy, paste, export, and reimport — a property that the proprietary glyph-set approach used by JSesh does not provide.
3.3 Input Modes
The editor exposes three input paradigms within a unified composition dialog:
- Single text run. A continuous glyph string for inline or simple uses.
- Individual signs. Each glyph as an independent canvas object, freely positioned.
- Three-line linked block. A glyph row paired with transliteration and translation rows, bound as a single editorial unit (see §3.5).
Glyphs are entered by drag-from-palette, click-to-select, MdC code (with Unicode admitted in the same stream), or by drag from the dictionary search panel. The palette reflows on an auto-fill CSS grid and is drag-resizable.
The dictionary search panel draws on two layers held in separate files. The first is a proprietary Hieroglyphica core compiled by the author (© M. Mazzon, all rights reserved). The second is the Wikidata-linked lemma list of the Thesaurus Linguae Aegyptiae (Berlin-Brandenburgische Akademie der Wissenschaften & Saxon Academy of Sciences and Humanities), reformatted for this tool and redistributed under CC BY-SA 4.0 (Thesaurus Linguae Aegyptiae 2024). The two are aggregated only at runtime, so the ShareAlike obligation is confined to the TLA file and does not reach the software, fonts, or the core. The TLA layer follows the Leiden Unified Transliteration, which the search bridges to the core's MdC-style transliteration so that a query in either scheme retrieves both.
3.4 Transliteration and Editorial Conventions
A dedicated transliteration keyboard surfaces the full Egyptological character set — ꜣ ꜥ ꞽ ḥ ḫ ẖ š ḳ ṯ ḏ — in lowercase and uppercase, with the combining diacritic H̱ (H + U+0331) handled inline. The full set of Leiden conventions ( [ ] ⌜ ⌝ ⟨ ⟩ { } … ) is available as a parallel palette inside the three-line dialog, addressing a long-standing pain point: epigraphers no longer leave the application to source editorial brackets from an external Unicode picker.
3.5 The Three-Line Linked Block
The three-line block is the central editorial primitive that distinguishes Hieroglyphica from glyph-only editors. A block is composed of up to three sibling text objects sharing a common identifier:
- Row 0 — Glyphs, rendered in Noto Sans Egyptian Hieroglyphs at the user-selected glyph size.
- Row 1 — Transliteration, rendered in Times italic at approximately 45% of glyph size.
- Row 2 — Translation, rendered in Arial at approximately 38% of glyph size.
Once created, siblings are linked: dragging any row repositions the block as a unit; deleting any row cascades the deletion across the block; and the linkage survives JSON save/load via custom-property persistence (blockId, blockRow). This primitive corresponds directly to the conventional epigraphic publication layout — interlinear gloss with translation underneath — and elevates that layout to a first-class object, rather than leaving it to emerge from the arrangement of free-floating elements as JSesh does.
3.6 Hybrid PDF Export
The PDF export pipeline is the feature in which Hieroglyphica most clearly advances the state of the art. Output is produced via jsPDF with addFileToVFS and addFont to embed the Noto Sans Egyptian Hieroglyphs TTF; a rasterised image of the composition is then placed as the visible layer; finally the original Unicode strings are re-emitted on top with renderingMode: 'invisible'.
The resulting PDF is:
- Visually identical to the on-screen composition;
- Searchable by Unicode codepoint in any standards-compliant PDF reader;
- Copy-pasteable — selecting glyphs in the PDF and pasting elsewhere yields the original Unicode string;
- Accessible to screen readers and to indexing tools that operate on text layers.
To the author's knowledge, no other hieroglyph editor — JSesh included — produces PDFs with all four of these properties simultaneously.
3.7 Additional Export
PNG and SVG export are also supported, with the editing grid programmatically hidden from all exports via a withGridHidden(fn) wrapper that temporarily nulls the canvas background image during rendering. Copy-to-clipboard (PNG via ClipboardItem) is wired into a custom right-click context menu, replacing the default browser handler that would otherwise capture only the transparent upper canvas.
4. Feature Comparison
Table 1 summarises the feature space along the axes most relevant to current epigraphic workflows. The table is not a claim of general superiority — JSesh's MdC parser, glyph-repertoire fidelity, and long-tested print pipeline remain reference quality, and the Hieroglyphica roadmap acknowledges those gaps in §6. The intent of the table is rather to make the comparison legible and to identify where Hieroglyphica offers a substantively different model: zero installation, complete Unicode coverage including Extended-A, a linked editorial block, and PDFs that carry a real text layer.
Table 1. Feature comparison: JSesh and Hieroglyphica.
| Capability | JSesh | Hieroglyphica |
|---|---|---|
| Installation required | Yes (Java runtime + font setup) | None — opens in any modern browser |
| Supported platforms | macOS, Windows, Linux desktop | Any device with a modern browser |
| MdC input | Yes | Yes, with mixed Unicode in the same stream |
| Drag-and-drop palette | Limited | Reflowing, resizable, click-or-drag |
| Unicode base block (U+13000–U+1342F) | Yes (proprietary glyphs) | Yes, via Noto Sans Egyptian Hieroglyphs |
| Egyptian Hieroglyphs Extended-A (U+13460–U+143FF) | Partial | Full (~4,897 signs) |
| Transliteration keyboard | External tool required | Built-in (lower, upper, H̱) |
| Leiden conventions palette | External tool required | Built-in |
| Linked three-line editorial block | No | Yes — glyph row, transliteration, translation |
| PDF export | Vector, no text layer | Hybrid: rasterised visual + invisible Unicode text layer |
| Searchable PDF output | No | Yes — Unicode codepoints retained |
| SVG export | Yes | Yes, with fonts embedded |
| PNG export | Yes | Yes, with grid suppressed at render time |
| Copy image to clipboard | Manual workaround | Built-in |
| JSON workspace persistence | Native binary format | Open JSON with custom-property persistence |
| Dictionary integration | No | Searchable, drag-to-canvas |
| Coptic support | No | Yes |
| Deployment model | Source-available desktop (GPL) | Client-side web application |
5. Use Cases for Epigraphers
Teaching. Instructors can direct students to a URL rather than walking them through installation. Worksheets become JSON files exchanged by email or learning-management system; students open them in-browser, on any device, with no licensing or runtime constraints.
Collaborative editing and peer review. Reviewers can open a colleague's JSON workspace, make changes, and return it without version-matching, licensing, or platform concerns. This contrasts sharply with the desktop model, where every collaborator must run a compatible installation.
Web publication. Compositions can be exported as SVG (with fonts embedded) for inclusion in HTML publications, or as PNG for blogs and figures, with the editing grid programmatically suppressed at render time.
Print and archival publication. PDFs carry both the visual rendering and a Unicode text layer, supporting faithful reproduction in print and downstream digital indexing — a combination not currently available in any other hieroglyphic editor.
Research workflows. Because the PDF text layer carries real Unicode, exported PDFs become first-class citizens in tools such as Zotero, in text-analysis pipelines, and in university repository indexers.
6. Limitations and Roadmap
Hieroglyphica is not feature-complete relative to JSesh on every axis, and honesty about the current gaps is essential to a fair comparison. The most significant present limitations are:
- MdC parser depth. Hieroglyphica accepts MdC input but does not yet implement the full grammar (complex insertions, ligatures, and shading conventions) at JSesh's level.
- Print typography. Fine-grained kerning, baseline shifting, and quadrat geometry adjustments are coarser than JSesh's mature typesetting model.
- Sign-list browsing. Gardiner-category navigation is functional but less polished than JSesh's category browser.
Planned work includes a deeper MdC parser pass; quadrat-aware spacing; expanded sign metadata (Gardiner number, MdC code, transliteration shown on hover); a citation export module producing BibTeX and CSL entries directly from workspace files; and integration with external sign-list lookup services. The aim is to close the typography gap while preserving the architectural advantages — web deployment, full Unicode, the linked block primitive, and the hybrid PDF — that Hieroglyphica introduces.
7. Conclusion
Hieroglyphica does not seek to displace JSesh but to move the toolkit forward in the directions that contemporary digital epigraphy actually requires. Zero-install access removes a structural barrier to teaching and peer review. Complete Unicode coverage, including the Extended-A block, brings the editor into alignment with the standards that the rest of the digital humanities ecosystem already assumes. The linked three-line block makes the conventional publication layout a first-class object rather than an emergent arrangement. And the hybrid PDF, with its embedded Unicode text layer, produces artefacts that are simultaneously print-faithful and machine-readable — a property no comparable tool currently offers.
Each of these advances is incremental on its own; together, they describe a meaningfully different editorial model. We offer Hieroglyphica as a complement to the JSesh tradition and as a contribution to the next decade of digital Egyptology.
References
Buurman, J., Grimal, N., van der Plas, D., Hallof, J., et al. (1988). Inventaire des signes hiéroglyphiques en vue de leur saisie informatique: Manuel de codage. Mémoires de l'Académie des Inscriptions et Belles-Lettres, 8. Paris: De Boccard.
Rosmorduc, S. (2014). "JSesh: a Hieroglyphic Text Editor." In Egyptology in the Present: Experiential and Experimental Methods in Archaeology, ed. C. Graves-Brown. Swansea: Classical Press of Wales.
Thesaurus Linguae Aegyptiae. (2024). Hieroglyphic/hieratic lemma list, Wikidata-linked export. Berlin-Brandenburgische Akademie der Wissenschaften & Saxon Academy of Sciences and Humanities in Leipzig. edoc.bbaw.de, docId 5402. Licensed under CC BY-SA 4.0.
Unicode Consortium. (2009). The Unicode Standard, Version 5.2.0. Egyptian Hieroglyphs block (U+13000–U+1342F). Mountain View, CA: The Unicode Consortium.
Unicode Consortium. (2022). The Unicode Standard, Version 15.0.0. Egyptian Hieroglyphs Extended-A block (U+13460–U+143FF). Mountain View, CA: The Unicode Consortium.
Leiden Conventions. Standard reference for epigraphic editorial sigla, as codified at the International Congress of Orientalists, Leiden, 1931, and revised in subsequent papyrological and epigraphic practice.