Skip to content

Commit

Permalink
(WIP) Document engine architecture, design, key concepts
Browse files Browse the repository at this point in the history
  • Loading branch information
eyelidlessness committed Jan 17, 2025
1 parent dd5ee03 commit c9cdd48
Showing 1 changed file with 111 additions and 0 deletions.
111 changes: 111 additions & 0 deletions packages/xforms-engine/ARCHITECTURE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,111 @@
# `@getodk/xforms-engine`: architecture, design, and key concepts

## Guiding principles and assumptions

1. `@getodk/xforms-engine` (interchangeably referred to as "the engine") is conceived as a software implementation of the data and computational models defined by the [ODK XForms Specification](https://getodk.github.io/xforms-spec/).

2. The engine is conceptually similar to [JavaRosa](https://github.com/getodk/javarosa), which serves as an engine for [ODK Collect](https://github.com/getodk/collect). Alignment with JavaRosa's behavior and functionality is a primary goal, and that project is a primary point of reference, e.g.:

- for understanding, interpreting and disambiguating important details specified by ODK XForms

- for prioritization of feature support

- for identification of _bug-for-bug_ compatibility goals

In **most cases**, JavaRosa is assumed to be the "source of truth" when answering questions about spec and requirements. Cases where we deviate tend to involve increased spec support; in all cases, interoperability between the two ODK form implementations is of utmost importance.

3. ODK forms are typically authored in the [XLSForms](https://xlsform.org/en/) format, which simplifies many ODK XForms concepts for form designers. The format is an _abstraction_ over ODK XForms. For the purposes of the engine:

- The XLSForms format establishes requirements and priority of features. ODK XForms features which _can be expressed_ in an XLSForm are higher priority than those which cannot.

- The format provides guidance—_but is not a source of truth_—on questions terminology, feature semantics, user concepts, etc. As an abstraction, XLSForms ultimately defers to ODK XForms as its underlying specification, and the engine does in kind. Especially in terms of naming and spec references, the engine's internals tend to stick close to ODK XForms; XLSForms concepts are more appropriate at the package boundary (usually called the "client interface").

4. The engine is intentionally designed to be "client agnostic", in terms of:

- Presentation and interaction: the engine supports integration and presentation as a conventional form UI on the web (hence the name "ODK Web Forms"); programmatically (as is the case in the `@getodk/scenario` integration test client); hypothetically as a graphical interface for non-web (e.g. mobile or desktop native) platforms, or even as a command line other text-first interface.

- Rendering technology: the engine is not coupled to any particular UI library or framework (web or otherwise). It is a goal of the engine to support integration with a client's choice of component framework, or even its own bespoke presentation layer.

- Model of state over time: the engine provides a minimal interface for clients to integrate their own model for observing state changes as they occur. The behavior of this interface is treated as opaque within the engine. It's generally assumed that clients will use and integrate an implementation of reactivity to handle state changes. But the interface is fully optional (for instance it is unused by the vast majority of tests in `@getodk/scenario`).

5. The engine has a _synchronous, consistent computational model_:

- **Synchronous to read:** Once the engine has initialized a form's state, a client may access any of that state **by reading object properties**. These properties may be implemented internally by `get` accessors and/or `Proxy` traps, but they always produce a _synchronous value_ (i.e. not a `Promise` or other access mechanism which unblocks the main thread's event loop).

- **Synchronous to write:** APIs provided by the engine for clients to make any state change will perform that state change in the same event loop tick.

- **Consistent (on read, after every write):** The engine ensures that any computations which must be performed to produce state to a client are performed before the client accesses that state. When a client issues any state change to the engine, any state subsequently read by that client will be a product of the complete and consistent result of any computations dependent (directly or indirectly) on that state change.

The engine _may_ defer certain computations (e.g. to optimize performance), but any deferred computations **will** be performed before a client acccesses the computation's result (or any other computation which depends on it). The engine may perform deferred computations either on client demand (e.g. in a `get` accessor or `Proxy` trap) or in the background (e.g. asynchrony which is not observable by a client).

- (Likely to change) **Every write method returns the complete form state**: early in the engine's design, we established a _convention_ for client-facing write method signatures where any write method for any aspect of a form's state will return the complete state of the form. This was intended as an _implied contract_ of the engine's guarantees of synchrony and consistency. While this has some philosophical value, we've found it doesn't have much _practical value_. So we will probably eliminate this convention, probably in favor of a more idiomatic convention like write methods returning the directly written state.

## Engine flow (broad strokes)

1. **`initializeForm` (client entrypoint):** accepts a `FormResource` (XForms XML string, or URL reference to XForms XML resource), and a client's options configuring certain form init/runtime behavior.

2. **Retrieve form definition (via client-configured `fetchFormDefinition`):** a `fetch`-like function. If the client provides the `FormResource` as a URL, the engine will use this option to request the form definition.

3. **Initial/pre-parse (engine-internal `XFormDOM`):**

- Parses the form definition from its XForms XML string input into a traversable tree.
- Performs some minor pre-processing to normalize certain ambiguous structures specified by ODK XForms.
- Identifies key aspects of the form definition for reference in later stages of parsing.

4. **Resolve form attachments (via client-configured `fetchFormAttachment`, `FormAttachmentResource`):** a `fetch`-like function. Form attachment references are identified in the `XFormDOM` structure. These references are then retrieved by the client-provided `fetchFormAttachment` function. The retrieved resources are represented internally by a `FormAttachmentResource`, which may be referenced for further parsing or other form functionality.

5. **Parse secondary instances (`FormAttachmentResource`, `SecondaryInstanceDefinition`, etc):** Forms may reference secondary instances in computations, generally to populate `<itemset>` items (for `<select>`, `<select1>`, `<odk:rank>` controls). Each secondary instance is parsed into a `SecondaryInstanceDefinition`, which is a common internal representation suitable for evaluation by references in XPath expressions.

- [Internal secondary instances](https://getodk.github.io/xforms-spec/#secondary-instances---internal) are referenced and parsed from the `XFormDOM` structure.

- [External secondary instances](https://getodk.github.io/xforms-spec/#secondary-instances---external) are referenced from their respective `FormAttachmentResource`s, then parsed from their source format (XML, CSV, GeoJSON).

6. **Parse form body (`XFormDefinition` -> `BodyDefinition` -> `BodyElementDefinition`\*):** the body of a form definition (representing its "view", or user-facing structure, controls, supporting text labels/hints) parsed into an internal `BodyDefinition` representation, collecting key information about how a form should be presented to a user. The `BodyDefinition` contains references to parsed descendants of the body, as concrete implementations of the abstract `BodyElementDefinition` base class. Each `BodyElementDefinition` implementation provides details about the specific control or view structure which are used to direct behavior for nodes in the form's model which they reference.

While `BodyDefinition` is _parsed as a tree_, it is ultimately used to produce a `Map` where each `BodyElementDefinition` is keyed by its reference to a corresponding model node.

`BodyElementDefinition` is used to represent body elements which are either:

- Form controls (`<input>`, etc)
- Structural elements (`<group>`, `<repeat>`) containing those controls (or other sub-structures)

Each `BodyElementDefinition` may _also reference_ parsed representations of _supporting elements_:

- Representing text (`<label>`, `<hint>`)
- Representing a control's items (`<item>`, `<itemset>`)

These supporting elements are represented by different base classes associated with the semantics they provide to their associated control or body structure (e.g. as computational expressions).

7. **Parse a coherent model of the form definition (`XFormDefinition` -> `ModelDefinition`):**

- **`ModelDefinition` -> `ModelBindMap` -> `BindDefinition`\*:** Each of a form definition's `<bind>` elements is parsed into a `BindDefinition`, representing its type information and (most of) the computations associated with the node. As with `BodyDefinition`, the `BindDefinition`s are then collected into a `Map` keyed by their model node references.

- **`ModelDefinition` -> `NodeDefinition`\*:** A form definition's primary `<instance>` subtree is parsed into a tree of `NodeDefinition`s, defining the core structure of a form instance's state. `NodeDefinition` is also an abstract base class, with several more specific concrete implementations associated with key semantics about each node.

Among those key semantics, each `NodeDefinition` is associated with the `BindDefinition` referencing it (`NodeDefinition.bind`; if one does not exist, a definition will be constructed for it with default values).

If a form's model node is also referenced by a body element, its `NodeDefinition` representation will have a reference back to that body element's (as `NodeDefinition.bodyElement: BodyElementDefinition`; this property will be `null` if the node is model-only). This association with a body element also influences which concrete `NodeDefinition` implementation is used for its representation.

- `RootDefinition`: always represents the root node of the form's primary instance model. A form's `RootNode` it is not directly associated with a `BodyElementDefinition`, but it is conceptually linked to the body itself (being the "root" of the form's body/view)

- `SubtreeDefinition`: represents descendants of the `RootNode` which contain other descendants. Each `SubtreeDefinition` _may or may not_ be associated with a `BodyElementDefinition` corresponding to a `<group>` element. A subtree node's descendants are parsed recursively.

- `RepeatRangeDefinition`: represents _one or more contiguous nodes_ referenced by a `<repeat>` in the form definition's body, and always has:

- One `RepeatTemplateDefinition`: represents _either_ a model node explicitly designated as a `jr:template` (if one exists) _or_ the first node in contiguous sequence.

- Any number of `RepeatInstanceDefinition`\*: each representing one contiguous model node which is _NOT_ explicitly designated as a `jr:template`.

Descendants of each `RepeatTemplateDefinition` and `RepeatInstanceDefinition` are parsed recursively.

- `LeafNodeDefinition`: represents a form's primary instance leaf nodes (currently only elements with no element children; we expect to use the same to represent attributes as well). Each `LeafNodeDefinition` _may or may not_ be associated with a `BodyElementDefinition` (specifically a `ControlDefinition`, as leaf nodes may only be associated with form controls).

- `NoteNodeDefinition` (special case): represents nodes we treat as a special "note" type, where a `LeafNodeDefinition` is associated with an `<input>` (`bodyElement: InputControlDefinition`), and where its `BindDefinition` has a `readonly` expression which will _always produce a `true` value_.

- **`ModelDefinition` -> `ItextTranslationsDefinition`:** a form's translations are parsed into a set of subtree structures, which support evaluation of `jr:itext(itextId)` XPath expressions. (They share the same underlying implementation as `SecondaryInstanceDefinition`s.)

8. **Initialize a form's primary instance state:** This warrants an entirely new section. See below!

## Primary instance state (client interface / engine-internal implementation)

[...]

0 comments on commit c9cdd48

Please sign in to comment.