Semantic Hotkeys
Authored by Edwin Kofler
Published on 2022-7-5
Hotkeys are present in nearly all software. However, in my view, both software users and application developers do not realize the full potential of hotkeys.
I believe the full productivity of hotkeys can be unleashed through the concept of semantic hotkeys. First I’ll introduce some current problems with hotkeys, then I’ll explain how semantic hotkeys can provide a solution, and lastly, show what an implementation might look like.
Current Woes
The crux of my claims boils down to consistency. In one perspective, there are three costs of hotkeys:
- The cost of learning
- The cost of context-switching
- The cost of hotkey clobbering
The first cost is somewhat-inevitable, especially for large enterprise softwares like Blender or Houdini. Sometimes, this cost is rarely a consideration because it is easily amortized by ample use of the hotkeys themselves. But, with less frequently-used or non-unique applications, it is often not worth learning the hotkeys. The cost of learning hotkeys of a particular application is typically not proportional to the total time using that application.
The second cost deals with context switching among multiple applications (that have different sets of hotkeys). A high familiarity with each applications’ hotkeys can mitigate this penalty somewhat, but it will always exist. As applications with incongruent hotkeys increase, the cost of context switching will only remain high.
The third cost involves clobbering. This can occur in applications that provide an extension or plugin environment, such as VSCode or NeoVim. Hotkeys of extensions are often defined arbitrarily, usually just set for the sake of being set. This creates high potential for hotkey clobbering; such conflicts have to be resolved manually. Inconsistent conventions make learning new hotkeys, especially those defined within plugin-based environments, difficult.
Solution
An ideal solution involves constructing a set of shared keyboard shortcuts across applications. But, the meaning of those shortcuts mustn’t be domain specific; they must translate intuitively.
Introducing, semantic hotkeys:
- Semantic Hotkey: A hotkey in which the keypresses that define it have semantics rooted in physicalities, geometries, or commonalities
- Example:
WASD
andHJKL
have semantics of orientation or directed movement based on the arrangement of said keys (physicality) - Example:
Ctrl+Tab
andCtrl+Shift+Tab
to move view right and left, respectively. (commonality)Tab
semantically equivalent to cycle (cycle through form fields, tabs, application windows, etc.)- But still sightly less semantic compared to the previous example because the modifier
Shift
does not consistently mean “backwards” or “opposite”.
- Example:
Ctrl+[
,Ctrl+(
,Ctrl+{
to move to end of discrete element (ex. symbol, closed brace, etc). (geometry)- The level of “curviness” of each brace may also be meaningful in some contexts
- Still, somewhat unsemantic, as braces aren’t arranged in order on keyboard layouts
- The level of “curviness” of each brace may also be meaningful in some contexts
- Example: Keys
I
andO
may signalOn
andOff
(geometry)
- Example:
The “physicalities”, “geometries”, and “commonalities” part of semantic hotkeys ensure that the semantics are intuitive enough to mitigate the penalty of context switching.
In contrast,
- Syntactic Hotkey: A hotkey in which the keypresses that define it have semantics rooted in (partial) superficialities
- Example:
Ctrl+P
means go to previous history item in many shells (rooted in English grammar) - Example:
Ctrl+F
means find item in current document (rooted in English grammar) - Example:
Delete
andPageUp
- These are dedicated keys, so their semantic meaning is inherently superficial
- Not all keyboards have these buttons so their utility is questionable
- Example:
Confusingly, the shortcut Ctrl+P
may mean print, previous, or project. Furthermore, users’ native tongue, keyboard, or keyboard layout may differ; these differences affect the intuitiveness and sensibility of the shortcut.
That is, more generally, semantic hotkeys are more resilient to any changes of the Human Computer Interface itself.
- Example: Changing the keyboard layout from QWERTY to DVORAK or changing the keyboard language from English to Deutsch will not change the meaning or position of the shortcut.
- Example: Replacing the keybord with, say, gloves that have acceleration, orientation, etc. tracking could still yield similar “shortcuts” in whatever form they exist
Lastly, Incorporating these shortcuts encourage interface designers to thing about making UI paths more explicit and cohesive, eventually improving UX.
Implementation
An implementation may start by creating a program that outputs application-specific hotkey configuration (binding key to action) from a common configuration file.
A reasonable implementation assumes that each applicable application supports:
- Custom mapping from hotkey to action
- Chorded hotkeys
The keys h
, j
, k
, l
are a great starting point - they correspond to directions. More specifically, they move a cursor, selection, or item directionally within a particular context. For example:
- in a Terminal Text Editor, it may move a Cursor within a Buffer
- in a Window Manager, it may move a Window within a Virtual Desktop
- in a Terminal Multiplexer, it may move a Pane within a Window
We can generalize to include the following actions:
- Selection navigate within context
- Selection move within context
- Selection move to new context
- Context navigate relatively
- Context move relatively
There are some other challanges:
- Contexts can sometimes treated as sections (sometimes simultaneously)
- Semantics may need to be more fine-tuned to fit into concrete categories
- Applications shortcuts may nest
- A terminal app (raw mode) may be within the contest of a graphical terminal emulator, a terminal multiplexer, and a TUI interface to a virtual machine manager (ex. QEMU Monitor)
- Applying this systems to programs with slightly different behaviors
This blog post mainly exists to explain the problem and solution - I don’t have a full “semantic shortcut system”. But, applying such a system will undoubtedly improve productivity, at least for me. I’m doing some preliminary work at github.com/semantic-hotkeys to apply this idea to everyday applications and websites. When its ready, you may find it useful, or employ a similar system in a program of your own.