p5.strands Overview

Shader programming is an area of creative coding that can feel like a dark art to many. People share lots of stunning visuals that are created with shaders, but shaders feel like a completely different way of coding, requiring you to learn a new language, pipeline, and paradigm.

p5.strands hopes to address all of those issues by letting you write shader snippets in JavaScript and compiling it to OpenGL Shading Language (GLSL) for you!

If you’re looking to start writing p5.strands shaders yourself, take a look at our p5.strands tutorial or the examples in the reference for the p5.js base shaders. The rest of this document will describe how p5.strands works behind the scenes. If you are interested in contributing to the p5.strands codebase, read on!

Code processing pipeline

At its core, p5.strands works in four steps:

The user writes a function in pseudo-JavaScript.
p5.strands transpiles that into actual JavaScript and rewrites aspects of your code.
The transpiled code is run. Variable modification function calls are tracked in a graph data structure.
p5.strands generates GLSL code from that graph.

Why pseudo-JavaScript?

The code the user writes when using p5.strands is mostly JavaScript, with some extensions. Shader code heavily encourages use of vectors, and the extensions all make this as easy in JavaScript as in GLSL.

In JavaScript, there is not a vector data type. In p5.strands, you create vectors by creating array, e.g. myVec = [1, 0, 0]. You can’t use actual arrays in p5.strands; all arrays are fixed-size vectors.
In JavaScript, you can only use mathematical operators like + between numbers and strings, not with vectors. In p5.strands, we allow use of these operators between vectors.
In GLSL, you can do something called swizzling, where you can create new vectors out of the components of an existing vector, e.g. myvec.xy, myvec.bgr, or even myvec.zzzz. p5.strands adds support for this on its vectors.

When we transpile the input code, we rewrite these into valid JavaScript. Array literals are turned into function calls like vec3(1, 0, 0) which return vector class instances. These instances are wrapped in a Proxy that handles property accesses that look like swizzles, and converts them into sub-vector references. Operators between vectors like a + b are rewritten into method calls, like a.add(b).

If a user writes something like this:

baseMaterialShader().modify(() => {
  const t = uniformFloat(() => millis())
  getWorldInputs((inputs) => {
    inputs.position += [20, 25, 20] * sin(inputs.position.y * 0.05 + t * 0.004)
    return inputs
  })
})

…it gets transpiled to something like this:

baseMaterialShader().modify(() => {
  const t = uniformFloat('t', () => millis())
  getWorldInputs((inputs) => {
    inputs.position = inputs.position.add(strandsNode([20, 25, 20]).mult(sin(inputs.position.y.mult(0.05).add(strandsNode(t).mult(0.004)))))
    return inputs
  })
})

The program graph

The overall structure of a shader program is represented by a control-flow graph (CFG). This divides up a program into chunks that need to be outputted in linear order based on control flow. A program like the one below would get chunked up around the if statement:

// Start chunk 1
let a = 0;
let b = 1;
// End chunk 1

// Start chunk 2
if (a < 2) {
  b = 10;
}
// End chunk 2

// Start chunk 3
b += 2;
return b;
// End chunk 3

flowchart TD

subgraph chunk1
  a0[let a = 0]
  b0[let b = 1]
end

subgraph chunk2
  ifstart[if a < 2]
  b1[b = 10]
  ifend[end if]
end

subgraph chunk3
  b2[b += 2]
  ret[return b]
end

chunk1-->chunk2
chunk2-->chunk3

We store the individual states that variables can be in as nodes in a directed acyclic graph (DAG). This is a fancy name that basically means each of these variable states may depend on previous variable states, and outputs can’t feed back into inputs. Each time you modify a variable, that represents a new state of that variable. For example, below, it is not sufficient to know that c depends on a and b; you also need to know which version of b it branched off from:

let a = 0;
let b = 1;
b += 1;
let c = a + b;
return c;

We can imagine giving each of these states a separate name to make it clearer. In fact, that’s what we do when we output GLSL, because we don’t need to preserve variable names.

let a_0 = 0;
let b_0 = 1;
let b_1 = b_0 + 1;
let c_0 = b_1 + a_0;
return c_0;

When we generate GLSL from the graph, we start from the variables we need to output, the return values of the function (e.g. c_0 in the example above.) From there, we can track dependencies through the DAG (in this case, b_1 and a_1). Each dependency has their own dependencies. We make sure we output the dependencies for a node before the node itself.

flowchart TB

c_0-->c_0_plus((+))
c_0_plus-->b_1
c_0_plus-->a_0
b_1-->b_1_plus((+))
b_1_plus-->b_0
b_1_plus-->n1_0[1]
b_0-->b1_1[1]
a_0-->n0[0]

Each node in the DAG belongs to a chunk in the CFG. This helps us keep track of key points in the code. If we need to, for example, generate a temporary variable at the end of an if statement, we can refer to that CFG chunk rather than whatever the last value node in the if statement happens to be.

Control flow

p5.strands has to convert any control flow that should show up in GLSL into function calls instead of JavaScript keywords. If we don’t, they run in JavaScript, and are invisible to GLSL generation. For example, if you had a loop that runs 10 times that adds 1 each time, it would output the add 1 line 10 times rather than outputting a for loop.

Input	Output without converting control flow
`let a = 0; for (let i = 0; i < 10; i++) { a += 2; } return a;`	`float a = 0.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; a += 2.0; return a;`

However, once we have a function call instead of real control flow, we also need a way to make sure that when the users’ javascript subsequently references nodes that were updated in the control flow, they properly reference the modified value after the if or for and not the original value.

Input Transpiled without updating references States without updating references

Input	Transpiled without updating references	States without updating references
`let a = 0; for (let i = 0; i < 10; i++) { a += 2; } let b = a + 1; return b;`	`let a = 0; p5.strandsFor( () => 0, (i) => i.lessThan(10), (i) => i.add(1), () => { a = a.add(2); } ); let b = a.add(1); return b;`	`let a_0 = 0; p5.strandsFor( // ... ) // At this point, the final state of a is a_n // ...but since we didn't actually run the loop, // b still refers to the initial state of a! let b_0 = a_0.add(1); return b;`

let a = 0;
for (let i = 0; i < 10; i++) {
  a += 2;
}
let b = a + 1;
return b;

let a = 0;
p5.strandsFor(
  () => 0,
  (i) => i.lessThan(10),
  (i) => i.add(1),

  () => {
    a = a.add(2);
  }
);
let b = a.add(1);
return b;

let a_0 = 0;

p5.strandsFor(
  // ...
)
// At this point, the final state of a is a_n

// ...but since we didn't actually run the loop,
// b still refers to the initial state of a!
let b_0 = a_0.add(1);
return b;

For that, we make the function calls return updated values, and we generate JS code that assigns these updated values back to the original JS variables. So for loops end up transpiled to something like this, inspired by the JavaScript reduce function:

Input Transpiled with updated references

Input	Transpiled with updated references
`let a = 0; for (let i = 0; i < 10; i++) { a += 2; } let b = a + 1; return b;`	`let a = 0; const outputState = p5.strandsFor( () => 0, (i) => i.lessThan(10), (i) => i.add(1), // Explicitly output new state based on prev state (i, prevState) => { return { a: prevState.a.add(2) }; }, { a } // Pass in initial state ); a = outputState.a; // Update reference // b now correctly is based off of the final state of a let b = a.add(1); return b;`

let a = 0;
for (let i = 0; i < 10; i++) {
  a += 2;
}
let b = a + 1;
return b;

let a = 0;

const outputState = p5.strandsFor(
  () => 0,
  (i) => i.lessThan(10),
  (i) => i.add(1),

  // Explicitly output new state based on prev state
  (i, prevState) => {
    return { a: prevState.a.add(2) };
  },

  { a } // Pass in initial state
);
a = outputState.a; // Update reference

// b now correctly is based off of the final state of a
let b = a.add(1);
return b;

We use a special kind of node in the DAG called a phi node, something used in compilers to refer to the result of some conditional execution. In the example above, the state of a in the output state is represented by a phi node.

In the CFG, we surround chunks producing phi nodes by a BRANCH and a MERGE chunk. In the BRANCH chunk, we can initialize phi nodes, sometimes giving them initial values. In the MERGE chunk, the value of the phi node has stabilized, and other nodes can use them as a dependency.

GLSL generation

GLSL is currently the only output format we support, but p5.strands is designed to be able to generate multiple formats. Specifically, in WebGPU, they use the WebGPU Shading Language (WGSL). Our goal is that your same JavaScript p5.strands code can be used in WebGL or WebGPU without you having to do any modifications.

To support this, p5.strands separates out code generation into backends. A backend is responsible for converting each type of CFG chunk into a string of shader source code. We currently have a GLSL backend, but in the future we’ll have a WGSL backend too!