April 2023
server-reason-react
is an implementation of react-dom/server
and some of React's internals in OCaml. Its purpose is to render HTML markup from the server for a Reason React application natively.
This post provides an overview of some of the concepts of the library, what it means to render React in OCaml, how we use it at Ahrefs.com, a benchmark against a Node equivalent, and the future of all of this.
If you are not familiar with Reason or OCaml, it's fine. The purpose of this post is not to convince you to learn or try those languages but rather to share something I built that I'm passionate about.
I will try to explain most concepts for any developer with experience in React. Don't be scared by any niche language/technology mentioned here since I will introduce them as we go.
The first piece of context to follow this post is to introduce Reason and OCaml for those who are not familiar with them:
Reason is a language built on top of OCaml, they share the compiler, the type-checker, and most tooling around. It was created by Jordan Walke to allow JavaScript developers to enjoy OCaml. Back in the days (when TypeScript was not as popular) many members from the JavaScript community were interested in Reason.
OCaml is a type-safe and robust language that empowers a functional style with powerful inference. It provides a unique balance of performance, maintainability, and reliability.
Despite being a niche programming language, OCaml has made an impact on modern programming languages and language tooling. The first versions of Rust were implemented in OCaml and Meta’s Flow JavaScript type checker as well, also used as the basis for ReScript an offshoot programming language which offers compiled statically typed code into JavaScript. Nowadays, OCaml is a solid option for a general programming language backed by trade firms, blockchains and SaaS companies.
I often refer to Reason/OCaml interchangeably since they are the same for me. Reason has a different syntax that should feel familiar to JavaScript developers, but the “engine” is the same as OCaml.
At Ahrefs, I'm a Software engineer working on tooling. I work on maintaining the design system, styled-ppx, helping with Melange and now on server-reason-react.
Ahrefs is a comprehensive SEO toolset that provides data-driven insights and competitive analysis for digital marketers and businesses. With one of the best crawlers of the internet, we recently launched a search engine called Yep.com with the goal of providing a better revenue-sharing model.
It's written mainly in OCaml and Reason (there’s a bit of Rust and C++/D) with a monorepo containing more than 1M lines of code. We value type-safety, maintainability and performance.
The frontend parts of Ahrefs are the dashboard (app.ahrefs.com), the public website (ahrefs.com), a tiny website called wordcount.com and Yep.com.
One of the initial problems with our main React application (app.ahrefs.com) was the client-side rendering.
The canonical example of this problem happens on the Header. We had to load a bunch of contextual data such as user, permissions, billing, tokens and theme. Running all of those requests on the client creates a request waterfall and often our users use more than one tool at the same time, where navigating between those is not an optimal experience. Navigation would be slow, “flashing” the header (and the entire app) while mounting each page each time can be annoying and a waste of the same requests.
To address those issues, we implemented server-side rendering for the static parts: a “shell” app and injecting the data as regular scripts
with serialized JSON. We used Tyxml, a library that generates HTML from OCaml, for creating these static templates.
It improved the user experience by a lot, since the HTML served contains the static part of the app, so the browser can cache it locally. But, we quickly encountered a new problem: having two separate implementations of the same component. The client version, written in React and the server version, written with Tyxml.
The resulting tech was difficult to maintain, deterring developers from making significant changes. While this solution was adequate for some time, we sought a more scalable approach for our other frontends.
To address some of those problems, Javi (one of my coworkers) experimented to mix Tyxml and ReasonReact and published the experiment in his blog: javierchavarri.com/react-server-side-rendering-with-ocaml
In addition to server-side rendering (rendering in each request), we needed a different strategy for our static pages, such as Server-Side Generation (what Next.js calls Incremental Static Regeneration). Running a rendering step at build-time and populate state on each request.
Ultimately, our goal was to use the same client components from our design system in these server templates.
The common solution to this problem is to go with a node-based solution, such as Next.js, gatsby or Astro. After using gastby and react-snap for a few years and a proof of concept with Next, we weren’t satisfied with any approach.
Running the pre-rendering step during the build and serving them via our backend in OCaml.
This solution couples the build process (running in CI) with the runtime (the production and staging servers). Increased the complexity on how we were serving those static files, and how those static files became polluted with data (also known as hydration).
The other solution is to serve the application by Node and using OCaml as API.
Placing node in front is a very common architecture that works wonderful in some teams, but there are still some drawbacks when using Node in the backend for Ahrefs.
Most of our API consists of fetching from different data storages and shuffle this data to serve it. Some of those problems make it non ideal for us:
After discussing with my coworkers and specially Javi with his explorations on Tyxml and ReasonReact, an idea appeared: it might be possible to use the exact same code from the client in the server, if we re-implement or stub some stuff here and there.
We could have the same approach on how the JavaScript ecosystem do, running the exact same React components on the server but with the twist of running native code on the server while compiled JavaScript on the client.
How hard it would be?
The Reason parser comes with the JSX transformation out of the box, so JSX expressions are compiled down to function calls. No need to use babel, esbuild or vite in Reason.
reason-react
are a set of bindings to the JavaScript version of React. So, a tiny layer for the type-system to tell the Reason code the right interface of those hooks, createElement calls, and any React API, similar on what .d.ts
modules are in TypeScript or FFI in Rust.
The repository lives here https://github.com/reasonml/reason-react
server-reason-react
is a re-implementation of ReactDOMServer (react-dom/server
) and some parts of React itself to generate markup from a React component but written in OCaml.
More precisely ReactDOM.renderToString
and ReactDOM.renderToStaticMarkup
To make sure this ReactDOM can represent all kinds of nodes in React: components, elements, Fragments, Providers, Consumers. Here's the node
variant type that represents all of them:
type element =
| Empty
| Text(string)
| List(array(element))
| Fragment(element)
| Html_element(string, attributes, list(element))
| Component(unit => element)
| Provider(element)
| Consumer(element)
| Suspense({ children: element, fallback: element })
type element =
| Empty
| Text(string)
| List(array(element))
| Fragment(element)
| Html_element(string, attributes, list(element))
| Component(unit => element)
| Provider(element)
| Consumer(element)
| Suspense({ children: element, fallback: element })
It's a variant type (aka union or ADT) and it matches with what React internally uses with Symbols. It's a recursive type (rec
) since it references itself.
The actual string generation is done by traversing the component tree to generate specific HTML representation of each node. In each "node" takes into account a few details such as serialize DOM attributes, process inline styles, encode HTML and a few React particularities (such as dangerouslySetInnerHTML
or hydration hacks).
/* A pseudo implementation of renderToString to ilustrate the mapping between
components and HTML representation.
Note: `++` is a string concatenation operator */
let rec render_to_string = node =>
/* ... */
switch (node) {
/* ... */
| Html_element({ tag, attributes, _ }) when Html.is_self_closing_tag(tag) =>
"<" ++ tag ++ attributes_to_string(tag, attributes) ++ "/>"
| Html_element({ tag, attributes, children }) =>
"<" ++ tag ++ attributes_to_string(tag, attributes) ++ ">" ++
List.iter(render_to_string, children) ++
"</" ++ tag ++ ">"
| Component(component) => render_to_string(component())
| Text(text) => Html.encode(text)
| List(list) => Array.iter(render_to_string, list)
| Consumer(element) | Provider(element) => render_element(element)
| Suspense({ children: _, fallback }) => render_element(fallback)
}
/* A pseudo implementation of renderToString to ilustrate the mapping between
components and HTML representation.
Note: `++` is a string concatenation operator */
let rec render_to_string = node =>
/* ... */
switch (node) {
/* ... */
| Html_element({ tag, attributes, _ }) when Html.is_self_closing_tag(tag) =>
"<" ++ tag ++ attributes_to_string(tag, attributes) ++ "/>"
| Html_element({ tag, attributes, children }) =>
"<" ++ tag ++ attributes_to_string(tag, attributes) ++ ">" ++
List.iter(render_to_string, children) ++
"</" ++ tag ++ ">"
| Component(component) => render_to_string(component())
| Text(text) => Html.encode(text)
| List(list) => Array.iter(render_to_string, list)
| Consumer(element) | Provider(element) => render_element(element)
| Suspense({ children: _, fallback }) => render_element(fallback)
}
To be sure it supports full render and hydration the same way as React does, I migrated all tests from the ReactDOM's server test suite.
With all of this solved, on the server rest of React such as hooks, portals or any other APIs were trivial in comparision. Most hooks do nothing: useEffect don’t run. useState is only setting the initial value, all setStates are ignored. useCallback creates the function once (and probably never called). useMemo runs and gets the value.
One difference from my implementation is about doing a single pass on the React tree, while React.js does multiple passes, but it’s on them to change it: https://github.com/facebook/react/issues/25318
With all of this done, we can return those strings as HTTP responses, and we have a server-side rendering solution working!
The main question I got while I was implementing this was about performance.
The theory said that a compiled language (OCaml) should over-perform an interpreted one (JavaScript in node), even when v8 (the engine that node runs on top) is working tirelessly to optimise it. There have been many benchmarks to prove it, but does it holds true here?
I was curious too, but, as explained above it was not the impetus for server-reason-react creation. In fact the implementation isn’t very optimised, and not even profiled. It does try to minimise allocations and CPU cycles but there hasn’t been any performance work so far.
I made a small micro-benchmark before pushing to production to ensure there wasn’t any regression and prove how much gain are we talking about (against a Node and bun solutions):
We compared the performance of three stacks in terms of latency, requests per second (req/s), and transfer rate.
Req/s | Avg Latency | Transfer/s | |
---|---|---|---|
Node.js (with Express) | 7.2k | 30.98ms | 8.25MB |
Bun | 10.3k | 24.32ms | 17.5MB |
server-reason-react (with OCaml) | 64.8k | 6.21ms | 155MB |
It shows an approximate improvement of x10 over Node and x6 over Bun.
All tests are run on local under my MacBook Pro (13-inch, M1, 2020), and the benchmark data comes from a demo repository: https://github.com/ml-in-barcelona/fullstack-reason-react-demo/tree/main/benchmark.
Even thought the benchmark is not very scientific and it can lie in terms of micro-benchmarking it shows the potential of this approach and validates the theory.
It’s deployed to all users at app.ahrefs.com since February and planning to use it for all frontends. Although, it’s not ready for consumption. The lack of documentation, the shape of the libraries and some missing APIs makes it hard to use and I don’t recommend relying on it.
It's open-source and the repository lives in GitHub, check it out!
Even with all of this, if you are still interested feel free to contact me in Discord or Twitter, and open issues on GitHub.
I got a little deep into some of the details about how it works for the curious minds, but the implementation doesn’t matter much. What matters are the consequences:
The same code gets compiled to Native and JavaScript (thanks entirely to Melange). As far as I know, it can’t be done in other native languages such as Rust or Go, there are many of similar solutions but they don't cross-compile the same code.
It enables full-stack applications written in Reason. It’s not a matter of "one language to rule them all", but rather simplicity.
Simplicity on shared data types, one learning experience, one toolchain, one set of rules, etc. Even thought sharing code has some detractors (and with good reasons) once you face the troubles of maintaining big pieces of code between stacks, you appreciate a single language after-all.
Performance is critical for any SSR solution, not only the rendering engine itself but the platform you are running as well. Not only the number of requests per second it supports or the memory footprint. Among many performance issues, one that stands out is the slow start. The slow start of a Node application is a barrier for current solutions. Often this problems is addressed by changing the architecture of your application by utilizing Edge computing, or a blocker to run SSR in development.
Here is an example of a performance benefit from OCaml. Some OCaml-based frameworks are fast enough that it can boot for each request and tear down when the session finishes. On the other hand, this is much more challenging to do with Node.
Maybe with a more performant approach we can solve some of the problems from SSR
React has been influenced by OCaml and some functional programming concepts from the start. Such as immutability, purity and eventually algebraic effects.
Creates the base implementation for Server components, which are a deal breaker for server-reason-react. Allowing to only run components on the server and stream the output representation of it, without the client-side penalty of executing the JavaScript code, can change radically how we write our backends.
Recently OCaml 5.0 was released with the highly anticipated features Multicore and Effects. These new features enable the possibility of exploring some of React’s concepts to be written in OCaml.
Here are main reasons why I would not adopt server-reason-react today:
I intentionally didn’t mention how this is compiled, because I want to keep it short and explain Melange in future blog posts. For reference: Ahrefs is now built with Melange
This has been very fun and I hope I can keep pushing this stack further. If you are as excited as we are, come talk to us!
Thanks for reaching the end. Let me know if you have any feedback, corrections or questions. Always happy to chat about any topic mentioned in this post, feel free to reach out.