Tiger's Place

Wtf is Hydration on the Web?

Tiger Abrodi — Tue, 23 Apr 2024 18:41:26 GMT

Introduction

In this post, we'll demystify the term "hydration" on the web.

What is Hydration?

Hydration refers to the process of attaching JavaScript behavior to HTML elements that have been generated on the server. When a web page is rendered on the server using techniques like Server-Side Rendering (SSR), the initial HTML is sent to the client as a static representation of the page.

However, the JavaScript code must be executed on the client side to make the page interactive and dynamic.

Hydration is the process of "watering" the static HTML with interactivity and event handlers. It involves the following steps:

The server generates the initial HTML markup and sends it to the client.
The client receives the HTML and renders it in the browser.
The JavaScript code is downloaded and executed on the client side.
The JavaScript code attaches event listeners and adds interactivity to the HTML elements.

Hydrating the server-rendered HTML makes the web application fully interactive and responsive to user actions.

A common confusion is that it's the browser performing the hydration process. However, it's the JavaScript itself that performs the hydration process by attaching event listeners and updating the DOM.

Different Approaches to Hydration

There are different approaches to hydration.

I want to touch on two of them: full hydration and partial hydration.

Full Hydration

The approach we just described is known as full hydration. In full hydration, the entire page is rendered on the server and sent to the client as static HTML. The client-side JavaScript then hydrates the entire page, making it interactive.

Partial Hydration

Partial hydration is an approach where only specific parts of a web page, known as "islands" or "components," are hydrated with JavaScript functionality on the client-side. This is in contrast to full hydration, where the entire page is hydrated at once.

In frameworks like Marko and Astro, the initial HTML is rendered on the server and sent to the client. The HTML includes placeholders or markers for interactive components. When the page loads in the browser, only the necessary components are hydrated with JavaScript, while the rest of the page remains static.

The exciting part here is that the time to interactive is much faster because only the necessary parts are hydrated. This can lead to a better user experience and performance.

So, like SSR, we send the initial HTML with content to the client. However, instead of needing to wait for the entire page to be hydrated, we only need to wait for the necessary parts to be hydrated so the user can start interacting with the page sooner.

How Partial Hydration Works

Here's how partial hydration generally works in these frameworks:

Server-side rendering: The framework renders the initial HTML on the server, including the static content and placeholders for interactive components.
Selective hydration: The framework identifies which components need to be interactive and includes the necessary JavaScript for those components.
Client-side rendering: When the page loads in the browser, the framework selectively hydrates only the marked components with JavaScript, making them interactive.
Static content remains static: The parts of the page that don't require interactivity remain as static HTML, without any JavaScript overhead.

For example, in Astro, you have the concept islands. You can mark an island as interactive yourself by using the client:* directive. There are multiple different ways to tell Astro which parts of the page should be hydrated.

One example is client:visible: This will only hydrate the component when it becomes visible in the viewport.

Next.js: Client Side vs Server Side Rendering vs Static Site Generation

Tiger Abrodi — Tue, 23 Apr 2024 16:00:28 GMT

Introduction

In this post, we'll dive into Client Side, Server Side, and Static Site Generation rendering methods. We will discuss their differences and when to use each to achieve the best user experience.

What is Client Side Rendering?

With Client Side Rendering (CSR), the server sends a minimal HTML document to the client. This document doesn't contain any content, but it includes links to JavaScript files. The browser then downloads these files to properly display the content.

Let's look at an example of how Client Side Rendering works. We have some minimal HTML code that includes a link to a JavaScript file:

html><html>  <head>    <title>Client Side Renderingtitle>  head>  <body>    <div id="app">div>    <script src="app.js">script>  body>html>

When the browser loads this HTML document, it will make a separate request to the server to fetch the app.js file. This process is referred to as "downloading" the JavaScript file.

Once the JavaScript file is downloaded, it executes in the browser. This JavaScript code is responsible for fetching the necessary data from the server (usually via API calls) and then rendering the content on the page. This rendering process involves manipulating the DOM (Document Object Model) to display the fetched data.

As you can see, the server doesn't render the content. Instead, the client's browser does all the work.

Pros and Cons

The main advantage of Client Side Rendering is that it allows for dynamic content updates without reloading the page. This makes the user experience more interactive and engaging.

The several downsides however:

The initial load time can be slow because the browser needs to both download and execute the JavaScript files.
Search engines have a hard time indexing the content because they don't execute JavaScript.
The content is not visible until the JavaScript files are downloaded and executed.
The page might not be accessible to users with JavaScript disabled.

When to Use Client Side Rendering

If you do use Client Side Rendering, make sure it's for pages that are:

Highly interactive and require frequent updates.
Behind a login wall. Don't need to be indexed by search engines.

It's important to note that both of above requirements should be met to justify the use of Client Side Rendering.

And even then, you should really think through whether the benefits of Client Side Rendering outweigh the downsides.

What is Server Side Rendering?

Server Side Rendering (SSR) is the process of rendering the content on the server. This means on the server we do all the work of fetching the data, rendering the HTML, and sending the fully rendered HTML to the client that includes the content.

When the browser receives the HTML document, it doesn't need to download any JavaScript files to display the content. The content is already there, ready to be shown to the user.

The server may send additional JavaScript files to the client if further interactivity is required. As we discussed in CSR, these files will need to be downloaded and executed by the browser when needed.

This is also where the term "hydration" comes into play. I'll cover that in a future blog post.

Pros and Cons

SSR has many upsides:

The initial load time is faster because the browser doesn't need to download and execute JavaScript files.
Fetching data on the server from another server or database is faster than doing it on the client.
Search engines can easily index the content because it's already in the HTML document.
The content is visible to the user immediately.
The page is accessible to users with JavaScript disabled.
It's easier to implement server-side caching for performance improvements. For example, using a CDN (Content Delivery Network) to cache the HTML content.
It's more secure because sensitive data can be kept on the server.

Downsides to SSR:

The server needs to do more work to render the content.
The user experience might not be as interactive as with Client Side Rendering. However, with modern fullstack frameworks like Remix and Next.js, this won't be a problem.

What is Static Site Generation?

Static Site Generation (SSG) is the process of generating static HTML files at build time. "Build time" is before the user even requests the page. This happens before deployment when the developer builds the application.

When the user requests a page, the server simply serves the pre-generated HTML file. There's no need to render the content on the server. However, modern frameworks still include JavaScript for interactivity and additional functionality if needed.

This method is great for websites that don't need to be updated frequently. For example, a blog or a portfolio website.

It's easy to cache the HTML files for even faster load times.

This is also good for SEO because search engines can easily index the content since it's already in the HTML files.

However, if you need the content to be dynamic, this isn't the right approach. It won't work for pages that require frequent updates or user interactions.

The main problem here is that the content is completely static. The HTML files are generated before deployment, and they don't change until the next build.

That's why SSG is good for public pages that don't change often.

Incremental Static Site Generation

Incremental Static Regeneration (ISR) is a powerful feature in Next.js that allows you to update static pages without rebuilding the entire site. It combines the benefits of Static Site Generation (SSG) and Server-Side Rendering (SSR) to provide fast performance and fresh content.

How ISR works in Next.js

1. Initial Build

When a request is made to a page that was pre-rendered at build time, it will initially show the cached page. Any requests to the page within the revalidation time window (e.g. 10 seconds) are also cached and served instantly.

2. Background Regeneration

After the revalidation time window, the next request will still show the cached (stale) page, but Next.js triggers a regeneration of the page in the background. Once the page generates successfully, Next.js invalidates the cache and shows the updated page.

It's important to note that the regeneration process only happens if a request is made to the page after the revalidation time window. If no requests are made, the page won't be regenerated.

This is not like a "scheduled job" that regenerates the page every X minutes. Instead, it's triggered by user requests.

3. Path not generated yet

If a request is made to a path that hasn't been generated yet, Next.js will server-render the page on the first request. Future requests will then serve the static file from the cache.

4. Manual Revalidation

Next.js also supports On-Demand Revalidation, which allows manually purging the cache for a specific page using the revalidate() function, triggering an immediate regeneration.

When to use ISR

Incremental Static Regeneration is a great choice for pages that need to be updated frequently but don't require real-time data. For example, an e-commerce product page that changes every hour.

React's evolution from Hooks to Concurrent React

Tiger Abrodi — Mon, 22 Apr 2024 17:26:11 GMT

Introduction

In this post, we'll look at the history of React and how it has evolved over the years since the introduction of hooks.

I'm writing this because I felt like React 17 didn't really introduce anything new, but React 18 felt like such a big leap forward.

I'm writing this post for myself. To really understand what has changed and why it has changed.

The main focus will be React 18.

Hooks

Hooks were introduced in React 16.8. This was released in February 2019. It was a massive change in how we write React components.

The challenges hooks solved

Hooks weren't of course mindlessly introduced. They solved a lot of problems that we had with class components.

Reusability: With class components, we had to use higher-order components or render props to share logic between components. Hooks made it easier to share logic between components by extracting it into a custom hook.
Complexity: Class components had a lot of boilerplate code. The more complex the component, the harder it was to read and maintain. Hooks made it easier to read and write complex logic.
this: The this keyword in class components was a source of confusion for many developers. Hooks removed the need for this and made it easier to reason about the code.
Simplicity: Hooks made it easier to write simpler components. You didn't have to worry about lifecycle methods or class properties. You could just write functions.
Performance: Hooks made it easier to optimize performance. You could use the useMemo and useCallback hooks to memoize values and functions.

As you can see, hooks solved many problems we had with class components. They made it easier to write and maintain React components.

React 17

React 17 was released in October 2020. Even though it didn't introduce any new features, it was an important release because it was a stepping stone to React 18.

Let's take a look at the problems React 17 solved.

Event Pooling

In React versions prior to 17, React used a technique called "event pooling" to improve performance. When an event is triggered (like a click), React creates a synthetic event object that wraps the native browser event.

Creating new event objects for every event can be expensive in terms of memory and performance, especially if there are many events being triggered. So React used a technique called "event pooling" to reuse event objects. When an event is triggered, React would reuse the same event object for all event handlers.

Essentially, if you had a new event, React would use the same event object and update its properties.

While this improved performance, it had some downsides:

It made it difficult to use asynchronous event handlers because the event object could be reused by the time the async operation completed. This could lead to inconsistent behavior.
It could lead to subtle bugs if the code relied on the event object being immutable. Which you may want when working with Redux or other state management libraries because you don't want the event object to change.

In React 17, event pooling has been removed. Now, each event is a unique instance, which makes the behavior more predictable and allows for async event handlers.

It's also worth mentioning that the performance gains from event pooling were minimal in modern browsers.

Tree Reconciliation

React's core strength is its ability to efficiently update the UI in response to changes in data. It does this through a process called "reconciliation".

When a component's state or props change, React creates a new virtual DOM tree. It then compares this new tree with the previous one to determine what has changed. Finally, it updates the real DOM to reflect these changes. This process is called "diffing".

In React 16 and earlier, the reconciliation algorithm would traverse and compare the entire virtual DOM tree, even if only a small part of it had changed. This could be inefficient, especially for large applications.

React 17 introduces changes to the reconciliation algorithm. Now, React will only traverse the parts of the tree that have actually changed. This improves performance, especially for large applications.

Portals

Portals provide a way to render children into a DOM node that is outside the parent component's DOM hierarchy. This can be useful for scenarios like modals, tooltips, and popovers.

While portals have existed since React 16, they had some limitations. For example, events triggered from within a portal would not bubble up to the parent component.

React 17 improves the support for portals. It ensures that events triggered within a portal will properly propagate to the parent component, regardless of whether the parent is a React component or a regular DOM node.

Gradual Upgrades

One of the biggest challenges with any library or framework is upgrading to a new major version. It's often a pain and requires a lot of work to ensure that everything still works as expected.

React 17 aims to make this process easier by enabling gradual upgrades. This means you can upgrade your application piece by piece, rather than all at once.

For example, you could have the majority of your application running on React 17, but keep a few complex components on React 16. This makes the upgrade process more manageable and less risky.

New JSX Transform

JSX is a syntax extension for JavaScript that allows you to write HTML-like code in your JavaScript files. Before the browser can understand JSX, it needs to be transformed into regular JavaScript. This was typically done by Babel, a popular JavaScript compiler.

In React 17, a new JSX transform is introduced. This transform is a pure compiler transform, which means it doesn't require React to be in scope. This can lead to smaller bundle sizes and slightly improved performance.

React 18

React 18 was released in March 2022. It was a big release with a lot of new features and improvements.

Concurrent React

Concurrent React is a new feature introduced in React 18 that fundamentally changes how React handles rendering and updates. It allows React to work on multiple tasks concurrently, enabling it to prioritize and respond to high-priority updates more efficiently.

Interruptible Rendering

One of the key aspects of Concurrent React is interruptible rendering. Prior to React 18, rendering was a synchronous and uninterruptible process. Once React started rendering an update, it couldn't be paused or interrupted until the entire component tree was rendered.

With Concurrent React, rendering can be paused and resumed at any time. This allows React to prioritize high-priority updates and respond to user interactions more quickly.

Prioritizating Updates

Concurrent React introduces the concept of prioritizing updates. It allows React to categorize updates into two types: urgent and transition.

Urgent updates: These are high-priority updates that require immediate attention, such as user interactions like clicking a button or typing in an input field. React ensures that urgent updates are processed quickly to maintain a responsive user interface. By default, all updates are considered urgent unless specified otherwise.

Transition updates: These are lower-priority updates that can be interrupted or deferred if more urgent updates come in. Transition updates are typically used for non-critical UI changes, such as rendering a large list of items or updating a complex component tree. This is achieved by using the startTransition API to mark a batch of updates as transition updates.

By prioritizing updates, React can ensure that the user interface remains responsive and interactive.

React Fiber

The old reconciler in React used a Stack:

It was blocking.
It was synchronous.
Had to work until it was done.

Once the reconciler started working, it had to finish before anything else could happen. This could lead to janky user interfaces, especially for complex applications. Because there was no way to prioritize updates.

With that in mind, we needed a new way to handle updates:

Ability to prioritize updates.
Ability to pause and resume rendering.
Ability to work on multiple tasks concurrently.

That's why React Fiber was introduced. It's important to note that React Fiber was introduced in React 16, but it's vital to understand it to understand Concurrent React.

React Fiber is a reimplementation of React's core algorithm. It changes the fundamental way React works. It's not just about performance, but also about enabling new features like Concurrent React.

What is Fiber

At it's core, Fiber is simply a JavaScript object that represents a unit of work. It's a lightweight representation of a component tree that React can work on.

React processes Fibers and when done, it commits the changes to the DOM.

This all happens in two phases:

Render phase: During this phase, React does asynchronous work behind the scenes. It processes all Fibers. Because it happens asynchronously, work can be prioritized, paused, resumed, and aborted. Internal functions like beginWork() and completeWork() are called during this process.
Commit phase: Once the render phase is complete, React commits the changes to the DOM by calling commitWork(). This is synchronous and can't be interrupted.

Fiber `tag` property

A Fiber always has a 1-1 relationship with a component instance, DOM node, etc. The type of the thing is stored inside the tag property. It can be a number between 0 and 24. In the React codebase, these numbers are constants:

export const FunctionComponent = 0;export const ClassComponent = 1;export const IndeterminateComponent = 2;export const HostRoot = 3;// ...

stateNode property holds a reference to the thing that the Fiber represents. For example, if the Fiber represents a class component, stateNode will hold a reference to the instance of that class component.

Fiber vs React Elements

Fibers are similar to React elements. In fact, they're often created from React elements and share the type and key properties. While React elements are re-created on every render, Fibers are reused.

Fibers only need to be created once and can be updated and reused multiple times. This is what allows React to pause and resume rendering.

Fiber Relationships

Fiber nodes are organized in a tree structure that mirrors the component tree. Each Fiber has a reference to its parent, child, and sibling Fibers. This allows React to traverse the tree efficiently and perform updates.

There are three properties that define the relationships between Fibers: Child, Sibling, and Return.

Child: Points to the first child Fiber of the current Fiber.
Sibling: Points to the next sibling Fiber of the current Fiber.
Return: Points to the parent Fiber of the current Fiber.

Let's look at some code:

  <p>Hellop>  <p>Worldp>

Here we have three elements: a div with two p elements inside it. React will create a Fiber for each of these elements and organize them in a tree structure. div will be the parent of the two p elements, and the two p elements will be siblings of each other. The child of div will be the first p element.

What is work?

In the Fiber architecture, "work" refers to the units of work that React needs to perform to update the UI. Each unit of work is represented by a Fiber node. These units can be prioritized, paused, resumed, and aborted as needed.

Time Slicing

Time slicing is a technique used by React Fiber to break down the rendering work into smaller chunks.

Instead of performing all the work in a single, synchronous go, React Fiber can pause and resume the work as needed.

This allows React to split the rendering process into multiple frames, preventing the main thread from being blocked for too long.

Scheduler

React Fiber uses a scheduler to prioritize and manage the work.

The scheduler assigns different priorities to different types of work, such as user interactions, updates, and background tasks.

Higher-priority work, like user input, is given preference and is executed earlier than lower-priority work.

requestAnimationFrame

React Fiber uses requestAnimationFrame to schedule the rendering work.

requestAnimationFrame is a browser API that allows scheduling a callback to be executed before the next repaint. A "repaint" is when the browser updates the screen with the latest changes.

By using requestAnimationFrame, React Fiber can coordinate the rendering work with the browser's painting process, ensuring smooth and efficient updates.

requestIdleCallback

React Fiber also uses requestIdleCallback to schedule low-priority work.

requestIdleCallback is a browser API that allows scheduling a callback to be executed when the browser is idle and not busy with other tasks. So, if the browser is busy with high-priority work, the low-priority work will be deferred until the browser is idle.

`beginWork` and `completeWork`

beginWork and completeWork are two internal functions in React Fiber that are responsible for processing the work. Together, they perform the rendering and reconciliation of the Fiber tree.

beginWork going down the tree, and completeWork going back up the tree.

`beginWork`

beginWork is called during the render phase, which is the first phase of the fiber reconciliation process. Inside the beginWork function, React compares the current props with the new props or checks for context changes, and sets flags to indicate if updates are needed.

beginWork recursively performs work on the fiber tree, traversing down the tree until there are no more child nodes.

`completeWork`

completeWork is called during the commit phase, which is the second phase of the fiber reconciliation process. completeWork is called when beginWork returns null, indicating that there is no more work to be done on the current branch.

It traverses back up the fiber tree, completing the work for each node. During completeWork, React can create, update, or delete DOM nodes based on the reconciliation results.

Fiber trees

We have the current tree and the work-in-progress tree. The current tree is the one that is currently rendered on the screen. The work-in-progress tree is the one that React is working on. It's a copy of the current tree with the updates applied. The reason we don't update the current tree directly is that it could lead to inconsistencies if e.g. an error occurs during the update.

When React starts rendering, it creates a work-in-progress tree that mirrors the current tree. As React processes the work, it updates the work-in-progress tree. Once the work is complete, React swaps the current tree with the work-in-progress tree.

Automatic Batching

Prior to React 18, only updates triggered by React events (like onClick or onChange) were batched. Updates triggered by other events (like setTimeout or fetch) were not batched, which could lead to performance issues.

The issues would occur because React would have to perform a separate render for each update, even if they were triggered in quick succession. This could lead to janky user interfaces and unnecessary re-renders.

Let's look at an example:

// React 17const handleClick = () => {  setCount(count + 1); // Triggers a re-render  setFlag(!flag); // Triggers another re-render};// React 18const handleClick = () => {  setCount(count + 1); // Queued state update  setFlag(!flag); // Queued state update  // React batches the updates and re-renders once};

With automatic batching, state updates are queued and batched together, regardless of how they are triggered. This leads to fewer re-renders and a smoother user experience.

React 18 also provides an escape hatch called flushSync that allows you to opt-out of automatic batching and force synchronous state updates when needed:

import { flushSync } from "react-dom";const handleClick = () => {  flushSync(() => {    setCount(count + 1); // Triggers an immediate re-render  });  setFlag(!flag); // Queued state update};

In this example, the setCount update will be processed synchronously, while the setFlag update will be queued and batched with other state updates.

Suspense improvements

Suspense was introduced in React 16.6 as a way to specify the loading state for a part of the component tree. It allowed components to suspend rendering while waiting for data to load. "Suspend" means that the component will not render until the data is available.

A basic example of Suspense:

<Spinner />}>  <Comments />

Here, the Suspense component wraps the Comments component. If the Comments component needs to fetch data before rendering, it can suspend rendering and show the Spinner component as a fallback.

Suspense was also used for code-splitting, where components could be loaded lazily when needed. Lazily loading components means that they are only fetched when they are needed, reducing the initial bundle size and improving performance.

Suspense on the server

With React 18, Suspense can be used on the server. With server-side Suspense, you can use Suspense to handle asynchronous data fetching on the server and progressively render the HTML as the data becomes available.

Progressively rendering the HTML means that the server can send parts of the page to the client as they become available, rather than waiting for the entire page to be rendered before sending anything. This can lead to a faster perceived loading time for users. This is also known as "streaming".

Here's an example of using Suspense on the server:

function App() {  return (    <Suspense fallback={<Spinner />}>      <Comments />    Suspense>  );}async function renderToHTML() {  const html = await renderToString(<App />);  return html;}

In this example, the server will render the App component and suspend rendering until the Comments component is ready. The server will send the fallback HTML () to the client first, and then stream the rendered Comments component as soon as it's available.

Suspense boundaries

React 18 introduces the concept of "Suspense boundaries." A Suspense boundary is a component that wraps one or more components that may suspend. It allows you to specify the loading state for a specific part of the component tree.

Here's an example of using Suspense boundaries:

function App() {  return (    <>      <Suspense fallback={<HeaderSpinner />}>        <Header />      Suspense>      <Suspense fallback={<MainContentSpinner />}>        <MainContent />      Suspense>      <Suspense fallback={<FooterSpinner />}>        <Footer />      Suspense>      );}

In this example, we have three Suspense boundaries: one for the Header, one for the MainContent, and one for the Footer. Each boundary has its own fallback component. This allows for more granular control over the loading experience, as each part of the UI can display its own loading state independently.

SuspenseList

React 18 introduces a new component called SuspenseList that allows you to coordinate the loading sequence of multiple Suspense boundaries.

With SuspenseList, you can specify the order in which the Suspense boundaries should be revealed. It provides options like showing the Suspense boundaries in a defined order (forwards), in the reverse order (backwards), or revealing them together (together).

Here's an example of using SuspenseList:

"forwards">  <Suspense fallback={<ProfilePictureSpinner />}>    <ProfilePicture />  Suspense>  <Suspense fallback={<ProfileDetailsSpinner />}>    <ProfileDetails />  Suspense>  <Suspense fallback={<FriendListSpinner />}>    <FriendList />  Suspense>

In this example, the SuspenseList component coordinates the loading sequence of the ProfilePicture, ProfileDetails, and FriendList components. With revealOrder="forwards", the Suspense boundaries will be revealed in the order they are defined, ensuring a top-to-bottom loading experience.

New Root API

React 18 comes with a new Root API.

The new Root API is a set of methods provided by the react-dom/client package in React 18. It is designed to replace the legacy ReactDOM.render method and offers a more powerful way to manage the rendering of React components.

The main method in the new Root API is createRoot, which creates a root for rendering React components. Here's how you can use it:

import { createRoot } from "react-dom/client";const container = document.getElementById("root");const root = createRoot(container);root.render(<App />);

createRoot creates an object that represents a root for rendering React components. You can then call the render method on the root object to render a component into the specified container.

The new Root API has advantages over the legacy ReactDOM.render method:

It supports concurrent rendering, allowing React to work on multiple tasks concurrently.
You can create multiple roots for rendering different parts of the application independently.
Error handling is improved, with better support for error boundaries and error recovery.

Strict Mode

React 18 introduces a new feature called "Strict Mode." Strict Mode is a tool that helps you write better React code by highlighting potential problems and deprecated features.

This only runs in development mode.

One example is the detection of unexpected side effects. Strict Mode runs the render function twice to detect side effects that may cause issues in your application.

Hooks

useTransition

useTransition gives you two things: a way to start a transition and a way to check if a transition is pending.

startTransition is a method provided by the useTransition hook in React 18. It allows you to mark certain state updates as "transitions" aka non-urgent updates.

Here's an example:

import React, { useState, useTransition } from "react";function App() {  const [isPending, startTransition] = useTransition();  const [count, setCount] = useState(0);  const [items, setItems] = useState([]);  const handleClick = () => {    // Urgent update    setCount(count + 1);    // Transition update    startTransition(() => {      const newItems = Array(20000)        .fill()        .map((_, index) => ({          id: index,          text: `Item ${index}`,        }));      setItems(newItems);    });  };  return (    <div>      <button onClick={handleClick}>Click mebutton>      <p>Count: {count}p>      {isPending && <p>Loading...p>}      <ul>        {items.map((item) => (          <li key={item.id}>{item.text}li>        ))}      ul>    div>  );}

In this example:

We use the useTransition hook to get the isPending boolean and the startTransition function.
When the button is clicked, we perform an urgent update by incrementing the count state immediately.
We wrap the expensive state update (setting a large array of items) inside the startTransition callback. This marks it as a non-urgent update.
If there are any pending transitions (isPending is true), we display a loading message.

useDeferredValue

useDeferredValue lets you defer the update of a value until more urgent updates have been completed.

Here's an example:

import React, { useState, useDeferredValue } from "react";function App() {  const [text, setText] = useState("");  const deferredText = useDeferredValue(text);  const handleChange = (e) => {    setText(e.target.value);  };  return (    <div>      <input type="text" value={text} onChange={handleChange} />      <ExpensiveComponent text={deferredText} />    div>  );}function ExpensiveComponent({ text }) {  // Simulating an expensive computation  const expensiveResult = expensiveComputation(text);  return <p>Expensive Result: {expensiveResult}p>;}function expensiveComputation(text) {  // Simulating an expensive computation  let result = "";  for (let i = 0; i < 1000; i++) {    result += text;  }  return result;}

In this example:

We have an input field where the user can enter text.
The text state is updated immediately as the user types.
We use the useDeferredValue hook to create a deferred version of the text value.
The deferred text value is passed to an ExpensiveComponent that performs an expensive computation.

If we didn't use useDeferredValue, the expensive computation would run on every keystroke, which could lead to performance issues. By deferring the update of the text value, we ensure that the expensive computation only runs when the text value has stabilized.

"Stabilized" means that the deferred value has not changed for a certain period of time. This period is determined by the React scheduler. It uses "time slicing" to break down the rendering work into smaller chunks and prioritize high-priority updates.

useId

The useId hook is used to generate unique IDs that are stable across server and client rendering. It helps in generating unique IDs for accessibility purposes or for linking related elements.

Here's an example:

import { useId } from "react";function FormInput({ label }) {  const id = useId();  return (    <>      <label htmlFor={id}>{label}label>      <input type="text" id={id} />      );}

useSyncExternalStore

useSyncExternalStore is a powerful hook introduced in React 18 that allows you to subscribe to an external store and keep your components in sync with its data.

In the context of useSyncExternalStore, a store refers to any external data source that holds state outside of React. This could be:

Third-party state management libraries like Redux, MobX, or Zustand.
Browser APIs that expose mutable values and events to subscribe to, such as browser history, localStorage, or online/offline status.
Custom stores or event emitters that you create yourself.

The key characteristic of a store is that it manages its own state independently of React, and provides a way to subscribe to changes in that state.

How it works

useSyncExternalStore takes three arguments:

const state = useSyncExternalStore(subscribe, getSnapshot, getServerSnapshot?);

subscribe: A function that subscribes to the store and returns an unsubscribe function. It is called whenever the component mounts or the store changes.
getSnapshot: A function that returns the current snapshot of the store's data.
getServerSnapshot (optional): A function that returns the initial snapshot of the store's data during server-side rendering.

Quick example

Here's a simple example of using useSyncExternalStore to subscribe to the online/offline status of the browser:

import { useSyncExternalStore } from "react";function useOnlineStatus() {  const isOnline = useSyncExternalStore(    () => {      window.addEventListener("online", () => {});      window.addEventListener("offline", () => {});      return () => {        window.removeEventListener("online", () => {});        window.removeEventListener("offline", () => {});      };    },    () => navigator.onLine  );  return isOnline;}function MyComponent() {  const isOnline = useOnlineStatus();  return <div>Online: {isOnline ? "Yes" : "No"}div>;}

In this example, the subscribe function adds event listeners for the 'online' and 'offline' events, and returns a cleanup function that removes those listeners. The getSnapshot function simply returns the current value of navigator.onLine.

Why not useEffect?

With React 18, as you know, there are new features like Concurrent Mode and Suspense that allow React to work on multiple tasks concurrently. This means that components can be rendered in a non-blocking way, and updates can be paused and resumed as needed.

These are all "edge cases" that you would have to handle manually with useEffect. useSyncExternalStore abstracts away the complexity of managing subscriptions and state updates, and provides a more declarative and efficient way to keep your components in sync with external stores.

Additionally, useSyncExternalStore supports server-side rendering with getServerSnapshot, which is not possible with useEffect.

I'm gonna faint

This was such a long post. I literally just kept researching and writing it for myself to gain clarity on what has changed in React over the years.

React keys fully explained!

Tiger Abrodi — Wed, 17 Apr 2024 16:12:27 GMT

Keys in React

When mapping over an array of elements in React, you need to provide a key prop to each element. If you don't, React will throw a warning in the console.

What should we use as a key?
What happens if we don't provide a key?
So why do we need to provide a key prop?

Problematic code

Let's look at some code where React throws a warning because we didn't provide a key prop:

function App() {  const items = ["apple", "banana", "cherry"];  return (    <ul>      {items.map((item) => (        <li>{item}li>      ))}    ul>  );}

This code will throw a warning in the console:

Warning: Each child in a list should have a unique "key" prop.

What's the problem?

When you map over an array of elements, React needs a way to identify each element uniquely.

Imagine the files on your computer didn't have a name. They were identified only by their order. If you moved a file to a different position or deleted a file, you wouldn't know which file you were referring to. This is an analogy coming from React's own documentation.

React needs a way to always know which element is which. That's why you need to provide a key prop.

Rules

Two rules to follow when choosing a key:

Stable: The key should be stable. It shouldn't change between renders.
Unique: The key should be unique among siblings.

It's also why you shouldn't use the index as a key. If the order of the elements changes, React won't be able to identify which element is which.

Solution

Let's fix the warning by providing a key prop:

function App() {  const items = ["apple", "banana", "cherry"];  return (    <ul>      {items.map((item) => (        <li key={item}>{item}li>      ))}    ul>  );}

In this case, it's ok to use the item itself as a key because the items are unique.

In other cases where you get data from backend, you'll likely want to use id or some other unique identifier instead of the item itself.

Surprise, it's not a prop

The key prop is not an actual prop that gets passed to the component. It's a special attribute that React uses internally to keep track of elements. That's why you can't access the key prop inside the component.

If we look at the JSX from the previous example again:

function App() {  const items = ["apple", "banana", "cherry"];  return (    <ul>      {items.map((item) => (        <li key={item}>{item}li>      ))}    ul>  );}

When this JSX is transpiled, it will look something like this:

const element = {  type: "ul",  props: {    children: [      {        type: "li",        key: "apple",        props: {          children: "apple",        },      },      {        type: "li",        key: "banana",        props: {          children: "banana",        },      },      {        type: "li",        key: "cherry",        props: {          children: "cherry",        },      },    ],  },};

As you can see, the key is a top-level property of the element, not a prop that gets passed to the component. React uses this key to identify the element internally and keep track of it across re-renders.

So remember, when mapping over an array of elements in React, always provide a unique and stable key to each element. It's not a prop, but a special attribute that React uses under the hood to efficiently update the DOM.

What's the deal with fragments in React?

Tiger Abrodi — Tue, 16 Apr 2024 17:50:07 GMT

Why sibling elements require a wrapper

In React, a component must return a single element. If you try to return multiple sibling elements without wrapping them, you'll encounter an error.

Let's dive into why this happens and how to fix it using React Fragments.

Problematic code

function App() {  return (    <h1>Helloh1>    <h2>Worldh2>  );}

This code will throw an error because React expects a single root element to be returned from a component. When transpiled, the code looks like this:

return React.createElement("h1", null, "Hello");return React.createElement("h2", null, "World");

The issue is that we're attempting to return two separate elements. The function will exit after the first return statement, and the second line will never be executed.

Fragments to the rescue

To solve this problem, we can use React.Fragment to wrap the multiple elements and return them as a single element:

function App() {  return (    <React.Fragment>      <h1>Helloh1>      <h2>Worldh2>    React.Fragment>  );}

When transpiled, it becomes:

return React.createElement(  React.Fragment,  null,  React.createElement("h1", null, "Hello"),  React.createElement("h2", null, "World"));

Alternatively, you can use the shorthand syntax <> and :

function App() {  return (    <>      <h1>Helloh1>      <h2>Worldh2>      );}

React Fragments allow you to group multiple elements without adding an extra DOM node, keeping your component's structure clean and efficient.

Cryptographic hash functions are better than manual hashing

Tiger Abrodi — Sat, 13 Apr 2024 18:20:39 GMT

Cryptographic hash functions like SHA-256 and MD5 are designed to be more uniform and secure compared to simple manual hashing techniques using ASCII values.

Here is why:

Uniformity: Cryptographic hash functions spread out hash values evenly. This means small changes in the input create very different hash values, reducing the chance of overlap and collisions. On the other hand, hashing by hand with ASCII values can cause uneven distribution and more collisions.
Avalanche effect: Cryptographic hash functions have the avalanche effect, meaning a tiny change in the input (like changing just one bit) leads to a big change in the output hash. This makes sure that even similar inputs end up with very different hash values, improving how they spread out and making the output hard to guess.
Efficiency: Cryptographic hash functions are made to work quickly and efficiently. They can create hash values for big data sets fast, making them good for hash tables and other structures that often need hashing.
Security: Cryptographic hash functions are made to work in one direction and avoid duplicates. It's nearly impossible to figure out the original input from the hash or find two different inputs that give the same hash. While not all hashing needs this high security, it offers extra protection against attacks.

Example

Let's take a look at an example where we create a pure function that takes a string and an array of values.

Our goal is to turn the string into an index.

const crypto = require('crypto');function hashFunction(key, array) {  const hash = crypto.createHash('sha256').update(key).digest('hex');  const index = parseInt(hash, 16) % array.length;  return array[index];}// Example usageconst myArray = ['apple', 'banana', 'cherry', 'date', 'elderberry'];// Returns a value from myArrayconst value1 = hashFunction('hello', myArray);// Returns a different value from myArrayconst value2 = hashFunction('world', myArray);

In this example, the SHA-256 hash of the input string is computed, and then the modulo operator is applied to the hash value to map it to an index within the range of the array. The resulting index is used to retrieve the corresponding value from the array.

Conclusion

By leveraging the uniformity and avalanche effect of SHA-256, you can achieve better distribution and reduce the likelihood of collisions compared to manual hashing techniques.

The most helpful thing with the modulo operator

Tiger Abrodi — Tue, 09 Apr 2024 05:59:07 GMT

Introduction

Have you ever wondered how to achieve a circular-like functionality with an array?

For example, let's say you have a component that displays a list of items. You can navigate the list by clicking "Next" and "Previous" buttons. When you reach the end of the list, you want to loop back to the beginning.

This can be achieved by using the modulo operator %.

Let's dive in!

Modulo Operator

The modulo operator % returns the remainder of a division operation.

What's the remainder?

4 % 2 returns 0 because 4 divided by 2 is 2 with no remainder.

5 % 2 returns 1 because 5 divided by 2 is 2 with a remainder of 1.

You can think of it as how many times does the right-hand number fit into the left-hand number, and what's left over.

5 % 2 -> 2 fits into 5 two times with 1 left over.

What about the other way around?

What if you have 2 % 5?

This is where it gets interesting.

2 % 5 returns 2 because 2 divided by 5 is 0 with a remainder of 2. To be clear, 5 fits into 2 zero times with 2 left over.

An easy way to think about this: Every time the left-hand number is less than the right-hand number, the result is the left-hand number.

How to use the modulo operator for circular functionality

Referring back to the example in the introduction, you can use the modulo operator to loop back to the beginning of the list when you reach the end.

Let's look at an example:

const items = ["A", "B", "C", "D", "E"];let currentIndex = 0;function nextItem() {  currentIndex = (currentIndex + 1) % items.length;  return items[currentIndex];}function previousItem() {  currentIndex = (currentIndex - 1 + items.length) % items.length;  return items[currentIndex];}console.log(nextItem()); // Output: "B"console.log(nextItem()); // Output: "C"console.log(nextItem()); // Output: "D"console.log(nextItem()); // Output: "E"console.log(nextItem()); // Output: "A"console.log(previousItem()); // Output: "E"console.log(previousItem()); // Output: "D"

Next Item

Let's break it down, starting with nextItem. (currentIndex + 1) this increments the currentIndex by 1. We do this because we want to move to the next item in the list.

The first time this is called currentIndex = (currentIndex + 1) % items.length evaluates to 1 % 5 which is 1. This means we're now at index 1 in the items array.

What happens if we're at the last item in the list? The last item would have the index 4. 4 % 5 is 4, which is fine. But if we move to the next item after that, we would be at index 5. This is out of bounds and where we want to loop back to the beginning. 5 % 5 is 0, which is the first item in the list.

And as you see we're always reassigning the currentIndex. So after the last item, currentIndex is back to 0.

Previous Item

What if we want to go back to the previous item? We use the same logic but in reverse.

(currentIndex - 1 + items.length) % items.length this decrements the currentIndex by 1. We add items.length to ensure we don't get a negative number. Then we use the modulo operator to loop back to the end of the list if we're at the beginning.

Let's say we're at the first item and want to go back. This would mean moving to the last item in the list. 0 - 1 + 5 % 5 is 4 % 5 which is 4. This means we're now at index 4 in the items array.

What if we're at the last item and want to go back? This would mean moving to the second to last item in the list. 4 - 1 + 5 % 5 is 8 % 5 which is 3. It's because 5 fits into 8 once with 3 left over.

Conclusion

The modulo operator % is a powerful tool for creating circular-like functionality with arrays. It allows you to loop back to the beginning of the list when you reach the end. This can be useful for creating navigational components or any situation where you need to cycle through a list of items.

Implement and understand usePrevious hook

Tiger Abrodi — Fri, 05 Apr 2024 15:57:02 GMT

Introduction

You've probably seen the pattern where useRef is used to keep track of the previous value of a state. But how does it always stay one time behind the useState?

That's what we'll dive into today!

usePrevious hook

import { useRef, useEffect } from "react";function usePrevious(value) {  const ref = useRef();  useEffect(() => {    ref.current = value;  }, [value]);  return ref.current;}

We have a custom hook called usePrevious that takes a value and returns the previous value. Let's break it down:

We import useRef and useEffect from React.
We define a function called usePrevious that takes a value as an argument.
We create a ref using useRef().
We set the ref.current to the value inside the useEffect hook.
We return ref.current.

So, how come ref.current always stays one time behind the useState?

To understand this, we have to understand how useRef and useState work.

useRef

useRef returns a mutable object whose .current property is initialized to the passed argument (initial value). Mutating the .current property doesn't trigger a re-render. This is important to understand because it allows us to store mutable values that persist across renders.

useState

useState returns an array with two values: the current state and a function to update the state. When you call the update function, React schedules a re-render with the new state.

Why useRef stays one time behind useState

During the very first render of the component, useEffect hasn't had a chance to run yet. React's effect hooks run after the render phase. This is intentional so that they don't block the rendering process. This also applies to every re-render.

Now that we understand the different pieces, let's go over the flow:

During the initial render, the useRef hook is initialized with the provided initialValue, setting ref.current to that value.
After the component renders and commits to the DOM, the useEffect runs and updates ref.current with the newValue. Remember, this happens after the render phase. "Commit" means that React has applied the changes to the DOM.
However, this update to ref.current does not trigger a re-render of the component.
On the next render cycle (e.g. when state/props change), the useRef hook returns the same ref object it created during the initial render.
This means that ref.current still holds the newValue from the previous render, not the current value.
During that next render, ref.current will hold the newValue that was assigned in the previous render's useEffect.
1. This is the key point to understand. The useEffect hook updates the ref.current value, but that update doesn't trigger a re-render. That's why it doesn't update the UI that holds the ref value.

Conclusion

Understanding how the usePrevious hook works requires understanding useRef, useState, useEffect and the order in which they run. By understanding these concepts, you can build more complex hooks and components that leverage the power of React's hooks system.

Expression Slots in React

Tiger Abrodi — Wed, 03 Apr 2024 18:08:12 GMT

Expression Slots in React

Have you ever wondered how React evaluates what's inside the curly braces {}?

That's what we'll dive into today!

JSX transpilation

When you write JSX, it gets transpiled into JavaScript. The transpilation process is what allows you to write HTML-like syntax in JavaScript.

Let's take a look at an example:

const element = <h1>Hello, world!h1>;

This JSX code gets transpiled into the following JavaScript code:

const element = React.createElement("h1", null, "Hello, world!");

React.createElement() is a function that creates a React element. It takes three arguments:

The type of element (in this case, "h1").
The element's properties (in this case, null).
The element's children (in this case, "Hello, world!").

So, how does React evaluate what's inside the curly braces {}?

What's inside the curly braces?

When you write JSX, you can use curly braces {} to embed JavaScript expressions. These expressions can almost be anything. It can't be for example a for loop or an if statement. But it can be a variable, a function call, or a ternary operator.

Why is that?

That's because when React sees the curly braces, it simply extracts the expression inside and puts it in the children of the element.

const name = "John Doe";const element = <h1>Hello, world! {name}h1>;

This will transpile into:

const name = "John Doe";const element = React.createElement("h1", null, "Hello, world! ", name);

It's important to note that anything after the first two arguments of React.createElement() is considered the children of the element. It can be annotated like React.createElement(type, props, ...children).

Going back to the example, as you can see, the name variable simply gets forwarded as a child of the h1 element.

When you see this, it's starting to make sense why you can't use a for loop or an if statement inside the curly braces. React expects a single expression, not a block of code.

Let's take a look at a last example with an if statement:

const isLoggedIn = true;const element = <h1>Hello, world! {if (isLoggedIn) {  return "John Doe";} else {  return "Guest";}}h1>;

If we were to forward this to React.createElement(), it would look like this:

const isLoggedIn = true;const element = React.createElement("h1", null, "Hello, world! ", if (isLoggedIn) {  return "John Doe";} else {  return "Guest";});

This doesn't work because if is a statement, not an expression. React expects an expression inside the curly braces, not a statement. An expression can be evaluated to produce a value on its own. If statements are used for control flow and executing different blocks of code based on a condition.

Conclusion

Understanding how React evaluates expressions inside curly braces is important when working with JSX. Remember that React expects an expression, not a statement, inside the curly braces. This expression gets evaluated and passed as a child to the element.

Architecture of my collaborative brainstorming app

Tiger Abrodi — Thu, 28 Mar 2024 09:32:52 GMT

Remix

Remix is a web framework built on top of the Web Fetch API, allowing for deployment on multiple platforms. It acts as a centralized bridge between the server and client, simplifying data fetching and UI rendering.

If Remix was e.g. built on top of node, it'd rely on node's specifics rather than being agnostic.

Unlike traditional web frameworks, Remix is not a server itself but a handler. This means it processes requests and generates responses without deciding how your application is hosted or deployed.

You own where to deploy your app, and Remix will work with it.

You create a request handler based on the adapter you're using, e.g. if deploying to Vercel, you'd use the Vercel adapter.

What about the client?

When you land on a page, Remix makes a document request to the server, the server SSR's the page and sends it to the client, then the client, client side renders (hydrates) and takes it from there. This turns the page into a single page app.

Deployed to Vercel

The project is deployed on Vercel, which uses serverless functions to serve requests. This setup allows for automatic scaling and reduces the operational overhead of managing servers. Under the hood, Vercel uses AWS Lambda to run serverless functions.

When a request hits the site hosted on Vercel, it's processed by Remix app handlers running as Vercel's serverless functions.

Authenticaton

Cookies

The project uses HTTP cookies for authentication, which are straightforward to manage. They're secure as long as you follow best practices, e.g. setting the Secure and HttpOnly flags, and using SameSite to prevent CSRF attacks.

How you declare a cookie in Remix:

const authCookie = createCookie('auth', {  secrets: [secret],  maxAge: 30 * 24 * 60 * 60,  httpOnly: true,  secure: env.NODE_ENV === 'production',  sameSite: 'lax',})

secrets: An array of secrets that may be used to sign/unsign the value of a cookie.
maxAge: how long the cookie will last in seconds, here it's 30 days
httpOnly: true means the cookie is only accessible by the server, not by JavaScript via document.cookie
secure: true means the cookie is only sent over HTTPS, locally we use false for development since localhost is not HTTPS
sameSite: "lax" means the cookie is sent with same-site requests

SameSite lax explained

Cookies are sent with requests initiated from the same site, ensuring smooth site functionality.
Cookies are sent for some cross-site requests, like clicking on a link to the site, enhancing usability while maintaining security.
Cookies are not sent for other cross-site requests (e.g., form submissions), helping prevent CSRF attacks.
CSRF (Cross-Site Request Forgery) attacks are a type of security threat where an attacker tricks a user into performing actions they didn't intend to on a web application where they're logged in. By not sending cookies for these requests, the site is protected from CSRF attacks, e.g. clicking on a link in email that performs an action on a site where you're logged in.

Passwords

We use password-based authentication. The password is hashed using crypto.pbkdf2Sync with a salt and 1000 iterations. The salt is stored in the database along with the hash.

When a user logs in, we hash the password they entered and compare it to the hash stored in the database.

How we create the hash:

// Create a salt// 16 bytes is the recommended size for a salt// Having a salt added to the password before// hashing it makes it more secure// Reduces the risk of rainbow table attacks:// https://en.wikipedia.org/wiki/PBKDF2let salt = crypto.randomBytes(16).toString('hex')// Create a hash// 1000 stands for the number of iterations// 64 is the length of the output hashlet hash = crypto.pbkdf2Sync(password, salt, 1000, 64, 'sha256').toString('hex')

The salt is created using crypto.randomBytes and the hash is created using crypto.pbkdf2Sync.

How we compare the hash when a user logs in:

let hash = crypto  .pbkdf2Sync(password, user.Password.salt, 1000, 64, 'sha256')  .toString('hex')if (hash !== user.Password.hash) {  return false}

We use the user's salt and the password entered by the user to create a hash and compare it to the hash stored in the database.

If they match, the user is authenticated.

Liveblocks for real-time collaboration

Liveblocks is the service used for the real-time collaboration stuff.

It's super neat, I love how it lets me be the one deciding how to authenticate.

Rather than being a complete package right away, it gives you the Lego blocks for building collaborative web apps, including Browser Dev Tools, for an awesome developer experience.

Another fun thing is that it uses Cloudflare Durable objects under the hood. The web socket servers sit on the edge, meaning they are as close to the user as possible, which is great for latency.

Database: Postgres on Railway

The database is a Postgres database hosted on Railway. It's used to store permanent info that we may need elsewhere in the app outside of the board where real-time collaboration is happening.

For example, user information, board information, board roles (who has access), etc.

JavaScript Runtime Environment and The Event Loop

Tiger Abrodi — Wed, 27 Mar 2024 12:20:01 GMT

Concurrency in JavaScript

When running JavaScript in a browser, it may appear that JavaScript is multi-threaded, but it's not. JavaScript is a single-threaded programming language, which means it has a single call stack and can only execute one task at a time. When a script is running, it blocks other scripts from running until it completes.

So, why can it appear that JavaScript is multi-threaded?

The answer lies in the JavaScript Runtime Environment.

JavaScript Engine

Let's start by taking a look at the JavaScript engine. The JavaScript engine is responsible for executing JavaScript code. It consists of two main components:

Call Stack: The call stack is a data structure that stores the execution context of the running code. It follows the Last In, First Out (LIFO) principle, meaning that the last function added to the stack is the first one to be executed.
Heap: The heap is a memory space where objects are stored. And to remind you, arrays are also objects in JavaScript.

Let's take a look at a simple example to understand how the call stack works:

function bark() {  console.log("Woof!");}function meow() {  console.log("Meow!");}function speak() {  console.log("Speaking");  bark();  meow();  console.log("Done speaking");}speak();

speak function is called, and it's added to the call stack.
speak's first console log is added to the call stack and executed. When the console log is executed, it's removed from the call stack.
bark function is called and added to the call stack.
bark's console log is added to the call stack and executed.
bark is removed from the call stack.
meow function is called and added to the call stack.
meow's console log is added to the call stack and executed.
meow is removed from the call stack.
speak's last console log is added to the call stack and executed.
speak is removed from the call stack.

That's the entire process when calling the speak function.

JavaScript Runtime Environment

Assume we want to run setTimeout(foo, 500), what would happen if we pushed it to the call stack?

function foo() {  console.log("Hello");}setTimeout(foo, 500);

If we pushed setTimeout(foo, 500) to the call stack, it would block the call stack for 500 milliseconds. This is not what we want. Instead, we want to run foo after 500 milliseconds. But how do we keep track of when to run foo if we can't use the call stack?

The JavaScript Engine isn't running code in complete isolation. It's running it in what we call a JavaScript Runtime Environment. This environment provides a set of extra functionality on top of JavaScript called Web APIs. These APIs include:

Timers (setTimeout, setInterval)
HTTP requests (fetch)
DOM manipulation functions

When we call setTimeout(foo, 500), the setTimeout function is pushed to the call stack. The setTimeout function is then removed from the call stack and sent to the Web API environment to handle the timer. But how does the JavaScript Engine know when to run foo?

This is where the callback queue comes in. The callback queue is a FIFO (First In, First Out) data structure that stores callback functions. Once the timer is complete, the Web API environment pushes the callback function (foo) to the callback queue.

The event loop is responsible for checking the call stack and callback queue. If the call stack is empty, it pushes the first function in the callback queue to the call stack. Once that's done, it checks the callback queue again for the next function. This process continues until the callback queue is empty.

The process of the Event Loop:

Dequeue the first function in the callback queue.
Enqueue the function to the call stack.
Execute the function.
Render any changes to the DOM.
Remove the function from the call stack.
Repeat the process until the callback queue is empty.

setTimeout(func, 0)

You might think that setTimeout(func, 0) runs the function immediately, but that's not the case.

As we mentioned earlier, setTimeout is a part of the Web API environment. When you call setTimeout(func, 0), the function is sent to the Web API environment to handle the timer. The timer is set to 0 milliseconds, but it doesn't mean the function will run immediately. The function is still sent to the callback queue, and the event loop will run it when the call stack is empty.

Let's look at some code:

console.log("Start");setTimeout(() => {  console.log("Inside setTimeout");}, 0);console.log("End");

console.log("Start") is added to the call stack and executed.
setTimeout is added to the call stack and sent to the Web API environment to handle the timer.
console.log("End") is added to the call stack and executed.
The event loop checks the call stack and callback queue. Since the call stack is empty, it dequeues the function from the callback queue and adds it to the call stack.
console.log("Inside setTimeout") is added to the call stack and executed.

Microtask Queue

When promises were added to JavaScript, they introduced a new queue called the microtask queue. The microtask queue has a higher priority than the callback queue. When a promise is resolved or rejected, the callback function is added to the microtask queue.

Let's look at some code to understand how everything works together:

console.log("Start");setTimeout(() => {  console.log("Inside setTimeout");}, 0);Promise.resolve().then(() => {  console.log("Inside Promise");});console.log("End");

Just to remind you, whenever something is "executed" in the call stack, it's removed from the call stack.

console.log("Start") is added to the call stack and executed.
setTimeout is added to the call stack and sent to the Web API environment to handle the timer.
Promise.resolve().then is added to the call stack and executed. This schedules for the then callback to run after the promise is resolved. The callback console.log("Inside Promise") goes straight into the microtask queue because promises send their callbacks there.
console.log("End") is added to the call stack and executed.
The event loop checks the call stack, microtask queue, and callback queue. Since the call stack is empty, it dequeues the function from the microtask queue and adds it to the call stack.
console.log("Inside Promise") is added to the call stack and executed.
The event loop does its job again and sees that microtask queue is empty. It then dequeues the function from the callback queue and adds it to the call stack.
console.log("Inside setTimeout") is added to the call stack and executed.

Pseudo code of the event loop may look something like:

while (true) {  if (callStack.isEmpty()) {    if (!microtaskQueue.isEmpty()) {      callStack.push(microtaskQueue.dequeue());    } else if (!callbackQueue.isEmpty()) {      callStack.push(callbackQueue.dequeue());    }  } else {    // This would execute the last function added to the call stack    // And then remove it from the call stack    callStack.execute();  }}

Quiz

What's the output of the following code?

console.log("Start");Promise.resolve().then(() => {  console.log("Inside Promise 1");});setTimeout(() => {  console.log("Inside setTimeout 1");}, 0);Promise.resolve().then(() => {  console.log("Inside Promise 2");});setTimeout(() => {  console.log("Inside setTimeout 2");}, 0);console.log("End");

Web Workers and making JavaScript Multi-Threaded

Tiger Abrodi — Tue, 26 Mar 2024 19:43:15 GMT

JavaScript is Single-Threaded

JavaScript is a single-threaded programming language, which means it has a single call stack and can only execute one task at a time. When a script is running, it blocks other scripts from running until it completes.

This can lead to performance issues, especially when dealing with long-running or computationally intensive tasks, as they can freeze the user interface and make the application unresponsive.

Example of problematic code

Here's an example of problematic code that can freeze the user interface:

function longRunningTask() {  let sum = 0;  for (let i = 0; i < 1000000000; i++) {    sum += i;  }  console.log(sum);}longRunningTask();function otherTask() {  console.log("This is another task");}otherTask();

longRunningTask is a computationally intensive task. For the sake of simplicity, we're just summing numbers in a loop. This could be something different.

The problem we encounter here: We can't execute otherTask until longRunningTask completes. This can cause the user interface to freeze, and the application becomes unresponsive.

Multi-Threaded Execution with Web Workers

Web Workers API allows you to run JavaScript code in the background, in a separate thread. You can run multiple scripts concurrently. This is called multi-threaded execution. Multi-threaded means that multiple threads can run simultaneously, allowing you to perform multiple tasks at the same time.

Speed up the slow example with Web Workers

Let's speed up the slow example with Web Workers. We'll move the longRunningTask function to a Web Worker, so it doesn't block the main thread:

function longRunningTask() {  const worker = new Worker("worker.js");  worker.postMessage("start");  worker.onmessage = (event) => {    console.log(event.data);    worker.terminate();  };}longRunningTask();function otherTask() {  console.log("This is another task");}otherTask();

In the updated code, we create a worker with new Worker("worker.js"). The worker runs the code in a separate file called worker.js. This could be any file name. The code will run in a separate thread, so it doesn't block the main thread.

worker.postMessage("start") sends a message to the worker to start the task. The worker will perform the task and send the result back to the main thread with worker.onmessage.

This will make more sense when we look at the worker.js file:

self.onmessage = (event) => {  if (event.data === "start") {    let sum = 0;    for (let i = 0; i < 1000000000; i++) {      sum += i;    }    self.postMessage(sum);  }};

The message you send with worker.postMessage is received in the worker with self.onmessage. event.data contains the message you sent. In this case, we're checking if the message is "start".

If you wanted to, you could send more complex data to the worker. For example, you could send an object with multiple properties.

When the worker completes the task, it sends the result back to the main thread with self.postMessage. The main thread receives the result with worker.onmessage and calls worker.terminate() to stop the worker.

This lets the main thread continue executing other tasks, like otherTask, without being blocked by longRunningTask.

Downsides

Web Workers are great for offloading heavy tasks to a separate thread, but they come with some downsides:

Web Workers can't access the DOM directly. They run in a separate global context, so they can't interact with the main thread's DOM.
Web Workers can't access global variables or functions from the main thread. You need to pass data between the main thread and the worker using messages. This increases complexity.
Debugging Web Workers can be more challenging compared to regular JavaScript code, as they run in a separate thread with limited access to the browser's developer tools.

Conclusion

If you need to run computationally intensive tasks in the background without blocking the main thread, Web Workers are a great solution. They allow you to run JavaScript code concurrently in separate threads, improving performance and user experience.

Browser storages

Tiger Abrodi — Tue, 26 Mar 2024 16:46:33 GMT

Introduction

In this post, we'll go over the different browser storages and how to use them.

I've excluded IndexedDB on purpose because it's rarely used.

Cookies

Cookies are small pieces of data stored in the browser. They are sent with every request to the server. An example is identifying a user.

Cookies have a size limit of 4KB and can be set to expire after a certain time. If you don't set an expiration date, the cookie will be deleted when the current session ends, which happens when the browser is closed.

Let's take a look at how to set a cookie in JavaScript:

document.cookie = "name=John";

This will call the cookie's setter function and set the cookie "name" to "John". Because we haven't set an expiration date, this cookie will be deleted when the browser is closed.

Let's set another cookie:

document.cookie = "age=30";

When we set a new cookie, it won't be replaced, but appended to the existing cookie string. So now, the cookie string will look like this: name=John; age=30. We have two cookies!

One string, different cookies

It may appear that we have only one cookie because document.cookie returns a single string. However, this string contains multiple cookies separated by a semicolon.

Overriding cookies

To override a cookie, you can set it again with the same name. For example:

document.cookie = "name=Jane";

The cookie "name" will now be set to "Jane".

Expiration date

For each cookie, you can set an expiration date. The date must be in UTC format. For example, if we want the cookie "name" to expire right away:

document.cookie = `name=John; expires=${new Date().toUTCString()}`;

This will set the expiration date to the current time in UTC which will cause the cookie to be deleted immediately.

Max age

Another way to set an expiration date is by using the max-age attribute. This attribute specifies the number of seconds until the cookie expires. For example, if we want the cookie "name" to expire in 1 hour:

document.cookie = `name=John; max-age=${60 * 60}`;

60 * 60 seconds equals 1 hour. The "name" cookie will expire in 1 hour.

Path

path specifies the URL path for which the cookie is valid. For example, if we want the cookie "name" to be valid only for the /blog path:

document.cookie = `name=John; path=/blog`;

This can be useful if you want to restrict the cookie to a specific part of your website. But if you want the cookie to be valid for the entire website, you can set the path to /.

Secure

secure ensures the cookie will only be sent over HTTPS connections. For example, if we want the cookie "name" to be secure:

document.cookie = `name=John; secure`;

As you can see, it doesn't take any value. The presence of the attribute is enough to make the cookie secure.

SameSite

SameSite helps prevent CSRF attacks by restricting when the cookie is sent.

It takes three possible values:

Strict: The cookie will only be sent in a first-party context. This means the cookie will only be sent if the site is directly accessed.
Lax: The cookie will be sent in a first-party context and in a cross-site context if the user navigates to the site from a link.
None: The cookie will be sent in all contexts (never use this value unless you know what you're doing).

Many times, you'll want to set the SameSite attribute to Lax. This lets the cookie be sent in a cross-site context if the user navigates to the site from a link. However, if the external link or button makes a POST request maliciously, the cookie won't be sent.

CSRF attacks are a big security risk, so it's important to set the SameSite attribute. They happen when a malicious website or element on a website sends a POST request to a legitimate website where the user is authenticated. If the user is logged in, the browser will send the cookies along with the request, which can lead to unauthorized actions.

For example, if you get an email with what appears to be a link to your bank's website, but it's actually a button that sends a POST request to your bank's website, the browser will send the cookies along with the request. If you're logged in, the bank's website will think you're making the request and will execute it.

Here is how to set the SameSite attribute:

document.cookie = `name=John; SameSite=Lax`;

Let's say we want to get the "name" cookie:

// Get all cookies -> ["name=John", "age=30"]const cookies = document.cookie.split("; ");// Find the "name" cookieconst nameCookie = cookies.find((cookie) => cookie.startsWith("name="));// Get the value of the "name" cookie// nameCookie is "name=John",// so we split it by "=" and get the second elementconst name = nameCookie.split("=")[1];

Although, if you work with cookies, it's good to use a library that does these things for you. I prefer Remix as my fullstack web framework. Working with cookies there is a phenomenal.

HttpOnly

If you set the HttpOnly attribute, the cookie will only be accessible through HTTP requests. This means JavaScript won't be able to access it. This is useful for preventing XSS attacks.

You'll want to do this especially if you send sensitive information in the cookie from the server to the client.

Recap

Cookies have a size limit of 4KB.
Cookies are sent with every request to the server.
Cookies are small pieces of data stored in the browser.
Cookies can be set to expire after a certain time, if not, they will be deleted when the browser is closed.

Web Storage

localStorage and sessionStorage are both a part of the Web Storage API, which provides a way to store data in the browser. The data is stored as key-value pairs. The difference between localStorage and sessionStorage is that localStorage persists even after the browser is closed, while sessionStorage is deleted when the browser is closed.

Besides that, localStorage and sessionStorage have the same methods and properties.

An upside compared to cookies is that they can store more data. The limit is around 5MB, which is much more than the 4KB limit of cookies.

Some downsides to be aware of:

Lack of expiration control.
Not automatically sent with every request to the server.
Potential for XSS attacks because the data is accessible through JavaScript. Therefore, you should never store sensitive information in them.

The API

Let's take a look at how to use localStorage. That should be enough to understand sessionStorage as well.

// Set an itemlocalStorage.setItem("name", "John");// Get an itemconst name = localStorage.getItem("name"); // "John"// Remove an itemlocalStorage.removeItem("name");// Clear all itemslocalStorage.clear();

One important thing to mention is that the data is stored as strings. So if you want to store objects or arrays, you'll need to convert them to strings first.

const person = {  name: "John",  age: 30,};// Convert the object to a stringlocalStorage.setItem("person", JSON.stringify(person));// Get the string and convert it back to an objectconst personString = localStorage.getItem("person");const person = JSON.parse(personString);

Recap

Web Storage API provides a way to store data in the browser.
localStorage and sessionStorage are both part of the Web Storage API.
localStorage persists even after the browser is closed, while sessionStorage is deleted when the browser is closed.

CAP Theorem

Tiger Abrodi — Mon, 25 Mar 2024 19:19:26 GMT

Introduction

CAP theorem is a concept that a distributed system can only have two of the following three properties:

Consistency: All nodes see the same data at the same time.
Availability: Every request gets a response on success/failure.
Partition Tolerance: The system continues to operate despite network partitions.

It's important to mention that the CAP theorem doesn't apply to single-node databases. It only applies to distributed systems, systems that run on multiple nodes.

It doesn't apply to single-node databases because they don't have to deal with network partitions. With a single-node database, you can have consistency and availability simultaneously.

What can we pick?

You may think we can pick any two of the three properties, but that's not the case.

Partition tolerance is a must in a distributed system because network partitions are inevitable. By network partition, I mean that some nodes can't communicate with other nodes. Imagine a system where we have a leader node and some follower nodes. If the leader node can't communicate with the followers, we have a network partition. So, you can't pick Consistency and Availability (CA).

This means you can choose either Consistency and Partition Tolerance (CP) or Availability and Partition Tolerance (AP).

Consistency

Consistency means that every single read on any node returns the most recent write. This is the strongest consistency model. If you have a consistent system, you can be sure that all nodes see the same data simultaneously.

The upside of consistency is that your users will always see the most recent data. The downside is that it can't serve requests if a node can't communicate with the other nodes. This means that if you have a consistent system, you can't be available when there is a network partition (a node can't communicate with the other nodes).

Availability

Availability means that every request gets a response on success/failure. This is the strongest availability model. If you have an available system, you can ensure that every request receives a response.

The upside of availability is that your system can serve requests even when there is a network partition. The downside is that your users may see stale data. This is because if a node can't communicate with the other nodes, it can't be sure it has the most recent data.

Consistency or Availability?

If the system must always return the most recent data, such as in banking systems, you should pick Consistency over Availability. If the system must always be available, such as in social networks, you should pick Availability over Consistency.

It's different for every system, and you must decide what is more important for your system.

API Design

Tiger Abrodi — Mon, 25 Mar 2024 11:27:36 GMT

Introduction

Imagine the Twitter API. You can use it to post, get, and even delete tweets. The Twitter API is an example of an external API. It's provided by a third-party service, Twitter, that allows us to interact with their service programmatically.

We need a way to interact with the Twitter API. How we do that is where API design comes in.

In this post, we'll focus on the basics of API design.

CRUD

When designing an API, we often talk about CRUD operations. CRUD stands for Create, Read, Update, and Delete. These are the basic operations that we can perform on a resource.

Often, it'll be helpful to think about the operations as functions, e.g., createTweet, getTweet, updateTweet, deleteTweet.

However, in REST APIs, the HTTP methods dictate the operation. POST is used to create a resource, GET is used to read a resource, PUT is used to update a resource, and DELETE is used to delete a resource.

Resource

A resource is an object or representation of something with data associated with it. For example, a user is a resource, and a user's name, email, and address are data associated with that resource.

Referring back to the Twitter example, a Tweet is a resource. A Tweet has data associated with it, like the tweet's content, the user who posted the tweet, and the timestamp when the tweet was posted.

Why API Design Matters

When designing an API, we need to consider how it will be used and evolve over time. We can't simply change it whenever we want. Once an API is released and used, we need to be careful about making changes. We need to maintain backward compatibility so existing clients can continue using the API without any issues.

Example of introducing a new feature

Let's say we have createTweet(userId, content) API for creating a tweet.

In REST, it'd look something like https://api.twitter.com/tweets with a POST request.

We want to add a feature where users can reply to a tweet. We can't simply add another field like parentId to the createTweet API. It wouldn't be backward compatible. We could make the field optional to maintain backward compatibility, but that's not a good design. It's not a good design because it's not clear that the parentId field is for replying to a tweet.

But, you can imagine a different scenario where introducing an optional field is a good design not to break existing clients.

Instead, we could create a new API like createReply(userId, content, parentId).

In REST, it'd look something like https://api.twitter.com/replies with a POST request.

Versioning

If we have to introduce breaking changes, we can version our API. We can have different versions of our API running simultaneously. This way, existing clients can continue to use the old version, and new clients can use the new version.

For example, the Twitter API might look like https://api.twitter.com/v1/tweets for version 1 and https://api.twitter.com/v2/tweets for version 2. Existing clients can continue to use v1, and new clients can use v2.

What to pass in the request

Let's take a look at what a Tweet may look like:

{  id: string  userId: string  content: string  createdAt: Date  likes: int}

If the user wants to create a tweet, they'd send a POST request to https://api.twitter.com/tweets with the following payload:

{  userId: "123",  content: "Hello, world!"}

They would only need to pass the userId and content fields. The id, createdAt, and likes fields would be generated by the server.

It's important to understand that not everything in a resource needs to be passed in the request. Some fields should be generated by the server.

Get a specific resource

What if we wanted to get a specific tweet?

We could make a GET request to https://api.twitter.com/tweets/:id, where :id is the ID of the tweet. It would look something like https://api.twitter.com/tweets/123.

To the same endpoint, we could make a DELETE request to delete the tweet.

What if we wanted a specific user's list of tweets?

We could make a GET request to https://api.twitter.com/users/:userId/tweets, where :userId is the ID of the user. It would look something like https://api.twitter.com/users/123/tweets.

Pagination

What happens if the user has thousands of tweets?

It would be inefficient to return all the tweets at once. Instead, we can use pagination. We can limit the number of tweets returned per page and provide a way to get to the next page of tweets.

This is where query parameters like limit and offset come in. We could make a GET request to https://api.twitter.com/users/123/tweets?limit=10&offset=0 to get the first 10 tweets, and https://api.twitter.com/users/123/tweets?limit=10&offset=10 to get the next 10 tweets.

limit specifies how many tweets to return, and offset specifies where to start fetching the tweets.

Internally, we may order the tweets by createdAt in descending order to get the latest tweets first.

Idempotence

GET and PUT requests are supposed to be idempotent, meaning they should not have any side effects.

POST and DELETE requests are not idempotent. They can have side effects. For example, making the same POST request multiple times will create multiple resources.

Conclusion

The most important points to remember when designing an API are:

Keep your API backward compatible.
Make sure your API is idempotent where it should be.
Endpoints should be intuitive and easy to understand.
Use query parameters for filtering, sorting, and pagination.
Use nouns for resources and HTTP methods for operations.
If you have to introduce breaking changes, version your API.

API Paradigms

Tiger Abrodi — Mon, 25 Mar 2024 08:05:16 GMT

API

API stands for Application Programming Interface. It simply is a way to access the functionality of a program or a service. APIs are used to interact with other software, similar to how user interfaces are used to interact with humans.

When we talk about APIs, we often refer to working with external APIs. These APIs provided by third-party services allow us to interact with their services programmatically. For example, we can use the Twitter API to post tweets, or the Google Maps API to get directions.

However, APIs can also be "internal". Consider arrays in programming languages like JavaScript. Arrays have methods like push, pop, shift, and unshift that allow us to interact with them. These methods are part of the array's API.

APIs are just a way to interact with a "thing". This "thing" can be a service, a library, a framework, or even a programming language itself.

This post focuses on external APIs and explores REST, GraphQL, and gRPC.

REST

Introduction

REST stands for Representational State Transfer. REST is built on top of HTTP. This doesn't mean REST is a new protocol, but rather a set of rules to follow when building APIs. People often use REST and HTTP interchangeably, but they are not the same thing. You've likely worked with REST APIs before, even if you didn't know it.

Stateless

The main thing about REST is that it's stateless. This means that each request from a client to a server must contain all the information needed to understand the request. The server cannot store any information about the client between requests. This makes REST APIs easy to cache and scale. It becomes easy to cache and scale, because each request is independent of the others.

Resources

REST APIs are built around resources. A resource is an object or representation of something with data associated with it. For example, a user is a resource, and a user's name, email, and address are data associated with that resource.

Take a look at the following example: GEThttps://youtube.com/videos. In this example, videos is a resource. The GET method is used to retrieve the list of videos. The response will contain a list of videos.

Example

Let's say for example, the response from YouTube contains 10 videos. What if we want to get 10 new videos?

The stateful approach would be for the server to keep track of the last video we saw, and return the next 10 videos.

The stateless approach would be to include a query parameter in the request, like GEThttps://youtube.com/videos?start=10. start is the query parameter that tells the server where to start fetching the videos. You might also want limit to decide how many videos you want to fetch and not be limited to 10. This way, the server doesn't need to keep track of the last video we saw. And that's what we want in REST!

Status Codes

REST APIs use status codes to indicate the result of a request. These are from the HTTP protocol. For example, 200 OK means the request was successful, 404 Not Found means the resource was not found, and 500 Internal Server Error means something went wrong on the server.

Let's look at the example of Youtube again GEThttps://youtube.com/videos. This endpoint is related to the videos resource as mentioned. But you might wonder, why we don't have /getVideos instead of /videos? This is because REST APIs use nouns instead of verbs. The HTTP methods like GET, POST, PUT, DELETE are the verbs. The endpoints are the nouns.

JSON

The most popular format for data exchange in REST APIs is JSON. JSON is a lightweight data-interchange format that is easy for humans to read and write and for machines to parse and generate. It looks like a JavaScript object, but it's a string.

Example of user data in JSON:

{  "name": "John Doe",  "email": "john@doe.com",  "address": "123 Main St"}

GraphQL

Introduction

GraphQL is a query language for APIs and a runtime for executing those queries by using a type system you define for your data. It was developed by Facebook in 2012 and released as an open-source project in 2015.

The idea

GraphQL is built around the idea of asking for what you need, and getting exactly that. With REST, you might need to make multiple requests to different endpoints to get the data you need. With GraphQL, you can make a single request to get all the data you need.

Example of REST's problem

Let's say we have a REST API for a blog. For each blog post, we need to make a request to get the post, then another request to get the author, and another request to get the comments. This is inefficient because we are making multiple requests to get the data we need. This is where GraphQL shines.

What if we could make a single request to get the blog post, the author, and the comments? Additionally, what if we could specify exactly what fields we need for each of these resources?

Only POST requests

It's built on top of HTTP. However, you send a POST request to a single endpoint, usually /graphql, and you send a query in the body of the request. The query is a string describing the data you want to return.

Two types of operations

In GraphQL, there are two types of operations: queries and mutations.

Queries are used to read data. For example, you might want to get a list of blog posts.

Mutations are used to write data. For example, you might want to create a new blog post, update a blog post, or delete a blog post.

Caching

GraphQL is built on top of HTTP POST requests. POST requests are not idempotent, meaning the same request can have different results, making caching more difficult.

However, GraphQL has a solution for this. You can use a technique called persisted queries. This is where you send a hash of the query instead of the query itself. The server can then look up the query based on the hash. If the query is not found, it can execute the query and store the result in a cache.

Schema

In GraphQL, you define a schema that describes the data you can query. It defines the types of data you can query, and the relationships between those types. For example, you might have a Post type that has a title field and an author field. The author field is a User type that has a name field.

gRPC

Introduction

gRPC is a high-performance, open-source universal RPC framework. It was developed by Google and released as an open-source project in 2015. gRPC is built on top of HTTP/2, which is a major revision of the HTTP protocol.

Problem with REST

REST APIs are great for many use cases but have some limitations.

One limitation is that they are text-based. This means that the data is sent over the network as text, which can be inefficient because text takes up more space than binary data. gRPC solves this problem by using Protocol Buffers.

Another limitation is that REST APIs are synchronous. This means the client must wait for the server to respond before it can continue. gRPC solves this problem by using HTTP/2, which allows for bidirectional streaming.

HTTP/2

HTTP/2 is the second major version of the HTTP protocol. It comes with several improvements over HTTP/1.1, to name a few:

Multiplexing: allows multiple requests and responses to be sent and received at the same time, which eliminates HOL blocking at the application layer.
Header compression: reduces the size of the headers, which reduces the amount of data that needs to be sent over the network.
Server push: allows the server to send resources to the client before the client requests them.
Bidirectional streaming: allows the client and server to send a stream of messages to each other at the same time.

It's important to mention HTTP/2 because gRPC needs it to work. gRPC uses bidirectional streaming.

Web Sockets become unnecessary

With HTTP/2, we can use bidirectional streaming. This means that we can send a stream of messages to the server and receive a stream of messages from the server at the same time, making Web Sockets unnecessary.

A "stream" is a sequence of messages. For example, you might have a stream of chat messages, where each message is sent as a separate message in the stream. A stream is different from a request-response, where you can only send one message at a time. With a stream, you can send multiple messages at the same time.

gRPC-Web

gRPC needs detailed control over the HTTP/2 connection, which is not possible in the browser. This is why you need a proxy server to convert the gRPC-Web requests to gRPC requests. The proxy server is called Envoy.

Protocol Buffers

Protocol Buffers are a method of serializing structured data. They're similar to JSON but more efficient because they're binary, which means they take up less space on the network. Protocol Buffers are used to define the messages that are sent and received in gRPC.

The messages are defined in a .proto file. Here's an example of a .proto file:

syntax = "proto3";package example;message User {  string name = 1;  string email = 2;}

This defines a User message with two fields: name and email.

We define a service in the file if we want to send a message in a gRPC request. Here's an example of a service:

service UserService {  rpc GetUser(UserRequest) returns (UserResponse) {}}message UserRequest {  string id = 1;}message UserResponse {  User user = 1;}

The downside of Protocol Buffers is that it's not human-readable like JSON. However, it's more efficient because it's in binary format.

Error handling

In REST, we have status codes to indicate the result of a request. In gRPC, we don't have status codes. Instead, we have error messages, and we handle what went wrong based on those error messages.

The meaning of ACID

Tiger Abrodi — Sun, 24 Mar 2024 16:41:02 GMT

Meaning of ACID

The well-known acronym ACID stands for Atomicity, Consistency, Isolation, and Durability. These are the properties that a transaction should have to be considered reliable.

All databases do not implement these properties in the same way. For example, a database can provide different levels of isolation. The most common levels are Read Uncommitted, Read Committed, Repeatable Read, and Serializable.

I'll write a different post explaining the isolation levels in more detail.

Atomicity

In computing, atomicity refers to different things. In ACID, atomicity is not about concurrency but about the transaction itself. A transaction is atomic if it is considered a single unit that either succeeds completely or fails completely.

If something goes wrong during the transaction, the database should be able to roll back the transaction's changes. The transaction is then said to be aborted.

With atomicity, it is safe to retry a transaction that has failed. The database will ensure that the transaction is not executed twice.

Consistency

Consistency in ACID refers to the state of the database before and after the transaction. The database should be in a consistent state after the transaction, regardless of whether the transaction was successful or not. A "consistent state" is a state that satisfies all the constraints and rules defined in the database schema.

However, the idea of consistency here is not the database's job, but the application's job. The database will ensure that the transaction is atomic, but it is up to the application to ensure that the transaction is consistent.

So actually, the C in ACID shouldn't be there. The database is not responsible for making the transaction consistent.

Isolation

Isolation in ACID refers to the ability of the database to isolate transactions from each other. This means that the changes made by a transaction should not be visible to other transactions until the transaction is committed. This is the most complex property to get right, and it is the one that is most often violated.

A problem could happen when two transactions are trying to read and write the same data at the same time. If the database does not handle this situation correctly, it could result in data corruption or inconsistency.

When a transaction is isolated, it's as if it is the only transaction running on the database.

As mentioned earlier, a database can provide different levels of isolation. The most common levels are Read Uncommitted, Read Committed, Repeatable Read, and Serializable. Each level has its own trade-offs in terms of performance and consistency.

I'll cover them in a different post.

Durability

Durability in ACID refers to the ability of the database to ensure that the changes made by a transaction are permanent. Once a transaction is committed, the changes should be stored in a way that they are not lost even if the database crashes.

In a single-node database, durability is usually achieved by writing the changes to disk. This typically involves a write-ahead log or something similar.

In a distributed database, durability is achieved by replicating the changes to multiple nodes.

Perfect durability is impossible to achieve because if your data center and all its backups are destroyed, you will lose your data. But the goal is to make it as durable as possible.

Introduction to Web Sockets

Tiger Abrodi — Sun, 24 Mar 2024 11:29:20 GMT

Introduction

When thinking about building a real-time chat feature like the one on Twitch, it's crucial that it works fast and efficiently.

The old way of checking the server every second with HTTP requests to get new messages isn't very efficient. This is mainly because HTTP uses TCP, which needs a three-step process to start a connection before any data can be sent.

This approach is too slow for apps that need instant communication and uses too many resources.

Web Sockets

Web Sockets offers a solution by creating a two-way communication channel through one persistent connection between the client and server.

Unlike HTTP requests that open and close repeatedly, WebSockets begin with an initial HTTP request from the client to the server. This request is then upgraded, indicated by the 101 Switching Protocols response status code, changing the connection from HTTP to WebSockets.

This upgrade starts a persistent connection, allowing for immediate data exchange. Messages can be sent back and forth without having to set up new connections each time, making WebSockets perfect for real-time applications like chat systems, live sports updates, or financial trading platforms.

Key Advantages of Web Sockets

Reduced LAtency: Messages are sent instantly without having to reconnect every time.
Saves Resources: Uses just one ongoing connection, cutting down on server stress and network traffic.
Two-way Communication: Both the client and server can start sending messages, making interactions better.

What is SOC 2 and why is it important?

Tiger Abrodi — Sun, 24 Mar 2024 07:18:52 GMT

Introduction

SOC 2 is a way to show that a company keeps data safe and secure. It stands for Service Organization Control 2. SOC 2 was created by the American Institute of CPAs.

These days, more and more companies work with other companies' data. They need to prove they handle that data properly. That's where SOC 2 comes in. It allows companies to be checked by an independent auditor who ensures the right security practices are in place.

The Five SOC 2 Principles

SOC 2 has five main principles that companies must follow. All companies must meet the security principle. The other four are optional, depending on the services the company provides.

1. Security

This principle ensures the company has proper security controls, such as secure passwords, firewalls, data encryption, etc. The goal is to protect against unauthorized access, misuse, or data leaks.

2. Availability

The availability principle makes sure systems are ready to use when needed. This means having backup sites, plans for recovering from disasters, and monitoring to avoid outages.

3. Processing Integrity

This principle focuses on ensuring data is processed completely, correctly, on time, and only by those who are supposed to. It involves checking data when entered, keeping an eye on processes, and ensuring data is consistent across different systems.

4. Confidentiality

The confidentiality principle keeps private information safe. It ensures that only approved people and processes can access this data. Methods to protect data include limiting access, encrypting, and hiding parts of the data.

5. Privacy

For the privacy principle, we must handle personal private data correctly. This means collecting, using, keeping, sharing, and getting rid of it in ways that follow all privacy laws and rules.

Example where another principle is required

Imagine a company named DataFlow that offers real-time services for processing and analyzing data. DataFlow's clients depend on it to quickly handle, analyze, and provide insights from huge amounts of data. This information is crucial for the client's businesses and decision-making.

For DataFlow, keeping their services up and running is critical. If their systems fail or are unavailable, it could seriously affect their clients' businesses, causing them to lose money, miss out on opportunities, or harm their reputation.

This means DataFlow needs to follow the principle of "availability". They must show they have plans in place to keep their services highly available and reduce any downtime.

This includes having strategies for how quickly they can recover from a failure (RTO) and how much data they can afford to lose (RPO).

Types of SOC 2 Reports

There are two types of SOC 2 reports that auditors can provide:

Type 1 - This report checks if the company's systems and security controls are well-designed at a point in time.

Type 2 - This report looks at how well the security controls worked over time, typically a year.

Most companies get a Type 2 report to show they're fully compliant.

Benefits of Being SOC 2 Compliant

There are many benefits for companies that achieve SOC 2 certification:

Finds ways to make security better.
Fulfills customer needs for security checks.
Makes customers trust that their data is safe.
Shows the company is serious about protecting data.
Gives an edge over competitors, especially for cloud services.

Who Needs SOC 2?

Any service provider that stores, processes or transmits customer data should consider SOC 2 certification.

This includes:

Payroll processors
Email marketing services
Managed service providers
Cloud providers (SaaS, PaaS, IaaS)
Data centers and colocation services

What is RTO and RPO?

Tiger Abrodi — Sat, 23 Mar 2024 17:33:22 GMT

Introduction

If you've worked in software engineering for a while, you may have encountered the words RTO, RPO, or SOC 2.

I remember myself coming across these terms for the first time and being confused.

In this post, I want us to dive into all of them, especially RTO and RPO.

We won't dive too deep into SOC 2. I'll write a different post on that alone.

SOC 2

SOC 2 is a way to show that a company keeps data safe and secure. It stands for Service Organization Control 2. SOC 2 was created by the American Institute of CPAs.

It's based on five trust service principles: security, availability, processing integrity, confidentiality, and privacy.

Companies undergo SOC 2 audits to ensure they comply with these principles and demonstrate to their customers that they are managing their data responsibly.

RTO and RPO fall under the principle of "availability".

RTO

RTO stands for the longest time a system can be offline after a problem happens. It's the time within which a business needs to get its processes back up to avoid serious issues from the downtime.

For instance, if an important application has an RTO of 4 hours, it needs to be fixed and running again within 4 hours after it stops working. The RTO guides what steps and resources (like backup systems) are necessary to get everything working in the allowed time.

RPO

RPO stands for the maximum time we can afford to lose data. It tells us how recent the data needs to be when we restore it after a problem. RPO helps figure out how often we need to back up our data.

For example, if we have an RPO of 4 hours, it means we can't afford to lose more than 4 hours of data if something goes wrong. Losing data for more than 4 hours is unacceptable. To meet this RPO, we need to back up our data at least every 4 hours.

Relationship between RTO and RPO

RTO focuses on downtime and how quickly you need to recover, while RPO focuses on data loss and how much data you can afford to lose.

Generally, the lower the RTO and RPO, the more expensive a disaster recovery solution is, since a tighter RTO requires more advanced infrastructure and a tighter RPO requires more frequent backups

How to achieve RTO and RPO

Many modern cloud services have tools to help you achieve RTO and RPO, but some general ideas include:

Redundant infrastructure across multiple data centers.
Frequent data backups and replication to minimize data loss.
Automatic failover mechanisms to switch to backup systems.
Regularly testing disaster recovery plans to ensure they work.
Monitoring and alerting to quickly detect and respond to issues.

The only Cloud services you actually need to know

Tiger Abrodi — Tue, 12 Mar 2024 07:35:45 GMT

Introduction

These are notes I took while watching the video The only Cloud services you actually need to know by Neetcode.

VM

A VM, or "Virtual Machine," is like renting a computer in the cloud. You can set it up with any software you need and use it for various tasks, such as hosting a website or a database.

VMs are the basic building blocks of cloud computing. All other cloud services are built on them.

Using a VM is considered an unmanaged service, meaning the cloud provider only gives you the resources. You're responsible for managing everything else.

To work with a VM, you often connect to it using SSH.

Examples include AWS EC2, Google Compute Engine, and Azure Virtual Machines.

Object stores

If you need to store files, object stores are the solution. You only focus on uploading and downloading files without worrying about the infrastructure. Unlike with VMs, where you have to choose the size, memory, and CPU, object stores handle all that for you.

This is a managed service, meaning the cloud provider manages the infrastructure, and you just use the service. It's sometimes called serverless because you don't deal with servers directly. Your files are safely stored across multiple servers, reducing the risk of losing data.

Examples include AWS S3, Google Cloud Storage, and Azure Blob Storage.

Database services

What if you need to store application data?

Databases are complicated. You might use MySQL, PostgreSQL, or MongoDB. Essentially, databases are a complex way to manage disk space. You could set up a database on a virtual machine, decide on its size, copy it across several machines, and split the data, but this requires a lot of manual and complex effort.

That's where cloud providers come in with managed database services.

For example, in AWS, RDS is for relational databases, and DynamoDB is for non-relational databases.

Proprietary services

In many situations, AWS simply uses open-source software and manages it for you. However, they also offer their own unique services.

For instance, DynamoDB is one of these unique services. It's not open-source, meaning you can't run it yourself. If you decide to switch from DynamoDB to another non-relational database because it's too costly, you'll need to move your data and update your application code.

This situation is known as vendor lock-in, where you're tied to using a single provider's services. Generally, it's best to avoid this. But sometimes, the benefits of using a specific proprietary service might outweigh the downsides.

Knowing one cloud sets you off

Every cloud provider offers similar services.

Google Cloud, AWS, Azure, they all have VMs, object stores, databases, functions, and more.

The basics of cloud computing are the same no matter which provider you choose.

The differences might be in:

Pricing
Performance
Features
How you use the service
And more

Functions as services

Cloud providers let you run code without dealing with the underlying setup, called Functions as a Service (FaaS).

This means you can focus on coding without worrying about the hardware. Examples include AWS Lambda, Google Cloud Functions, and Azure Functions.

This is a type of managed service, often referred to as serverless. It's the simplest way to use cloud services. You just write your code, and the cloud provider handles everything else.

That's actually enough

You will typically not even need to deal with VMs. An object store for files, database and serverless functions is enough for most applications. With these three services, you can build a lot of things already.

Observability

When you run your app in the cloud, keeping track of what's happening is important. You'll want to know details like the number of incoming requests, how long they take, and how much memory is being used. This is where observability comes into play.

Logging: Record what's happening in your app. This helps you fix problems when they happen and understand user issues.

Monitoring: Keeping an eye on your app is a must. You need to know if it's working correctly, if it's slow, or if it's using too much memory.

Alerts: You should get alerts for problems, like an increase in errors, high latency, etc.

Cloud services like AWS CloudWatch, Google Cloud Monitoring, and Azure Monitor can help with observability. There are also specialized services like Datadog that focus entirely on observability.

Data warehouses

Data warehouses are special databases designed for analysis, not for storing everyday transaction data. They are great for running complicated searches across vast amounts of information. Data analysts and scientists use them to perform queries and create reports.

Companies use data warehouses to dig into big data sets. For instance, they might want to find out how many people are using a specific feature or how many have purchased a particular product.

Cloud services like AWS Redshift, Google BigQuery, and Azure Synapse Analytics offer managed data warehouses. There are also other popular services like Snowflake and Databricks.

Cloud wrappers

Cloud wrappers are tools that simplify working with cloud providers. Developers use them to easily deploy their applications to the cloud. Despite the challenges of working directly with cloud providers, cloud wrappers offer a solution.

Well-known examples are Vercel and Netlify. They allow you to deploy your application to the cloud with just a single command.

Regional vs Global services

Some services are regional, meaning they work in specific areas and are often cheaper. However, they might not be available everywhere. Global services, on the other hand, work everywhere but tend to cost more.

A CDN (Content Delivery Network) is a global service because it uses servers around the world to store your static content. You can't have a regional CDN because its whole purpose is to be worldwide.

Virtual Machines (VMs) are regional. When you set them up, you choose a specific area, and they don't span multiple regions.

It gets a bit more complicated when we talk about services like DynamoDB or Google Cloud Spanner.

For instance, RDS (Relational Database Service) is mainly regional. You set it up in one area but can create read replicas in other regions to improve availability. However, this doesn't make it a global service.

DynamoDB can be seen as a global service. You start with a table in one region, but then you can activate global tables. This makes DynamoDB a fully managed service that works across multiple regions and provides quick read and write operations for large-scale, worldwide applications.

What is Exponential Backoff?

Tiger Abrodi — Mon, 11 Mar 2024 12:19:28 GMT

The Problem

Have you ever thought about what happens when your app tries to talk to a server, but the server is too busy or temporarily down? You might think to try again right away. But what if the server is still busy?

Trying over and over quickly can make things worse, using up resources and causing more delays. This is bad for both the server and the app.

Enter Exponential Backoff

Exponential backoff is a smart way to handle retries. It slowly increases the wait time between tries, which lessens the pressure on the server and raises the chance of making a successful connection. This approach finds a good middle ground between trying again quickly and not overloading the server.

When to Use Exponential Backoff

Exponential backoff is great for situations where systems might be temporarily down or too busy. Two cases where it can help:

Web apps talking to APIs.
Distributed systems where multiple different components are communicating.

Implementing Exponential Backoff in JavaScript

Let's take a look at a simplified JavaScript pseudo-code example to show how exponential backoff can be implemented:

function exponentialBackoff(attempt, maxAttempts) {  if (attempt > maxAttempts) {    console.log("Max attempts reached. Giving up.");    return;  }  // Delay time in milliseconds  // Increases as number of attempts increase  // Resulting in higher delay for every attempt  const delay = Math.pow(2, attempt) * 100;  console.log(`Attempt ${attempt}: Retrying in ${delay}ms`);  setTimeout(() => {    // Your retry logic here, e.g., making an API call    console.log("Performing retry...");    // Assuming a function that checks whether the retry was successful    const success = performRetry();    if (!success) {      // attempt increased here      exponentialBackoff(attempt + 1, maxAttempts);    } else {      console.log("Success!");    }  }, delay);}// Example usage:exponentialBackoff(1, 5); // Start with attempt 1, and a maximum of 5 attempts

Signs you may need exponential backoff

You might need to use exponential backoff if you notice a lot of network requests or API calls failing often, especially when it seems like the server or service you're using is too busy or has too much traffic.

Here are some specific signs:

Repeated Timeouts: If your requests often timeout, it might mean the server is too busy. Using exponential backoff gives the server time to recover and handle your requests better.
Server Errors: Getting errors like HTTP 500 (Internal Server Error) or 503 (Service Unavailable) means the server is overloaded or under maintenance. Exponential backoff helps by spacing out your retry attempts, easing the load on the server.
Rate Limiting Responses: If you're getting told you're sending too many requests (like with HTTP 429 Too Many Requests), you need to slow down. Exponential backoff helps you stick to these limits and manage retries more effectively.
Unstable Network Conditions: When the network is unreliable, like on mobile networks, exponential backoff can help by making sure retries don't make things worse by adding more load.

I want to clarify that you will not likely discover these signs during development. That's where logging and monitoring comes in. Being able to retrieve information and see how your system behaves in production.

System Design Requirements

Tiger Abrodi — Sun, 10 Mar 2024 09:49:24 GMT

Introduction

In this post, we'll dive into the fundamental thing a system is doing and how to measure if a system is well designed.

What are we doing?

At the fundamental level of a system, we're moving, storing and transforming data.

Moving Data

The main job of any system is to move data smoothly from one place to another, like from a user to a server, server to database, or database back to the user. This easy flow of information is key to how well the system works.

Storing Data

Good data storage means keeping information safe and easy to get when needed. For example, using databases for organized data and blob storage for less structured data, making sure both are quick and efficient to access.

Transforming Data

The real worth of data often comes from how we interpret it. Changing raw data into easier-to-understand formats, like graphs or lists, is crucial for getting valuable insights and making better decisions.

Key Metrics for a Good Design

When designing a system, there is no right or wrong answer, it's all about tradeoffs. Understanding the trade-offs in system design is important. But how do we measure these trade-offs?

Availability

Availability measures how likely it is that a system is working and available when needed. It's not realistic to expect a system to be up 100% of the time because unexpected events like disasters can happen. Aiming for high availability, which is often talked about in terms of "nines" (for example, 99.9% or "three nines"), greatly increases how reliable the system is.

For example, improving availability from 99% to 99.99% greatly reduced downtime, showing how important even small improvements can be.

Service Level Objectives (SLO) and Agreements (SLA)

SLOs (Service Level Objectives) are goals for how well a system should work. They are like promises we make to ourselves to make sure our users are happy.

SLAs, on the other hand, are promises we make to our users about how well the system will work. They are like a deal we make, saying our service will meet certain standards, such as being up 99.99% of the time.

For example, AWS S3 has an SLA (Service Level Agreement) that says it will be up and running 99.99% of the time. The customer gets a partial refund if AWS doesn't meet this agreement.

But an SLO for AWS S3 might say that it should answer requests in less than 300 milliseconds 99% of the time. This SLO is about making sure the system is not just on, but also fast and reliable for users.

Reliability

Reliability and availability are both important for a system, but they focus on different things. Reliability is about a system's ability to do its job right every time under usual conditions. For example, a reliable email service would send and receive emails without errors. Availability, however, is about the system being ready to use when needed. A system is available if you can access it, like a website that loads when you want to visit it.

Even if a system is available (you can access it), it might not be reliable if it doesn't work as it should. Imagine a messaging app that opens every time you try, showing it's available. But, if it often fails to send messages or delays in showing them, it's not reliable.

To increase a system's reliability, adding redundancy is a common strategy. This means putting in extra parts, like more servers, so if one part fails, the others keep the system running smoothly. Take a streaming service as an example. If it uses multiple servers around the world, viewers can still watch videos even if one server goes offline. This setup helps the service stay reliable (videos play correctly) and available (the service is up and running) for users everywhere.

Throughput

Throughput is about how many tasks a system can handle in a certain amount of time, like how many searches it can do every second (queries per second or QPS).

When you add more servers to the system (this is called horizontal scaling), you help it do more tasks at once, which improves throughput. This means the system can process more requests faster.

For example, if a system can handle 100 tasks per second and you add more servers, it might then handle 200 tasks per second. This increase helps the system work better and more reliably by allowing it to manage more work without slowing down or crashing.

Latency

Latency is the time it takes for a system to respond to a request, from the moment it's sent by the user to when the user gets a response back.

To make this response time faster, you can place servers closer to where the users are or use edge locations. This helps reduce the travel time of data back and forth, leading to quicker responses.

For example, if a server is in the same city as the user, the data doesn't have to go as far, which means the user can get information or complete actions much faster. This improvement in speed makes the experience better for the user because they spend less time waiting.

A simple introduction to Pub/Sub Pattern

Tiger Abrodi — Fri, 08 Mar 2024 18:48:00 GMT

Introduction

The Publish/Subscribe pattern, often shortened to Pub/Sub, allows parts of a software system to communicate with each other.

The Basics of Pub/Sub

Imagine you have an app that needs to send updates to users. With Pub/Sub, you have a system that's split into four main parts:

Publisher: This is the part that sends out messages. Think of it as the app sending updates.
Subscriber: This is the part that receives messages. It's like the app on your phone that gets the update.
Topic: This is a way to organize messages. For example, one topic could be for app updates, and another could be for new messages.
Message: This is the actual information being sent from the publisher to the subscriber.

Why Use Pub/Sub?

The Pub/Sub model lets publishers send messages without waiting, so subscribers can get them when they can.

This asynchronous approach means senders don't have to stop and wait.

Adding more subscribers is easy without needing to change anything for the publishers, making everything work smoothly and quickly.

How Messages Get Delivered

Pub/Sub systems can deliver messages in different ways:

At most once: The message is sent once but might not get delivered.
At least once: The message definitely gets delivered, but it might arrive more than once.
Exactly once: The message is guaranteed to be delivered once and only once.

Making Sure Messages Don't Cause Problems

When a message is sent more than once, it's important that it doesn't mess anything up. This is where idempotency comes in. An idempotent message can be received multiple times without causing any issues. No matter how often the message is received, it will yield the same result.

Keeping Messages in Order

In Pub/Sub systems, messages are not guaranteed to arrive in the order they were sent. For instance, Google Cloud Pub/Sub supports ordered delivery by using ordering keys, which group messages into sequences that are delivered in order. This makes sure messages that are grouped together get delivered in the right order.

But, ensuring messages stay in order means you might have to deal with slower message handling and more complex setups, especially if a message doesn't get through correctly. For example, if one message gets stuck, others with the same key might also get held up, slowing things down.

Why Different Topics?

Using different topics lets you organize messages based on their type. For instance, one topic might be for notifications, while another for user messages. This helps in making sure the right messages get to the right subscribers.

Popular Pub/Sub Tools

Many tools exist for setting up a Pub/Sub system.

Some of the well-known ones include:

Amazon SNS
Azure Web PubSub
Google Cloud Pub/Sub

Wrapping Up

The Pub/Sub pattern is a smart way to make parts of your software talk to each other without getting too complicated.

It's like having a postal system that knows exactly where to deliver messages without the sender and receiver ever needing to meet.

Observability: The Power of Logging and Monitoring

Tiger Abrodi — Mon, 04 Mar 2024 17:46:01 GMT

Introduction

In today's world, we use websites and apps like YouTube every day, where millions of people are watching videos and uploading content all the time.

But think about what would happen if these platforms couldn't keep track of what users do, like how many videos are watched or uploaded. Even worse, imagine the frustration if you paid for YouTube Premium but couldn't access to the premium content.

This shows how important it is to have logging and monitoring to both keep and make the user experience better.

Understanding Logging and Monitoring

Logging and monitoring are two important parts of keeping an eye on complex systems. They let us know what's going on inside, making it easier to quickly find and fix problems. Although they both aim to keep the system running smoothly and reliably, they do it in different ways.

Logging: The Digital Record Keeper

Logging is like keeping a detailed journal of everything that happens in a system, from user logins to server errors. In systems spread out over different servers or locations, logging is important.

Instead of just showing error messages on a console like you might do in a local development setup, in distributed systems, these messages need to be saved to files, databases, or log management systems. This way, developers can look at and analyze them from anywhere, not just on the server where the problem happened.

Logs are often sorted by how serious they are such as info, warning, and error, and are written in formats that are easy to read and work with, such as syslog or JSON. This organized method lets teams sort through issues by how urgent they are and how much they affect the system.

Monitoring: The System's Watchful Eye

Logging keeps a record of past events, but monitoring watches everything as it happens. It uses tools that keep an eye on the system's health and how well it's working, tracking things like how often it's up, how many errors there are, how fast it responds, and how much traffic it gets. This immediate information is key to knowing if the system is working right and spotting any problems early on.

For example, for a website where people share videos, monitoring can check how long it takes to upload and play videos. By setting up warnings for when things aren't working as they should, like if there are too many errors, teams can fix problems quickly, often before users even notice anything's wrong.

Integrating Monitoring with Communication Tools

Modern monitoring systems can work with communication tools like Slack, making sure alerts quickly get to the right team members. This fast response helps fix problems quickly.

The Role of Time Series Databases

Time series databases are handy for monitoring because they are good at storing and managing data that has time stamps.

They are designed to deal with information that changes over time, like how many users are online or how fast the system is working. This feature is key for looking at patterns, predicting what will be needed in the future, and making smart choices to make the system work better.

Rate Limiting and Its Importance in Backend Systems

Tiger Abrodi — Sat, 02 Mar 2024 17:51:10 GMT

Introduction

Rate limiting is an important method to control how many requests a user can send to a server in a certain time, like a second, minute, or hour. This helps stop the server from getting too many requests at the same time.

Understanding the Need for Rate Limiting

Servers can only handle a certain number of requests. If they get too many at once, they can slow down or crash. This isn't just annoying, it can also be a security risk. For instance, attackers might flood the server with requests in a DoS (Denial of Service) attack, trying to block access for everyone else.

How Rate Limiting Works

Rate limiting sets a cap on how many requests a user can make to the server in a certain period. If a user goes over this limit, the server responds with a "429 Too Many Requests" error, telling the user to wait. This makes sure the server isn't overloaded and can work for everyone.

Rate limits can be customized based on the user's IP address, location, or account type, allowing you to customize rate limiting to your server's needs and user behavior.

Rate Limiting Techniques

There are several ways to implement rate limiting, each with its own advantages.

I've implemented 4 of them here.

To recap some of them:

Fixed Window Counting: This method tracks the number of requests within a fixed time frame, like a minute or an hour. Once the limit is reached, no more requests are allowed until the next time window.
Sliding Log Algorithm: This approach logs each request's timestamp. It allows for more flexibility, as it can dynamically adjust the rate limit window based on the server's current load.
Token Bucket Algorithm: This method works with tokens that stand for how many requests you can make. Every user begins with a bucket full of tokens, and making a request uses up one token. Over time, tokens get added back to the bucket until it's full again.
Leaky Bucket Algorithm: Similar to the token bucket, but the tokens leak out at a steady rate. If the bucket is empty, no requests can be made. This ensures a smooth and steady flow of requests.

Challenges and Considerations

While rate limiting helps control server load and stops abuse, it can't fully stop DDoS attacks, where many sources flood the server with requests. For these situations, you need extra steps like load balancing and better firewall protections.

Conclusion

Rate limiting is crucial for managing server load, ensuring availability, and preventing abuse. By selecting an appropriate rate limiting strategy, you balance usability and security, making your systems more robust and reliable.

Polling vs Streaming: Data Updates in Real-Time Systems

Tiger Abrodi — Sat, 02 Mar 2024 13:42:01 GMT

Introduction

In today's apps, especially ones that need live updates like monitoring systems, financial tickers, or chat apps, it's important to handle data flow between servers and clients well.

The two main ways to do this are polling and streaming.

Polling

Polling is a technique where the client periodically requests data from the server at set intervals. This method is straightforward and easy to implement but comes with trade-offs regarding resource usage and real-time efficiency.

How It Works

Request Cycle: The client sends a request to the server asking for the latest data.
Server Response: The server processes the request and sends back the data.
Wait: The client waits for a predefined interval before sending another request.

Diagram

When to Use Polling

Low-Frequency Updates: Best for when data doesn't change often and keeping a constant connection isn't worth it.
Simplicity: Good for apps that value simplicity and want to use minimal server resources.

Streaming

Streaming allows for an open, long-lived connection between the client and the server. With this approach, the server can "push" updates to the client in real-time as soon as new data is available, eliminating the need for repeated requests.

How It Works

Open Connection: The client starts a connection with the server, usually using WebSockets, making a persistent link.
Data Push: Whenever new data is available, the server immediately sends updates to the client without waiting for a request.
Continuous Updates: The client receives data as soon as it's available, ensuring real-time updates.

Diagram

When to Use Streaming

Real-Time Applications: Important for situations where instant updates are needed, like chat apps or live stock trading sites.
High-Frequency Updates: Ideal for places where information changes quickly and all the time.

Choosing Between Polling and Streaming

Choosing between polling and streaming for your app depends on a few things:

Data Update Frequency: Use streaming for data that changes often, and polling for data that doesn't change much.
Resource Constraints: Polling is easier to set up but can waste resources if updates are frequent. Streaming is harder to set up but is better for saving resources in real-time situations.
Application Requirements: What your app does and how quickly it needs data updates will greatly affect your decision.

Leader Election in Distributed Systems

Tiger Abrodi — Thu, 29 Feb 2024 17:56:09 GMT

Introduction

In the world of distributed systems, it's important to keep everything running smoothly, even if a server or node stops working. That's why the idea of leader election is very important.

The Need for Leader Election

Think of a well-known subscription service, like Netflix, where users are charged regularly through payment services like Stripe or PayPal. To handle transactions without sharing user credit card information with the payment service, a middleman service steps in to manage the charges.

However, using just one middleman service creates a risk. If it fails, all transactions stop, which can mess up the subscription system. To avoid this, we use several instances of the service for backup. But this leads to a new problem: we need to make sure only one instance (the leader) processes transactions at a time to prevent charging customers more than once.

The Process of Electing a Leader

Leader election is how different parts of a system agree on choosing a leader to manage tasks, like our service that handles credit card charges. If the leader stops working, they pick a new one to keep things running smoothly.

But, deciding on the new leader isn't easy. Problems like network partitions (when parts of the system can't talk to each other) and the split-brain scenario (when two parts pick different leaders, causing confusion) make the process challenging.

Achieving Consensus

Consensus algorithms make sure that all nodes in a distributed system agree on one main truth, even if there are failures or network problems. Choosing a leader is part of this process, where one node is picked to be the leader.

Popular Consensus Algorithms:

Paxos: This algorithm is a basic way to reach agreement but is often seen as complex to understand and put into practice.
Raft: Known for being easy to understand, Raft simplifies the process of reaching a consensus.
Zab: This is used by Apache ZooKeeper for choosing a leader, aiming for fast and reliable communication.

Using Distributed Consensus Services

Creating consensus algorithms from the ground up is tough and usually not needed. Instead, distributed systems use existing libraries and tools that include these algorithms, offering ready-made solutions for things like choosing a leader.

Examples of Distributed Consensus Services

ZooKeeper: Provides coordination services for distributed applications, including leader election, configuration management, and synchronization.
etcd: A highly available, strongly consistent key-value store used for shared configuration and service discovery, using the Raft consensus algorithm.
Consul: Offers service discovery, configuration, and orchestration capabilities, also based on the Raft algorithm.

Case Study: etcd

Etcd is a great example of a distributed key-value store that guarantees strong consistency, which is very important for choosing a leader. In our case of a subscription service, using etcd makes sure that at any moment, everyone agrees on who the main middleman service is. This stops users from being charged more than once for the same billing period.

Strong consistency in etcd means that every time you read or write data, you get the latest information. This is essential to make sure that when a new leader is chosen, the whole system knows about it right away.

Exploring Specialized Storage Paradigms

Tiger Abrodi — Thu, 29 Feb 2024 16:14:11 GMT

Introduction

In the world of databases, we don't just have SQL or NoSQL databases. Specialized storages exist for different use cases.

Blob Storage

What Is It

Blob Storage, standing for Binary Large Object, is designed for handling unstructured data such as files, images, and videos. Unlike relational databases that require data to be stored in a structured format, Blob Storage allows for the storage of massive amounts of unstructured data.

How It Works

Access to blobs is typically done through a key-value interface, where a unique key is used to upload, retrieve, or delete the blob. This simplicity makes Blob Storage highly efficient for specific tasks.

Use Cases

Storing large multimedia files
Backing up data
Serving images or documents directly to a browser

Popular Services

Amazon S3
Google Cloud Storage
Azure Blob Storage

Time Series Databases

What Is It

A Time Series Database is optimized for handling data that changes over time, such as sensor readings, stock market prices, or application logs. These databases excel at storing, retrieving, and analyzing time-indexed data.

Advantages

Efficient storage and querying of time-stamped data
Built-in functions for time-based aggregations and calculations

Use Cases

IoT device monitoring
Financial market analysis
Application performance monitoring

Examples

InfluxDB
Prometheus
Graphite

Graph Databases

What Is It

Graph Databases use graph structures (nodes, edges, and properties) to represent and store data, focusing on the relationships between data points. They are particularly useful for data models where relationships are as important as the data itself.

Advantages

Handles complex searches on highly connected data well
Easier to model certain types of data relationships

Use Cases

Social networks
Recommendation systems
Fraud detection

Popular Platforms

Neo4j
Amazon Neptune
Azure Cosmos DB

Technical Insight

Graph databases make it easy to search through connections between data points. For example, with Neo4js Cypher query language, you can quickly find the shortest path between two points or gather data from a network of connections.

Spatial Databases

What Is It

Spatial Databases are optimized for storing and querying data related to objects in space, including locations on the earth, geographic information, and maps. They handle spatial data more efficiently than traditional relational databases.

How It Works

Spatial databases often use data structures like Quadtrees to efficiently store and query spatial information. A Quadtree is a tree structure where each node represents a spatial region, and branches divide that region into four smaller quadrants, recursively.

Use Cases

Geographic information systems (GIS)
Location-based services (e.g., finding nearby restaurants)
Spatial analysis in various industries

Efficiency

Spatial queries, such as finding all points within a given distance from a location, are much faster using spatial databases thanks to their optimized indexing methods.

Introduction to Relational Databases

Tiger Abrodi — Thu, 29 Feb 2024 11:01:54 GMT

Understanding the Basics

At its core, a relational database organizes data into tables, which consist of rows and columns. Each row represents a record, a single data item, and each column represents a field, a specific type of data within a record.

Example of a Table

+----+-------+-----+| id | name  | age |+----+-------+-----+| 1  | John  | 25  || 2  | Alice | 30  || 3  | Bob   | 35  |+----+-------+-----+

In this table, each row is a record of an individuals id, name, and age. Each column specifies what data the field holds.

The Role of Schemas

A schema is a blueprint of how a database is structured. It defines the tables, the fields in each table, and the type of data each field can hold. For the table in the previous section, the schema would specify that id and age are integers and name is a string.

Schemas make sure the data stays consistent by applying rules. These rules include making sure every record has data for each field and that the data matches the specified type.

Relationships Between Tables

Relational databases allow tables to be linked, creating relationships between different data items.

These relationships can be:

One-to-One: Each row in Table A is linked to one row in Table B, and vice versa. For example, a persons passport number is unique to them.
One-to-Many: A single row in Table A can be linked to many rows in Table B. For example, a customer can have multiple orders.
Many-to-Many: Rows in Table A can be linked to multiple rows in Table B, and vice versa. For example, a student can take multiple classes, and a class can have multiple students.

Interacting with Databases: SQL

Structured Query Language (SQL) is the standard language for interacting with relational databases. It allows you to perform various operations, such as creating tables, inserting data, querying data, updating data, and deleting data.

SQL is declarative, meaning you describe what you want to do, and the database figures out the most efficient way to do it.

Ensuring Reliability: ACID Properties

Relational databases are built to be reliable, making sure data stays accurate and consistent through transactions. Transactions in a database follow the ACID properties:

Atomicity: Ensures a transaction is all-or-nothing.
Consistency: Guarantees that a transaction takes the database from one valid state to another.
Isolation: Keeps transactions separate from each other until theyre completed.
Durability: Ensures that once a transaction is committed, it remains so, even in the event of a system failure.

Optimizing Searches: Indexes

As databases get bigger, searching through them can slow down. Indexes are like special lookup tables that help the database find data faster. Adding an index to a database column is like putting in a bookmark, making it easier to find specific information.

SQL Example for Creating an Index

CREATE INDEX name_index ON employees (name);

This index would make queries searching for employees by name much faster.

Managing Transactions in SQL

Transactions in SQL ensure that operations involving multiple steps are either complete fully or not at all.

Heres a simple example:

BEGIN TRANSACTION;UPDATE accounts SET balance = balance - 100 WHERE id = 1;UPDATE accounts SET balance = balance + 100 WHERE id = 2;COMMIT;

This transaction moves money between accounts, ensuring both operations complete successfully before committing the changes.

Consistency Models: Strong vs. Eventual

Strong Consistency: Changes made by a transaction are immediately visible to all other transactions. This applies to Relational Databases' transactions.
Eventual Consistency: Changes may not be instantly visible but will become so over time, ensuring data consistency across the database eventually. Outside of relational databases, this may apply to other data storage systems.

AWS Regions, Availability Zones, and Local Zones

Tiger Abrodi — Tue, 27 Feb 2024 09:43:26 GMT

AWS Regions

Think of AWS Regions as big areas around the world where AWS has its data centers. Each Region is in a different country or part of a country.

For example, there's a Region called "US East (N. Virginia)" and another called "Europe (Ireland)." When you use AWS, you choose a Region. Your choice might depend on how close it is to your users.

How Regions Work

Isolated: Each Region works on its own, which means if there's a problem in one Region, it doesnt affect the others.
Choose Your Region: When you start with AWS, you pick a Region that's close to your users or fits your needs for rules and laws.
Data Stays in the Region: Whatever you do in a Region, like storing files or running applications, stays in that Region. AWS doesn't move it elsewhere without your say.

Example of picking a region based on laws

An example of choosing an AWS Region based on laws could be related to data sovereignty and privacy rules. For example, the European Union has the General Data Protection Regulation (GDPR), which sets strict guidelines for handling users' personal information in the EU.

If a US company serves EU customers, they might pick the "Europe (Frankfurt)" AWS Region or another European Region to follow GDPR. This makes sure the data is stored in the EU and follows EU laws, helping the company meet GDPR rules about data privacy and security.

Availability Zones (AZs)

In each AWS Region, there are different areas called Availability Zones (AZs).

Each AZ has one or more data centers filled with servers. The data centers are close enough for fast communication but far enough apart so one disaster (like a storm) won't impact all of them at the same time. This setup makes sure that if one data center fails, the others in the AZ can continue working, offering reliability and non-stop service.

Understanding AZs

Isolation: Each AZ in a Region is isolated from the others, which means they don't share the same power source, cooling system, or network. This design keeps services running smoothly even if there's a problem in one AZ.
Example: In the "US East (N. Virginia)" Region, you might have AZs called us-east-1a, us-east-1b, and us-east-1c. Each letter at the end (a, b, c) stands for a different AZ.
Use Case: If you're running a website, you can set it up to use servers in multiple AZs. If one AZ has a problem, your website can still work using the servers in the other AZs.

Local Zones

Local Zones are like mini-Regions closer to specific cities or areas. They are made for when you need super fast responses for your applications, like game streaming.

How Local Zones Help

Low Latency: They reduce the delay (latency) in sending data to your users. This is great for services that need to respond very quickly.
Extension of a Region: Each Local Zone is connected to a Region. It's like having a piece of that Region closer to your users.
Example: There might be a Local Zone in Los Angeles (called us-west-2-lax-1) that's an extension of the Oregon Region (us-west-2). This helps people in Los Angeles use AWS services faster.

Choosing Where to Deploy

For Global Reach: Pick Regions based on where your users are. Closer Regions mean faster services.
For Safety: Use multiple AZs to make sure your applications keep running, even if one AZ has issues.
For Speed: Use Local Zones if you have users in specific cities that need very quick access to your applications.

Knowing these AWS basics helps you choose the best places and ways to run your applications. This makes them quicker, safer, and more dependable for your users.

Security in AWS

Tiger Abrodi — Tue, 27 Feb 2024 09:35:58 GMT

Shared Responsibility Model

Think of AWS security as a team effort. AWS makes sure the cloud itself is secure. This means they look after the physical stuff (like servers and data centers) and the software that makes AWS run.

Your job is to keep your data and applications safe. This includes setting who can access your data and making sure it's encrypted (turned into a code that only you can understand).

Identity and Access Management (IAM)

IAM is a tool on AWS that lets you control who can see and do things with your AWS services. With IAM, you can:

Make user accounts and groups for different people on your team.
Decide who can access what, making sure people can only see the data or services they need for their job.

More Tools for Keeping Safe

AWS has more tools to help keep your data secure:

Amazon CloudTrail: This is like a security camera for your AWS account. It records what was done, when, and by whom. It's great for checking on how your AWS services are used and making sure everything is above board.
Amazon VPC: This tool lets you create a private network for your AWS services. It's like having a piece of the AWS cloud all to yourself, where you control who can come in and out.
AWS Key Management Service (KMS): This service helps you lock your data away safely. It creates keys (like really complex passwords) that lock and unlock your data.
AWS Shield: This protects you from DDoS attacks, which are attempts to make your services unavailable by overwhelming them with traffic.

AWS gives you the tools and protection to keep your data safe, but it's up to you to use them. By working together, you can make sure your applications and data on AWS are as secure as can be.

AWS Best Practices

Tiger Abrodi — Tue, 27 Feb 2024 09:34:31 GMT

Introduction

When exploring Amazon Web Services (AWS), it's important to make sure your cloud environment is safe and efficient. AWS offers strong tools and features to help, but you need to be proactive too!

Avoid Using the Root User

The root user of your AWS account has complete access to all resources and services. For everyday tasks, it's safer to use Identity and Access Management (IAM) users with specific permissions.

Create IAM Users: For each team member, create an individual IAM user. This ensures that everyone has access only to the resources necessary for their role.
Limit Permissions: Apply the principle of least privilege. Start with minimal access and grant additional permissions as needed.

Organize with IAM Roles and Groups

In a typical organization, you might have various teams like developers, quality assurance, and operations, each requiring different levels of access.

Use IAM Roles: Roles allow you to define a set of permissions that you can then assign to IAM users, applications, or services. For example, you can create a role in AWS that allows a certain application to view and use data in an S3 bucket. This is helpful because it makes sure only the apps you approve can access your data, improving security by stopping unauthorized access.
Create Groups for Easy Management: Groups help you manage permissions for multiple users. You might have a "Developers" group with access to EC2 instances and a "Quality Assurance" group with read-only access to certain databases.

Enable Multi-Factor Authentication (MFA)

Adding a second layer of security is super important. MFA requires users to provide two forms of identification: something they know (like a password) and something they have (like a smartphone app or a hardware token).

Activate MFA for All Users: Ensure that both your root account and all IAM users have MFA enabled to protect against unauthorized access.
- You would typically attach a policy that requires MFA to the IAM users or groups right from the start.

Set Billing Alerts

To avoid surprises in your AWS bill:

Monitor Usage: Keep an eye on your AWS usage and set up billing alerts. AWS allows you to receive notifications if your costs exceed certain thresholds.
Use the AWS Free Tier: Take advantage of the AWS Free Tier for new services and testing. It can significantly reduce costs for small projects or during the early stages of development.
Review Regularly: Check your AWS bill regularly to understand where costs are coming from and adjust your usage accordingly.
Fun story, when I worked with AWS MSK, I didn't know it'd be quite expensive. I hit my AWS Billing Alert, but I didn't think it was growing exponentially. I thought "Once this side project is done, then I'll look into the Alerts". My billing alert was set to 7$, by the time I checked the expenses I had, it was up to 22$.
- Lesson learned? If you use any new AWS services, always check their pricing. And make sure to treat your "alerts" like actual alerts.

Real-World Example

Imagine a startup with three main teams: Development, QA, and Operations. Using IAM, the startup creates groups for each team with specific permissions:

Development Team: Access to AWS Lambda, Amazon S3, and Amazon DynamoDB for building applications.
QA Team: Read-only access to the same resources for testing.
Operations Team: Broad access, including AWS CloudFormation for infrastructure management and AWS CloudWatch for monitoring.

Using these practices, the startup makes sure each team gets the access they need without risking security or causing extra costs.

Conclusion

By following these best practices, you can create a secure, efficient, and cost-effective cloud environment to support your organization's goals.

Don't forget the shared responsibility model: AWS protects the cloud infrastructure, but it's your job to keep your data and apps safe in the cloud.

Avoid Layout Thrashing to keep your website performant

Tiger Abrodi — Tue, 27 Feb 2024 08:45:59 GMT

Introduction

In web development, understanding the rendering process is important for creating fast, high-quality apps.

Layout reflow happens when browsers determine page element placement and size. However, if not managed properly, it can lead to performance issues.

What Triggers a Layout Reflow?

Several actions can cause the browser to perform a reflow, including:

DOM Manipulation: JavaScript that changes the layout, like adjusting an element's height, can cause a reflow. This impacts the changed element, its child elements, and possibly its sibling and parent elements.
Style Changes: Adjusting styles that change the size or position of elements triggers a reflow.
Accessing Computed Styles: When your script accesses certain properties like offsetWidth, scrollTop, or calls getComputedStyle(), it can force a reflow because the browser must provide the most up-to-date value.

The Problem of Layout Thrashing

Layout thrashing happens when scripts keep switching between reading from and writing to the DOM. This makes the browser do many reflows quickly, one after another. This is not efficient and can make performance worse because each reflow uses a lot of computer power and takes up important CPU resources.

Strategies to Avoid Layout Reflow Issues

To minimize the impact of layout reflows, here is what you can do:

Batch DOM Read/Write Operations: Organize your code to group all DOM reads together before performing any writes. This approach reduces the number of reflows by avoiding repeated read/write cycles. For example, collect data from multiple elements first, then apply all your changes in one go.
Use Document Fragments: Don't change the DOM directly. Instead, use document fragments to make your changes off-DOM. When you're done, add the fragment to the DOM in one step. This way, you'll have fewer reflows.
Optimize CSS: Simplify your CSS selectors and avoid complex queries that can slow down style calculations and, by extension, reflows.
UserequestAnimationFrame: This API allows you to queue up changes to be executed in the next frame, aligning your updates with the browser's repaint cycle and reducing unnecessary reflows.
Debounce and Throttle Event Handlers: For events that cause many updates (like resize or scroll events), using debounce or throttle methods helps control how often event handlers run. This lowers the chance of layout problems.

Key components of AWS

Tiger Abrodi — Mon, 26 Feb 2024 17:48:07 GMT

Amazon EC2 (Elastic Compute Cloud)

Amazon EC2 gives you the ability to grow computing power in the cloud. You can start virtual servers, set up security and networking, and handle storage. EC2 lets you pick from many instance types, which are suited for different tasks and apps.

Amazon S3 (Simple Storage Service)

Amazon S3 is a storage service that provides top-notch scalability, data access, security, and speed. It's made to store and safeguard any amount of data for various uses, such as websites, mobile apps, backup and restore, archiving, and analyzing large amounts of data.

Amazon RDS (Relational Database Service)

Amazon RDS makes it easier to set up, operate, and scale a relational database in the cloud. It provides cost-efficient and resizable capacity while automating time-consuming administration tasks such as hardware provisioning, database setup, patching, and backups.

AWS Lambda

AWS Lambda is a serverless compute service that lets you run code without provisioning or managing servers. You only pay for the compute time you consume, making it a cost-effective way to run applications.

Amazon VPC (Virtual Private Cloud)

Amazon VPC lets you provision a logically isolated section of the AWS cloud where you can launch AWS resources in a virtual network that you define. It offers complete control over your virtual networking environment, including resource placement, connectivity, and security.

AWS IAM (Identity and Access Management)

AWS IAM helps you securely control access to AWS services and resources for your users. With IAM, you can create and manage AWS users and groups, and use permissions to allow and deny their access to AWS resources.

Understanding Scaling in AWS: Vertical vs. Horizontal

Tiger Abrodi — Mon, 26 Feb 2024 17:46:29 GMT

Introduction

Scaling is a basic part of cloud computing that helps control how well an application works and if it's always available. In AWS, there are two main ways to scale: vertical scaling and horizontal scaling. Both methods have their own benefits and situations where they work best.

Vertical Scaling (Scaling Up)

Vertical scaling, or scaling up, involves increasing the capacity of an existing server or instance by adding more resources like CPU, RAM, or storage. It's like upgrading your computer to a stronger version. This method is simple and can work well for apps with scaling limits or databases that do better with a single-node structure.

Diagram

Vertical scaling works well for apps with a single-node design or when you need stronger computing resources without making things more complicated. However, it has limits, mostly because of the server's hardware restrictions and possible downtime during upgrades.

Horizontal Scaling (Scaling Out)

Horizontal scaling, or scaling out, means adding more servers to your group to share the load better. This method lets the application handle more tasks by spreading them across many servers, instead of depending on just one server.

Diagram

Horizontal scaling is preferred for apps that work in parallel or spread-out systems, like web apps and microservices. It makes it easy to manage more load by adding or taking away instances as needed. AWS services like Elastic Load Balancing and Auto Scaling make it simpler to share traffic and change the instance count.

Key Differences

Flexibility and Scalability: Horizontal scaling is more flexible and can manage growth better over time without being limited by server size.
Cost Implications: Vertical scaling can be simpler but may lead to higher costs when using more powerful (and expensive) instances. Horizontal scaling can be more cost-effective by using smaller, cheaper instances.
Complexity: Horizontal scaling makes the architecture more complex, needing methods for load balancing, session management, and possibly more advanced data management strategies.
Downtime: Vertical scaling might need downtime to upgrade to a bigger instance, while horizontal scaling can often be done without downtime.

Which to pick?

Horizontal scaling is preferred due to a couple of reasons:

Flexibility: It easily adjusts to changing demands by adding more instances.
Fault Tolerance: It improves reliability since the failure of one instance doesn't bring down the entire system.
Cost-Effectiveness: Pay only for extra resources when needed, aligning costs with actual usage.
Better Load Management: Distributes traffic across multiple instances, preventing any single point of overload or failure.

What is AWS and why is it beneficial for businesses?

Tiger Abrodi — Mon, 26 Feb 2024 17:45:55 GMT

What is Amazon Web Services (AWS)?

Amazon Web Services is a cloud computing platform provided by Amazon. It delivers a wide variety of services, including computing power (with EC2 instances), storage options (like S3 buckets), and networking capabilities (through services like Amazon VPC) directly over the internet.

AWS enables businesses to deploy applications and data without the need for physical hardware, leading to increased efficiency and flexibility.

Benefits of AWS for Businesses

Scalability and Elasticity

AWS offers auto-scaling and elastic load balancing features, allowing businesses to automatically adjust the number of instances up or down in response to traffic or demand. This means you can handle more users or data without manually intervening, ensuring your application remains responsive at all times.

Cost Efficiency

AWS operates on a pay-as-you-go pricing model, which means you only pay for the resources you consume. This can significantly reduce upfront and ongoing IT costs. Additionally, AWS offers various pricing plans, such as Reserved Instances or Spot Instances, which can provide savings compared to on-demand pricing.

High Availability and Reliability

AWSs infrastructure is spread across different regions and Availability Zones (AZs), minimizing the risk of a single point of failure and ensuring your services remain available. Data replication across AZs helps protect against outages and data loss.

Enhanced Security

AWS provides comprehensive security tools like AWS Identity and Access Management (IAM) for controlling access to services and resources, Amazon CloudTrail for tracking user activity and API usage, and encryption capabilities both at rest and in transit. AWS also complies with various compliance programs, ensuring that your data is protected according to industry standards.

`this` context in JavaScript

Tiger Abrodi — Mon, 26 Feb 2024 14:43:40 GMT

Global Context

When used in the global scope, this refers to the global object. In a browser environment, this global object is window, and in Node.js, it is global.

function myFunction() {  console.log(this); // Refers to the global object}myFunction();

Object Methods

When this is used inside a method of an object, it refers to the object itself.

const myObject = {  myMethod() {    console.log(this); // Refers to myObject  },};myObject.myMethod();

This allows methods to access other properties or methods of the same object using this.

Event Listeners

In the context of addEventListener, this refers to the element that the event listener is attached to.

However, this wouldn't work with arrow functions, they'd refer to the global object in this case. (see next section)

const myButton = document.querySelector("button");function logThis() {  console.log(this); // Refers to the button element}myButton.addEventListener("click", logThis);

Arrow Functions

Arrow functions do not have their own this. Instead, they inherit this from the parent scope at the time they are defined.

const myObject = {  myMethod: () => {    console.log(this); // Inherits `this` from the parent scope  },};myObject.myMethod(); // Likely logs the global object or undefined in strict mode

Binding `this` with `bind`

The bind method allows you to set the value of this explicitly for any function.

function logThis() {  console.log(this);}const myObject = { name: "John" };const boundFunction = logThis.bind(myObject);boundFunction(); // `this` will refer to myObject

Overriding `this` with `call` and `apply`

Both call and apply methods allow you to call a function with an explicitly set this value. The difference between them lies in how you pass arguments: call takes them individually, while apply accepts them as an array.

function logThis() {  console.log(this);}const myObject = { name: "John" };logThis.call(myObject); // Calls the function with `this` set to myObjectfunction logThisWithArgs(x, y) {  console.log(this, x, y);}logThisWithArgs.apply(myObject, [1, 2]); // Also calls the function with `this` set to myObject, arguments passed as an array

`this` in Array Methods

Some array methods, like forEach, allow you to specify the value of this to be used within the callback function.

const myArray = [1, 2, 3];const myObject = { name: "John" };myArray.forEach(function(item) {  console.log(this, item); // `this` refers to myObject}, myObject);

However, this does not apply to arrow functions within these methods because they do not have their own this context.

Closures in JavaScript

Tiger Abrodi — Mon, 26 Feb 2024 14:37:53 GMT

Introduction

To understand closures, we first have to understand Lexical Environment and Scoping.

Lexical Environment

The Lexical Environment refers to the context within which JavaScript code is executed. This context includes where the code is written and its surrounding environment.

Consider the following example:

function myFunction() {  let a = 1;  function innerFunction() {    console.log("Hello", a);  }  innerFunction();}

In this example, innerFunction can access the variable a because it's inside myFunction, which contains innerFunction. The lexical environment includes all variables and functions that can be reached in the current area and its parent scopes.

Lexical Scoping

Lexical Scoping means that a variable's scope depends on where it is in the source code.

Here's an illustration:

function myFunction() {  let a = 1;  function innerFunction() {    console.log("Hello", a);  }  return innerFunction;}const returnedFunction = myFunction();returnedFunction(); // Accesses `a` even outside its defining scope

In this example, innerFunction is executed outside of myFunction, yet it keeps access to a. This happens because of closures.

Closures Explained

When myFunction is called, it creates a new environment that has a. By returning innerFunction, it keeps access to myFunction's environment. This ongoing access is what we call a closure.

A closure happens when a function can remember and use its original scope, even if it's run outside of that scope. This means innerFunction keeps the scope of myFunction, letting it access a no matter where innerFunction is called, even in another file.

This isn't possible in all programming languages.

Practical Use Cases

Closures are not just a theoretical aspect of JavaScript, they have practical use cases:

Data Encapsulation: Closures allow for private variables that cannot be accessed directly from outside the function.
Factory Functions: They can be used to create function factories that can create multiple instances of similar functions, each with access to private variables.
Event Handlers and Callbacks: Closures are widely used in event handlers and callbacks, where specific variables need to be accessed.

setTimeout and setInterval in JavaScript

Tiger Abrodi — Mon, 26 Feb 2024 14:35:30 GMT

Intervals

Use Case: You want to execute a piece of code repeatedly at fixed time intervals, such as updating a live clock on your webpage.

JavaScript provides setInterval() for this purpose. It allows you to specify a function to be executed every N milliseconds.

// Update a clock every secondsetInterval(function() {  document.getElementById('clock').innerText = new Date().toLocaleTimeString();}, 1000);

To stop the interval, you can use clearInterval() with the identifier returned by setInterval().

const intervalId = setInterval(yourFunction, 1000);// When you want to stop the intervalclearInterval(intervalId);

Timeouts

Use Case: You need to execute a function after a delay, such as showing a welcome message a few seconds after a page loads.

For this, setTimeout() is your tool. It executes a function once after a specified delay in milliseconds.

// Show a message after 5 secondssetTimeout(function() {  alert('Welcome to the site!');}, 5000);

Similarly, clearTimeout() can be used to cancel the timeout before it occurs, using the identifier returned by setTimeout().

const timeoutId = setTimeout(yourFunction, 5000);// To cancel the timeoutclearTimeout(timeoutId);

Animation Frames (`requestAnimationFrame`)

Use Case: You're creating animations or visual effects that need to run smoothly, adjusting to the browser's refresh rate.

requestAnimationFrame() is designed for such tasks. It tells the browser you wish to perform an animation and requests that the browser calls a specified function to update an animation before the next repaint.

function animate() {  // Update your animation here...  requestAnimationFrame(animate);}// Start the animationrequestAnimationFrame(animate);

This method provides a smoother visual experience than setInterval() for animations because it synchronizes with the browser's frame rate and pauses when the user navigates to another browser tab, improving performance and battery life.

Some use cases:

Smooth Animations for Visual Effects: When creating animations, such as moving elements across the screen or fading them in and out, requestAnimationFrame ensures these animations run smoothly. It does this by matching the frame rate to the browser's refresh rate, reducing the chances of animations looking choppy or causing unnecessary CPU load.
Game Loop Implementation: For web-based games, requestAnimationFrame is ideal for implementing the game loop. It allows for smooth and consistent updates to the game's state and rendering, providing a better gaming experience. The function can be recursively called to continuously update game elements and respond to user inputs in real-time.
UI Interactions and Transitions: Complex UI interactions, such as drag-and-drop interfaces or responsive menus with animated transitions, benefit from requestAnimationFrame. It allows for the smooth updating of the UI based on user interaction, ensuring a responsive and tactile feel to web applications without the lag or jitter that might come from less optimized methods.

Conclusion

In conclusion, intervals are best for repeated actions at fixed intervals, timeouts are ideal for delayed single actions, and animation frames offer the smoothest experience for animations.

Hashing in System Design

Tiger Abrodi — Mon, 26 Feb 2024 06:40:20 GMT

Introduction

Hashing is a basic method in system design that changes data of any size to a fixed size. This helps spread client requests across servers, improve load balancing, and make caching better.

The Naive Hashing Approach

At first, think about a basic situation: many clients send requests to one server. When the number of requests goes up, the server gets too busy, which can cause the service to get worse or even fail.

To prevent this, systems scale in two primary ways:

Vertical Scaling (Scaling Up): Enhancing the server's capacity with more powerful resources.
Horizontal Scaling (Scaling Out): Adding more servers to distribute the load.

Horizontal scaling is a popular approach because of it's flexibility and how well it scales. However, now we have multiple servers. We run into the problem of needing to distribute the load.

This is where load balancers come in. A load balancer is placed between clients and servers to evenly distribute the load when clients make requests.

The naive approach here is to map every client to a server by hashing e.g. the client's IP address and modding it to a server (using % operator).

But, this method doesn't work well when servers are added or removed. It requires rehashing all clients, which disturbs the distribution and makes cached data on servers less useful.

Consistent Hashing

Consistent hashing minimizes the redistribution of requests when the server pool changes. It places both clients and servers on a hash ring (also called circle), assigning each request to the nearest server in the clockwise direction.

Notes:

C are the clients, hashed onto a specific position on the circle.
Servers are also hashed.
Virtual nodes are excluded from the diagram, but you may have e.g. server B placed between C and D if server B is more powerful and you want to utilize it's resources.
This is nice because if e.g. server D goes down, Client 4 and 3 simply point to server A. We don't have to redistribute the entire system just because one server is added/removed.

How It Works

Hash Space: Both servers and clients are hashed onto a virtual circle.
Mapping: Clients are assigned to the nearest server on the circle in a clockwise direction.
Server Addition/Removal: Changes affect only the immediate neighbors on the circle, significantly reducing cache invalidation.

Virtual Nodes

Virtual nodes help balance the workload, especially when servers have different capacities. A server can have multiple points on the circle, making request distribution more even.

Rendezvous Hashing

Rendezvous hashing, also known as Highest Random Weight (HRW) hashing, offers a different approach. It calculates a unique score for each server-client pair and assigns the client to the server with the highest score.

How It Works

Score Calculation: Each server computes a score based on both the server and client identifiers.
Server Selection: The client is assigned to the server with the highest score.
Dynamic Adjustments: Adding or removing servers requires recalculating scores without major problems.

Note: Each client calculates a score with every server and selects the server with the highest score.

Choosing the Right Hashing Strategy

Consistent Hashing is ideal for frequently scaling systems as it minimizes cache inconsistency by evenly distributing load. It's useful in dynamic environments, limiting the impact of server changes and ensuring smooth transitions.
In contrast, Rendezvous Hashing is your go-to choice when you need a straightforward and quick way to connect clients to servers. Unlike Consistent Hashing, Rendezvous Hashing makes it easier to reassign clients to different servers without affecting the overall system. This approach is most effective in scenarios where servers are similar in performance and capacity. It allows for flexible server assignment, making it simpler to manage without the worry of unbalancing the system. However, if servers have varying capabilities, Rendezvous Hashing may not distribute work evenly, as it doesn't adapt to server performance or workload. Then you should go with Consistent Hashing.

Promises in JavaScript

Tiger Abrodi — Mon, 26 Feb 2024 06:24:05 GMT

Introduction

Promises in JavaScript are a powerful tool for managing asynchronous operations, allowing developers to write cleaner, more manageable code.

What is a Promise?

A Promise is an object representing the eventual completion or failure of an asynchronous operation. A Promise holds a value that might be available now, later, or never. It represents the idea of promising to do something. Promises have three states:

Pending: Initial state, neither fulfilled nor rejected.
Fulfilled: The operation completed successfully.
Rejected: The operation failed.

Consider the following example where a Promise resolves another Promise:

new Promise((resolveOuter) => {  resolveOuter(    new Promise((resolveInner) => {      setTimeout(resolveInner, 1000);    })  );});

In this case, the outer Promise is pending until the inner Promise resolves. The completion of the inner Promise after 1 second fulfills the outer Promise.

Chaining Promises

Promises can be chained to perform sequential asynchronous operations. Methods like .then(), .catch(), and .finally() enable this chaining:

const myPromise = new Promise((resolve, reject) => {  setTimeout(() => {    resolve("foo");  }, 300);});myPromise  .then(handleFulfilledA)  .then(handleFulfilledB)  .then(handleFulfilledC)  .catch(handleRejected)  .finally(() => console.log('Operation completed.'));

.then() executes a callback on Promise fulfillment or rejection.
.catch() is a shorthand for handling rejections.
.finally() executes after the Promise settles, regardless of its outcome.

Handling Multiple Promises

JavaScript provides several methods to deal with multiple Promises concurrently:

Promise.all() waits for all promises to be resolved or for any to be rejected. Helpful for combining the outcomes of several promises.

  Promise.all([promise1, promise2]).then((results) => {    const [result1, result2] = results;    console.log(result1, result2);  });

Promise.allSettled() waits for all promises to settle, regardless of the outcome. Each promise's result is provided, indicating success or failure.
```
  Promise.allSettled([promise1, promise2]).then((results) => results.forEach((result) => console.log(result.status)));
```
Promise.race() waits for the first promise to settle, either fulfilled or rejected. This is useful for timeout patterns.
```
  Promise.race([promise1, promise2]).then((result) => console.log(result));
```
Promise.any() waits for the first promise to fulfill, ignoring all rejections unless all promises are rejected.
```
  Promise.any([promise1, promise2]).then((result) => console.log(result));
```

Async/Await: Syntactic Sugar for Promises

Async/await syntax offers a cleaner, more readable way to work with Promises, making asynchronous code appear synchronous:

async function fetchData() {  try {    const data = await fetch('https://api.example.com/data');    console.log(await data.json());  } catch (error) {    console.error('Failed to fetch data:', error);  }}

An async function implicitly returns a Promise.
The await keyword pauses the function execution until the Promise settles.
Use try/catch blocks to handle potential rejections.

Concurrent Execution with Async/Await

To run Promises concurrently within an async function, utilize Promise.all():

async function fetchMultipleData() {  try {    const [dataA, dataB] = await Promise.all([fetchDataA(), fetchDataB()]);    console.log(dataA, dataB);  } catch (error) {    console.error('Failed to fetch data:', error);  }}

This approach ensures that dataA and dataB are fetched in parallel, optimizing performance.

Debounce vs Throttling

Tiger Abrodi — Mon, 26 Feb 2024 06:22:33 GMT

Introduction

Debouncing and throttling are two techniques used to control the rate at which a function is executed. Both are essential in optimizing performance and improving user experience in web development, but they serve slightly different purposes.

Debouncing

Problem

Imagine a search bar that fetches suggestions from a server as the user types. If a request is sent to the server with every keystroke, it can lead to a significant load on the server and a laggy experience for the user. This is where debouncing comes in.

Explanation

Debouncing is a technique that limits the rate at which a function can fire. A debounced function will only execute after a certain amount of time has passed without it being called again. In the context of the search bar, debouncing ensures that the server request is made only after the user has stopped typing for a predetermined period, say 300 milliseconds.

This means that regardless of the number of keystrokes, the function to get suggestions is called only once after the user stops typing. This reduces server calls and makes the performance and user experience better.

Throttling

Problem

Think about a website where more pictures or messages load as you scroll down, similar to social media feeds. If the website tries to load more content every single time you move down even a little bit, it can make scrolling slow and choppy. This happens because the website is working too hard, trying to load more stuff every fraction of a second.

Explanation

Throttling is like setting a timer for how often the website can try to load more content while you scroll. For example, you could set it up so the website only tries to load more pictures or messages once every 200 milliseconds (which is a fifth of a second) no matter how much you scroll.

This way, every 200 milliseconds, the website can only load more pictures once. This makes scrolling smooth because the website isnt overloaded with work, and it still loads more content as needed without any noticeable delay.

Summary

Throttling makes a function run only once in a certain time frame, making sure it runs regularly but not too much.

Debouncing waits for a break in activity before running a function, delaying it until the triggers stop for a set time.

Throttling manages how often a function runs, while debouncing waits for the activity to stop before running it.

Virtual DOM and Render in React

Tiger Abrodi — Sun, 25 Feb 2024 19:18:21 GMT

Virtual DOM

React optimizes webpage rendering through a concept called the Virtual DOM, which is a simplified, in-memory representation of the real Document Object Model (DOM).

Instead of directly modifying the real DOM every time there's a change in the UI, React first updates the Virtual DOM. It then performs a process called "diffing," where it compares the updated Virtual DOM with a snapshot of the Virtual DOM before the update.

This comparison identifies the exact changes made. Finally, React applies these specific changes to the real DOM, updating only what's necessary. This approach enhances performance by minimizing direct interactions with the real DOM, which are costly in terms of processing time.

Why Directly Updating the Real DOM is Not Ideal

Updating the real DOM directly for every minor change is inefficient for several reasons.

Performance Costs

The DOM is a complicated, tree-shaped representation of a webpage. When you directly change it (like adding, deleting, or changing elements), it causes "re-rendering." During this process, the browser figures out layouts, updates the UI, and shows the changes. These actions take a lot of computing power and can cause performance problems, especially in complex apps.

Reflows and Repaints

Direct updates can cause multiple "reflows" and "repaints." A reflow happens when the layout of a part of the webpage is recalculated (for example, changing the size of an element).

A repaint occurs when changes are made that affect the visual appearance of an element (like color changes) but not its layout. Frequent reflows and repaints are resource-intensive and can degrade the user experience, leading to sluggish interactions and animations.

Batching and Efficiency

The Virtual DOM allows React to batch multiple updates together. Instead of applying each change as it comes, React can collect changes to the Virtual DOM and then update everything at once. This batching reduces the number of reflows and repaints, making it more efficient and faster.

Diffing Algorithm

React uses a diffing algorithm to compare the Virtual DOM with the real DOM. This algorithm finds the smallest number of changes needed to update the real DOM. By only making the needed updates, React reduces direct changes to the DOM, making updates faster and more efficient.

Render and Commit

Requesting and serving UI in React has three steps:

Triggering a render.
Rendering the component.
Committing to the DOM.

Step 1: Trigger a Render

Components in React render for two reasons:

Initial render: When a component is first mounted in the DOM.
Re-render: When a component's state or one of its ancestors state changes.

Once a component has been mounted, you can trigger further renders by updating its state by calling setState(). Updating a component's state queues a render.

Step 2: Rendering the component

Rendering is React calling your components.

On initial render, React will call the root component.

For renders after the initial render, React will call the component whose state has changed, and any of its children. The process of calling functions is recursive.

Pitfall

Rendering must always be a pure function. It should not have any side effects. It should not change the state of the component or the DOM. Side effects go in useEffect. The render function itself, if not pure, it will cause unpredictable behavior. That's why React StrictMode exist during development.

From React's official documentation:

When developing in Strict Mode, React calls each components function twice, which can help surface mistakes caused by impure functions.

Step 3: React commits changes to the DOM

When React displays your components on the screen for the first time, it uses a method called appendChild() to add all the elements it created to the webpage.

If your components need to be updated or re-displayed because of changes, React figures out the least amount of work needed to make these updates happen.

It only makes changes to the parts of the webpage that actually need to be updated.

React does this by comparing the new version of your components with the previous one and only updates the differences.

Introduction to Load Balancers

Tiger Abrodi — Sun, 25 Feb 2024 14:39:14 GMT

Introduction

Load balancers are important parts that help control web traffic and make sure applications stay available and work well. They are one of the core pieces in system design.

The Role of Load Balancers

Imagine a scenario where a single server is connected to multiple clients. As more clients make requests, the server struggles to keep up, leading to slow response times or even crashes due to overload. This is where load balancing comes into play.

To address this, we can scale the system in two ways:

Vertical Scaling: Adding more power (CPU, RAM) to the existing server.
Horizontal Scaling: Adding more servers to distribute the workload.

Horizontal scaling is often favored for its flexibility and scalability. However, with multiple servers, we face a new challenge: how to distribute incoming client requests efficiently across all servers. This distribution is the primary function of a load balancer.

How Load Balancers Work

A load balancer acts as an intermediary between clients and servers, directing incoming requests to the least busy server. It ensures no single server becomes a bottleneck, improving the application's overall performance and reliability.

In essence, load balancers can be seen as a type of reverse proxy, but with the added ability to manage traffic across several servers.

Configuration and Load Distribution Strategies

When a new server is added to the system, the load balancer's configuration is updated to include this server in the distribution process.

Load balancers use various algorithms to distribute traffic, such as:

Round Robin: Cycles through a list of servers, sending each new request to the next server.
Least Connections: Directs traffic to the server with the fewest active connections.
IP Hash: Determines the server based on the IP address of the client, ensuring a client always reaches the same server. This can help with caching.

Addressing Load Balancer Overload

What if the load balancer itself becomes a bottleneck?

To prevent this, you can scale load balancers horizontally by adding more load balancers.

DNS load balancing is another technique where DNS is used to distribute requests across multiple load balancers, further enhancing the system's scalability and reliability.

Conclusion

Load balancers are importance in modern web systems as they distribute traffic evenly, preventing server overloads and enhancing performance and reliability.

Using load balancers well helps businesses make sure their apps stay easy to grow, strong, and always ready for users.

Proxies

Tiger Abrodi — Sun, 25 Feb 2024 07:41:19 GMT

Introduction

Proxies are like middlemen in the internet world. They sit between you and the rest of the internet. There are two main types: forward proxies and reverse proxies, each serving a different purpose.

Forward Proxy: Helping the Client

A forward proxy stands between you (the user) and the internet. When you try to visit a website, your request goes to the forward proxy first. The proxy then sends your request out to the website.

When the website responds, the proxy gets the response and then sends it back to you.

This is great for a few reasons:

Privacy: It hides your real IP address from websites, keeping your identity more private.
Rules and Blocks: It can block certain websites or content based on rules, like a school or company might want to do.
Speeding Things Up: It can save copies of websites you visit often, making them load faster next time.

Reverse Proxy: Helping the Server

A reverse proxy is for the websites or services you're trying to reach. It sits in front of the website's server and acts like a front door. All the requests from users go to the reverse proxy, which then decides which server to send them to.

It's useful because:

Balancing Traffic: It spreads out all the incoming requests to different servers so no single server gets overwhelmed.
Caching for Speed: Just like a forward proxy, it can save copies of web pages to make them load faster.
Extra Security: It helps protect websites from attacks and can also make sure data is sent securely.

Real-Life Uses

Shopping Websites: Use reverse proxies to handle lots of visitors at the same time, especially during big sales.
Offices and Schools: Use forward proxies to control what websites people can visit and keep user data more secure.
VPN Services: Use a type of forward proxy to help users browse the web privately and from different locations.

Summary

Forward proxies protect and speed up your browsing, while reverse proxies help websites handle traffic, speed up loading times, and add security.

Introduction to Caching

Tiger Abrodi — Sat, 24 Feb 2024 11:12:24 GMT

Introduction

Caching is a powerful technique to speed up data retrieval and improve system latency by storing frequently accessed data in memory.

What is Caching?

Caching is a way to store data for a short time in a place that's easy to access. By keeping data nearby, caching makes it faster to get data from slower storage areas. This speeds up response times for user requests.

Where is Caching Applied?

Caching can be implemented at various levels, including:

Client-side: Where data like images, CSS, and JavaScript files are stored on the users browser to reduce redundant network requests.
Server-side: Where results from frequent database queries are stored to avoid repeated database calls, thereby reducing server response times.
Database level: Where frequently accessed data is cached to expedite query processing.

Use Cases of Caching

Caching proves most beneficial in scenarios where:

A client makes repetitive requests to a server.
A server frequently accesses data from a database.
Static files that dont change often need to be quickly available to users.

Write Through vs. Write Back Cache

These are two popular ways of writing to cache.

Write Through Cache

In this approach, data is written to both the cache and the database simultaneously, ensuring consistency between the two.

The main advantage is the assurance that data remains synchronized.

However, this can lead to slower write operations since data must be written to the slower database as well.

Write Back Cache

On the other hand, write back caching writes data only to the cache initially, with the cache later responsible for writing to the database.

This method offers faster write speeds and reduces the load on the database.

The risk, though, is potential data loss if the cache fails before it writes back to the database.

Managing Consistency

Caching can introduce consistency challenges, especially in systems where data is updated frequently.

Strategies to maintain consistency include setting expiration times for cached data or using more complex invalidation schemes to ensure cached data reflects the current state of the database.

General Caching Strategies

To maximize the effectiveness of caching, consider the following principles:

Cache data that changes infrequently.
Cache data that is frequently accessed.
Cache computationally expensive data.
Cache data that is geographically distant from its requesters.
Cache large data items to reduce data transfer times.

Cache Eviction Policies

When the cache reaches its capacity, older or less frequently accessed data must be evicted (removed) to make room for new entries.

The most common eviction strategy is the Least Recently Used (LRU) method, which removes the items that haven't been accessed for the longest time.

Conclusion

Caching is a key part of system design for better performance, shorter delays, and managing backend system loads. By using caching smartly and choosing the right removal policies, you can enhance the user experience and possibly save money.

Simplifying Network Protocols: IP, TCP, and HTTP Explained

Tiger Abrodi — Sat, 24 Feb 2024 07:33:51 GMT

Introduction

Understanding how the internet works might seem hard because of the complicated network rules. But, at its heart, the internet uses a set of guidelines for sharing data that makes sure communication between devices is fast and dependable.

What is a Protocol?

A protocol is a set of rules that dictate how two entities on a network communicate. It's like the behavior of sharing and getting data on the internet.

IP: The Internet Protocol

IP (Internet Protocol) is the foundational protocol that defines the method for sending data across the internet. It operates by exchanging packets of information between devices, each identified by a unique IP address. There are two main versions of IP in use today:

IPv4: The original IP addressing system, which uses 32-bit addresses. Due to the internet's growth, IPv4 addresses are in short supply.
IPv6: The newer version that uses 128-bit addresses, significantly increasing the number of available addresses.

An IP packet consists of two parts: a header (containing metadata like source and destination IP addresses) and the data payload.

The header is 20 to 60 bytes long, while the maximum size for the data payload is 65,536 bytes, also calculated as 2^16. This size limitation means large files must be divided into multiple packets for transmission.

TCP: Ensuring Reliable Delivery

While IP handles data packet routing, TCP (Transmission Control Protocol) ensures the reliable delivery of data across the internet. TCP addresses the limitation of IP packets' size by enabling the transmission of large data sets and ensuring packets arrive in the correct order. It adds a layer of control by including its own header in the data part of the IP packet, with information like source and destination ports and sequence numbers.

TCP establishes a connection through a three-way handshake:

The client sends a SYN (synchronize) packet to the server to request a connection.
The server responds with a SYN-ACK (synchronize-acknowledge) packet, agreeing to the connection.
The client sends an ACK (acknowledge) packet back, and the connection is established.

This handshake ensures that both the client and server are ready for data exchange.

HTTP: Structuring the Data

TCP provides a robust method for data transmission, but it doesn't define how data should be structured. This is where HTTP (HyperText Transfer Protocol) comes in.

HTTP is a protocol that outlines how clients (like web browsers) and servers should communicate, specifying the format for requests and responses.

An HTTP request includes information like the host (e.g., www.youtube.com), port (typically 80 for HTTP), method (e.g., GET), path (e.g., /watch?v=123), and optionally headers and a body.

An HTTP response from the server contains a status code (e.g., 200 for success), headers, and a body containing the requested data.

Conclusion

IP, TCP, and HTTP work together to make internet communication possible. They help devices connect, talk, and share data easily and dependably. IP sends data packets, TCP makes sure they arrive correctly and in order, and HTTP organizes the data exchange. Knowing these protocols helps us understand how the internet works and makes it seem less complicated.

The Essentials of System Availability

Tiger Abrodi — Fri, 23 Feb 2024 18:27:22 GMT

Introduction

System availability is an important measurement that shows how often a service or platform is working and available for users. Having high availability isn't only a technical need, but it's also essential for keeping users happy and the business running smoothly.

What is Availability?

Availability measures the proportion of time a system is operational and available to users.

For anyone subscribing to a service, especially those paying for it, the expectation is that this service will be available whenever needed, 24/7. Not meeting these expectations can cause users to be unhappy and can result in big money losses for the company.

High Availability

Certain systems demand exceptionally high availability due to the critical nature of their functions.

For example, a hospital's database system needs to be available all the time to get patient records. Any downtime could cause serious problems. Also, an airplane's control system must always work, because any failure could lead to terrible results.

Measuring Availability: The "Nines"

Availability is usually measured in "nines."

For instance, a system with 99.9% availability works 99.9% of the time during a specific period. This means it can be down for about 8.76 hours each year. Adding more nines makes the system more reliable, but also more complex and expensive.

Two Nines (99%): Allows for 3.65 days of downtime per year.
Three Nines (99.9%): Limits downtime to 8.76 hours per year.
Four Nines (99.99%): Reduces downtime to 52.56 minutes per year.
Five Nines (99.999%): Cuts downtime to just 5.26 minutes per year.

Service Level Agreements (SLA)

An SLA is an agreement that describes the expected service quality between a provider and a customer, often with specific availability goals. It's a way to officially promise to keep a certain level of availability.

Service Level Objectives (SLO)

While related to SLAs, SLOs are specific targets within those agreements, representing precise goals a service aims to achieve in terms of performance and reliability.

Achieving High Availability

To ensure high availability, systems must eliminate single points of failure, where the failure of a single component could bring down the entire service. This involves implementing redundancy, where multiple components can perform the same function, ensuring the system remains operational even if one component fails.

Redundancy and Load Balancing

Redundancy can be passive, with backup components waiting to take over in case of a failure, or active, where all components are operational simultaneously and share the workload.

Passive Redundancy: Backup Server

Passive redundancy involves a standby system that activates only when the primary system fails.

For example, consider a website hosted on a primary server with a backup server updated nightly. The backup server remains idle until the primary server fails, at which point it takes over to ensure the website remains accessible.

What makes it passive? The backup server does nothing until the primary fails.

Active Redundancy: Load-Balanced Web Servers

Active redundancy involves multiple systems running concurrently, sharing the workload. An example is a website served by multiple web servers behind a load balancer. The load balancer distributes incoming traffic evenly across all servers, ensuring no single point of failure.

What makes it active? All servers are actively handling requests simultaneously.

Conclusion

Keeping a system available all the time is important for any service. This needs good planning and a strong system design. By using ideas like redundancy, companies can make sure their services are always open to users. This meets both user expectations and business requirements.

Demystifying Latency and Throughput in System Design

Tiger Abrodi — Wed, 21 Feb 2024 18:55:49 GMT

Introduction

In the world of system design, two critical performance metrics often come into play: latency and throughput.

Knowing about these metrics is important for making and improving efficient systems, especially when thinking about real-world uses.

Latency

Latency refers to the time it takes for data to travel from one point to another. Its the delay between a request being initiated and the response being received.

For instance, in a video game, latency is the time from when you perform an action (like shooting a bullet) to when you see the result on your screen. Lower latency is important for real-time applications because they need a quick response.

Examples to understand latency in different contexts:

Reading 1MB of data sequentially from memory typically takes about 250 microseconds.
The same action from an SSD takes about 1 millisecond (1,000 microseconds), which is 4 times slower than from memory.
Transferring 1MB of data over a 1 Gbps network link takes around 10,000 microseconds.
A packet's round trip time from California to the Netherlands and back is roughly 150 milliseconds (150,000 microseconds).

These examples show that latency can change a lot depending on the medium and distance.

Throughput

Throughput, on the other hand, measures how much data can be transferred from one point to another within a given time frame. Its about volume and capacity, essentially answering the question, "How much data can we move at once?"

This is particularly relevant in scenarios where large volumes of data need to be processed or transferred.

For example:

A 1 Gbps (Gigabit per second) link can transfer 1 billion bits per second.

Throughput is an important measurement in applications and networks that deal with a lot of data. It shows how much data can be processed or moved in a short time. High throughput systems can manage more requests or transfer more data, which is important for services like streaming platforms, data centers, and big web applications.

Real-World Implications

Knowing the balance between latency and throughput is important in system design.

For example, an online video streaming service needs low latency for fast video start times and high throughput to show high-quality content without problems.

Tiger's Place

Wtf is Hydration on the Web?

Introduction

What is Hydration?

Different Approaches to Hydration

Full Hydration

Partial Hydration

How Partial Hydration Works

Next.js: Client Side vs Server Side Rendering vs Static Site Generation

Introduction

What is Client Side Rendering?

Pros and Cons

When to Use Client Side Rendering

What is Server Side Rendering?

Pros and Cons

What is Static Site Generation?

Incremental Static Site Generation

How ISR works in Next.js

1. Initial Build

2. Background Regeneration

3. Path not generated yet

4. Manual Revalidation

When to use ISR

React's evolution from Hooks to Concurrent React

Introduction

Hooks

The challenges hooks solved

React 17

Event Pooling

Tree Reconciliation

Portals

Gradual Upgrades

New JSX Transform

React 18

Concurrent React

Interruptible Rendering

Prioritizating Updates

React Fiber

What is Fiber

Fiber tag property

Fiber vs React Elements

Fiber Relationships

What is work?

Time Slicing

Scheduler

requestAnimationFrame

requestIdleCallback

beginWork and completeWork

beginWork

completeWork

Fiber trees

Automatic Batching

Suspense improvements

Suspense on the server

Suspense boundaries

SuspenseList

New Root API

Strict Mode

Hooks

useTransition

useDeferredValue

useId

useSyncExternalStore

How it works

Quick example

Why not useEffect?

I'm gonna faint

React keys fully explained!

Keys in React

Problematic code

What's the problem?

Rules

Solution

Surprise, it's not a prop

What's the deal with fragments in React?

Why sibling elements require a wrapper

Problematic code

Fragments to the rescue

Cryptographic hash functions are better than manual hashing

Example

Fiber `tag` property

`beginWork` and `completeWork`

`beginWork`

`completeWork`