scalablemodeling

Scalable Modeling – An Event-centric Approach

This repository offers a modeling technique for designing scalable systems. Every new system should be designed with scalability in mind — scalability is essential for success. With Scalable Modeling, you can embed scalability in your system design from the start. Influenced by Clean Architecture, Event-Driven Architecture (EDA), Domain-Driven Design (DDD), EventStorming, and CQRS, this approach remains flexible and doesn’t rigidly adhere to any single methodology. That is why this event-centric architectural approach has different name: CEQS - Command-Event-Query Separation.

Justification for the red arrows in sections: Queries & Time Travel.

Webinar recording: Software engineering for the future: Fast, scalable and built to last (November 26, 2024).

Table of Content

High Level Overview

There are three opportunities and three challenges in scalability and understanding them takes you far already. Combining the opportunities & challenges with an upfront and iterative modeling technique provides a solid foundations for modeling reliable and scalable systems.

Three Opportunities

  1. Decomposition - scale by splitting different things
  2. Duplication - scale by cloning
  3. Partition - scale by splitting similar things

Immutability (of messages/events) plays key role in each aspect.

Software Engineering Flow

Software is ultimately a model — a conceptual solution that, while invisible, solves real-world challenges. In software engineering, three aspects are critical:

  1. Understanding: WHY the software is needed and by who (understanding the purpose and the problem it should address).
  2. Designing: WHAT is the conceptual model for the solution.
  3. Developing: HOW the solution is implemented.

Scalable Modeling serves as an opinionated bridge from WHY to HOW, with a primary focus on the WHAT.

Iteration is naturally much cheaper when it is done on the conceptual model rather than on the implementation level.

Shift Left

“There is nothing so useless as doing efficiently that which should not be done at all.”
Peter Drucker

Scalable Modeling is a method for shifting left in the software engineering process. It helps crystallize the ‘why’ by focusing on the ‘what,’ allowing the creation of a result (the model) that serves as an opinionated bridge to the ‘how’.

Start From Events (…after you think you understand the ‘why’)

To design reliable scalable systems, we need to start from temporal thinking (the flow of time and how things evolve) and gradually move into spatial thinking (the arrangement of things). In essence, we design spatial systems to handle temporal matters – events.

Services are anchored in space but act in time – At any given moment, a service’s location is tied to a physical or virtual environment (e.g. a server, container or cluster). Over time, services evolve as they process events, make decisions and produce outputs.

Events are anchored in time but act in space – Events are fixed to the moment they are created, carrying an immutable snapshot of information. As they traverse the system, they move between services and may even be replicated.

So, instead of focusing too much on the space between services, consider the flow of events and how they evolve over time.

The immutability of events contributes significantly to scalability, particularly in event-driven architectures. Additionally, events play a central role in uncovering domain insights and fostering a shared language. Thus, we adopt an event-centric approach.

Three Challenges

  1. Deduplication - as exactly-once delivery is impossible in distributed systems
  2. Tailoring Consistency - as strong consistency is the wrong default
  3. Time Travel - as distribution causes eventual consistency

Why to Concentrate Domain Knowledge & Scalability

Domain Knowledge is the Most Underrated Key to High Development Velocity and Quality

“It’s developer (mis)understanding that’s released in production, not the experts’ knowledge.”
Alberto Brandolini

Without a proper understanding of the domain, it’s impossible to implement a conceptual model that accurately reflects it. Effective collaboration with domain experts is essential to bridge this gap. Events play a central role in uncovering domain insights and fostering a shared language. Additionally, the immutability of events contributes significantly to scalability, particularly in event-driven architectures.

1_domain_knowledge_is_key.png

“A complex system that works is invariably found to have evolved from a simple system that worked.” John Gall

Design flaws in the simple system tend to compound and lead to exponentially increasing complexity in the complex system.

“Doing the wrong thing right is not nearly as good as doing the right thing wrong.” Russel L. Ackoff

Without a proper understanding of the domain, it’s easy to prioritize technical correctness over solving the actual problem.

“Organizations which design systems are constrained to produce designs which are copies of the communication structures of these organizations.” Melvin E. Conway

Conceptual models derived from the domain often lead to software structures that should mirror the communication patterns within the organization.

“Civilization advances by extending the number of important operations which we can perform without thinking of them.” Alfred North Whitehead

When we can focus solely on implementing conceptual models from the domain without worrying about the technical details we can abstract away, we achieve the highest velocity.

Scalability is Prerequisite of Success

2_need_for_scalability.png

Building a scalable foundation allows businesses to adapt to increasing demands and capitalize on opportunities without being limited by technical constraints. As these influential leaders have noted, scalability is not an afterthought; it is integral to achieve sustainable success.

Without a scalable system from day one, even the best ideas can be slowed down by bottlenecks and inefficiencies as they grow. Designing with scalability in mind ensures that as demand increases, systems can grow exponentially to meet those demands without sacrificing performance or creating technical debt.

“The faster you scale, the more wealth you create.” Reid Hoffman (Co-founder of LinkedIn)

“In order to win, you must be able to scale exponentially.” Marc Andreessen (Founder of Andreessen Horowitz)

“The most scalable businesses in the world are software businesses.” Bill Gates (Founder of Microsoft)

Success is not just about growing fast — it’s about building the right infrastructure from the outset, so that growth becomes an advantage, not a challenge.

Scalable Modeling

0_scalable_modeling_components.png

Theory

Event Centrism

In Event Centrism, everything we experience is either an event or a trace of one. Reality is not made up of static objects but a continuous flow of occurrences, where everything, from a falling leaf to a mountain, is part of an ongoing process. Even seemingly permanent things are temporary outcomes of past events, always subject to change.

Our lives are driven by events — every thought, action, and emotion is triggered by something, and memories are a collection of past events that define who we are. Traces of past events — like an old building or a weathered book — remind us of what once occurred, continuing to shape the present.

Events can also be potential, waiting to happen, or hidden, unfolding beyond our perception, like biological processes or distant cosmic phenomena. Every event is part of a cause-and-effect continuum, influencing future occurrences. Time, in this view, is meaningful only as the medium through which events unfold.

Even the self is an ongoing event, constantly shaped by experiences and interactions. In Event Centrism, everything in life is fluid, dynamic, and interconnected, emphasizing that our world and our identities are in constant motion, driven by the events we experience.

Event Centrism is not an absolute truth but rather one way to understand the world — a very useful way to model systems.

Event-Driven Architecture (EDA)

Event-Driven Architecture (EDA) is a design pattern where systems react to events in near real-time. Components communicate by producing, detecting, and responding to events, enabling asynchronous processing and loose coupling between services, which allows for more scalable and resilient systems.

Command Query Responsibility Segregation (CQRS)

CQRS is a pattern that separates the responsibilities of updating data (commands) and reading data (queries). By dividing these operations, CQRS improves performance, scalability, and security, allowing for more efficient handling of complex, high-demand systems.

Scalable Modeling does not go into purism in CQRS - in Scalable Modeling queries can (when well justified) also query command models for improved consistency where it does not jeopardise the scalability. Commands can also return simple data like sequence number of the produced events.

Vertical & Horizontal Scalability

3_vertical_horizontal_scalability.png Universal Scalability Law, Wikipedia

Limiting contention is the key for higher scalability.

Event Sourcing

4_event_sourcing.png

The Three Dimensions to Scalability

In this section we are in the context of:

Scale Cube

5_0_scale_cube.png

Decomposition

5_1_decomposition.png

Duplication

5_2_duplication.png

Partition

5_3_partition.png

CEQS: Command-Event-Query Separation

The diagram below illustrates a novel software architecture concept called CEQSCommand-Event-Query Separation. It is an enhancement of the traditional CQS (Command Query Separation) pattern highlighting the importance of events in scalable systems.

In scalable systems, events should be treated as first-class citizens, as their immutability is fundamental to achieving scalability. Their asynchronous nature enables loose coupling between services, while their immutability allows events to be queried or streamed as an accurate record of what has occurred within the system.

Immutability ensures that events are append-only, allowing distributed systems to replicate and process data consistently across services without conflicts.

Commands, Events and Queries each serve as distinct interfaces to a service:

This clear separation of concerns ensures that systems remain scalable, decoupled, and maintainable.

In CEQS, the separation of models (command vs. query) is not strictly enforced. However, it very often makes sense to separate these models to enhance scalability or usability. The use of immutable private events facilitates this separation, even in later stages of development, when it becomes necessary.

Architectural Benefits

  1. Clean Domain Logic
    • Separating commands and queries results in clearer, more focused domain logic. Commands handle state changes, while queries handle data retrieval, ensuring that business logic remains concise and purpose-driven.
  2. Separation of Concerns
    • Commands are responsible for writing data (changing state), while queries are responsible for reading data. This distinction enhances maintainability by providing cleaner code and reducing the risk of unintended side effects.
  3. Loose Coupling
    • Within Services: Commands and queries are decoupled through immutable, private events, allowing each to evolve independently without interference.
    • Between Services: Public events decouple microservices, enabling them to operate and scale independently, reducing dependencies between service boundaries.
  4. Resiliency
    • The system is more resilient because failures in one component (e.g. command processing) do not cascade to others (e.g. queries). This design ensures fault isolation and minimizes downtime.
  5. Near-Real-Time Integrations
    • Events enable near-real-time communication between services, allowing for fast and seamless integration. This improves responsiveness to changes in the system and facilitates use cases like live updates.
  6. Low Latency at Any Scale
    • By separating read and write responsibilities, the system can optimize availability and scalability. For example:
      • High Availability: Read-heavy workloads can be handled by scaling replicas.
      • Scalability: This architecture supports all three dimensions of scalability, ensuring low latency even under high demand.

Components

  1. Events
  2. Ubiquitous Language
  3. Commands & State
  4. Queries
  5. Policies
  6. Hotspots & Descriptions
  7. Consistency Boundaries

Events

6_1_events_1.png

6_1_events_2.png

Ubiquitous Language

Ubiquitous language is more than connections and stickies in the picture above. At best, it is a shared conceptual framework that allows both technical and non-technical stakeholders to collaborate effectively, ensuring that all parties have a common understanding of the domain and its logic. It helps avoid ambiguity, enabling precise communication and design decisions, fostering a deep connection between the code and the business model it represents.

This language allows everyone involved to speak the same language, preventing misunderstandings and ensuring that the system’s design remains consistent with the business goals and domain requirements.

Commands & State

Queries

I understand that querying the command model (red arrow in the picture above) is a topic that often sparks strong opinions. In my view, the command model serves as a special type of model that validates commands, and while it is not designed for querying, there are cases where it can be read from if necessary. For example, in in-memory command models Akka-style, where the model is clustered, a valid use case might involve querying the state of an entity immediately after its creation or update. Since the entity is already in memory, fetching it quickly from there can be justified. However, querying the command model should be avoided wherever possible, as there are many ways this can lead to misuse, such as introducing performance bottlenecks or inconsistencies. For this reason, querying the command model requires strong justification.

Policies

Hotspots & Descriptions

Consistency Boundaries

Interestingly, even if this is done as a last step in the Scalable Modeling, this often requires business-driven decisions. For instance, while financial transactions demand strict consistency, less critical processes like reporting can tolerate delays.

Dynamic Consistency Boundaries (DCB) is an emerging term and as we leave defining the consistency boundaries as a last step we make Scalable Modeling compatible with DCBs.

Challenges

Deduplication

7_1_deduplication.png

Since exactly-once delivery is impossible in distributed systems, we use effectively-once or idempotent processing to ensure duplicate messages don’t affect the outcome.

Tailoring Consistency

In distributed systems, defining consistency boundaries is crucial for balancing performance and correctness. These boundaries determine where strong consistency is enforced and where eventual consistency is acceptable. Interestingly, tailoring these boundaries often requires business-driven decisions. For instance, while financial transactions demand strict consistency, less critical processes like reporting can tolerate delays.

By explicitly defining consistency boundaries, you ensure the system aligns with business needs while optimizing scalability. This approach prevents over-engineering and allows the system to scale efficiently without unnecessary constraints.

7_2_tailoring_consistency.png

Time Travel

7_3_time_travel.png

When using CQRS with eventually consistent read models, the system’s read models may reflect different points in time due to the delay in propagating updates. This allows for a form of “time travel,” where users can observe data at various stages of consistency. As new events are processed, the read model gradually “catches up” with the latest state. However, during this period, the system might expose outdated or future-looking states, effectively letting users or systems “travel” between these states.

This is particularly significant when querying or presenting data that spans multiple sources or services across consistency boundaries. For example, one source might reflect an older version of the data, while another presents the most recent state.

As a result, combining data from these sources may yield conflicting or incoherent views of the overall system. This inconsistency poses a challenge when trying to form a unified or accurate perspective of the current state. To mitigate these issues, designers need to carefully consider the timing and aggregation of data to avoid misleading or inconsistent results, especially in cases where decisions are being made based on these read models.

Summary of Components

6_components_of_scalable_modeling.png

Credits

The following persons have had a lot of influence on what this repository describes:

Alberto Brandolini

Creator of Event Storming, a collaborative workshop technique used to explore and model complex business processes through domain events. His approach helps teams rapidly gain insights into business domains by focusing on key events that drive processes.

Eric Evans

Known for pioneering Domain-Driven Design (DDD), a software development philosophy that emphasizes aligning the software model closely with the business domain. His work focuses on creating a shared understanding between technical teams and domain experts to ensure the software reflects real-world complexity.

Gregory Young

Renowned for developing and promoting Command Query Responsibility Segregation (CQRS) and Event Sourcing. His work centers on separating read and write operations in systems, improving scalability, and using event sourcing to maintain the history of all changes in a system, offering resilience and insights into past system states.

Robert C. Martin

Author of Clean Architecture and a key figure in the promotion of software craftsmanship. His principles focus on building flexible, maintainable, and scalable systems by adhering to the separation of concerns and reducing dependencies between different layers of the system. Martin advocates for architecture that allows software to evolve over time, ensuring it remains easy to understand, extend, and refactor, even as requirements change. His work is centered on creating systems that prioritize independence from frameworks, databases, and UI, ensuring longevity and adaptability in software design.

Background

About Simo Roikonen

The Trigger that Led to This

The trigger that led to the creation of this page came from a LinkedIn post by Allard Buijze.
The post was about aggregates and their necessity. I studied the post and ended up watching Sara Pellegrini’s and Milan Savic’s talk: The Aggregate is dead. Long live the Aggregate!. After overcoming my cognitive dissonance, I needed to try out the new ideas by drawing them with Excalidraw, which led to new ideas that I have collected in this repository.

Thanks for triggering the ideas!

License For Using the Pictures

copyright.png

See License.