n8n Queue Mode vs Regular Mode (& When to Switch)

n8n Queue Mode vs Regular Mode (& When to Switch)

n8n Queue Mode vs Regular Mode (& When to Switch) blog

Your n8n setup worked great last month. Now your team added five new workflows, traffic doubled, and suddenly everything breaks during business hours. Before you throw more RAM at the problem, there’s something you need to understand.

The execution engine you choose determines everything. Regular mode keeps things simple. Queue mode unlocks serious scalability. Knowing when to make that switch separates teams that struggle from teams that scale.

Choosing between queue mode and regular mode in n8n depends on workflow complexity and scaling requirements. The comparison table below highlights VPS hosting providers that support both lightweight and distributed automation deployments. These environments make it easier to scale workflows efficiently while maintaining stable execution performance. Explore our recommended VPS hosting options.

Best VPS Solutions for Scaling n8n Beyond Standard Workflow Execution

ProviderUser RatingRecommended For 
Kamatera Logo4.8ScalabilityVisit Kamatera
4.6AffordabilityVisit Hostinger
4.7DevelopersVisit IONOS

Takeaways
  • Regular mode runs all workflows in a single process, hitting performance ceilings around 100 concurrent users.
  • Queue mode distributes execution across multiple worker processes using Redis as a message broker.
  • Switching to queue mode requires PostgreSQL database and Redis instance setup.
  • Worker instances can scale horizontally to handle heavy load without bottlenecks.
  • Binary data storage needs external solutions like S3 when running queue mode.
  • The encryption key must match across all nodes for proper credential access.
  • Most teams should stick with regular mode until they consistently need more than 10 simultaneous workflow executions.

Understanding n8n Execution Modes: A High-Level Overview

Before diving into the technical weeds, let’s establish what we’re actually comparing. Both modes accomplish the same goal. They execute workflows. But they approach the task with fundamentally different architectures.

What is Regular Mode in n8n?

Regular mode is n8n’s default execution architecture designed for simplicity and rapid deployment. Think of it as the “just works” option that ships out of the box.

Here’s what defines it:

  • Operates as a single-instance setup where all operations happen within a unified Node.js process
  • Handles the complete workflow lifecycle from receiving triggers to generating execution requests to performing computations
  • Perfect for initial development, testing, and small-scale deployments without external dependencies

The beauty of regular mode? It easily runs on basic hosting plans. If you’re a startup or small team watching your budget, this keeps costs low while you prove out your automation strategy.

What is n8n Queue Mode?

Queue mode is the enterprise-grade distributed architecture built for horizontal scaling and high-volume automation. It’s what you graduate to when regular mode can’t keep up.

The key characteristics set it apart:

  • Separates workflow dispatching from actual execution using a centralized message broker through Redis
  • Distributes execution tasks asynchronously across multiple independent worker instances
  • Provides high availability, massive throughput, and fault tolerance for mission-critical business processes

When you enable queue mode, you’re fundamentally restructuring how n8n approaches work. Instead of one process doing everything, you’re building a system where specialized components handle specific responsibilities.

Architectural Differences: Single-Instance vs. Distributed

Monolithic single-instance versus scalable distributed multi-node execution design.

Understanding the architecture helps you predict when you’ll hit limitations. Let’s break down what’s actually happening under the hood.

The Monolithic Design of the Main Process

In regular mode, the main process handles everything. It’s a single-threaded monolith that does the heavy lifting alone.

All workflow execution competes for the exact same system resources. CPU and RAM on a single node get shared across every running automation. This design works great until it doesn’t.

Here’s the problem: a single crashed execution or one highly resource-intensive workflow can bottleneck the entire system. One runaway process affects everything else waiting in line.

How the Event Loop Impacts Performance

Because regular mode relies on a single Node.js process, it’s bound by a single event loop. This creates real constraints for busy systems.

The input output operations bottleneck shows up in practical ways. When you’re waiting 5 seconds for an external API response, that wait blocks the thread. While waiting, the system cannot process other workflow executions efficiently.

Even on multi-core servers, regular mode can only effectively utilize one CPU core for execution. All that extra processing power? Sitting idle while your single process struggles.

The Distributed Architecture of Queue Mode

Queue mode fundamentally restructures the platform by decoupling trigger reception from the execution phase. This split responsibilities approach unlocks serious scalability.

The multi-component stack includes:

  • Redis: Acts as the message broker maintaining the persistent job queue
  • PostgreSQL: Serves as the shared database accessible by all nodes
  • Worker Nodes: Independent instances that claim and process jobs from Redis

This architecture means you can scale workers independently based on workload. Need more capacity? Add workers. Simple as that.

The Role of the Main Instance and Workers

With queue mode, the main instance receives webhook calls and timer events, generates an execution ID, but does not run the workflow itself.

The execution IDs pass to Redis, allowing one or more worker instances to pull jobs asynchronously. Each worker operates as an independent Node.js instance. This enables true parallel processing across separate machines or containers.

Redis notifies workers when new jobs arrive. Workers claim tasks, process them, and report back. The main instance coordinates. Workers execute workflows. Clean separation.

Ultahost

Launch, Scale, and Manage your website with high-performance Web Hosting and VPS.
Visit Site Coupons6

Performance and Scalability Benchmarks

Numbers tell the real story. Let’s look at actual stress test data comparing both modes.

Single-Instance Performance Ceilings

Regular mode performs exceptionally well for small loads but hits a hard ceiling under high concurrency.

Stress test data from a C5.large AWS instance (1 vCPU, 4GB RAM) reveals the pattern:

  • 100 virtual users: Handles traffic with reasonable response times
  • 200 virtual users: Response times spike to 12 seconds with a 1% failure rate
  • 200 users + 10 distinct workflows: System collapses with 34-second response times and a 38% failure rate

Maximum theoretical capacity under ideal conditions? Around 220 workflow executions per second. But real-world performance degrades well before that ceiling.

Achieving High Throughput With Queue Mode

Queue mode distributes load across workers enabling high throughput scaling.

Queue mode solves these limitations through horizontal distribution. The benchmark improvements are dramatic.

Stress test data with queue mode active:

  • C5.large instance: Sustained 74 requests per second with a 0% failure rate, even with 10 concurrent workflows
  • C5.4xlarge instance (16 vCPUs, 32GB RAM): Achieved 162 requests per second across 200 virtual users
  • Latency: Remained under 1.2 seconds with zero failures under maximum load

That’s the best scalability you’ll find for high-volume automation needs.

Managing Queue Depth and Concurrency Limits

Both modes offer concurrency limits to prevent system overload and manage Redis queue depth effectively.

Regular mode uses N8N_CONCURRENCY_PRODUCTION_LIMIT to queue executions locally in FIFO order. Simple and predictable.

Queue mode takes a multi-layered approach balancing global and per-worker limits:

  • Individual workers default to 10 concurrent jobs, configured via the –concurrency flag
  • Global limits override worker concurrency settings to ensure the shared database and Redis aren’t overwhelmed
  • Queue depth monitoring helps prevent pending executions from piling up

Input Output and CPU Processing Models Compared

The performance differences become clear when you examine how each mode handles different operation types.

For input output operations, queue mode distributes waits across more workers. A 5-second API wait on Worker A does not block Worker B. They operate independently.

For CPU operations, queue mode executes intensive tasks in true parallel across multiple CPU cores. Regular mode remains strictly limited to one core regardless of your hardware.

Infrastructure and Resource Requirements

Scaling comes with costs. Let’s examine what each mode actually demands from your infrastructure.

Regular Mode Resource Consumption Profile

RAM usage graph.

Highly efficient for simple setups, regular mode requires minimal infrastructure footprint. You can run queue mode on modest hardware when you’re starting out. Wait, that’s not right. Regular mode runs on modest hardware.

Baseline RAM consumption sits at approximately 500MB when idle. Memory recommended increases with workflow complexity, but growth remains predictable for simple automations. Complex data transformations can quickly consume available memory though.

Queue Mode Infrastructure Overheads

Achieving distributed scalability requires a larger initial resource investment. Here’s the baseline RAM consumption breakdown totaling approximately 2.5GB:

  • Redis + PostgreSQL: ~1GB
  • Main n8n process: ~500MB
  • Worker process instances: 200-500MB per worker

During active execution, queue mode may use slightly more RAM per workflow, around 60MB vs 40MB. That’s the distributed coordination overhead at work. Database performance matters more in this configuration.

Database Requirements: SQLite vs. PostgreSQL

The database choice changes significantly between modes.

FeatureRegular ModeQueue Mode
Default DatabaseSQLite (Supported)PostgreSQL (Mandatory)
ArchitectureLocal file-basedShared network database
Connection PoolingNot requiredHighly recommended (e.g., pgBouncer)
MaintenanceSimple, single-ownerComplex, multi-process access

The SQLite database works fine for single-instance deployments. But queue mode needs PostgreSQL to support queue mode operations with multiple worker processes hitting the same data. Database queries must be optimized for concurrent access.

Managing Binary Data in Your Workflows

File handling changes dramatically between modes. This catches many teams off guard during migration.

Filesystem Constraints in Regular Mode

Filesystem storage capacity on a laptop.

In regular mode, binary data like images, PDFs, and file uploads can be safely stored directly on the local filesystem.

Since all operations occur within a single instance, the process always has reliable access to the local disk. This keeps infrastructure costs low for file-heavy workflows. Design workflows around local storage when it makes sense.

Why You Need External Binary Data Storage

Queue mode does not support storing binary data on the local filesystem. Here’s why.

Distributed workers run on separate machines or containers. They cannot reliably share local volumes. One worker might save a file. Another worker processing the next step can’t find it.

The requirement: you must configure external binary data storage like Amazon S3 to persist binary data. All workers can then access the same files. It adds slight architectural complexity but ensures data consistency across your worker node fleet.

Configuration and Setup Complexity

Let’s get practical about what setup actually looks like for each mode.

Setting Up Regular Mode for Quick Deployment

Deployment takes seconds. Download n8n, set basic credentials, and start the service.

No external dependencies. No message broker configuration. No complex orchestration required. It’s ideal for developers wanting to iterate quickly on workflow logic without DevOps overhead.

Check our guide on best n8n hosting providers for options that make this even easier.

Build Your App Now with Hostinger Horizons
Turn your idea into a powerful app in minutes with Hostinger Horizons. No coding, no hassle, just AI-powered building that brings your vision to life.
Visit Hostinger

How to Enable Queue Mode in Production

Queue mode requires a sequential, dependency-ordered startup process. Getting this wrong means failed workers and broken triggers.

The startup sequence:

  1. Start Redis (workers will fail if this is missing)
  2. Start PostgreSQL
  3. Launch the main n8n instance
  4. Launch worker instances

This setup is best managed via multi-container orchestration platforms like Docker Compose or Kubernetes. Check our guide on high-availability setup for n8n on VPS for detailed deployment strategies.

Essential Environment Variables

Proper configuration relies on specific environment variables applied across all nodes.

Key variables you must set:

  • EXECUTIONS_MODE=queue (Must be set on main and workers)
  • QUEUE_BULL_REDIS_HOST & QUEUE_BULL_REDIS_PORT (Redis connection)
  • DB_TYPE=postgresdb (Plus associated host, user, and password variables)

Redis configuration details matter. Get the connection strings wrong, and workers can’t pull jobs from the queue.

Configuring a Load Balancer for Multi-Main Setup

For cloud enterprise deployments needing high availability, you can deploy multiple main instances behind a load balancer.

This requires setting N8N_MULTI_MAIN_SETUP_ENABLED=true.

The crucial step: the load balancer must be configured with session persistence through sticky sessions to maintain consistent user sessions. Webhook triggers should route specifically to dedicated webhook processor instances if you’re running manual workflow executions alongside automated ones.

Securing Your Setup and Managing the Encryption Key

Security becomes paramount when moving to a distributed architecture. More components mean more attack surface.

Ensure Redis and PostgreSQL connections are secured and not exposed to the public internet. The n8n encryption key (N8N_ENCRYPTION_KEY) must be identical across the main instance and all worker nodes to properly decrypt stored credentials and sensitive workflow data.

Mismatched encryption keys? Workers can’t read credential configuration details. Workflows fail silently on authentication steps.

Choosing the Right VPS for Your n8n Deployment

VPS servers in a server room.

Whether you run queue mode or stick with regular mode, your hosting infrastructure matters enormously. The wrong VPS can bottleneck even well-architected deployments.

For regular mode, look for providers offering high IOPS storage and solid single-core performance. Queue mode deployments need VPS options that support multiple containers and offer good network connectivity between instances.

Explore VPS hosting options to find providers matching your scalability requirements. The right foundation makes adding workers and scaling horizontally much smoother down the road.

Real-World Use Cases: Queue Mode vs Regular Mode in n8n

Theory is great. Practical guidance is better. Here’s when each mode makes sense.

When to Stick with Regular Mode

Regular mode fits several common scenarios perfectly:

  • Development and Testing: Rapid prototyping without DevOps overhead
  • Small Businesses: Modest automation needs where peak concurrency stays under 10 multiple simultaneous workflow executions
  • Proof of Concept: Demonstrating value quickly before investing in enterprise hosting infrastructure
  • File-Heavy Local Work: Workflows relying heavily on local filesystem binary data without an S3 budget

Most teams fall into these categories. Don’t overcomplicate things if regular mode handles your workload comfortably.

When to Switch to n8n Queue Mode

Queue mode becomes essential under specific conditions:

  • Enterprise Scale: Processing more than 100 concurrent workflow executions regularly
  • Mission-Critical Reliability: Requires high availability where a single node failure won’t stop business operations
  • Spiky Traffic: Unpredictable concurrency patterns requiring elastic horizontal scaling with adding workers on-demand
  • Multi-Team Environments: Needing resource isolation so one team’s batch jobs don’t starve another team’s automations

Compare these requirements against alternatives like n8n vs Airflow for complex orchestration or n8n vs Windmill for high-throughput scenarios. Sometimes a different tool fits better than scaling n8n.

Scaling Strategy Considerations

Horizontal vs Vertical scaling illustration.

Understanding horizontal vs vertical scaling for n8n helps you plan your growth path intelligently.

Vertical scaling means bigger servers. More RAM, more CPU cores. It works until it doesn’t. There’s a ceiling on how powerful a single machine can get.

Horizontal scaling means scale horizontally by adding more workers. Queue mode enables this approach. Need twice the capacity? Deploy twice the workers. No single-machine limits.

For performance optimization details, see our guide on performance tuning n8n for large workflow volumes.

Pros and Cons Summary

Let’s consolidate the tradeoffs into clear comparisons.

Advantages and Disadvantages of Regular Mode

Pros:

  • Incredibly easy to set up and maintain
  • Low base memory footprint around 500MB
  • Supports local filesystem storage for binary data

Cons:

  • Hard performance ceiling that degrades rapidly past 100 concurrent users
  • Single point of failure with no high availability
  • Input output operations block the single event loop thread

Advantages and Disadvantages of Queue Mode

Pros:

  • Massive horizontal scalability exceeding 160 requests per second on good hardware
  • True parallel processing across multiple CPU cores
  • High availability and fault tolerance

Cons:

  • Steeper learning curve requiring DevOps knowledge for Docker, Redis, and Postgres
  • Higher baseline resource requirements around 2.5GB RAM
  • Requires external S3 storage for binary files
VPS
Cheap VPS
best option

Making the Transition

Switching from regular mode to queue mode isn’t flipping a switch. It’s a migration project.

Start by auditing your current workflow definitions. Identify any that rely on local filesystem access for binary data. These need refactoring first.

Next, provision your infrastructure. You need a Redis instance and PostgreSQL database before anything else works. Test connectivity between components before attempting to run queue mode.

Deploy gradually. Start with one worker process and validate execution works correctly. Then scale workers based on observed high load patterns.

Monitor Redis queue depth and pending executions closely during the transition. Bottlenecks reveal themselves quickly under real workload.

Next Steps: What Now?

  1. Audit your current workflow execution patterns and measure peak concurrency.
  2. Calculate whether your single-instance ceiling meets actual demand.
  3. Provision Redis and PostgreSQL in a test environment to practice setup.
  4. Migrate binary data storage to S3 before enabling queue mode.
  5. Deploy queue mode with one worker initially, then scale workers based on load.
  6. Configure monitoring for database performance and queue depth metrics.

Frequently Asked Questions

Can I switch between modes without losing workflow data?

Yes. Workflow definitions live in the database. Switching modes changes how n8n processes them, not where they’re stored.

How many workers should I start with?

Start with two workers. Monitor performance, then add more workers as needed based on queue depth and execution times.

Does queue mode require Kubernetes?

No. Docker Compose works fine for smaller deployments. Kubernetes adds benefits for larger-scale operations.

What happens if Redis goes down?

Workers cannot pull jobs from the queue. Executions pause until Redis recovers. Consider Redis clustering for high availability.

Can regular mode handle webhook triggers effectively?

Yes, for moderate volumes. High-volume webhook scenarios benefit from queue mode’s distributed processing.

Is PostgreSQL required for queue mode?

Yes. SQLite cannot handle concurrent access from multiple worker processes safely.

How do I monitor queue performance?

Check Redis queue depth, worker concurrency utilization, and database query times. n8n exposes metrics for monitoring systems.

Handling Webhook Traffic at Scale in n8n

N8n webhook scaling breaks down faster than you'd expect. When request volumes spike, concurrency pressure builds, and executions start backin...
8 min read
Christi Gorbett
Christi Gorbett
Content Marketing Specialist

Running n8n in Production - Stability Checklist

Getting workflows live is only half the battle. n8n production stability is what keeps your automations running reliably when it actually matt...
8 min read
Christi Gorbett
Christi Gorbett
Content Marketing Specialist

CI/CD Pipelines for Deploying n8n Updates

Manually pushing n8n updates across environments is error-prone and time-consuming. A well-configured n8n CI/CD pipeline changes that. It auto...
8 min read
Christi Gorbett
Christi Gorbett
Content Marketing Specialist

Running n8n with Docker Compose vs Bare-Metal VPS

Choosing between n8n Docker Compose vs bare metal VPS comes down to more than personal preference. It affects how you deploy, scale, and maint...
8 min read
Christi Gorbett
Christi Gorbett
Content Marketing Specialist
Click to go to the top of the page
Go To Top
HostAdvice.com provides professional web hosting reviews fully independent of any other entity. Our reviews are unbiased, honest, and apply the same evaluation standards to all those reviewed. While monetary compensation is received from a few of the companies listed on this site, compensation of services and products have no influence on the direction or conclusions of our reviews. Nor does the compensation influence our rankings for certain host companies. This compensation covers account purchasing costs, testing costs and royalties paid to reviewers.