
Your n8n setup worked great last month. Now your team added five new workflows, traffic doubled, and suddenly everything breaks during business hours. Before you throw more RAM at the problem, there’s something you need to understand.
The execution engine you choose determines everything. Regular mode keeps things simple. Queue mode unlocks serious scalability. Knowing when to make that switch separates teams that struggle from teams that scale.
Choosing between queue mode and regular mode in n8n depends on workflow complexity and scaling requirements. The comparison table below highlights VPS hosting providers that support both lightweight and distributed automation deployments. These environments make it easier to scale workflows efficiently while maintaining stable execution performance. Explore our recommended VPS hosting options.
Best VPS Solutions for Scaling n8n Beyond Standard Workflow Execution
| Provider | User Rating | Recommended For | |
|---|---|---|---|
![]() | 4.8 | Scalability | Visit Kamatera |
![]() | 4.6 | Affordability | Visit Hostinger |
![]() | 4.7 | Developers | Visit IONOS |
Understanding n8n Execution Modes: A High-Level Overview
Before diving into the technical weeds, let’s establish what we’re actually comparing. Both modes accomplish the same goal. They execute workflows. But they approach the task with fundamentally different architectures.
What is Regular Mode in n8n?
Regular mode is n8n’s default execution architecture designed for simplicity and rapid deployment. Think of it as the “just works” option that ships out of the box.
Here’s what defines it:
- Operates as a single-instance setup where all operations happen within a unified Node.js process
- Handles the complete workflow lifecycle from receiving triggers to generating execution requests to performing computations
- Perfect for initial development, testing, and small-scale deployments without external dependencies
The beauty of regular mode? It easily runs on basic hosting plans. If you’re a startup or small team watching your budget, this keeps costs low while you prove out your automation strategy.
What is n8n Queue Mode?
Queue mode is the enterprise-grade distributed architecture built for horizontal scaling and high-volume automation. It’s what you graduate to when regular mode can’t keep up.
The key characteristics set it apart:
- Separates workflow dispatching from actual execution using a centralized message broker through Redis
- Distributes execution tasks asynchronously across multiple independent worker instances
- Provides high availability, massive throughput, and fault tolerance for mission-critical business processes
When you enable queue mode, you’re fundamentally restructuring how n8n approaches work. Instead of one process doing everything, you’re building a system where specialized components handle specific responsibilities.
Architectural Differences: Single-Instance vs. Distributed

Understanding the architecture helps you predict when you’ll hit limitations. Let’s break down what’s actually happening under the hood.
The Monolithic Design of the Main Process
In regular mode, the main process handles everything. It’s a single-threaded monolith that does the heavy lifting alone.
All workflow execution competes for the exact same system resources. CPU and RAM on a single node get shared across every running automation. This design works great until it doesn’t.
Here’s the problem: a single crashed execution or one highly resource-intensive workflow can bottleneck the entire system. One runaway process affects everything else waiting in line.
How the Event Loop Impacts Performance
Because regular mode relies on a single Node.js process, it’s bound by a single event loop. This creates real constraints for busy systems.
The input output operations bottleneck shows up in practical ways. When you’re waiting 5 seconds for an external API response, that wait blocks the thread. While waiting, the system cannot process other workflow executions efficiently.
Even on multi-core servers, regular mode can only effectively utilize one CPU core for execution. All that extra processing power? Sitting idle while your single process struggles.
The Distributed Architecture of Queue Mode
Queue mode fundamentally restructures the platform by decoupling trigger reception from the execution phase. This split responsibilities approach unlocks serious scalability.
The multi-component stack includes:
- Redis: Acts as the message broker maintaining the persistent job queue
- PostgreSQL: Serves as the shared database accessible by all nodes
- Worker Nodes: Independent instances that claim and process jobs from Redis
This architecture means you can scale workers independently based on workload. Need more capacity? Add workers. Simple as that.
The Role of the Main Instance and Workers
With queue mode, the main instance receives webhook calls and timer events, generates an execution ID, but does not run the workflow itself.
The execution IDs pass to Redis, allowing one or more worker instances to pull jobs asynchronously. Each worker operates as an independent Node.js instance. This enables true parallel processing across separate machines or containers.
Redis notifies workers when new jobs arrive. Workers claim tasks, process them, and report back. The main instance coordinates. Workers execute workflows. Clean separation.
Performance and Scalability Benchmarks
Numbers tell the real story. Let’s look at actual stress test data comparing both modes.
Single-Instance Performance Ceilings
Regular mode performs exceptionally well for small loads but hits a hard ceiling under high concurrency.
Stress test data from a C5.large AWS instance (1 vCPU, 4GB RAM) reveals the pattern:
- 100 virtual users: Handles traffic with reasonable response times
- 200 virtual users: Response times spike to 12 seconds with a 1% failure rate
- 200 users + 10 distinct workflows: System collapses with 34-second response times and a 38% failure rate
Maximum theoretical capacity under ideal conditions? Around 220 workflow executions per second. But real-world performance degrades well before that ceiling.
Achieving High Throughput With Queue Mode

Queue mode solves these limitations through horizontal distribution. The benchmark improvements are dramatic.
Stress test data with queue mode active:
- C5.large instance: Sustained 74 requests per second with a 0% failure rate, even with 10 concurrent workflows
- C5.4xlarge instance (16 vCPUs, 32GB RAM): Achieved 162 requests per second across 200 virtual users
- Latency: Remained under 1.2 seconds with zero failures under maximum load
That’s the best scalability you’ll find for high-volume automation needs.
Managing Queue Depth and Concurrency Limits
Both modes offer concurrency limits to prevent system overload and manage Redis queue depth effectively.
Regular mode uses N8N_CONCURRENCY_PRODUCTION_LIMIT to queue executions locally in FIFO order. Simple and predictable.
Queue mode takes a multi-layered approach balancing global and per-worker limits:
- Individual workers default to 10 concurrent jobs, configured via the –concurrency flag
- Global limits override worker concurrency settings to ensure the shared database and Redis aren’t overwhelmed
- Queue depth monitoring helps prevent pending executions from piling up
Input Output and CPU Processing Models Compared
The performance differences become clear when you examine how each mode handles different operation types.
For input output operations, queue mode distributes waits across more workers. A 5-second API wait on Worker A does not block Worker B. They operate independently.
For CPU operations, queue mode executes intensive tasks in true parallel across multiple CPU cores. Regular mode remains strictly limited to one core regardless of your hardware.
Infrastructure and Resource Requirements
Scaling comes with costs. Let’s examine what each mode actually demands from your infrastructure.
Regular Mode Resource Consumption Profile

Highly efficient for simple setups, regular mode requires minimal infrastructure footprint. You can run queue mode on modest hardware when you’re starting out. Wait, that’s not right. Regular mode runs on modest hardware.
Baseline RAM consumption sits at approximately 500MB when idle. Memory recommended increases with workflow complexity, but growth remains predictable for simple automations. Complex data transformations can quickly consume available memory though.
Queue Mode Infrastructure Overheads
Achieving distributed scalability requires a larger initial resource investment. Here’s the baseline RAM consumption breakdown totaling approximately 2.5GB:
- Redis + PostgreSQL: ~1GB
- Main n8n process: ~500MB
- Worker process instances: 200-500MB per worker
During active execution, queue mode may use slightly more RAM per workflow, around 60MB vs 40MB. That’s the distributed coordination overhead at work. Database performance matters more in this configuration.
Database Requirements: SQLite vs. PostgreSQL
The database choice changes significantly between modes.
| Feature | Regular Mode | Queue Mode |
|---|---|---|
| Default Database | SQLite (Supported) | PostgreSQL (Mandatory) |
| Architecture | Local file-based | Shared network database |
| Connection Pooling | Not required | Highly recommended (e.g., pgBouncer) |
| Maintenance | Simple, single-owner | Complex, multi-process access |
The SQLite database works fine for single-instance deployments. But queue mode needs PostgreSQL to support queue mode operations with multiple worker processes hitting the same data. Database queries must be optimized for concurrent access.
Managing Binary Data in Your Workflows
File handling changes dramatically between modes. This catches many teams off guard during migration.
Filesystem Constraints in Regular Mode

In regular mode, binary data like images, PDFs, and file uploads can be safely stored directly on the local filesystem.
Since all operations occur within a single instance, the process always has reliable access to the local disk. This keeps infrastructure costs low for file-heavy workflows. Design workflows around local storage when it makes sense.
Why You Need External Binary Data Storage
Queue mode does not support storing binary data on the local filesystem. Here’s why.
Distributed workers run on separate machines or containers. They cannot reliably share local volumes. One worker might save a file. Another worker processing the next step can’t find it.
The requirement: you must configure external binary data storage like Amazon S3 to persist binary data. All workers can then access the same files. It adds slight architectural complexity but ensures data consistency across your worker node fleet.
Configuration and Setup Complexity
Let’s get practical about what setup actually looks like for each mode.
Setting Up Regular Mode for Quick Deployment
Deployment takes seconds. Download n8n, set basic credentials, and start the service.
No external dependencies. No message broker configuration. No complex orchestration required. It’s ideal for developers wanting to iterate quickly on workflow logic without DevOps overhead.
Check our guide on best n8n hosting providers for options that make this even easier.
How to Enable Queue Mode in Production
Queue mode requires a sequential, dependency-ordered startup process. Getting this wrong means failed workers and broken triggers.
The startup sequence:
- Start Redis (workers will fail if this is missing)
- Start PostgreSQL
- Launch the main n8n instance
- Launch worker instances
This setup is best managed via multi-container orchestration platforms like Docker Compose or Kubernetes. Check our guide on high-availability setup for n8n on VPS for detailed deployment strategies.
Essential Environment Variables
Proper configuration relies on specific environment variables applied across all nodes.
Key variables you must set:
- EXECUTIONS_MODE=queue (Must be set on main and workers)
- QUEUE_BULL_REDIS_HOST & QUEUE_BULL_REDIS_PORT (Redis connection)
- DB_TYPE=postgresdb (Plus associated host, user, and password variables)
Redis configuration details matter. Get the connection strings wrong, and workers can’t pull jobs from the queue.
Configuring a Load Balancer for Multi-Main Setup
For cloud enterprise deployments needing high availability, you can deploy multiple main instances behind a load balancer.
This requires setting N8N_MULTI_MAIN_SETUP_ENABLED=true.
The crucial step: the load balancer must be configured with session persistence through sticky sessions to maintain consistent user sessions. Webhook triggers should route specifically to dedicated webhook processor instances if you’re running manual workflow executions alongside automated ones.
Securing Your Setup and Managing the Encryption Key
Security becomes paramount when moving to a distributed architecture. More components mean more attack surface.
Ensure Redis and PostgreSQL connections are secured and not exposed to the public internet. The n8n encryption key (N8N_ENCRYPTION_KEY) must be identical across the main instance and all worker nodes to properly decrypt stored credentials and sensitive workflow data.
Mismatched encryption keys? Workers can’t read credential configuration details. Workflows fail silently on authentication steps.
Choosing the Right VPS for Your n8n Deployment

Whether you run queue mode or stick with regular mode, your hosting infrastructure matters enormously. The wrong VPS can bottleneck even well-architected deployments.
For regular mode, look for providers offering high IOPS storage and solid single-core performance. Queue mode deployments need VPS options that support multiple containers and offer good network connectivity between instances.
Explore VPS hosting options to find providers matching your scalability requirements. The right foundation makes adding workers and scaling horizontally much smoother down the road.
Real-World Use Cases: Queue Mode vs Regular Mode in n8n
Theory is great. Practical guidance is better. Here’s when each mode makes sense.
When to Stick with Regular Mode
Regular mode fits several common scenarios perfectly:
- Development and Testing: Rapid prototyping without DevOps overhead
- Small Businesses: Modest automation needs where peak concurrency stays under 10 multiple simultaneous workflow executions
- Proof of Concept: Demonstrating value quickly before investing in enterprise hosting infrastructure
- File-Heavy Local Work: Workflows relying heavily on local filesystem binary data without an S3 budget
Most teams fall into these categories. Don’t overcomplicate things if regular mode handles your workload comfortably.
When to Switch to n8n Queue Mode
Queue mode becomes essential under specific conditions:
- Enterprise Scale: Processing more than 100 concurrent workflow executions regularly
- Mission-Critical Reliability: Requires high availability where a single node failure won’t stop business operations
- Spiky Traffic: Unpredictable concurrency patterns requiring elastic horizontal scaling with adding workers on-demand
- Multi-Team Environments: Needing resource isolation so one team’s batch jobs don’t starve another team’s automations
Compare these requirements against alternatives like n8n vs Airflow for complex orchestration or n8n vs Windmill for high-throughput scenarios. Sometimes a different tool fits better than scaling n8n.
Scaling Strategy Considerations

Understanding horizontal vs vertical scaling for n8n helps you plan your growth path intelligently.
Vertical scaling means bigger servers. More RAM, more CPU cores. It works until it doesn’t. There’s a ceiling on how powerful a single machine can get.
Horizontal scaling means scale horizontally by adding more workers. Queue mode enables this approach. Need twice the capacity? Deploy twice the workers. No single-machine limits.
For performance optimization details, see our guide on performance tuning n8n for large workflow volumes.
Pros and Cons Summary
Let’s consolidate the tradeoffs into clear comparisons.
Advantages and Disadvantages of Regular Mode
Pros:
- Incredibly easy to set up and maintain
- Low base memory footprint around 500MB
- Supports local filesystem storage for binary data
Cons:
- Hard performance ceiling that degrades rapidly past 100 concurrent users
- Single point of failure with no high availability
- Input output operations block the single event loop thread
Advantages and Disadvantages of Queue Mode
Pros:
- Massive horizontal scalability exceeding 160 requests per second on good hardware
- True parallel processing across multiple CPU cores
- High availability and fault tolerance
Cons:
- Steeper learning curve requiring DevOps knowledge for Docker, Redis, and Postgres
- Higher baseline resource requirements around 2.5GB RAM
- Requires external S3 storage for binary files
Making the Transition
Switching from regular mode to queue mode isn’t flipping a switch. It’s a migration project.
Start by auditing your current workflow definitions. Identify any that rely on local filesystem access for binary data. These need refactoring first.
Next, provision your infrastructure. You need a Redis instance and PostgreSQL database before anything else works. Test connectivity between components before attempting to run queue mode.
Deploy gradually. Start with one worker process and validate execution works correctly. Then scale workers based on observed high load patterns.
Monitor Redis queue depth and pending executions closely during the transition. Bottlenecks reveal themselves quickly under real workload.
Next Steps: What Now?
- Audit your current workflow execution patterns and measure peak concurrency.
- Calculate whether your single-instance ceiling meets actual demand.
- Provision Redis and PostgreSQL in a test environment to practice setup.
- Migrate binary data storage to S3 before enabling queue mode.
- Deploy queue mode with one worker initially, then scale workers based on load.
- Configure monitoring for database performance and queue depth metrics.



