What happens when I exceed my concurrency limit?

Excess executions enter a queue and process in FIFO order as concurrency capacity frees up.

Do sub workflow calls count toward limits?

No. Sub workflow executions don't increment your concurrency counter.

Can I limit concurrency for one workflow only?

Not natively. Use external message brokers like RabbitMQ for workflow-specific limits.

What's the default concurrency limit for self hosted?

Self hosted instances default to unlimited. Hardware resources determine your practical ceiling.

How does queue mode differ from regular mode?

Queue mode uses Redis and worker instances for distributed processing. Regular mode runs everything on one instance.

Limiting Concurrency in n8n (& Why It Matters)

Limiting Concurrency in n8n (& Why It Matters) blog

You’re running a growing automation business. Suddenly, your n8n workflows start failing with cryptic errors. Too many concurrent executions are overwhelming your system. Your carefully built automation empire is crashing, and you don’t know why.

Sound familiar? Here’s the thing: limiting concurrency in n8n isn’t just a technical checkbox. It’s the difference between workflows that scale gracefully and systems that buckle under pressure.

Managing concurrency in n8n is essential for preventing resource exhaustion and maintaining stable workflow execution. The comparison table below highlights VPS hosting providers that deliver predictable performance under varying automation loads. These environments help ensure workflows run efficiently without overwhelming server resources. To explore our recommended VPS hosting options.

Reliable VPS Hosting Options for Controlled n8n Workflow Concurrency

User Rating	Recommended For
4.8	Scalability	Visit Kamatera
4.6	Affordability	Visit Hostinger
4.7	Developers	Visit IONOS

Takeaways

Concurrency measures how many workflow instances run simultaneously at any moment.
Sub workflow executions and manual executions don’t count toward limits.
n8n Cloud plans enforce strict concurrency limits while self hosted deployments offer flexibility.
Queue mode with Redis enables massive horizontal scaling for enterprise workloads.
Environment variables let you configure instance-level concurrency control easily.
External message brokers like RabbitMQ solve workflow-level concurrency challenges.
Wait nodes longer than 65 seconds don’t consume concurrency capacity.

Understanding Concurrency in n8n

How n8n Measures Concurrent Executions

Concurrency is the exact number of workflow instances running simultaneously in your system. Think of it as counting how many workers are actively processing tasks right now.

The measurement rules are straightforward. An active workflow triggered by a webhook or trigger node counts as exactly one instance. If five triggers fire simultaneously, that’s a concurrency count of five. Simple math.

Each concurrency instance lasts as long as the workflow runs. Once it reaches success or error, that slot frees up. Workflows with automatic retries consume a separate concurrent instance for each retry attempt. This matters when planning your concurrency capacity.

The contents table of your workflow’s executions tab shows exactly what’s happening. Active executions appear in real-time, letting you monitor your system’s current load.

What Doesn’t Count Toward Concurrency Limits

Here’s where things get interesting. n8n intentionally excludes certain execution types to preserve your production capacity.

Test webhook URLs used for development don’t count. Sub workflow invocations called by parent workflows don’t increment the counter either. Manual executions by administrators stay separate. Triggering deactivated workflows? Also excluded.

This document discusses concurrency from a production perspective. n8n keeps development activities in a separate mechanism so your testing never drains resources meant for business-critical automation.

Comparing Concurrency Limits Across n8n Deployment Models

n8n Cloud Concurrency Limits and Plans

Cloud dashboard shows execution limits and usage under current subscription plan.

Cloud instances enforce strict, plan-specific limits. This prevents excessive resource consumption on shared infrastructure and ensures fair usage across customers.

When executions exceed your plan’s concurrency limit, they’re placed in a queue. Processing follows FIFO order. First in, first out. No favorites.

Test evaluation concurrency limit operates on a completely separate pool. Your testing won’t interfere with production workflows.

Plan Tier	Pricing	Deployment	Concurrency Limit	Use Case
Starter	€20/month	n8n Cloud	5 concurrent	Individual users and learning
Pro	€50/month	n8n Cloud	Plan-dependent	Solo builders and small teams
Business	€667/month	Self-hosted	Plan-dependent	Companies < 100 employees
Enterprise	Contact Sales	Hosted or Self-hosted	Scalable	Organizations with compliance needs
Community Edition	Free	Self-hosted	Unlimited (hardware-dependent)	Open-source enthusiasts

Cloud enterprise plans offer scalable options for organizations needing more. The plan’s regular concurrency limit scales with your subscription level.

When comparing n8n Cloud to self hosted options, understanding these limits helps you choose wisely.

Ultahost

Launch, Scale, and Manage your website with high-performance Web Hosting and VPS.

Visit Site Coupons6

Mastering Concurrency Control in Self Hosted Deployments

Unlike cloud plans, Community Edition and self hosted deployments impose no default concurrent execution limits. Your instance is constrained only by hardware. CPU, memory, storage, and bandwidth limitations from tunneling services dictate your ceiling.

This flexibility sounds great. But here’s the catch: administrators must actively configure limits to prevent severe performance degradation during traffic spikes. Unlimited doesn’t mean invincible.

Without proper concurrency settings, your database can become overwhelmed. Error executions pile up. System responsiveness tanks. The whole instance suffers.

Finding the best n8n hosting providers ensures your infrastructure matches your concurrency needs. The right VPS delivers the resources your workflows demand.

Implementing Concurrency Control in Regular Mode

Using Environment Variables for Instance-Level Control

Laptop displays terminal commands setting concurrency limit to twenty.

Self-hosted administrators achieve production-grade control through the N8N_CONCURRENCY_PRODUCTION_LIMIT environment variable. No external systems required.

By default, this variable equals -1, which disables concurrency control entirely. Running export N8N_CONCURRENCY_PRODUCTION_LIMIT=20 restricts your system to 20 maximum production executions.

This limit applies exclusively to production executions from webhooks and trigger node activations. The concurrency control applies uniformly across all workflows on that specific workflow instance. One setting rules them all.

Concurrency settings determine how your entire instance behaves under load. Choose wisely based on your hardware capabilities.

Handling Queued Executions and Recovery

When your plan’s concurrency limit is reached, n8n automatically queues excess executions for later processing. The limits queue system handles overflow gracefully.

Queued executions cannot be retried while waiting. Deleting or cancelling removes them permanently. There’s no bringing them back.

If service interrupts, n8n resumes queued executions up to the limit upon instance startup. Remaining executions are re-enqueued immediately. Zero data loss. Data integrity maintained across reboots.

You cannot retry queued executions manually. Once concurrency capacity frees up, they process automatically. Executions remain in queue until slots open.

Scaling With n8n Queue Mode

How Queue Mode Architecture Works

Bright room displays n8n Redis workflow and queue architecture on screen.

Queue mode is a distributed architecture designed for massive scalability. Multiple n8n instances coordinate through a Redis message broker.

The main instance handles timers and webhooks. It generates the execution ID and passes it to Redis without running the workflow itself. Worker instances pull pending executions from the Redis queue, process the workflow, and report results back.

This setup requires robust infrastructure. A supported database backend like PostgreSQL or MySQL is mandatory. SQLite doesn’t support availability queue mode.

Check our guide on queue mode vs regular mode for a deeper technical dive into these architectural differences.

Managing Worker Concurrency in Queue Mode

Horizontal scaling happens by adding or removing worker instances based on workload. Elastic and cost-effective operations become possible.

Administrators use the –concurrency flag to control how many jobs each worker runs in parallel. The system defaults to 10 concurrent jobs per worker. Adjust via command line: n8n worker –concurrency=5.

n8n recommends keeping worker concurrency at 5 or higher. Setting it too low across many workers exhausts your database connection pool rapidly.

Workarounds for Workflow-Level Concurrency Control

Using External Message Brokers Like RabbitMQ

Currently, n8n lacks native mechanisms to limit concurrency for individual workflows. Limits apply to the whole instance only.

Developers solve this using queue-based architecture. Workflows push requests into an external message broker like RabbitMQ. A separate worker workflow consumes that queue one item at a time. Strictly sequential processing for sensitive tasks guaranteed.

This workaround introduces operational complexity. You’re maintaining an external system alongside n8n. But for specific workflow concurrency needs, it works beautifully.

Our upcoming guide on integrating n8n with external queues provides detailed implementation steps.

RabbitMQ website.

Leveraging Wait Nodes and HTTP Request Batching

Wait nodes with durations longer than 65 seconds get offloaded to the database. They don’t count towards your concurrency limits. Highly efficient for time-based workflow sequencing.

HTTP Request batch processing offers built-in options. The node features controlled parallelism for multiple API calls. These native workarounds preserve concurrency slots without sending data off-platform.

Best Practices for High-Volume n8n Workflows

Preventing Database Deadlocks at Scale

Workflows processing 10,000+ records per minute open independent database connections. This causes deadlocks and connection pool exhaustion. Your system grinds to a halt.

Optimization strategies include slowing down instance concurrency by lowering N8N_CONCURRENCY_PRODUCTION_LIMIT. Process smaller batches to reduce transaction scope. Implement idempotent operations and add explicit retries for deadlock errors.

For comprehensive optimization techniques, see our guide on performance tuning for large workflows.

Managing Execution Timeouts

Workflows exceeding execution limits return a 500 status code or code 0. The message reads: “The execution was cancelled because it timed out.”

Default maximum execution timeout is 3600 seconds. Administrators set global timeouts using EXECUTIONS_TIMEOUT. Individual users override settings up to the maximum threshold defined by EXECUTIONS_TIMEOUT_MAX.

Setting Up Your Automation Infrastructure

Before diving deep into n8n configuration, you need solid hosting infrastructure. A reliable VPS provides the foundation for stable workflow execution.

Whether you’re running a business website alongside n8n or deploying dedicated automation servers, proper hosting matters. Explore VPS hosting options to find infrastructure matching your concurrency and performance requirements.

Conclusion

Limiting concurrency in n8n transforms unpredictable automation into reliable systems. Whether using cloud plans with built-in limits or configuring self hosted deployments manually, understanding how concurrency works matters.

Start with appropriate limits, monitor your workflows, and scale infrastructure as needed. Your automation will thank you.

Next Steps: What Now?

Audit your current concurrency settings and document active limits.
Monitor the workflow’s executions tab to identify bottlenecks.
Configure N8N_CONCURRENCY_PRODUCTION_LIMIT for self hosted instances.
Evaluate queue mode if you need horizontal scaling capabilities.
Implement external message brokers for workflow-specific concurrency control.
Test your configuration under simulated load before production deployment.