14 min read

I Spent a Week Building AI Agent Workflows Without Writing a Single Orchestration Layer — Here's What I Found

09 Mar 2026

Sim is an open-source visual canvas for building AI agent workflows — no orchestration boilerplate, 100+ integrations, self-hostable in one command.

Last quarter I burned three days wiring together a multi-step AI pipeline in Python. There was a research agent that pulled from Exa, a summarization step using Claude, a router that decided which Slack channel to post to based on urgency, and a fallback that emailed the team if Slack was down. The logic itself took maybe an afternoon. The other two and a half days? Retry logic, state passing between steps, debugging why the router's output wasn't being serialized correctly into the next agent's context, and writing the deployment boilerplate to run the whole thing on a schedule.

I remember thinking: this orchestration layer is not the hard part. The hard part should be the AI behavior, the tool selection, the routing logic. Instead I'm writing plumbing.

That's the pain point Sim is designed to eliminate — and after spending a week with it, I have a lot of thoughts.

What Sim Actually Is (And What It Isn't)

Sim (previously Sim Studio, available at sim.ai and open-sourced at github.com/simstudioai/sim) is a visual, canvas-based workflow builder for AI agent pipelines. Think ReactFlow meets LangChain, but with a no-code interface bolted on top and a deployment story that doesn't require you to write a Dockerfile from scratch.

The elevator pitch: you drag blocks onto a canvas, connect them, configure each block with prompts and tool bindings, and then run the whole thing via the UI, a REST API call, a webhook, or a cron schedule. The pipeline executes, logs are retained, and you can inspect every node's input and output inline.

But calling it "no-code" undersells what it actually supports. The architecture is genuinely layered: non-technical users can build simple chains without touching a line of code, while engineers can drop into a Function block that executes arbitrary JavaScript, wire environment variables directly into prompts, or expose workflows as MCP endpoints. It's a spectrum, not a ceiling.

The repo has accumulated over 21,000 GitHub stars and 2,800 forks at time of writing, which is not nothing for a project that's still pre-1.0 by version number. Their latest release is v0.5.29. The community Discord is active. 34 contributors have touched the code. This isn't a weekend project someone abandoned — it has genuine momentum.

The Block-Based Architecture

Before you open the canvas, it helps to understand the mental model. Sim organizes everything into three conceptual layers: triggers, blocks, and tools.

Triggers are how a workflow starts. You can kick off a workflow through a chat interface, an inbound webhook (great for GitHub Actions or Stripe events), a scheduled cron, an RSS feed, or direct REST API calls via their SDK. The webhook trigger alone unlocks a ton of real-world automation patterns — pipe a Typeform submission into a workflow, have it research the respondent's company, draft a personalized reply, and push to HubSpot, all without a backend.

Blocks are the nodes you connect on the canvas. Sim ships with a rich set out of the box: Agent (an LLM call with tool access), API (raw HTTP), Condition (branching logic), Evaluator (scoring/grading a prior step's output), Function (custom JavaScript), Guardrails (content filtering), Human in the Loop (pause and wait for approval), Loop, Parallel, Router (LLM-decided routing), Response, Variables, Wait, Webhook, and the ability to embed entire sub-workflows as reusable blocks. This last one — the Workflow block — is underrated. It means you can build modular agent components and compose them like functions.

Tools are what Agent blocks can call. And here Sim gets genuinely impressive in scope. The tool catalog covers over 100 integrations: the usual suspects like Gmail, Slack, Notion, GitHub, Jira, HubSpot, Salesforce, and Stripe are there, but so are specialized tools like Browser Use (browser automation), Stagehand, Firecrawl, Exa, Perplexity, Tavily for web research, ElevenLabs for TTS, MistralParser for document extraction, and vector databases like Pinecone, Qdrant, and Supabase. You can also connect any MCP (Model Context Protocol) server, which opens the door to custom tool servers you control.

The model support is model-agnostic in the best way. You can target OpenAI (GPT-4o, o3, o1), Anthropic (Claude 3.5 Sonnet, Claude 3 Opus), Google Gemini (2.0 Flash, 1.5 Pro), Groq, Cerebras, Mistral, DeepSeek, and local models via Ollama or vLLM. Switching models is a dropdown. Comparing how the same workflow behaves with Claude versus GPT-4o is a two-minute experiment rather than a refactoring exercise.

Getting It Running

The self-hosted path is genuinely straightforward. The fastest approach for kicking the tires is their NPX package:

npx simstudio

That's it — Docker must be running, and it spins up a local instance at http://localhost:3000. For production self-hosting, Docker Compose is the intended path:

git clone https://github.com/simstudioai/sim.git
cd sim
docker compose -f docker-compose.prod.yml up -d

If you want to use local models with Ollama and you have a GPU available:

docker compose -f docker-compose.ollama.yml --profile setup up -d

This automatically pulls gemma3:4b to get you started. You can add any Ollama-compatible model afterward:

docker compose -f docker-compose.ollama.yml exec ollama ollama pull llama3.1:8b

For engineers who want a proper development environment, the manual setup requires Bun (not Node), PostgreSQL 12+ with the pgvector extension (required for the knowledge base and semantic search features), and a handful of environment variables. The full .env is documented in apps/sim/.env.example, but the required minimum looks like this:

DATABASE_URL="postgresql://postgres:your_password@localhost:5432/simstudio"
BETTER_AUTH_SECRET="$(openssl rand -hex 32)"
BETTER_AUTH_URL="http://localhost:3000"
NEXT_PUBLIC_APP_URL="http://localhost:3000"
ENCRYPTION_KEY="$(openssl rand -hex 32)"

After that, run migrations and start the dev servers:

# From project root
bun install
cd apps/sim && bunx drizzle-kit migrate --config=./drizzle.config.ts
cd ../..
bun run dev:full

The dev:full command starts both the Next.js app and the Socket.io realtime server simultaneously. The realtime server is what powers the live execution view — you watch each block light up as the workflow runs, see intermediate outputs, and catch errors at the exact node they occur.

The Tech Stack Under the Hood

For engineers who want to understand what they're running, Sim's stack is worth examining because it's well-chosen and not over-engineered. The framework is Next.js 15 with the App Router. The runtime is Bun for performance. ORM is Drizzle against PostgreSQL, with pgvector for embeddings. Auth is handled by Better Auth. The canvas is built on ReactFlow. State management is Zustand. Real-time collaboration uses Socket.io. Background job scheduling (for cron-triggered workflows) runs through Trigger.dev. Remote code execution for the Function block runs through E2B's sandboxed environments, which is a smart choice — it means user-authored JavaScript runs in an isolated container, not your server.

The monorepo is organized with Turborepo, and the language breakdown is 68% TypeScript, 31% MDX (docs), with a sliver of Python and CSS. This is a codebase you can actually navigate as a TypeScript engineer.

One thing I appreciated: the CLAUDE.md file in the root is their Cursor/AI coding rules file, which tells you a lot about how they build. It documents code conventions, naming patterns, and architecture constraints. The fact that it exists and is maintained says something about the team's engineering culture.

Copilot: The Part That Felt Like Magic

Copilot is Sim's AI assistant for building workflows. I was skeptical before I used it. AI-assisted UI manipulation usually means autocomplete that gets in the way. This is different.

In "Agent mode," Copilot can directly manipulate the canvas. You describe what you want in natural language — "add a loop that checks the API response for an error code and retries up to three times with exponential backoff" — and it proposes the structural change, wires the connections, and configures the block properties. You review and accept or reject. It's closer to pair programming than autocomplete.

The @ symbol lets you reference context inline: @workflow, @block, @logs, @knowledge. When debugging a run that failed on the Router block, I typed "why did @RouterBlock send traffic to the wrong branch?" and Copilot pulled the execution log and gave me a coherent explanation. It doesn't always get things right on the first pass — complex routing logic sometimes needed a revision — but the feedback loop is fast.

Copilot runs as a managed service even on self-hosted instances. You generate a Copilot API key from sim.ai's settings and set COPILOT_API_KEY in your environment. This is the one meaningful "call home" in the self-hosted version.

Connecting to Workflows Programmatically

Sim exposes a REST API for triggering workflows externally. The Python and TypeScript SDKs wrap this cleanly. A basic execution call from TypeScript looks like:

import { SimClient } from '@simstudio/sdk';

const client = new SimClient({
  apiKey: process.env.SIM_API_KEY,
  baseUrl: 'https://your-sim-instance.com', // or https://api.sim.ai
});

const result = await client.workflows.execute({
  workflowId: 'wf_abc123',
  input: {
    query: 'Summarize the latest quarterly earnings for NVDA',
    outputFormat: 'markdown',
  },
});

console.log(result.output);

Workflows can also be deployed as MCP servers, which means other AI tools (Cursor, Claude Desktop, any MCP-compatible client) can call your Sim workflows as tools. This is a genuinely interesting composability story — build a complex research pipeline in Sim's visual editor, expose it as an MCP endpoint, and have your IDE call it via Cursor as part of a code review workflow.

The Knowledge Base and RAG Story

Sim has a built-in knowledge base that lets you upload documents, index them with OpenAI embeddings into pgvector, and give Agent blocks access to them via a semantic retrieval tool. The implementation is pragmatic: upload PDFs or text, tag them, and reference them with the Knowledge tool inside any agent. Filtering by tag lets you scope what a given agent can see — useful when you have multiple agents in a workflow that should only access domain-specific data.

This isn't a replacement for a full-featured RAG pipeline with chunking strategies, hybrid search, or re-ranking. But for 80% of use cases — "let this agent answer questions grounded in this document set" — it works without requiring you to stand up a separate vector DB and embedding pipeline.

Honest Caveats

I want to be direct about the limitations because they're real.

The version number (v0.5.29) tells the truth: this is a fast-moving project that hasn't hit a stable API contract yet. Block configurations and schema formats have changed between minor versions. If you self-host and auto-update, verify that migrations ran correctly before assuming your existing workflows still execute as expected.

Copilot requires an internet call to sim.ai's servers even in self-hosted mode. If you're operating in an air-gapped environment or have strict data residency requirements, Copilot is effectively unavailable. The visual builder itself works offline, but you lose the AI assistance layer.

Complex workflow debugging can still be frustrating. When an Agent block fails mid-chain because the model returned an unexpected JSON shape, the error surface isn't always as precise as you'd want. You're still doing some console-log-style debugging via the execution log viewer. The Human in the Loop block helps with validating intermediate outputs, but it adds manual overhead during development.

The parallel execution model has nuances. Running the Parallel block with many simultaneous branches burns through rate limits on your LLM provider quickly, and Sim doesn't currently have built-in rate limit management per provider. You'll want to handle this at the workflow design level or through the Wait block.

Finally, Copilot is genuinely useful but not a workflow designer replacement. I'd describe it as a skilled assistant who knows the Sim API but doesn't know your business domain. The more context you give it about what the workflow is supposed to accomplish, the better it performs. Treating it like a magic "build this for me" button leads to frustration.

The Bigger Picture: What This Signals for the Industry

Here's the thesis I keep coming back to: the current model for building AI pipelines — write Python, use LangGraph or CrewAI, deploy to a custom service, maintain your own logging, reinvent observability — is unsustainable at scale across organizations. It works fine for AI teams at AI companies. It's a barrier for every other engineering team that wants to ship AI-powered features.

Sim is part of a wave of tools (alongside n8n AI, Flowise, Dify, and others) that are making a bet that the interface for building AI pipelines should be visual and composable, not purely programmatic. The difference from the previous generation of no-code automation tools (Zapier, Make) is the explicit first-class support for non-deterministic, reasoning-based steps — the things that make AI pipelines categorically different from simple API chains.

The open-source Apache 2.0 license is a strong signal. It's not "open core with a moat," it's genuinely open. The self-hosting path is documented and functional. The Helm chart and Kubernetes deployment support suggests they're serious about enterprise deployments. The SOC2 compliance on the cloud version says something about who they're selling to.

Whether visual workflow builders become the dominant interface for AI pipelines or get absorbed into IDE-native AI coding experiences (a la Cursor agents) is an open question. My intuition is that both patterns persist for different teams and different problem shapes. But I'm confident that tools like Sim will matter for the large class of organizations who need AI automation and don't have dedicated ML engineering teams to build it from scratch.

Who This Is For

If you're an engineer who regularly builds multi-step AI pipelines and spends meaningful time on orchestration boilerplate rather than the AI logic itself, Sim will feel like relief. The canvas makes it easier to reason about complex flow graphs than staring at nested function calls. The integrated logging and observability cut debugging time.

If you're a technical product manager or solutions architect who needs to prototype and deploy AI workflows without a full engineering sprint, the cloud version at sim.ai is one of the faster paths to a working demo.

If you're running an engineering team that wants to move AI workflow iteration out of the "write a PR, review a PR, deploy" loop and into something more interactive, the combination of Sim's visual editor, Copilot, and the REST API for programmatic integration gives you a reasonable story.

If you need air-gapped operation, have strict data policies that preclude even the Copilot API call to sim.ai's servers, or are building pipelines complex enough that you've outgrown visual representations, you'll run into the ceiling faster.

The fastest way to form your own opinion: npx simstudio, open http://localhost:3000, and try to recreate the last AI pipeline you built manually. My bet is you'll be surprised how far you get before you need to write any code.

Quickstart Checklist

Install Docker and ensure it's running
Run npx simstudio to spin up a local instance at http://localhost:3000
For self-hosting with Git: git clone https://github.com/simstudioai/sim.git && cd sim && docker compose -f docker-compose.prod.yml up -d
For local Ollama models: use docker-compose.ollama.yml with --profile setup
For manual setup: install Bun, PostgreSQL 12+ with pgvector, copy .env.example, run bunx drizzle-kit migrate, then bun run dev:full
Set required env vars: DATABASE_URL, BETTER_AUTH_SECRET, BETTER_AUTH_URL, NEXT_PUBLIC_APP_URL, ENCRYPTION_KEY
To use Copilot on self-hosted: generate key at sim.ai → Settings → Copilot, set COPILOT_API_KEY
Add LLM API keys (OpenAI, Anthropic, Gemini, etc.) in workspace settings
Build your first workflow: drag an Agent block, connect a Starter trigger and a Response block, configure a system prompt, hit Run
Try the Parallel block to fan out to multiple agents simultaneously
Expose a workflow via webhook trigger for external event-driven execution
Check the execution log for per-block input/output inspection

Tags:

agentic-ai

ai-agents

developer-tools

langchain-alternative

llm

no-code

open-source

self-hosted

typescript

vector-database

workflow-automation