Python MIT orchestrationfoundationalecosystem

LangChain

The original agent framework that defined the chains, agents, tools, memory pattern

LangChain is the framework that most developers hit first when they start building with LLMs. Released in late 2022, it introduced the core abstractions that still dominate how people think about agent systems: chains, agents, tools, and memory. In 2026 it sits in an interesting position. The OSS library still serves as the fastest path from zero to a working prototype, and its integration ecosystem is genuinely hard to beat at over 600 connectors. But teams that try to push LangChain into production often find that the abstractions which made it easy to start become the things they fight hardest in month three. LangGraph has taken over for stateful, production-grade workflows, and LangChain itself now positions around rapid prototyping and onboarding. It's still worth knowing. It's not always worth keeping.

If you've been building with LLMs for more than a week, you've probably already touched LangChain. It launched in October 2022 when Harrison Chase open-sourced a library to wrap OpenAI calls with a consistent abstraction layer, and within a year it had become the de facto starting point for anyone building agent applications. The GitHub star count crossed 95,000 and the concept of "chains" spread so far into the ecosystem that frameworks that came later either borrowed the idea or explicitly positioned against it.

That success has a complicated tail. LangChain made it easy to build things, and it made it just as easy to build things that fall apart under load, drift in unexpected directions, or accumulate abstraction debt that takes weeks to untangle. Developers who learned on it have strong opinions. Some still reach for it by default. Others avoid it on principle. Most end up somewhere in the middle, using it where it genuinely helps and replacing it where it doesn't.

This review is for people who want an honest read on where LangChain sits in 2026, not a tour of the API.

Chains, agents, tools, memory

LangChain introduced four abstractions that defined a generation of LLM tooling. Chains are sequences of steps where the output of one feeds the input of the next. Agents pick which tool to call based on the model's reasoning. Tools are functions the agent can invoke. Memory stores conversation history so the model doesn't forget what happened three messages ago.

These abstractions are genuinely useful for getting started. They map onto how people naturally think about agent behavior, and that's not an accident. The problem shows up when your application grows. Chains are great when you need to do A then B then C. They get awkward when you need to do A, then decide between B and C, then retry B if C fails, then wait for a human to approve the result. The abstraction doesn't stretch that far without help.

Memory is the part that ages worst. LangChain's built-in memory classes handle the common cases like conversation buffer and summary memory, but they're essentially black boxes. When something goes wrong with context, and it does, you spend a lot of time digging into internals that weren't designed to be poked at.

The agent abstractions hold up better. Tool calling is core to how LLMs work now, and LangChain's interface for defining tools (decorated Python functions with docstrings) is clean and straightforward. The framework has kept up with how providers like OpenAI and Anthropic expose function calling, and that part of the API is stable.

Massive integration ecosystem

This is where LangChain genuinely pulls ahead of alternatives. The library ships with over 600 integrations: every major LLM provider, dozens of vector stores, document loaders for PDF, web, Notion, Google Drive, GitHub, databases, and a long list of third-party APIs. If you need to pull data from somewhere and feed it to a model, there's a good chance LangChain already has a loader for it.

For prototyping this is a huge time saver. Connecting to Pinecone or Weaviate takes two lines. Switching from OpenAI to Anthropic is a one-line change. Testing the same chain with three different models takes a loop.

The tradeoff is that integration quality varies. The most popular connectors (OpenAI, Anthropic, Chroma, Pinecone) are well-maintained and keep up with provider changes. The long tail of community integrations can lag, and breaking changes in underlying libraries sometimes surface as cryptic errors rather than clear messages. Before relying on a less common integration in production, it's worth reading the source and checking the issue tracker.

The Python package structure also matters here. LangChain split into langchain-core, langchain, and a collection of langchain-community packages over the past couple years. The split improved things (core is stable, integrations can update independently), but it added confusion for developers who started before the split and hit import errors when upgrading.

LCEL and the runnable interface

LangChain Expression Language (LCEL) is the current recommended way to compose pipelines. It uses a pipe operator (|) to chain runnables together, so a basic RAG pipeline reads like retriever | prompt | model | output_parser. The syntax is clean and the composition is genuinely flexible. You can swap any component in the chain without touching the others.

LCEL also handles streaming and async automatically. If you build a chain with LCEL and call .stream(), streaming propagates through every component that supports it. This is a real improvement over the older chain classes where you had to think explicitly about whether each piece handled streaming.

The runnable interface standardizes invoke, stream, batch, and ainvoke across every component. That consistency pays off when you're building something that needs to work synchronously in one context and asynchronously in another.

The honest limitation of LCEL is that it works best for linear or near-linear pipelines. Branching, loops, and stateful workflows quickly outgrow what the pipe syntax can express cleanly. That's not a flaw in the design exactly, it's just the point where LangGraph becomes the right tool instead.

There's also a testing story worth mentioning. Because LCEL chains are composable Python objects, you can unit test individual components in isolation without spinning up a real LLM. That's a meaningful improvement over the older chain classes, where testing often meant mocking internals you weren't supposed to touch. In practice, teams doing rigorous evals still reach for LangSmith's evaluation harness rather than unit tests alone, but LCEL at least doesn't fight you when you try to write tests.

LangSmith for tracing and evals

LangSmith is the commercial product built on top of the LangChain ecosystem, and it's genuinely worth knowing about even if you're skeptical of everything else. It traces every call your agent makes and stores the full input/output at each step. When an agent behaves unexpectedly, you can pull up the run and see exactly what happened at each node: what the model received, what it returned, how long each step took, and what it cost.

The evaluation tooling has gotten serious. You can set up LLM-as-judge evaluators that score your agent outputs on custom criteria and run them automatically on every deployment. Human feedback loops let you flag runs for review and feed corrections back into your eval dataset. The dashboard shows how your agent's scores change over time as you make changes.

LangSmith isn't free at scale. The free tier handles light development usage, but production traffic with hundreds of thousands of traces per month moves you into paid plans quickly. The pricing is reasonable for what you get, and the alternative is building your own tracing infrastructure, which takes real time.

One thing worth knowing: LangSmith works with any framework. You don't need to use LangChain or LangGraph to get value from it. Teams that have migrated away from LangChain's OSS libraries but stayed on LangSmith for observability are not uncommon.

The TypeScript port

LangChain ships a TypeScript version under langchain on npm. It covers the core abstractions and a solid subset of integrations, and for teams building Node.js applications or full-stack projects where the backend runs on TypeScript, it's often the only realistic option besides writing raw API calls.

The honest assessment is that the TypeScript port lags the Python library by a meaningful margin. Features land in Python first. Some integrations only exist in Python. Documentation examples overwhelmingly use Python, which means you'll spend time cross-referencing and adapting code rather than copying it directly. If you have a choice of language, Python will give you fewer surprises.

That said, the TypeScript library is not broken. The core agent loop, tool calling, LCEL, and the most popular integrations like OpenAI, Anthropic, and Chroma are all stable and production-usable. If you're building a Next.js backend and reach for langchain on npm, you'll get something that works. Just don't expect full feature parity.

LangGraph as the successor

LangChain Inc. is clear about how they see the relationship between LangChain and LangGraph. LangChain is for quick starts and standard agents. LangGraph is for workflows that need real control flow. That positioning is accurate and honest.

LangGraph lets you model your workflow as a directed graph where nodes are steps and edges define what runs next. You can have conditional edges, cycles, and checkpoints that persist state between runs. Human-in-the-loop review becomes a first-class feature, not an afterthought bolted onto a chain.

If you're starting a new project in 2026 that needs stateful agents, branching logic, or multi-agent coordination, start with LangGraph, not LangChain. The learning curve is steeper, but you'll spend less time fighting the framework six weeks in.

LangChain still makes sense as the first thing you learn, as a prototyping environment, and for applications that genuinely fit the simpler model. Not everything needs a graph. A customer support bot that retrieves from a knowledge base and calls a few tools can stay on LangChain and be perfectly maintainable. The mistake is trying to force LangChain abstractions onto workflows that have grown beyond what those abstractions handle well.

Who should use LangChain in 2026

LangChain is a good choice when you're prototyping and want to move fast, when you need to connect to an integration that LangGraph or another framework doesn't have yet, or when you're teaching someone the fundamentals of agent design and want clean, readable examples.

It's the wrong choice when you need explicit, auditable control flow for a production system, when you're building something with complex branching or retry logic, or when you need to optimize latency and cost at a level that requires seeing through the abstractions.

One useful way to think about it: LangChain optimizes for the time it takes to get something working. LangGraph optimizes for the time it takes to understand what went wrong. Early in a project, LangChain's tradeoff makes sense. In month four of production, LangGraph's tradeoff usually wins. The team at LangChain Inc. has accepted this and positioned accordingly. They're not pretending LangChain is the answer for everything, and that honesty is actually a mark in its favor.

Compared to LlamaIndex, LangChain is broader but shallower on retrieval. LlamaIndex has invested heavily in indexing strategies and query routing in ways that LangChain doesn't match. If your application is predominantly RAG, evaluate both before committing.

Compared to Pydantic AI, LangChain is less opinionated about type safety and validation but far more mature with more integrations and a larger community. Pydantic AI gives you stricter guarantees at the cost of a smaller ecosystem.

The teams that have the most friction with LangChain are the ones that used it to prototype, then kept all the LangChain code in production when the scope grew. The migration path exists, and LangGraph is the natural destination, but every month of LangChain-in-production in a complex application is a month of compounding abstraction debt.

Start there. Learn it. Know what it's good at. And know when to move.

For teams evaluating the broader coding agent space, see our roundup of top AI agent tools for coding for context on where LangChain-based tools sit relative to purpose-built coding agents.

Key features

Chains, agents, tools, and memory abstractions
600+ third-party integrations out of the box
LCEL runnable interface for composable pipelines
LangSmith tracing and evaluation platform
First-class async and streaming support
Works with every major LLM provider

Frequently Asked Questions

What is LangChain?

LangChain is an open-source Python and TypeScript framework for building LLM-powered applications. It introduced the chains-agents-tools-memory pattern that most agent frameworks still use as a mental model. You connect a language model to tools, give it memory, and define a chain of steps to accomplish a task. The library ships with over 600 integrations covering LLM providers, vector stores, document loaders, and third-party APIs, so you spend less time writing connectors and more time on application logic.

Is LangChain free?

Yes. The core LangChain Python and TypeScript libraries are fully open source under the MIT license and cost nothing to use. LangSmith, the observability and evaluation platform from the same team, is a separate paid product with a free tier for light usage. You can build production applications without ever paying for anything if you self-manage observability.

Should I use LangChain or LangGraph in 2026?

For prototyping or simple pipelines where you just need to chain a few steps together, LangChain is still the faster start. For anything that needs real state management, branching logic, retries, or human-in-the-loop review, use LangGraph instead. The two are complementary rather than competing. Most teams learn on LangChain and graduate to LangGraph once their workflows get complex enough to justify the explicit graph structure. Starting a new production project in 2026 on plain LangChain chains is a choice you'll likely revisit within a few months.

How does LangChain compare to LlamaIndex?

LangChain is a general-purpose agent and chain framework. LlamaIndex is purpose-built for retrieval-augmented generation, with deeper tooling around document ingestion, indexing strategies, and query engines. If your application is primarily RAG, LlamaIndex often gets you to a better result faster. If you need agents that do more than retrieve, LangChain's breadth wins. Many teams use both: LlamaIndex for the retrieval layer and LangChain or LangGraph to orchestrate agents around it.

What is LangSmith?

LangSmith is LangChain's commercial observability and evaluation platform. It traces every call your agent makes, so you can see exactly what prompt went in, what came out, how long it took, and how much it cost. It also provides evaluation tools including LLM-as-judge scoring and human feedback workflows. LangSmith works with any framework, not just LangChain or LangGraph, which makes it genuinely useful even if you've moved away from the rest of the ecosystem. The free tier covers light usage, paid plans start when you need more traces or team features.

Is LangChain still relevant?

Yes, with caveats. LangChain's 100 million monthly downloads tell you that it's still the default starting point for most developers entering the LLM space. The integration library is unmatched. But the honest picture in 2026 is that serious teams building production agents tend to migrate away from LangChain's higher-level abstractions once they need predictable control flow. LangGraph fills that gap. LangChain stays relevant as an entry point, an integration layer, and for projects where a working prototype is the goal, not a hardened production system.