Philosophy

We are based on four principles:

1. Batch API first.¶

The batch API has been historically neglected, even by major production LLM libraries. Our library is designed from the ground up to support the Batch API.

2. Synchronous and Batch interchangeable.¶

By instantly switching from batch to sync, you can debug batch pipelines and rapidly prototype without having to wait.

Switching from sync to batch, you instantly scale up your complex workflows (ie. function calls, structured output, web search, images) to hundreds of thousands of calls. They instantly translate to the Batch API.

3. Control flow described by Python code, not data structures.¶

Python can already express complex conditionals, arbitrary expressions, branching; why use anything else?

4. An "agent" is more than an LLM.¶

We believe that an agent is a program. It just so happens that the program uses LLM(s) to automate much of its decision making. But the program can also ask functions and the user. This philosophy is represented in the ask (ask_llm, ask_functions, ask_human) API.

By decoupling the LLM from the agent, we conceptually allow multi-LLM pipelines, message history editing, non-linear message histories, and more.

See the Your first agent and Ask API docs for more information.

Developer Experience¶

We deeply care about developer experience: from code readability, expressivity, conciseness, to command line pretty-printing and import times.

We are low-level and lightweight, with simple and non-obtrusive abstractions.