sherma

API Reference

All public symbols are exported from the sherma package.

import sherma

Entities

`EntityBase`

Base class for all registry entities.

class EntityBase(BaseModel):
    id: str
    version: str = "*"
    tenant_id: str = DEFAULT_TENANT_ID  # "default"

`Prompt`

class Prompt(EntityBase):
    instructions: str

`LLM`

class LLM(EntityBase):
    model_name: str

`Tool`

class Tool(EntityBase):
    function: Callable

`Skill`

class Skill(EntityBase):
    front_matter: SkillFrontMatter
    body: Markdown = ""
    scripts: list[Tool] = []
    references: list[Markdown] = []
    assets: list[Any] = []

`SkillFrontMatter`

class SkillFrontMatter(BaseModel):
    name: str
    description: str
    license: str | None = None
    compatibility: str | None = None
    metadata: dict[str, Any] | None = None
    allowed_tools: list[str] | None = None

`SkillCard`

class SkillCard(EntityBase):
    name: str
    description: str
    base_uri: str
    files: list[str] = []
    mcps: dict[str, MCPServerDef] = {}
    local_tools: dict[str, LocalToolDef] = {}
    extensions: list[SkillExtension] = []

`SkillExtension`

Modeled after the A2A AgentExtension.

class SkillExtension(BaseModel):
    uri: str
    description: str | None = None
    required: bool = False
    params: dict[str, Any] | None = None

`MCPServerDef`

class MCPServerDef(BaseModel):
    id: str
    version: str = "*"
    url: str
    transport: str  # "stdio" | "sse" | "streamable-http"

`LocalToolDef`

class LocalToolDef(BaseModel):
    id: str
    version: str = "*"
    import_path: str

Agents

`Agent`

Abstract base class for all agents.

class Agent(EntityBase, ABC):
    agent_card: AgentCard | None = None
    input_schema: type[BaseModel] | dict[str, Any] | None = None
    output_schema: type[BaseModel] | dict[str, Any] | None = None

    def send_message(self, request: Message, ...) -> AsyncIterator[UpdateEvent | Message | Task]
    async def cancel_task(self, request: TaskIdParams, ...) -> Task
    async def get_card(self) -> AgentCard | None

input_schema and output_schema accept either a Pydantic model class or a raw JSON Schema dict. Validation utilities and the A2A executor dispatch on the value’s type, so both forms are treated identically. Declarative agents populate these from the YAML’s AgentDef.input_schema / AgentDef.output_schema blocks.

`LocalAgent`

Agent running in the same process.

`RemoteAgent`

Proxy for an A2A-compatible remote agent.

`LangGraphAgent`

Agent backed by a LangGraph compiled state graph.

class LangGraphAgent(Agent):
    hook_manager: HookManager

    def register_hooks(self, executor: HookExecutor) -> None
    async def get_graph(self) -> CompiledStateGraph  # abstract

send_message and cancel_task are auto-implemented. You only implement get_graph().

`DeclarativeAgent`

Agent defined by YAML and CEL. Extends LangGraphAgent.

class DeclarativeAgent(LangGraphAgent):
    yaml_path: str | Path | None = None
    yaml_content: str | None = None
    config: DeclarativeConfig | None = None
    base_path: Path | None = None
    http_async_client: Any | None = None
    hooks: list[HookExecutor] = []
    tenant_id: str = DEFAULT_TENANT_ID  # "default"
    checkpointer: BaseCheckpointSaver = MemorySaver()  # State persistence

Provide one of yaml_path, yaml_content, or config.

When yaml_path is provided, base_path is automatically derived from the YAML file’s parent directory. When using yaml_content or config, set base_path explicitly to resolve relative file paths (skill card paths, sub-agent YAML paths).

Registries

`RegistryEntry`

class RegistryEntry(BaseModel, Generic[T]):
    id: str
    version: str = "*"
    tenant_id: str = DEFAULT_TENANT_ID  # "default"
    remote: bool = False
    instance: T | None = None
    factory: Callable[[], T | Awaitable[T]] | None = None
    url: str | None = None
    protocol: Protocol | None = None

`Registry`

Abstract base class for all registries.

class Registry(ABC, Generic[T]):
    async def add(entry: RegistryEntry[T]) -> None
    async def update(entry: RegistryEntry[T]) -> None
    async def get(entity_id: str, version: str = "*") -> T
    async def fetch(entry: RegistryEntry[T]) -> T  # abstract
    async def refresh(entry: RegistryEntry[T]) -> None

Typed Registries

PromptRegistry – Registry[Prompt]
LLMRegistry – Registry[LLM]
ToolRegistry – Registry[Tool]
SkillRegistry – Registry[Skill] (skills may have a skill_card: SkillCard attribute)
AgentRegistry – Registry[Agent]
SkillCardRegistry – Registry[SkillCard]

`RegistryBundle`

Container for all per-tenant registry instances.

class RegistryBundle(BaseModel):
    tenant_id: str = DEFAULT_TENANT_ID
    tool_registry: ToolRegistry
    llm_registry: LLMRegistry
    prompt_registry: PromptRegistry
    skill_registry: SkillRegistry
    agent_registry: AgentRegistry
    skill_card_registry: SkillCardRegistry
    chat_models: dict[str, Any]

`TenantRegistryManager`

Manages per-tenant singleton RegistryBundle instances.

class TenantRegistryManager:
    def get_bundle(self, tenant_id: str = DEFAULT_TENANT_ID) -> RegistryBundle
    def has_tenant(self, tenant_id: str) -> bool
    def list_tenants(self) -> list[str]
    def remove_tenant(self, tenant_id: str) -> None

`DEFAULT_TENANT_ID`

DEFAULT_TENANT_ID = "default"

The default tenant ID used when no tenant is explicitly specified.

Hooks

`HookExecutor`

Protocol (interface) for hook executors. All methods are async and return Context | None.

`BaseHookExecutor`

Default implementation with all hooks returning None. Subclass and override what you need.

`RemoteHookExecutor`

Hook executor that delegates to a remote JSON-RPC 2.0 server. Implements the HookExecutor protocol.

class RemoteHookExecutor(BaseHookExecutor):
    def __init__(self, url: str, timeout: float = 30.0) -> None

The on_chat_model_create hook is a no-op (cannot return Python objects over JSON-RPC). On any network or protocol error, logs a warning and passes through.

`HookHandler`

Base class for implementing remote hook servers. The remote equivalent of BaseHookExecutor – works with plain dicts instead of dataclasses.

class HookHandler:
    async def before_llm_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_llm_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def before_tool_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_tool_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def before_agent_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_agent_call(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def before_skill_load(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_skill_load(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def node_enter(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def node_execute(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def node_exit(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def before_interrupt(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_interrupt(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def before_graph_invoke(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def after_graph_invoke(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def on_node_error(self, params: dict[str, Any]) -> dict[str, Any] | None
    async def on_error(self, params: dict[str, Any]) -> dict[str, Any] | None

All methods return None by default. on_chat_model_create is intentionally absent.

`HookFastAPIApplication`

Builds a FastAPI application from a HookHandler.

class HookFastAPIApplication:
    def __init__(self, handler: HookHandler) -> None
    def build(self, rpc_url: str = "/hooks", **kwargs) -> FastAPI
    def add_routes_to_app(self, app: FastAPI, rpc_url: str = "/hooks") -> None

`HookStarletteApplication`

Builds a Starlette application from a HookHandler.

class HookStarletteApplication:
    def __init__(self, handler: HookHandler) -> None
    def build(self, rpc_url: str = "/hooks", **kwargs) -> Starlette
    def add_routes_to_app(self, app: Starlette, rpc_url: str = "/hooks") -> None
    def routes(self, rpc_url: str = "/hooks") -> list[Route]

`HookManager`

class HookManager:
    def register(self, executor: HookExecutor) -> None
    async def run_hook(self, hook_name: str, ctx: T) -> T

`HookType`

class HookType(Enum):
    BEFORE_LLM_CALL = "before_llm_call"
    AFTER_LLM_CALL = "after_llm_call"
    BEFORE_TOOL_CALL = "before_tool_call"
    AFTER_TOOL_CALL = "after_tool_call"
    BEFORE_AGENT_CALL = "before_agent_call"
    AFTER_AGENT_CALL = "after_agent_call"
    BEFORE_SKILL_LOAD = "before_skill_load"
    AFTER_SKILL_LOAD = "after_skill_load"
    NODE_ENTER = "node_enter"
    NODE_EXECUTE = "node_execute"
    NODE_EXIT = "node_exit"
    BEFORE_INTERRUPT = "before_interrupt"
    AFTER_INTERRUPT = "after_interrupt"
    ON_CHAT_MODEL_CREATE = "on_chat_model_create"
    BEFORE_GRAPH_INVOKE = "before_graph_invoke"
    AFTER_GRAPH_INVOKE = "after_graph_invoke"
    ON_NODE_ERROR = "on_node_error"
    ON_ERROR = "on_error"

Hook Context Types

Imported from sherma.hooks.types:

BeforeLLMCallContext
AfterLLMCallContext
BeforeToolCallContext
AfterToolCallContext
BeforeAgentCallContext
AfterAgentCallContext
BeforeSkillLoadContext
AfterSkillLoadContext
NodeEnterContext
NodeExecuteContext
NodeExitContext
BeforeInterruptContext
AfterInterruptContext
ChatModelCreateContext
GraphInvokeContext
AfterGraphInvokeContext
OnNodeErrorContext
OnErrorContext

Declarative Config

`DeclarativeConfig`

Top-level YAML schema model.

class DeclarativeConfig(BaseModel):
    manifest_version: int             # Required: schema version (currently 1)
    agents: dict[str, AgentDef] = {}
    llms: list[LLMDef] = []
    tools: list[ToolDef] = []
    prompts: list[PromptDef] = []
    skills: list[SkillDef] = []
    hooks: list[HookDef] = []
    sub_agents: list[SubAgentDef] = []
    mcp_servers: list[MCPServerDef] = []
    default_llm: RegistryRef | None = None
    checkpointer: CheckpointerDef | None = None

manifest_version is a required integer that tracks which version of the declarative agent schema the config uses. The current version is 1. This enables the runtime to handle configs with different schema versions.

default_llm is an optional RegistryRef that call_llm nodes inherit when they omit the step-level llm field. A step-level llm always takes precedence.

`PromptDef`

class PromptDef(BaseModel):
    id: str
    version: str = "*"
    instructions: str | None = None       # Inline prompt body
    instructions_path: str | None = None  # Path to a file containing the prompt body

Exactly one of instructions or instructions_path must be provided. Relative instructions_path values are resolved against the YAML’s base_path (see populate_registries); absolute paths are used as-is. The file contents are read as UTF-8 text and used verbatim as the prompt instructions.

`HookDef`

class HookDef(BaseModel):
    import_path: str | None = None  # Local Python hook executor
    url: str | None = None          # Remote JSON-RPC hook server

Exactly one of import_path or url must be provided.

`CheckpointerDef`

class CheckpointerDef(BaseModel):
    type: Literal["memory"] = "memory"

`MCPServerDef` (declarative YAML)

Top-level mcp_servers: entry — distinct from the same-named class under sherma.entities.skill_card, which models MCP servers embedded in a skill card. This one lives in sherma.langgraph.declarative.schema.

class MCPServerDef(BaseModel):
    id: str
    version: str = "*"
    transport: Literal["streamable_http", "sse", "stdio"] = "streamable_http"
    # streamable_http / sse
    url: str | None = None
    headers: dict[str, str] = {}
    # stdio
    command: str | None = None
    args: list[str] = []
    env: dict[str, str] = {}
    # Optional renaming to avoid name collisions across servers
    tool_prefix: str | None = None

For HTTP-based transports, set url (and optionally headers). For stdio, set command (and optionally args / env). At config-load time sherma connects to each declared server, lists its tools, and registers them in the tool registry — making them usable from call_llm nodes via tools: or use_tools_from_registry: true. When tool_prefix is set, each tool is registered as <tool_prefix><tool_name>.

`load_declarative_config`

def load_declarative_config(
    yaml_path: str | Path | None = None,
    yaml_content: str | None = None,
) -> DeclarativeConfig

Before the YAML data is validated, every string value is passed through environment-variable interpolation: ${VAR} is replaced with os.environ["VAR"], ${VAR:-default} falls back to default when the variable is unset, and $$ becomes a literal $. Only UPPERCASE_WITH_UNDERSCORES names are matched, so lowercase placeholders intended for the CEL template() function (e.g. ${available_skills}) pass through untouched. Missing required variables raise DeclarativeConfigError listing all unresolved names.

Schema Utilities

Constants

SCHEMA_INPUT_URI = "urn:sherma:schema:input"
SCHEMA_OUTPUT_URI = "urn:sherma:schema:output"

Functions

def validate_data(data: dict, schema_model: type[BaseModel]) -> BaseModel
def validate_json_schema_data(data: dict, schema: dict) -> None
def validate_against_schema(data: dict, schema: type[BaseModel] | dict) -> None
def schema_to_extension(uri: str, schema: type[BaseModel] | dict) -> AgentExtension
def make_schema_data_part(data: dict, schema_uri: str, *, extra_metadata=None) -> Part

def create_agent_input_as_message_part(data, schema_uri, *, role=Role.user, ...) -> Message
def create_agent_output_as_message_part(data, schema_uri, *, role=Role.agent, ...) -> Message
def get_agent_input_from_message_part(message: Message, schema_model) -> BaseModel
def get_agent_output_from_message_part(message: Message, schema_model) -> BaseModel

LangGraph Utilities

`combine_ai_messages`

from sherma.langgraph.agent import combine_ai_messages

def combine_ai_messages(messages: list[AIMessage]) -> AIMessage

Merges multiple AIMessage instances into one by concatenating their content into list-form. Collapses to a plain string when the result contains exactly one text block.

`LazyChatModel`

from sherma.langgraph.declarative.loader import LazyChatModel

proxy = LazyChatModel(factory=lambda: ChatOpenAI(model="gpt-4o"))

A transparent proxy that defers chat model construction until first attribute access. Used internally when on_chat_model_create hooks set chat_model to a callable factory. All attribute access and method calls are forwarded to the real model after construction.

Skill Tools

def create_skill_tools(
    skill_registry: SkillRegistry,
    tool_registry: ToolRegistry,
    hook_manager: HookManager | None = None,
) -> list[BaseTool]

Returns: list_skills, load_skill_md, unload_skill, list_skill_resources, load_skill_resource, list_skill_assets, load_skill_asset.

Types

Markdown = str  # Type alias

class Protocol(StrEnum):
    A2A = "a2a"
    MCP = "mcp"
    CUSTOM = "custom"

class EntityType(StrEnum):
    PROMPT = "prompt"
    LLM = "llm"
    TOOL = "tool"
    SKILL = "skill"
    AGENT = "agent"

Exceptions

All exceptions inherit from ShermaError:

Exception	Description
`ShermaError`	Base exception
`EntityNotFoundError`	Entity not in registry
`VersionNotFoundError`	No matching version found
`RegistryError`	General registry error
`RemoteEntityError`	Failed to fetch remote entity
`DeclarativeConfigError`	Invalid YAML config
`GraphConstructionError`	Error building graph from config
`CelEvaluationError`	CEL expression evaluation failed
`SchemaValidationError`	Input/output schema validation failed

This site is open source. Improve this page.

sherma

API Reference

Entities

EntityBase

Prompt

LLM

Tool

Skill

SkillFrontMatter

SkillCard

SkillExtension

MCPServerDef

LocalToolDef

Agents

Agent

LocalAgent

RemoteAgent

LangGraphAgent

DeclarativeAgent

Registries

RegistryEntry

Registry