How to Do Structured LLM Generation in Flows#

python-icon Download Python Script

Python script/notebook for this guide.

Prompt execution step how-to script

Prerequisites

This guide assumes familiarity with Flows.

WayFlow enables you to leverage LLMs to generate text and structured outputs. This guide will show you how to:

Basic implementation#

In this how-to guide, you will learn how to perform structured LLM generation with Flows.

WayFlow supports several LLM API providers. Select an LLM from the options below:

from wayflowcore.models import OCIGenAIModel, OCIClientConfigWithApiKey

llm = OCIGenAIModel(
    model_id="provider.model-id",
    compartment_id="compartment-id",
    client_config=OCIClientConfigWithApiKey(
        service_endpoint="https://url-to-service-endpoint.com",
    ),
)

Assuming you want to summarize this article:

article = """Sea turtles are ancient reptiles that have been around for over 100 million years. They play crucial roles in marine ecosystems, such as maintaining healthy seagrass beds and coral reefs. Unfortunately, they are under threat due to poaching, habitat loss, and pollution. Conservation efforts worldwide aim to protect nesting sites and reduce bycatch in fishing gear."""

WayFlow offers the PromptExecutionStep for this type of query. Use the code below to generate a 10-word summary:

from wayflowcore.controlconnection import ControlFlowEdge
from wayflowcore.dataconnection import DataFlowEdge
from wayflowcore.flow import Flow
from wayflowcore.property import StringProperty
from wayflowcore.steps import PromptExecutionStep, StartStep

start_step = StartStep(input_descriptors=[StringProperty("article")])
summarize_step = PromptExecutionStep(
    llm=llm,
    prompt_template="""Summarize this article in 10 words:\n {{article}}""",
    output_mapping={PromptExecutionStep.OUTPUT: "summary"},
)
summarize_step_name = "summarize_step"
flow = Flow(
    begin_step=start_step,
    steps={
        "start_step": start_step,
        summarize_step_name: summarize_step,
    },
    control_flow_edges=[
        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
        ControlFlowEdge(source_step=summarize_step, destination_step=None),
    ],
    data_flow_edges=[
        DataFlowEdge(start_step, "article", summarize_step, "article"),
    ],
)

Note

In the prompt, article is a Jinja2 syntax to specify a placeholder for a variable, which will appear as an input for the step. If you use {{var_name}}, the variable named var_name will be of type StringProperty. If you specify anything else Jinja2 compatible (for loops, filters, and so on), it will be of type AnyProperty.

Now execute the flow:

from wayflowcore.executors.executionstatus import FinishedStatus

conversation = flow.start_conversation(inputs={"article": article})
status = conversation.execute()
if isinstance(status, FinishedStatus):
    print(status.output_values["summary"])
else:
    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
# Sea turtles face threats from poaching, habitat loss, and pollution globally.

As expected, your flow has generated the article summary!

Structured generation with Flows#

In many cases, generating raw text within a flow is not very useful, as it is difficult to leverage in later steps. Instead, you might want to generate attributes that follow a particular schema. The PromptExecutionStep class enables this through the output_descriptors parameter.

from wayflowcore.property import ListProperty, StringProperty
from wayflowcore.steps import PromptExecutionStep, StartStep

animal_output = StringProperty(
    name="animal_name",
    description="name of the animal",
    default_value="",
)
danger_level_output = StringProperty(
    name="danger_level",
    description='level of danger of the animal. Can be "HIGH", "MEDIUM" or "LOW"',
    default_value="",
)
threats_output = ListProperty(
    name="threats",
    description="list of threats for the animal",
    item_type=StringProperty("threat"),
    default_value=[],
)


start_step = StartStep(input_descriptors=[StringProperty("article")])
summarize_step = PromptExecutionStep(
    llm=llm,
    prompt_template="""Extract from the following article the name of the animal, its danger level and the threats it's subject to. The article:\n\n {{article}}""",
    output_descriptors=[animal_output, danger_level_output, threats_output],
)
summarize_step_name = "summarize_step"
flow = Flow(
    begin_step=start_step,
    steps={
        "start_step": start_step,
        summarize_step_name: summarize_step,
    },
    control_flow_edges=[
        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
        ControlFlowEdge(source_step=summarize_step, destination_step=None),
    ],
    data_flow_edges=[
        DataFlowEdge(start_step, "article", summarize_step, "article"),
    ],
)

conversation = flow.start_conversation(inputs={"article": article})
status = conversation.execute()
if isinstance(status, FinishedStatus):
    print(status.output_values)
else:
    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
# {'threats': ['poaching', 'habitat loss', 'pollution'], 'danger_level': 'HIGH', 'animal_name': 'Sea turtles'}

Complex JSON objects#

Sometimes, you might need to generate an object that follows a specific JSON Schema. You can do that by using an output descriptor of type ObjectProperty, or directly converting your JSON Schema into a descriptor:

from wayflowcore.property import Property, StringProperty
from wayflowcore.steps import PromptExecutionStep, StartStep

animal_json_schema = {
    "title": "animal_object",
    "description": "information about the animal",
    "type": "object",
    "properties": {
        "animal_name": {
            "type": "string",
            "description": "name of the animal",
            "default": "",
        },
        "danger_level": {
            "type": "string",
            "description": 'level of danger of the animal. Can be "HIGH", "MEDIUM" or "LOW"',
            "default": "",
        },
        "threats": {
            "type": "array",
            "description": "list of threats for the animal",
            "items": {"type": "string"},
            "default": [],
        },
    },
}
animal_descriptor = Property.from_json_schema(animal_json_schema)

start_step = StartStep(input_descriptors=[StringProperty("article")])
summarize_step = PromptExecutionStep(
    llm=llm,
    prompt_template="""Extract from the following article the name of the animal, its danger level and the threats it's subject to. The article:\n\n {{article}}""",
    output_descriptors=[animal_descriptor],
)
summarize_step_name = "summarize_step"
flow = Flow(
    begin_step=start_step,
    steps={
        "start_step": start_step,
        summarize_step_name: summarize_step,
    },
    control_flow_edges=[
        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
        ControlFlowEdge(source_step=summarize_step, destination_step=None),
    ],
    data_flow_edges=[
        DataFlowEdge(start_step, "article", summarize_step, "article"),
    ],
)

conversation = flow.start_conversation(inputs={"article": article})
status = conversation.execute()
if isinstance(status, FinishedStatus):
    print(status.output_values)
else:
    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
# {'animal_object': {'animal_name': 'Sea turtles', 'danger_level': 'MEDIUM', 'threats': ['Poaching', 'Habitat loss', 'Pollution']}}

Structured generation with Agents#

In certain scenarios, you might need an agent to generate well-formatted outputs within your flow. You can instruct the agent to generate them, and use it in the AgentExecutionStep class to perform structured generation.

from wayflowcore.agent import Agent, CallerInputMode
from wayflowcore.controlconnection import ControlFlowEdge
from wayflowcore.steps import AgentExecutionStep, StartStep

start_step = StartStep(input_descriptors=[])
agent = Agent(
    llm=llm,
    custom_instruction="""Extract from the article given by the user the name of the animal, its danger level and the threats it's subject to.""",
    initial_message=None,
    caller_input_mode=CallerInputMode.NEVER,  # <- ensure the agent does not ask the user questions, just produces the expected outputs
    output_descriptors=[animal_output, danger_level_output, threats_output],
)

summarize_agent_step = AgentExecutionStep(agent=agent)
summarize_step_name = "summarize_step"
flow = Flow(
    begin_step=start_step,
    steps={
        "start_step": start_step,
        summarize_step_name: summarize_agent_step,
    },
    control_flow_edges=[
        ControlFlowEdge(source_step=start_step, destination_step=summarize_agent_step),
        ControlFlowEdge(source_step=summarize_agent_step, destination_step=None),
    ],
    data_flow_edges=[],
)

conversation = flow.start_conversation()
conversation.append_user_message("Here is the article: " + article)
status = conversation.execute()
if isinstance(status, FinishedStatus):
    print(status.output_values)
else:
    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
# {'animal_name': 'Sea turtles', 'danger_level': 'HIGH', 'threats': ['poaching', 'habitat loss', 'pollution']}

How to write secure prompts with Jinja templating#

Jinja2 is a fast and flexible templating engine for Python, enabling dynamic generation of text-based formats by combining templates with data.

However, enabling all Jinja templating capabilities poses some security challenges. For this reason, WayFlow relies on a stricter implementation of the Jinja2’s SandboxedEnvironment for higher security. Every callable is considered unsafe, and every attribute and item access is prevented, except for:

  • The attributes index0, index, first, last, length of the jinja2.runtime.LoopContext;

  • The entries of a Python dictionary (only native type is accepted);

  • The items of a Python list (only native type is accepted).

You should never write a template that includes a function call, or access to any internal attribute or element of an arbitrary variable: that is considered unsafe, and it will raise a SecurityException.

Moreover, WayFlow performs additional checks on the inputs provided for rendering. In particular, only elements and sub-elements that are of basic Python types (str, int, float, bool, list, dict, tuple, set, NoneType) are accepted. In any other case, a SecurityException is raised.

What you can write#

Here’s a set of common patterns that are accepted by WayFlow’s restricted Jinja templating.

Templates that access variables of base Python types:

my_var: str = "simple string"
template = "{{ my_var }}"
# Expected outcome: "simple string"

Templates that access elements of a list of base Python types:

my_var: list[str] = ["simple string"]
template = "{{ my_var[0] }}"
# Expected outcome: "simple string"

Templates that access dictionary entries of base Python types:

my_var: dict[str, str] = {"k1": "simple string"}
template = "{{ my_var['k1'] }}"
# Expected outcome: "simple string"

my_var: dict[str, str] = {"k1": "simple string"}
template = "{{ my_var.k1 }}"
# Expected outcome: "simple string"

Builtin functions of Jinja, like length or format:

my_var: list[str] = ["simple string"]
template = "{{ my_var | length }}"
# Expected outcome: "1"

Simple expressions:

template = "{{ 7*7 }}"
# Expected outcome: "49"

For loops, optionally accessing the LoopContext:

my_var: list[int] = [1, 2, 3]
template = "{% for e in my_var %}{{e}}{{ ', ' if not loop.last }}{% endfor %}"
# Expected outcome: "1, 2, 3"

If conditions:

my_var: int = 4
template = "{% if my_var % 2 == 0 %}even{% else %}odd{% endif %}"
# Expected outcome: "even"

Our general recommendation is to avoid complex logic in templates, and to pre-process the data you want to render instead. For example, in case of complex objects, in order to comply with restrictions above, you should conveniently transform them recursively into a dictionary of entries of basic Python types (see list of accepted types above).

What you cannot write#

Here’s a set of common patterns that are NOT accepted by WayFlow’s restricted Jinja templating.

Templates that access arbitrary objects:

my_var: MyComplexObject = MyComplexObject()
template = "{{ my_var }}"
# Expected outcome: SecurityException

Templates that access attributes of arbitrary objects:

my_var: MyComplexObject = MyComplexObject(attribute="my string")
template = "{{ my_var.attribute }}"
# Expected outcome: SecurityException

Templates that access internals of any type and object:

my_var: dict = {"k1": "my string"}
template = "{{ my_var.__init__ }}"
# Expected outcome: SecurityException

Templates that access non-existing keys of a dictionary:

my_var: dict = {"k1": "my string"}
template = "{{ my_var['non-existing-key'] }}"
# Expected outcome: SecurityException

Templates that access keys of a dictionary of type different from int or str:

my_var: dict = {("complex", "key"): "my string"}
template = "{{ my_var[('complex', 'key')] }}"
# Expected outcome: SecurityException

Templates that access callables:

my_var: Callable = lambda x: f"my value {x}"
template = "{{ my_var(2) }}"
# Expected outcome: SecurityException

my_var: list = [1, 2, 3]
template = "{{ len(my_var) }}"
# Expected outcome: SecurityException

my_var: MyComplexObject = MyComplexObject()
template = "{{ my_var.to_string() }}"
# Expected outcome: SecurityException

For more information, please check our Security considerations page.

Agent Spec Exporting/Loading#

You can export the assistant configuration to its Agent Spec configuration using the AgentSpecExporter.

from wayflowcore.agentspec import AgentSpecExporter
serialized_assistant = AgentSpecExporter().to_json(flow)

Here is what the Agent Spec representation will look like ↓

Click here to see the assistant configuration.
{
  "component_type": "Flow",
  "id": "e075072d-6aa0-4da4-84bd-dd54ce060ff4",
  "name": "flow_021b3829__auto",
  "description": "",
  "metadata": {
    "__metadata_info__": {}
  },
  "inputs": [],
  "outputs": [
    {
      "description": "name of the animal",
      "type": "string",
      "title": "animal_name",
      "default": ""
    },
    {
      "description": "list of threats for the animal",
      "type": "array",
      "items": {
        "type": "string",
        "title": "threat"
      },
      "title": "threats",
      "default": []
    },
    {
      "description": "level of danger of the animal. Can be \"HIGH\", \"MEDIUM\" or \"LOW\"",
      "type": "string",
      "title": "danger_level",
      "default": ""
    }
  ],
  "start_node": {
    "$component_ref": "5c227b46-f36a-413c-8090-3be75fafbdec"
  },
  "nodes": [
    {
      "$component_ref": "5c227b46-f36a-413c-8090-3be75fafbdec"
    },
    {
      "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
    },
    {
      "$component_ref": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c"
    }
  ],
  "control_flow_connections": [
    {
      "component_type": "ControlFlowEdge",
      "id": "de3e74f8-9b09-402f-a1ec-b8d5a35d9a3d",
      "name": "start_step_to_summarize_step_control_flow_edge",
      "description": null,
      "metadata": {
        "__metadata_info__": {}
      },
      "from_node": {
        "$component_ref": "5c227b46-f36a-413c-8090-3be75fafbdec"
      },
      "from_branch": null,
      "to_node": {
        "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
      }
    },
    {
      "component_type": "ControlFlowEdge",
      "id": "47583cac-4599-4e3a-8cab-7edb2cba870f",
      "name": "summarize_step_to_None End node_control_flow_edge",
      "description": null,
      "metadata": {},
      "from_node": {
        "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
      },
      "from_branch": null,
      "to_node": {
        "$component_ref": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c"
      }
    }
  ],
  "data_flow_connections": [
    {
      "component_type": "DataFlowEdge",
      "id": "3620497d-5f8e-4280-a496-439cc2f32936",
      "name": "summarize_step_animal_name_to_None End node_animal_name_data_flow_edge",
      "description": null,
      "metadata": {},
      "source_node": {
        "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
      },
      "source_output": "animal_name",
      "destination_node": {
        "$component_ref": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c"
      },
      "destination_input": "animal_name"
    },
    {
      "component_type": "DataFlowEdge",
      "id": "137eb74d-0a75-48c5-8084-99b679567f44",
      "name": "summarize_step_threats_to_None End node_threats_data_flow_edge",
      "description": null,
      "metadata": {},
      "source_node": {
        "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
      },
      "source_output": "threats",
      "destination_node": {
        "$component_ref": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c"
      },
      "destination_input": "threats"
    },
    {
      "component_type": "DataFlowEdge",
      "id": "90669e70-722c-459c-9f64-cc8ba6fac091",
      "name": "summarize_step_danger_level_to_None End node_danger_level_data_flow_edge",
      "description": null,
      "metadata": {},
      "source_node": {
        "$component_ref": "4c691ba3-2d4f-4d30-8714-11fe71a731a0"
      },
      "source_output": "danger_level",
      "destination_node": {
        "$component_ref": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c"
      },
      "destination_input": "danger_level"
    }
  ],
  "$referenced_components": {
    "4c691ba3-2d4f-4d30-8714-11fe71a731a0": {
      "component_type": "AgentNode",
      "id": "4c691ba3-2d4f-4d30-8714-11fe71a731a0",
      "name": "summarize_step",
      "description": "",
      "metadata": {
        "__metadata_info__": {}
      },
      "inputs": [],
      "outputs": [
        {
          "description": "name of the animal",
          "type": "string",
          "title": "animal_name",
          "default": ""
        },
        {
          "description": "level of danger of the animal. Can be \"HIGH\", \"MEDIUM\" or \"LOW\"",
          "type": "string",
          "title": "danger_level",
          "default": ""
        },
        {
          "description": "list of threats for the animal",
          "type": "array",
          "items": {
            "type": "string",
            "title": "threat"
          },
          "title": "threats",
          "default": []
        }
      ],
      "branches": [
        "next"
      ],
      "agent": {
        "component_type": "Agent",
        "id": "eb2f9b51-e5b2-44e2-91d5-69711665b550",
        "name": "agent_72c8e146__auto",
        "description": "",
        "metadata": {
          "__metadata_info__": {}
        },
        "inputs": [],
        "outputs": [
          {
            "description": "name of the animal",
            "type": "string",
            "title": "animal_name",
            "default": ""
          },
          {
            "description": "level of danger of the animal. Can be \"HIGH\", \"MEDIUM\" or \"LOW\"",
            "type": "string",
            "title": "danger_level",
            "default": ""
          },
          {
            "description": "list of threats for the animal",
            "type": "array",
            "items": {
              "type": "string",
              "title": "threat"
            },
            "title": "threats",
            "default": []
          }
        ],
        "llm_config": {
          "component_type": "VllmConfig",
          "id": "ce3c577c-9e03-44f8-bec2-258bda52789e",
          "name": "llm_8052f2ad__auto",
          "description": null,
          "metadata": {
            "__metadata_info__": {}
          },
          "default_generation_parameters": null,
          "url": "LLAMA_API_URL",
          "model_id": "LLAMA_MODEL_ID"
        },
        "system_prompt": "Extract from the article given by the user the name of the animal, its danger level and the threats it's subject to.",
        "tools": []
      }
    },
    "6bb4d577-8da4-4e05-83da-e16d10dbfa1c": {
      "component_type": "EndNode",
      "id": "6bb4d577-8da4-4e05-83da-e16d10dbfa1c",
      "name": "None End node",
      "description": "End node representing all transitions to None in the WayFlow flow",
      "metadata": {},
      "inputs": [
        {
          "description": "name of the animal",
          "type": "string",
          "title": "animal_name",
          "default": ""
        },
        {
          "description": "list of threats for the animal",
          "type": "array",
          "items": {
            "type": "string",
            "title": "threat"
          },
          "title": "threats",
          "default": []
        },
        {
          "description": "level of danger of the animal. Can be \"HIGH\", \"MEDIUM\" or \"LOW\"",
          "type": "string",
          "title": "danger_level",
          "default": ""
        }
      ],
      "outputs": [
        {
          "description": "name of the animal",
          "type": "string",
          "title": "animal_name",
          "default": ""
        },
        {
          "description": "list of threats for the animal",
          "type": "array",
          "items": {
            "type": "string",
            "title": "threat"
          },
          "title": "threats",
          "default": []
        },
        {
          "description": "level of danger of the animal. Can be \"HIGH\", \"MEDIUM\" or \"LOW\"",
          "type": "string",
          "title": "danger_level",
          "default": ""
        }
      ],
      "branches": [],
      "branch_name": "next"
    },
    "5c227b46-f36a-413c-8090-3be75fafbdec": {
      "component_type": "StartNode",
      "id": "5c227b46-f36a-413c-8090-3be75fafbdec",
      "name": "start_step",
      "description": "",
      "metadata": {
        "__metadata_info__": {}
      },
      "inputs": [],
      "outputs": [],
      "branches": [
        "next"
      ]
    }
  },
  "agentspec_version": "25.4.1"
}

You can then load the configuration back to an assistant using the AgentSpecLoader.

from wayflowcore.agentspec import AgentSpecLoader
new_assistant = AgentSpecLoader().load_json(serialized_assistant)

Recap#

In this guide, you learned how to incorporate LLMs into flows to:

  • generate raw text

  • produce structured output

  • perform structured generation using the agent and AgentExecutionStep

Next steps#

Having learned how to perform structured generation in WayFlow, you may now proceed to:

Full code#

Click on the card at the top of this page to download the full code for this guide or copy the code below.

  1# Copyright © 2025 Oracle and/or its affiliates.
  2#
  3# This software is under the Apache License 2.0
  4# %%[markdown]
  5# Wayflow Code Example - How to Do Structured LLM Generation in Flows
  6# -------------------------------------------------------------------
  7
  8# How to use:
  9# Create a new Python virtual environment and install the latest WayFlow version.
 10# ```bash
 11# python -m venv venv-wayflowcore
 12# source venv-wayflowcore/bin/activate
 13# pip install --upgrade pip
 14# pip install "wayflowcore==26.1" 
 15# ```
 16
 17# You can now run the script
 18# 1. As a Python file:
 19# ```bash
 20# python howto_promptexecutionstep.py
 21# ```
 22# 2. As a Notebook (in VSCode):
 23# When viewing the file,
 24#  - press the keys Ctrl + Enter to run the selected cell
 25#  - or Shift + Enter to run the selected cell and move to the cell below# (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0) or Universal Permissive License
 26# (UPL) 1.0 (LICENSE-UPL or https://oss.oracle.com/licenses/upl), at your option.
 27
 28
 29# %%[markdown]
 30## Define the article
 31
 32# %%
 33article = """Sea turtles are ancient reptiles that have been around for over 100 million years. They play crucial roles in marine ecosystems, such as maintaining healthy seagrass beds and coral reefs. Unfortunately, they are under threat due to poaching, habitat loss, and pollution. Conservation efforts worldwide aim to protect nesting sites and reduce bycatch in fishing gear."""
 34
 35
 36# %%[markdown]
 37## Define the llm
 38
 39# %%
 40from wayflowcore.models import VllmModel
 41
 42llm = VllmModel(
 43    model_id="LLAMA_MODEL_ID",
 44    host_port="LLAMA_API_URL",
 45)
 46
 47# %%[markdown]
 48## Create the flow using the prompt execution step
 49
 50# %%
 51from wayflowcore.controlconnection import ControlFlowEdge
 52from wayflowcore.dataconnection import DataFlowEdge
 53from wayflowcore.flow import Flow
 54from wayflowcore.property import StringProperty
 55from wayflowcore.steps import PromptExecutionStep, StartStep
 56
 57start_step = StartStep(input_descriptors=[StringProperty("article")])
 58summarize_step = PromptExecutionStep(
 59    llm=llm,
 60    prompt_template="""Summarize this article in 10 words:\n {{article}}""",
 61    output_mapping={PromptExecutionStep.OUTPUT: "summary"},
 62)
 63summarize_step_name = "summarize_step"
 64flow = Flow(
 65    begin_step=start_step,
 66    steps={
 67        "start_step": start_step,
 68        summarize_step_name: summarize_step,
 69    },
 70    control_flow_edges=[
 71        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
 72        ControlFlowEdge(source_step=summarize_step, destination_step=None),
 73    ],
 74    data_flow_edges=[
 75        DataFlowEdge(start_step, "article", summarize_step, "article"),
 76    ],
 77)
 78
 79
 80# %%[markdown]
 81## Run the flow to get the summary
 82
 83# %%
 84from wayflowcore.executors.executionstatus import FinishedStatus
 85
 86conversation = flow.start_conversation(inputs={"article": article})
 87status = conversation.execute()
 88if isinstance(status, FinishedStatus):
 89    print(status.output_values["summary"])
 90else:
 91    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
 92# Sea turtles face threats from poaching, habitat loss, and pollution globally.
 93
 94from wayflowcore.controlconnection import ControlFlowEdge
 95from wayflowcore.dataconnection import DataFlowEdge
 96
 97
 98# %%[markdown]
 99## Use structured generation to extract formatted information
100
101# %%
102from wayflowcore.property import ListProperty, StringProperty
103from wayflowcore.steps import PromptExecutionStep, StartStep
104
105animal_output = StringProperty(
106    name="animal_name",
107    description="name of the animal",
108    default_value="",
109)
110danger_level_output = StringProperty(
111    name="danger_level",
112    description='level of danger of the animal. Can be "HIGH", "MEDIUM" or "LOW"',
113    default_value="",
114)
115threats_output = ListProperty(
116    name="threats",
117    description="list of threats for the animal",
118    item_type=StringProperty("threat"),
119    default_value=[],
120)
121
122
123start_step = StartStep(input_descriptors=[StringProperty("article")])
124summarize_step = PromptExecutionStep(
125    llm=llm,
126    prompt_template="""Extract from the following article the name of the animal, its danger level and the threats it's subject to. The article:\n\n {{article}}""",
127    output_descriptors=[animal_output, danger_level_output, threats_output],
128)
129summarize_step_name = "summarize_step"
130flow = Flow(
131    begin_step=start_step,
132    steps={
133        "start_step": start_step,
134        summarize_step_name: summarize_step,
135    },
136    control_flow_edges=[
137        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
138        ControlFlowEdge(source_step=summarize_step, destination_step=None),
139    ],
140    data_flow_edges=[
141        DataFlowEdge(start_step, "article", summarize_step, "article"),
142    ],
143)
144
145conversation = flow.start_conversation(inputs={"article": article})
146status = conversation.execute()
147if isinstance(status, FinishedStatus):
148    print(status.output_values)
149else:
150    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
151# {'threats': ['poaching', 'habitat loss', 'pollution'], 'danger_level': 'HIGH', 'animal_name': 'Sea turtles'}
152
153from wayflowcore.controlconnection import ControlFlowEdge
154from wayflowcore.dataconnection import DataFlowEdge
155
156
157# %%[markdown]
158## Use structured generation with JSON schema
159
160# %%
161from wayflowcore.property import Property, StringProperty
162from wayflowcore.steps import PromptExecutionStep, StartStep
163
164animal_json_schema = {
165    "title": "animal_object",
166    "description": "information about the animal",
167    "type": "object",
168    "properties": {
169        "animal_name": {
170            "type": "string",
171            "description": "name of the animal",
172            "default": "",
173        },
174        "danger_level": {
175            "type": "string",
176            "description": 'level of danger of the animal. Can be "HIGH", "MEDIUM" or "LOW"',
177            "default": "",
178        },
179        "threats": {
180            "type": "array",
181            "description": "list of threats for the animal",
182            "items": {"type": "string"},
183            "default": [],
184        },
185    },
186}
187animal_descriptor = Property.from_json_schema(animal_json_schema)
188
189start_step = StartStep(input_descriptors=[StringProperty("article")])
190summarize_step = PromptExecutionStep(
191    llm=llm,
192    prompt_template="""Extract from the following article the name of the animal, its danger level and the threats it's subject to. The article:\n\n {{article}}""",
193    output_descriptors=[animal_descriptor],
194)
195summarize_step_name = "summarize_step"
196flow = Flow(
197    begin_step=start_step,
198    steps={
199        "start_step": start_step,
200        summarize_step_name: summarize_step,
201    },
202    control_flow_edges=[
203        ControlFlowEdge(source_step=start_step, destination_step=summarize_step),
204        ControlFlowEdge(source_step=summarize_step, destination_step=None),
205    ],
206    data_flow_edges=[
207        DataFlowEdge(start_step, "article", summarize_step, "article"),
208    ],
209)
210
211conversation = flow.start_conversation(inputs={"article": article})
212status = conversation.execute()
213if isinstance(status, FinishedStatus):
214    print(status.output_values)
215else:
216    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
217# {'animal_object': {'animal_name': 'Sea turtles', 'danger_level': 'MEDIUM', 'threats': ['Poaching', 'Habitat loss', 'Pollution']}}
218
219
220
221# %%[markdown]
222## Use structured generation with Agents in flows
223
224# %%
225from wayflowcore.agent import Agent, CallerInputMode
226from wayflowcore.controlconnection import ControlFlowEdge
227from wayflowcore.steps import AgentExecutionStep, StartStep
228
229start_step = StartStep(input_descriptors=[])
230agent = Agent(
231    llm=llm,
232    custom_instruction="""Extract from the article given by the user the name of the animal, its danger level and the threats it's subject to.""",
233    initial_message=None,
234    caller_input_mode=CallerInputMode.NEVER,  # <- ensure the agent does not ask the user questions, just produces the expected outputs
235    output_descriptors=[animal_output, danger_level_output, threats_output],
236)
237
238summarize_agent_step = AgentExecutionStep(agent=agent)
239summarize_step_name = "summarize_step"
240flow = Flow(
241    begin_step=start_step,
242    steps={
243        "start_step": start_step,
244        summarize_step_name: summarize_agent_step,
245    },
246    control_flow_edges=[
247        ControlFlowEdge(source_step=start_step, destination_step=summarize_agent_step),
248        ControlFlowEdge(source_step=summarize_agent_step, destination_step=None),
249    ],
250    data_flow_edges=[],
251)
252
253conversation = flow.start_conversation()
254conversation.append_user_message("Here is the article: " + article)
255status = conversation.execute()
256if isinstance(status, FinishedStatus):
257    print(status.output_values)
258else:
259    print(f"Invalid execution status, expected FinishedStatus, received {type(status)}")
260# {'animal_name': 'Sea turtles', 'danger_level': 'HIGH', 'threats': ['poaching', 'habitat loss', 'pollution']}
261
262
263# %%[markdown]
264## Export config to Agent Spec
265
266# %%
267from wayflowcore.agentspec import AgentSpecExporter
268serialized_assistant = AgentSpecExporter().to_json(flow)
269
270# %%[markdown]
271## Load Agent Spec config
272
273# %%
274from wayflowcore.agentspec import AgentSpecLoader
275new_assistant = AgentSpecLoader().load_json(serialized_assistant)