Generate structured data
Guardrails AI is effective for generating structured data across from a variety of LLMs. This guide contains the following:
- General instructions on generating structured data from Guardrails using Pydanticor Markup (i.e.RAIL), and
- Examples to generate structured data using Pydanticor Markup.
Syntax for generating structured data
There are two ways to generate structured data with Guardrails AI: using Pydantic or Markup (i.e. RAIL).
- Pydantic: In order to generate structured data with Pydantic models, create a Pydantic model with the desired fields and types, then create a Guardobject that uses the Pydantic model to generate structured data, and finally call the LLM of your choice with theguardobject to generate structured data.
- RAIL: In order to generate structured data with RAIL specs, create a RAIL spec with the desired fields and types, then create a Guardobject that uses the RAIL spec to generate structured data, and finally call the LLM of your choice with theguardobject to generate structured data.
Below is the syntax for generating structured data with Guardrails AI using Pydantic or Markup (i.e. RAIL).
- Pydantic
- RAIL
In order to generate structured data, first create a Pydantic model with the desired fields and types.
from pydantic import BaseModel
class Person(BaseModel):
    name: str
    age: int
    is_employed: bool
Then, create a Guard object that uses the Pydantic model to generate structured data.
from guardrails import Guard
guard = Guard.for_pydantic(Person)
Finally, call the LLM of your choice with the guard object to generate structured data.
import openai
res = guard(
    openai.chat.completion.create,
    model="gpt-3.5-turbo",
)
In order to generate structured data, first create a RAIL spec with the desired fields and types.
<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
  </output>
</rail>
Then, create a Guard object that uses the RAIL spec to generate structured data.
from guardrails import Guard
guard = Guard.for_rail_string("""
  <rail version="0.1">
    <output>
      <string name="name" />
      <integer name="age" />
      <boolean name="is_employed" />
    </output>
  </rail>
""")
Finally, call the LLM of your choice with the guard object to generate structured data.
import openai
res = guard(
    openai.chat.completion.create,
    model="gpt-3.5-turbo",
)
Generate a JSON object with simple types
- JSON
- Pydantic
- Markup
{
  "name": "John Doe",
  "age": 30,
  "is_employed": true
}
from pydantic import BaseModel
class Person(BaseModel):
    name: str
    age: int
    is_employed: bool
<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
  </output>
</rail>
Generate a dictionary of nested types
- JSON
- Pydantic
- Markup
{
  "name": "John Doe",
  "age": 30,
  "is_employed": true,
  "address": {
    "street": "123 Main St",
    "city": "Anytown",
    "zip": "12345"
  }
}
from pydantic import BaseModel
class Address(BaseModel):
    street: str
    city: str
    zip: str
class Person(BaseModel):
    name: str
    age: int
    is_employed: bool
    address: Address
<rail version="0.1">
  <output>
    <string name="name" />
    <integer name="age" />
    <boolean name="is_employed" />
    <object name="address">
      <string name="street" />
      <string name="city" />
      <string name="zip" />
    </object>
  </output>
</rail>
Generate a list of types
- JSON
- Pydantic
- Markup
[
  {
    "name": "John Doe",
    "age": 30,
    "is_employed": true
  },
  {
    "name": "Jane Smith",
    "age": 25,
    "is_employed": false
  }
]
from pydantic import BaseModel
class Person(BaseModel):
    name: str
    age: int
    is_employed: bool
people = list[Person]
<rail version="0.1">
  <output type="list">
    <object>
      <string name="name" />
      <integer name="age" />
      <boolean name="is_employed" />
    </object>
  </output>
</rail>