Skip to content

2-Week GenAI & LLM Engineering Crash Training

Structured Extraction

KirkYagami/2-Week-GenAI-LLM-Engineering-Crash-Training

2-Week GenAI & LLM Engineering Crash Training

KirkYagami/2-Week-GenAI-LLM-Engineering-Crash-Training

Home
Notebooks
Week 01 — LLM Fundamentals
Week 01 — LLM Fundamentals
- Overview
- Day 01 — How LLMs Work & Prompt Engineering
  Day 01 — How LLMs Work & Prompt Engineering
  - Part 1 — How LLMs Work
    Part 1 — How LLMs Work
    
    Agenda
    
    Transformers and Attention
    
    Tokenization
    
    Context Windows
    
    How LLMs Generate Text
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Prompt Engineering
    Part 2 — Prompt Engineering
    
    Agenda
    
    Zero-Shot and Few-Shot
    
    Chain-of-Thought
    
    Structured Output
    
    System Prompts
    
    Advanced Prompt Patterns
    
    Practice Exercises
    
    Interview Questions
- Day 02 — APIs & Embeddings
  Day 02 — APIs & Embeddings
  - Part 1 — OpenAI and Anthropic APIs
    Part 1 — OpenAI and Anthropic APIs
    
    Agenda
    
    OpenAI Chat Completions
    
    Vision and Multimodal
    
    Tool Use with OpenAI
    
    Anthropic Messages API
    
    Cost and Rate Limits
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Embeddings and Semantic Search
    Part 2 — Embeddings and Semantic Search
    
    Agenda
    
    What Are Embeddings
    
    Embedding Models
    
    Cosine Similarity
    
    Vector Search
    
    Semantic Search Pipeline
    
    Practice Exercises
    
    Interview Questions
- Day 03 — RAG & Vector Databases
  Day 03 — RAG & Vector Databases
  - Part 1 — RAG Basics
    Part 1 — RAG Basics
    
    Agenda
    
    What is RAG
    
    Chunking Strategies
    
    Retrieval and Augmentation
    
    RAG Pipeline End-to-End
    
    RAG vs Fine-Tuning
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Vector Databases
    Part 2 — Vector Databases
    
    Agenda
    
    Vector DB Overview
    
    ChromaDB
    
    Pinecone
    
    Qdrant
    
    Indexing and Filtering
    
    Hybrid Search
    
    Practice Exercises
    
    Interview Questions
- Day 04 — Evaluation & Responsible AI
  Day 04 — Evaluation & Responsible AI
  - Part 1 — LLM Evaluation
    Part 1 — LLM Evaluation
    
    Agenda
    
    Evaluation Overview
    
    RAGAS Framework
    
    Hallucination and Faithfulness
    
    Relevance Metrics
    
    Human Evaluations
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Responsible AI and Safety
    Part 2 — Responsible AI and Safety
    
    Agenda
    
    Jailbreaks and Prompt Injection
    
    Guardrails
    
    Content Filtering
    
    Bias and Fairness
    
    Practice Exercises
    
    Interview Questions
- Day 05 — Hugging Face & Local LLMs
  Day 05 — Hugging Face & Local LLMs
  - Part 1 — Hugging Face Ecosystem
    Part 1 — Hugging Face Ecosystem
    
    Agenda
    
    Hugging Face Hub
    
    Transformers Library
    
    Inference API
    
    Spaces
    
    Datasets and Model Cards
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Local LLMs
    Part 2 — Local LLMs
    
    Agenda
    
    Why Run Locally
    
    Ollama
    
    llama.cpp
    
    Quantization
    
    When to Run Locally
    
    Practice Exercises
    
    Interview Questions
Week 02 — Building LLM Applications
Week 02 — Building LLM Applications
- Overview
- Day 01 — LangChain & Advanced RAG
  Day 01 — LangChain & Advanced RAG
  - Part 1 — LangChain Fundamentals
    Part 1 — LangChain Fundamentals
    
    Agenda
    
    LangChain Overview
    
    Chains and Prompts
    
    Memory
    
    LCEL
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Advanced RAG
    Part 2 — Advanced RAG
    
    Agenda
    
    Reranking
    
    HyDE
    
    Multi-Query Retrieval
    
    Contextual Compression
    
    Advanced RAG Patterns
    
    Practice Exercises
    
    Interview Questions
- Day 02 — Fine-Tuning & Function Calling
  Day 02 — Fine-Tuning & Function Calling
  - Part 1 — Fine-Tuning
    Part 1 — Fine-Tuning
    
    Agenda
    
    Fine-Tuning Overview
    
    LoRA and QLoRA
    
    PEFT
    
    Training Data Preparation
    
    When to Fine-Tune
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Function Calling and Tool Use
    Part 2 — Function Calling and Tool Use
    
    Agenda
    
    Function Calling Overview
    
    OpenAI Tools API
    
    Anthropic Tool Use
    
    Structured Extraction Structured Extraction
    Table of contents
    
    Learning objectives
    
    Pydantic models as extraction schemas
    
    with_structured_output — the clean approach
    
    Batch extraction with error recovery
    
    Nested extraction: complex schemas
    
    Choosing a schema design approach
    
    Parallel Tool Calls
    
    Practice Exercises
    
    Interview Questions
- Day 03 — Agents & LangGraph
  Day 03 — Agents & LangGraph
  - Part 1 — AI Agents
    Part 1 — AI Agents
    
    Agenda
    
    What Are AI Agents
    
    The ReAct Loop
    
    Planning Strategies
    
    Tool-Calling Agents
    
    Memory Strategies
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — LangGraph
    Part 2 — LangGraph
    
    Agenda
    
    LangGraph Overview
    
    Nodes and Edges
    
    Stateful Graphs
    
    Conditional Routing
    
    Multi-Agent Orchestration
    
    Practice Exercises
    
    Interview Questions
- Day 04 — LLMOps & Deployment
  Day 04 — LLMOps & Deployment
  - Part 1 — LLMOps
    Part 1 — LLMOps
    
    Agenda
    
    Tracing and Logging
    
    LangSmith
    
    Cost Tracking
    
    Latency Optimization
    
    Observability
    
    Practice Exercises
    
    Interview Questions
  - Part 2 — Deployment
    Part 2 — Deployment
    
    Agenda
    
    FastAPI Wrappers
    
    Streaming Responses
    
    Async Patterns
    
    Serverless Deployment
    
    Caching Strategies
    
    Practice Exercises
    
    Interview Questions
- Day 05 — Capstone & Interview
  Day 05 — Capstone & Interview
  - Part 1 — Capstone Project
    Part 1 — Capstone Project
    
    Agenda
    
    Project Brief
    
    Architecture Design
    
    Implementation
    
    Evaluation and Testing
    
    Presentation Guide
    
    Submission Checklist
  - Part 2 — Mock Interview and Portfolio
    Part 2 — Mock Interview and Portfolio
    
    Agenda
    
    Resume Checklist
    
    Portfolio and GitHub
    
    Technical Interview Questions
    
    System Design Practice
    
    Mock Interview Script
Projects
Projects
- RAG Q&A Chatbot
  RAG Q&A Chatbot
- AI Writing Assistant
  AI Writing Assistant
- Document Summarizer with Eval
  Document Summarizer with Eval
- Function-Calling Data Extractor
  Function-Calling Data Extractor
- LangGraph Research Agent
  LangGraph Research Agent
- Fine-Tuned Classifier
  Fine-Tuned Classifier
Cheat Sheets
Cheat Sheets
- OpenAI API
- Anthropic API
- LangChain
- LangGraph
- ChromaDB
- Prompt Engineering
- RAG
- Fine-Tuning
- AI Agents
- LLMOps
Interview Prep
Interview Prep
- Overview
- Prompt Engineering
  Prompt Engineering
- RAG and Vector Search
  RAG and Vector Search
- LLM APIs
  LLM APIs
- Agents and LangGraph
  Agents and LangGraph
- Fine-Tuning
  Fine-Tuning
- LLMOps
  LLMOps
- System Design
  System Design
- Mock Interview Simulator
Glossary
Progress Tracker
Assignments
Assignments
- Week 01
- Week 02
Resources
Resources

Structured Extraction¶

Extraction is the most common use case for tool calling in production: take a messy email, ticket, document, or web page and pull out typed, validated fields. This note covers two approaches — raw tool calling and LangChain's with_structured_output wrapper — and builds a realistic extraction pipeline.

Learning objectives¶

Build Pydantic models for extraction schemas
Use with_structured_output for clean, type-safe extraction
Handle optional fields, nested objects, and lists
Implement batch extraction with error recovery
Validate and post-process extracted data

Pydantic models as extraction schemas¶

from pydantic import BaseModel, Field
from typing import Optional

class Address(BaseModel):
    street: Optional[str] = None
    city: str
    state: Optional[str] = None
    country: str = "US"

class ContactInfo(BaseModel):
    full_name: str = Field(description="Person's full name")
    email: Optional[str] = Field(default=None, description="Email address")
    phone: Optional[str] = Field(default=None, description="Phone number with country code")
    company: Optional[str] = Field(default=None, description="Employer or organization")
    address: Optional[Address] = Field(default=None, description="Physical address if mentioned")
    role: Optional[str] = Field(default=None, description="Job title or role")

class Invoice(BaseModel):
    vendor: str
    invoice_number: Optional[str] = None
    date: str = Field(description="Invoice date in YYYY-MM-DD format")
    due_date: Optional[str] = Field(default=None, description="Payment due date in YYYY-MM-DD")
    total: float = Field(description="Total amount in USD")
    items: list[str] = Field(default_factory=list, description="List of line item descriptions")
    paid: bool = False

`with_structured_output` — the clean approach¶

import os
from langchain_openai import ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate
from pydantic import BaseModel, Field
from typing import Optional, Literal

llm = ChatOpenAI(model="gpt-4o-mini", temperature=0.0, api_key=os.getenv("OPENAI_API_KEY"))

class SupportTicket(BaseModel):
    """Extracted fields from a customer support ticket."""
    category: Literal["billing", "technical", "account", "shipping", "other"]
    priority: Literal["low", "medium", "high", "urgent"]
    product_mentioned: Optional[str] = Field(default=None, description="Product name if mentioned")
    order_id: Optional[str] = Field(default=None, description="Order or transaction ID if present")
    sentiment: Literal["positive", "neutral", "negative", "very_negative"]
    summary: str = Field(description="One-sentence summary of the issue")
    action_required: bool = Field(description="Whether immediate action is needed")

extractor = llm.with_structured_output(SupportTicket)

prompt = ChatPromptTemplate.from_messages([
    ("system", "Extract structured data from customer support tickets."),
    ("human", "{ticket}")
])

chain = prompt | extractor

tickets = [
    "ORDER-4521: My laptop stopped charging after 2 weeks. I need a replacement ASAP or I'm disputing the charge!",
    "Hi, I just wanted to say your team was super helpful yesterday. Issue resolved, thanks!",
    "Can't log into my account since the password reset yesterday. No email received.",
]

for t in tickets:
    result = chain.invoke({"ticket": t})
    print(f"\nTicket: {t[:60]}...")
    print(f"  Category: {result.category} | Priority: {result.priority}")
    print(f"  Sentiment: {result.sentiment} | Action: {result.action_required}")
    print(f"  Summary: {result.summary}")
    if result.order_id:
        print(f"  Order ID: {result.order_id}")

Batch extraction with error recovery¶

import os
from openai import OpenAI
from pydantic import BaseModel, ValidationError
from typing import Optional
import json

client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))

class ProductMention(BaseModel):
    name: str
    price: Optional[float] = None
    sentiment: str  # "positive", "negative", "neutral"
    features_mentioned: list[str] = []

EXTRACT_TOOL = {
    "type": "function",
    "function": {
        "name": "extract_product_mention",
        "description": "Extract product information from text.",
        "parameters": ProductMention.model_json_schema()  # Generate schema from Pydantic
    }
}

def extract_with_retry(text: str, max_retries: int = 2) -> ProductMention | None:
    messages = [
        {"role": "system", "content": "Extract product mention details from the text."},
        {"role": "user", "content": text}
    ]

    for attempt in range(max_retries + 1):
        try:
            response = client.chat.completions.create(
                model="gpt-4o-mini",
                messages=messages,
                tools=[EXTRACT_TOOL],
                tool_choice={"type": "function", "function": {"name": "extract_product_mention"}},
            )
            tool_call = response.choices[0].message.tool_calls[0]
            raw = json.loads(tool_call.function.arguments)
            return ProductMention(**raw)  # Pydantic validates types

        except (json.JSONDecodeError, ValidationError, KeyError) as e:
            if attempt < max_retries:
                messages.append({"role": "user", "content": f"The previous output was malformed: {e}. Try again."})
                continue
            print(f"  Extraction failed after {max_retries + 1} attempts: {e}")
            return None

texts = [
    "The Sony WH-1000XM5 headphones at $349 have incredible noise cancellation. Best purchase I've made.",
    "Bought this $25 cable and it stopped working in a week. Total garbage.",
    "Just upgraded my home office setup today — very happy with the monitor.",
]

print(f"{'Text':<55} {'Product':<25} {'Price':>8} {'Sentiment'}")
print("-" * 100)
for t in texts:
    result = extract_with_retry(t)
    if result:
        print(f"{t[:55]:<55} {result.name[:25]:<25} {str(result.price or 'N/A'):>8} {result.sentiment}")

Nested extraction: complex schemas¶

import os
from langchain_openai import ChatOpenAI
from pydantic import BaseModel, Field
from typing import Optional

llm = ChatOpenAI(model="gpt-4o-mini", temperature=0.0, api_key=os.getenv("OPENAI_API_KEY"))

class LineItem(BaseModel):
    description: str
    quantity: int = 1
    unit_price: float
    total: float

class InvoiceExtraction(BaseModel):
    vendor_name: str
    vendor_email: Optional[str] = None
    invoice_number: Optional[str] = None
    invoice_date: str = Field(description="Date in YYYY-MM-DD format")
    due_date: Optional[str] = Field(default=None, description="Due date in YYYY-MM-DD format")
    line_items: list[LineItem] = Field(default_factory=list)
    subtotal: Optional[float] = None
    tax_rate: Optional[float] = Field(default=None, description="Tax rate as decimal, e.g. 0.08 for 8%")
    total_amount: float
    currency: str = "USD"
    payment_status: str = Field(description="paid, pending, or overdue")

extractor = llm.with_structured_output(InvoiceExtraction)

invoice_text = """
INVOICE #INV-2025-0847
From: Acme Supplies Inc. (billing@acme.com)
Date: 2025-05-15 | Due: 2025-06-15

Items:
- Standing Desk (x2) @ $349.00 each = $698.00
- Monitor Arm (x2) @ $89.99 each = $179.98
- Cable Management Kit (x1) @ $24.99 = $24.99

Subtotal: $902.97
Tax (8%): $72.24
TOTAL: $975.21

Status: PENDING
"""

result = extractor.invoke(invoice_text)
print(f"Vendor: {result.vendor_name}")
print(f"Invoice: {result.invoice_number} | Date: {result.invoice_date} | Due: {result.due_date}")
print(f"Items: {len(result.line_items)}")
for item in result.line_items:
    print(f"  - {item.description}: {item.quantity}x ${item.unit_price} = ${item.total}")
print(f"Total: ${result.total_amount} {result.currency} | Status: {result.payment_status}")

Choosing a schema design approach¶

Approach	Best for	Tradeoffs
`with_structured_output` + Pydantic	LangChain pipelines, clean code	LangChain dependency
Raw tool calling + `json.loads`	Minimal dependencies, full control	More boilerplate
JSON mode + manual parsing	Simple schemas	No type guarantees
`model_json_schema()` from Pydantic	Reuse Pydantic models as tool schemas	Verbose schema output

Prefer with_structured_output for extraction-only tasks

It eliminates the two-round conversation pattern (no need to send tool results back), handles schema generation from Pydantic automatically, and raises OutputParserException cleanly when extraction fails.

03-anthropic-tool-use | 05-parallel-tool-calls