Skip to content

Agenda — Hugging Face Ecosystem

Hugging Face is where the open-source ML community lives. Over 500,000 models, 200,000 datasets, and a growing collection of hosted demo environments — all accessible through a consistent Python SDK. For LLM engineers, the Hub is where you find fine-tuned models for your domain, test them without infrastructure, and publish your own work.

Learning objectives

By the end of this session you will be able to:

  • Navigate the Hub to find, evaluate, and download models
  • Load and run inference with the transformers library
  • Call hosted models via the Inference API without GPU hardware
  • Deploy a demo app to Spaces with Gradio

Schedule

Time Topic File
0:00 – 0:25 Hugging Face Hub — models, datasets, orgs 01-hugging-face-hub
0:25 – 1:05 Transformers library — pipelines, tokenizers, model loading 02-transformers-library
1:05 – 1:35 Inference API — serverless hosted inference 03-inference-api
1:35 – 2:00 Spaces — hosting demos with Gradio 04-spaces
2:00 – 2:30 Datasets and model cards 05-datasets-and-models
2:30 – 3:00 Practice exercises 06-practice-exercises

Setup

pip install transformers datasets huggingface_hub accelerate torch gradio
import os
from huggingface_hub import login

# Authenticate — get your token at https://huggingface.co/settings/tokens
login(token=os.getenv("HF_TOKEN"))

Free tier is enough for this session

The Hugging Face Inference API has a free tier that supports most small models. For larger models (70B+), you need a Pro subscription or self-hosted inference. The exercises in this session use models that run on the free tier.

← Day 04 Part 2 | Start →