Agent Developer Roadmap

Project 1: Build an LLM Playground

LLM Overview and Foundations
Pre-Training

Data collection (manual crawling, Common Crawl)
Data cleaning (RefinedWeb, Dolma, FineWeb)
Tokenization (e.g., BPE)
Architecture (neural networks, Transformers, GPT family, DeepSeek, Qwen, Gemma)
Text generation (greedy and beam search, top-k, top-p) Post-Training
SFT
RL and RLHF (verifiable tasks, reward models, PPO, etc.) Evaluation
Traditional metrics
Task-specific benchmarks
Human evaluation and leaderboards Chatbots’ Overall Design

Project 1

Project 2: Build a Customer Support Chatbot using RAGs and Prompt Engineering

Overview of Adaptation Techniques
Finetuning

Parameter-efficient fine-tuning (PEFT)
Adapters and LoRA Prompt Engineering
Few-shot and zero-shot prompting
Chain-of-thought prompting
Role-specific and user-context prompting RAGs Overview
Retrieval
Document parsing (rule-based, AI-based) and chunking strategies
Indexing (keyword, full-text, knowledge-based, vector-based, embedding models) Generation
Search methods (exact and approximate nearest neighbor)
Prompt engineering for RAGs RAFT: Training technique for RAGs
Evaluation (context relevance, faithfulness, answer correctness)
RAGs’ Overall Design

Project 2

Project 3: Build an “Ask-the-Web” Agent similar to Perplexity with Tool calling

Agents Overview

Agents vs. agentic systems vs. LLMs
Agency levels (e.g., workflows, multi-step agents) Workflows
Prompt chaining
Routing
Parallelization (sectioning, voting)
Reflection
Orchestration-worker Tools
Tool calling
Tool formatting
Tool execution
MCP Multi-Step Agents
Planning autonomy
ReACT
Reflexion, ReWOO, etc.
Tree search for agents Multi-Agent Systems (challenges, use-cases, A2A protocol)
Agent Evaluation

Project 3

Project 4: Build “Deep Research” Capability with Web Search and Reasoning Models

Reasoning and Thinking LLMs

Overview of reasoning models like OpenAI’s “o” family and DeepSeek-R1 Inference-time Techniques
Inference-time scaling
CoT prompting
Parallel sampling
Sequential sampling
Tree of Thoughts (ToT)
Search against a verifier Training-time techniques
SFT on reasoning data (e.g., STaR)
Reinforcement learning with a verifier
Reward modeling (ORM, PRM)
Self-refinement
Internalizing search (e.g., Meta-CoT) Local Deployment

Project 4

Overview of Image and Video Generation

VAE
GANs
Auto-regressive models
Diffusion models Text-to-Image (T2I)
Data preparation
Diffusion architectures (U-Net, DiT)
Diffusion training (forward process, backward process)
Diffusion sampling
Evaluation (image quality, diversity, image-text alignment, IS, FID, and CLIP score) Text-to-Video (T2V)
Latent-diffusion modeling (LDM) and compression networks
Data preparation (filtering, standardization, video latent caching)
DiT architecture for videos
Large-scale training challenges
T2V’s overall system

Project 5

Project 6: Capstone Project

Ship a portfolio-ready AI project from idea to demo

Choose: pick your own idea, or start from a curated list
Build: implement using techniques from the course
Iterate: get real-time feedback from the instructor as you build
Optional Demo: present your project on final demo day

Project 6

wwsun

探索

Agent Developer Roadmap

Project 1: Build an LLM Playground

Project 2: Build a Customer Support Chatbot using RAGs and Prompt Engineering

Project 3: Build an “Ask-the-Web” Agent similar to Perplexity with Tool calling

Project 4: Build “Deep Research” Capability with Web Search and Reasoning Models

Project 6: Capstone Project

关系图谱

目录

最近更新

我的 Agent Skills Workflow

bat 但带有 git 集成和语法高亮.

Claude Code

SKILL.md Frontmatter 完整字段

Skills For Real Engineers

反向链接

wwsun

探索

Agent Developer Roadmap

Project 1: Build an LLM Playground

Project 2: Build a Customer Support Chatbot using RAGs and Prompt Engineering

Project 3: Build an “Ask-the-Web” Agent similar to Perplexity with Tool calling

Project 4: Build “Deep Research” Capability with Web Search and Reasoning Models

Project 5: Build a Multi-modal Generation Agent

Project 6: Capstone Project

关系图谱

目录

最近更新

我的 Agent Skills Workflow

bat 但带有 git 集成和语法高亮.

Claude Code

SKILL.md Frontmatter 完整字段

Skills For Real Engineers

反向链接