Module I - LLM, Prompt, RAG & Vector DB¶

Tổng quan¶

Module I xây dựng nền tảng toàn diện cho LLM Engineer: từ gọi API đến chạy model local, từ fine-tuning đến xây dựng RAG pipeline hoàn chỉnh có khả năng đánh giá và giám sát.

Lộ trình học¶

graph LR
    A[LLM APIs] --> B[Local LLMs]
    B --> C[Fine-tuning]
    A --> D[Embeddings & Vector DB]
    D --> E[RAG Pipeline]
    E --> F[Agentic RAG]
    F --> G[Evaluation & Guardrails]
    G --> H[Capstone Project]

Danh sách bài học¶

#	Bài	Chủ đề chính
1	LLM APIs Hands-on	Chat completions, Prompt patterns, Structured outputs, Function calling
2	Local LLMs	Attention, Transformer Block, Pre-train/Post-train, Quantization, Ollama
3	Fine-tuning với LoRA/QLoRA	PEFT, LoRA/QLoRA, TRL SFTTrainer, HF Hub
4	Embeddings & Vector Databases	Sentence-transformers, ANN, Chroma/FAISS/Qdrant
5	RAG Pipeline	Ingestion, Retrieval, Generation, Vietnamese PDFs
6	Agentic RAG	Self-RAG, CRAG, Query decomposition, LangGraph
7	LLM Evaluation & Guardrails	RAGAS, LLM-as-judge, Prompt injection, LangSmith
8	Capstone Project	Vietnamese Legal Assistant end-to-end

Framework tư duy: Khi nào dùng gì?¶

graph TD
    Q{Vấn đề của bạn?} --> A[Cần model hiểu ngữ cảnh<br/>từ tài liệu bên ngoài]
    Q --> B[Cần model thực hiện<br/>tác vụ chuyên biệt]
    Q --> C[Cần kiểm soát<br/>chi phí / privacy]
    A --> RAG[→ RAG Pipeline]
    B --> FT[→ Fine-tuning]
    C --> LOCAL[→ Local LLMs]
    RAG --> AGENT[Dữ liệu phức tạp,<br/>multi-hop?]
    AGENT --> ARAG[→ Agentic RAG]