Fine-Tuning Large Language Models vs Building Systems with the ChatGPT API
Same Bayesian formula, same rubric — so the difference in scores reflects the difference in the courses, not the difference in how we evaluated them.
DeepLearning.AI · AI & ML Courses
Fine-Tuning Large Language Models
DeepLearning.AI · AI & ML Courses
Building Systems with the ChatGPT API
Per-criterion
The course is structured around five core modules: why fine-tune versus prompt engineering, how to prepare and format training data for instruction-following, full-weight fine-tuning mechanics using the Lamini library, training loop internals (loss curves, learning rates, batch sizes), and evaluation of fine-tuned model outputs. For a one-hour short course it is remarkably focused — Sharon Zhou stays disciplined about scope and the conceptual framing of when fine-tuning is the right tool is praised across reviews as the most practically useful part. The recurring mark-down is that the course covers only full-weight fine-tuning and does not address parameter-efficient methods (LoRA, QLoRA, adapters) that dominate practical fine-tuning work in 2025-2026, when GPU cost and accessibility are real constraints for most learners. Reviewers also note that the Lamini-specific API means some of what is taught does not transfer directly to HuggingFace Transformers workflows without re-reading documentation.
Sharon Zhou is the co-founder and CEO of Lamini AI and a Stanford adjunct instructor who has taught machine learning at the university level. Reviewers across Class Central, blogs, and the DeepLearning.AI forum consistently single out her clarity and authoritative delivery as the course's defining strength — she explains technical concepts like gradient updates, loss functions, and the distinction between pre-training, fine-tuning, and RLHF with enough precision for practitioners while keeping the pace accessible to learners with a basic ML background. The criticism directed at instruction is almost always actually criticism of the Lamini dependency rather than of Zhou's teaching itself, which reviewers separate clearly.
The course is free on the DeepLearning.AI platform with all notebooks runnable in-browser using a provided Lamini API key — no local GPU, no cloud compute bill, and no subscription required. For roughly one hour of instruction from a practitioner who helped build a fine-tuning platform, the price-to-value ratio is high by any comparison. The only cost caveat is that learners who want to run the notebooks outside the sandbox need their own Lamini API credits or must re-implement the training loops against HuggingFace Transformers — neither is expensive, but both require additional setup work the course does not walk you through.
The in-browser notebook environment removes all setup friction for the duration of the course, which reviewers describe as genuinely useful — you are fine-tuning a real LLM within minutes of starting. Outside the sandbox, support shows its limits. The DeepLearning.AI community forum contains threads where learners ask how to replicate the Lamini training loop against HuggingFace Transformers or open-source alternatives, and community responses are helpful but unofficial. There is no teaching assistant response mechanism, no office hours, and DeepLearning.AI does not update short courses at a pace that keeps them current with rapidly evolving tooling. Learners asking about LoRA or QLoRA integration find the forum useful but the course itself silent.
The conceptual content — understanding when fine-tuning beats prompt engineering, how to format instruction data, what the loss curve tells you, and how to evaluate whether the fine-tuned model is better — transfers directly to real work regardless of which library you use. Several practitioner reviewers note that the course gave them the mental model they needed to approach fine-tuning projects confidently. The applicability ceiling is the Lamini dependency and the absence of parameter-efficient methods. Full-weight fine-tuning of a base LLM requires GPU resources that most practitioners do not run locally, and the industry has largely moved to LoRA and QLoRA for cost-effective fine-tuning. A learner who finishes this course and tries to apply the skills immediately in a typical cloud ML environment will find a gap between what was taught and what the tools they are most likely to use expect.
At no cost with in-browser compute provided, the course delivers a credible conceptual foundation for fine-tuning from one of the field's genuine practitioners. The value is real — reviewers describe it as the clearest available explanation of why and how to fine-tune, which is a question most AI practitioners eventually face. The value ceiling is that a learner who wants to move from conceptual understanding to hands-on practice in their own environment will need to supplement with HuggingFace documentation, LoRA tutorials, and compute resources not covered here.
Every lesson is paired with a Jupyter notebook, and the course's running example is fine-tuning a base language model on a custom dataset to produce a model that follows instructions in a particular style. Learners run real training steps and observe loss curves drop. The limitation is the Lamini API abstraction — the notebooks handle infrastructure concerns automatically in ways that obscure the HuggingFace Trainer API, the PEFT library, or the raw PyTorch training loop that practitioners most commonly use outside this environment. The practical exercise is genuine but somewhat sandboxed.
Fine-tuning is a genuine and growing skill demand. The course provides vocabulary, conceptual grounding, and a completion certificate that can be added to a LinkedIn profile or CV. Multiple reviewers describe using the course as a launchpad to deeper reading and their first real fine-tuning project. The career ceiling is that the Lamini-specific implementation does not directly translate to the HuggingFace ecosystem that most job descriptions and ML engineering roles expect, and the absence of parameter-efficient methods (LoRA, QLoRA, PEFT) means employers looking for practical fine-tuning experience will want evidence of work beyond this course.
The end-to-end example — preparing a dataset, launching a fine-tuning run, monitoring loss, and evaluating the result — covers the full lifecycle at a high level of realism. The instructional design is solid: Zhou explains each step before the notebook executes it, and the notebooks surface real outputs (loss numbers, model responses) rather than simulated ones. The project is limited by its Lamini dependency and by the dataset scale — learners do not grapple with the data curation challenges that dominate real fine-tuning projects.
The course is tightly structured across 11 short lessons: how LLMs and tokenization work, the chat format, input classification, the Moderation API, chain-of-thought reasoning, prompt chaining, output checking and system-level evaluation, all tied together by a running customer-service example. Reviewers repeatedly praise the clarity and the theory-to-practice balance. The honest mark-down is depth and age: it was built on GPT-3.5 Turbo in 2023, so it predates tool calling, structured JSON outputs and reasoning models, and it does not go deep on real-world deployment beyond the safety checks.
Isa Fulford (Member of Technical Staff at OpenAI) demonstrates while Andrew Ng frames the concepts, and reviewers consistently call the pairing knowledgeable and effective communicators. The teacher-demonstrator dynamic mirrors how a beginner actually thinks through each step, and the pacing of 5-20 minute lessons keeps momentum. This is the most authoritative free source for building multi-step LLM systems, and it shows.
Free on the DeepLearning.AI platform with runnable in-browser notebooks, and free to audit the Coursera version. For roughly 90 minutes of content that teaches a reusable architecture for chaining LLM calls, the value is hard to beat. The only caveats are that the platform's graded assignment and certificate sit behind a Pro upgrade, and that the aging notebook code can eat time if you insist on running it locally rather than in-browser.
The standout feature for most reviewers is the hands-on coding: you build prompt chains that consume prior completions, glue Python around model calls, and assemble a full customer-service chatbot that classifies queries, moderates input, reasons step by step and evaluates its own output. The caveat is that there is no graded, kept portfolio artefact on the free tier, and the supplied notebooks now require fixes (deprecated API syntax, missing Utils.py and products.json) to run outside the course sandbox.
The patterns taught — chaining, moderation, evaluation, routing — are exactly the building blocks of production LLM features, and developers report the course gave them a structured mental model they could apply immediately. But it is a one-hour primer with no certificate on the free tier and no capstone, so on its own it is a strong foundation rather than a credential. Its career value is as the second step in a sequence, not a destination.
Scoring methodology applies identically to every course on the site — see the formula.