Fine-Tuning Large Language Models vs Generative AI with Large Language Models
Same Bayesian formula, same rubric — so the difference in scores reflects the difference in the courses, not the difference in how we evaluated them.
DeepLearning.AI · AI & ML Courses
Fine-Tuning Large Language Models
DeepLearning.AI & AWS (Coursera) · AI & ML Courses
Generative AI with Large Language Models
Per-criterion
The course is structured around five core modules: why fine-tune versus prompt engineering, how to prepare and format training data for instruction-following, full-weight fine-tuning mechanics using the Lamini library, training loop internals (loss curves, learning rates, batch sizes), and evaluation of fine-tuned model outputs. For a one-hour short course it is remarkably focused — Sharon Zhou stays disciplined about scope and the conceptual framing of when fine-tuning is the right tool is praised across reviews as the most practically useful part. The recurring mark-down is that the course covers only full-weight fine-tuning and does not address parameter-efficient methods (LoRA, QLoRA, adapters) that dominate practical fine-tuning work in 2025-2026, when GPU cost and accessibility are real constraints for most learners. Reviewers also note that the Lamini-specific API means some of what is taught does not transfer directly to HuggingFace Transformers workflows without re-reading documentation.
Sharon Zhou is the co-founder and CEO of Lamini AI and a Stanford adjunct instructor who has taught machine learning at the university level. Reviewers across Class Central, blogs, and the DeepLearning.AI forum consistently single out her clarity and authoritative delivery as the course's defining strength — she explains technical concepts like gradient updates, loss functions, and the distinction between pre-training, fine-tuning, and RLHF with enough precision for practitioners while keeping the pace accessible to learners with a basic ML background. The criticism directed at instruction is almost always actually criticism of the Lamini dependency rather than of Zhou's teaching itself, which reviewers separate clearly.
The course is free on the DeepLearning.AI platform with all notebooks runnable in-browser using a provided Lamini API key — no local GPU, no cloud compute bill, and no subscription required. For roughly one hour of instruction from a practitioner who helped build a fine-tuning platform, the price-to-value ratio is high by any comparison. The only cost caveat is that learners who want to run the notebooks outside the sandbox need their own Lamini API credits or must re-implement the training loops against HuggingFace Transformers — neither is expensive, but both require additional setup work the course does not walk you through.
The in-browser notebook environment removes all setup friction for the duration of the course, which reviewers describe as genuinely useful — you are fine-tuning a real LLM within minutes of starting. Outside the sandbox, support shows its limits. The DeepLearning.AI community forum contains threads where learners ask how to replicate the Lamini training loop against HuggingFace Transformers or open-source alternatives, and community responses are helpful but unofficial. There is no teaching assistant response mechanism, no office hours, and DeepLearning.AI does not update short courses at a pace that keeps them current with rapidly evolving tooling. Learners asking about LoRA or QLoRA integration find the forum useful but the course itself silent.
The conceptual content — understanding when fine-tuning beats prompt engineering, how to format instruction data, what the loss curve tells you, and how to evaluate whether the fine-tuned model is better — transfers directly to real work regardless of which library you use. Several practitioner reviewers note that the course gave them the mental model they needed to approach fine-tuning projects confidently. The applicability ceiling is the Lamini dependency and the absence of parameter-efficient methods. Full-weight fine-tuning of a base LLM requires GPU resources that most practitioners do not run locally, and the industry has largely moved to LoRA and QLoRA for cost-effective fine-tuning. A learner who finishes this course and tries to apply the skills immediately in a typical cloud ML environment will find a gap between what was taught and what the tools they are most likely to use expect.
At no cost with in-browser compute provided, the course delivers a credible conceptual foundation for fine-tuning from one of the field's genuine practitioners. The value is real — reviewers describe it as the clearest available explanation of why and how to fine-tune, which is a question most AI practitioners eventually face. The value ceiling is that a learner who wants to move from conceptual understanding to hands-on practice in their own environment will need to supplement with HuggingFace documentation, LoRA tutorials, and compute resources not covered here.
Every lesson is paired with a Jupyter notebook, and the course's running example is fine-tuning a base language model on a custom dataset to produce a model that follows instructions in a particular style. Learners run real training steps and observe loss curves drop. The limitation is the Lamini API abstraction — the notebooks handle infrastructure concerns automatically in ways that obscure the HuggingFace Trainer API, the PEFT library, or the raw PyTorch training loop that practitioners most commonly use outside this environment. The practical exercise is genuine but somewhat sandboxed.
Fine-tuning is a genuine and growing skill demand. The course provides vocabulary, conceptual grounding, and a completion certificate that can be added to a LinkedIn profile or CV. Multiple reviewers describe using the course as a launchpad to deeper reading and their first real fine-tuning project. The career ceiling is that the Lamini-specific implementation does not directly translate to the HuggingFace ecosystem that most job descriptions and ML engineering roles expect, and the absence of parameter-efficient methods (LoRA, QLoRA, PEFT) means employers looking for practical fine-tuning experience will want evidence of work beyond this course.
The end-to-end example — preparing a dataset, launching a fine-tuning run, monitoring loss, and evaluating the result — covers the full lifecycle at a high level of realism. The instructional design is solid: Zhou explains each step before the notebook executes it, and the notebooks surface real outputs (loss numbers, model responses) rather than simulated ones. The project is limited by its Lamini dependency and by the dataset scale — learners do not grapple with the data curation challenges that dominate real fine-tuning projects.
Across three weeks (roughly 16 hours), the course covers the full generative AI project lifecycle: the Transformer architecture from the "Attention Is All You Need" paper, prompt engineering, in-context learning, Chinchilla scaling laws, instruction fine-tuning, parameter-efficient fine-tuning (LoRA), and reinforcement learning from human feedback (RLHF). Reviewers repeatedly praise how it grounds each technique in the relevant research paper before showing the "how," which builds genuine understanding of the "why." The most consistent content criticism is that week three squeezes too many topics (RLHF, model optimisation, RAG, ReAct) in at shallow depth and feels disjointed after the RLHF section.
The course is fronted by Andrew Ng with AWS instructors Antje Barth, Mike Chambers, Shelbee Eigenbrode and Chris Fregly delivering the technical content. Reviewers describe the delivery as technically clear, well-diagrammed and well-paced, with one calling Andrew Ng "like a rock star in Artificial Intelligence teaching." The multi-instructor AWS panel draws consistently positive marks for explaining production concepts from real experience, though it is a panel format rather than a single narrative voice.
At roughly USD 49 with six months of access — and the AWS SageMaker lab compute included in that price — multiple reviewers explicitly call it "not overpriced" for the breadth of current, applied content. The main value caveats are that the labs do not require writing original code (so you can finish for the certificate without coding), and that the included lab budget is finite — at least one learner exhausted it after a technical glitch on the very first lab and could not continue.
The three SageMaker labs (dialogue summarisation prompt engineering, PEFT fine-tuning with LoRA, and RLHF detoxification) give learners an end-to-end view of real LLM pipelines using PyTorch and the Hugging Face transformers library. The near-universal complaint is that the labs are "run all the cells" walkthroughs with no original coding, no graded homework, and no self-built project — you can submit by clicking through. Reviewers value them as illustrations but warn they do not verify skill or prepare you to build a similar application from scratch.
The curriculum maps closely to how LLM applications are actually scoped, adapted and deployed in industry — model selection, cost-aware optimisation (quantisation, pruning, distillation), fine-tuning strategy, RLHF alignment and RAG-style augmentation. The modern toolchain (SageMaker, Hugging Face, PyTorch) is exactly what practitioners use. The gap is between conceptual fluency and hands-on ability: because the labs require no original code, several reviewers recommend pairing the course with a build-it-yourself resource such as the Hugging Face NLP course to close the implementation gap.
Scoring methodology applies identically to every course on the site — see the formula.