Fine-Tuning Large Language Models vs Building Systems with the ChatGPT API

Same Bayesian formula, same rubric — so the difference in scores reflects the difference in the courses, not the difference in how we evaluated them.

DeepLearning.AI · AI & ML Courses

Fine-Tuning Large Language Models

4.0/ 5 · 38 opinions

26 positive8 neutral4 negative/ 38 total

Read full review

DeepLearning.AI · AI & ML Courses

Building Systems with the ChatGPT API

4.3/ 5 · 32 opinions

23 positive6 neutral3 negative/ 32 total

Read full review

Per-criterion

Content quality4.1 / 5

The course is structured around five core modules: why fine-tune versus prompt engineering, how to prepare and format training data for instruction-following, full-weight fine-tuning mechanics using the Lamini library, training loop internals (loss curves, learning rates, batch sizes), and evaluation of fine-tuned model outputs. For a one-hour short course it is remarkably focused — Sharon Zhou stays disciplined about scope and the conceptual framing of when fine-tuning is the right tool is praised across reviews as the most practically useful part. The recurring mark-down is that the course covers only full-weight fine-tuning and does not address parameter-efficient methods (LoRA, QLoRA, adapters) that dominate practical fine-tuning work in 2025-2026, when GPU cost and accessibility are real constraints for most learners. Reviewers also note that the Lamini-specific API means some of what is taught does not transfer directly to HuggingFace Transformers workflows without re-reading documentation.

Instructor4.7 / 5

Sharon Zhou is the co-founder and CEO of Lamini AI and a Stanford adjunct instructor who has taught machine learning at the university level. Reviewers across Class Central, blogs, and the DeepLearning.AI forum consistently single out her clarity and authoritative delivery as the course's defining strength — she explains technical concepts like gradient updates, loss functions, and the distinction between pre-training, fine-tuning, and RLHF with enough precision for practitioners while keeping the pace accessible to learners with a basic ML background. The criticism directed at instruction is almost always actually criticism of the Lamini dependency rather than of Zhou's teaching itself, which reviewers separate clearly.

Value for money4.5 / 5

The course is free on the DeepLearning.AI platform with all notebooks runnable in-browser using a provided Lamini API key — no local GPU, no cloud compute bill, and no subscription required. For roughly one hour of instruction from a practitioner who helped build a fine-tuning platform, the price-to-value ratio is high by any comparison. The only cost caveat is that learners who want to run the notebooks outside the sandbox need their own Lamini API credits or must re-implement the training loops against HuggingFace Transformers — neither is expensive, but both require additional setup work the course does not walk you through.

Support3.3 / 5

The in-browser notebook environment removes all setup friction for the duration of the course, which reviewers describe as genuinely useful — you are fine-tuning a real LLM within minutes of starting. Outside the sandbox, support shows its limits. The DeepLearning.AI community forum contains threads where learners ask how to replicate the Lamini training loop against HuggingFace Transformers or open-source alternatives, and community responses are helpful but unofficial. There is no teaching assistant response mechanism, no office hours, and DeepLearning.AI does not update short courses at a pace that keeps them current with rapidly evolving tooling. Learners asking about LoRA or QLoRA integration find the forum useful but the course itself silent.

Real-world use3.7 / 5

The conceptual content — understanding when fine-tuning beats prompt engineering, how to format instruction data, what the loss curve tells you, and how to evaluate whether the fine-tuned model is better — transfers directly to real work regardless of which library you use. Several practitioner reviewers note that the course gave them the mental model they needed to approach fine-tuning projects confidently. The applicability ceiling is the Lamini dependency and the absence of parameter-efficient methods. Full-weight fine-tuning of a base LLM requires GPU resources that most practitioners do not run locally, and the industry has largely moved to LoRA and QLoRA for cost-effective fine-tuning. A learner who finishes this course and tries to apply the skills immediately in a typical cloud ML environment will find a gap between what was taught and what the tools they are most likely to use expect.

Value4.2 / 5

At no cost with in-browser compute provided, the course delivers a credible conceptual foundation for fine-tuning from one of the field's genuine practitioners. The value is real — reviewers describe it as the clearest available explanation of why and how to fine-tune, which is a question most AI practitioners eventually face. The value ceiling is that a learner who wants to move from conceptual understanding to hands-on practice in their own environment will need to supplement with HuggingFace documentation, LoRA tutorials, and compute resources not covered here.

Practical projects3.8 / 5

Every lesson is paired with a Jupyter notebook, and the course's running example is fine-tuning a base language model on a custom dataset to produce a model that follows instructions in a particular style. Learners run real training steps and observe loss curves drop. The limitation is the Lamini API abstraction — the notebooks handle infrastructure concerns automatically in ways that obscure the HuggingFace Trainer API, the PEFT library, or the raw PyTorch training loop that practitioners most commonly use outside this environment. The practical exercise is genuine but somewhat sandboxed.

Career impact3.5 / 5

Fine-tuning is a genuine and growing skill demand. The course provides vocabulary, conceptual grounding, and a completion certificate that can be added to a LinkedIn profile or CV. Multiple reviewers describe using the course as a launchpad to deeper reading and their first real fine-tuning project. The career ceiling is that the Lamini-specific implementation does not directly translate to the HuggingFace ecosystem that most job descriptions and ML engineering roles expect, and the absence of parameter-efficient methods (LoRA, QLoRA, PEFT) means employers looking for practical fine-tuning experience will want evidence of work beyond this course.

Project quality3.9 / 5

The end-to-end example — preparing a dataset, launching a fine-tuning run, monitoring loss, and evaluating the result — covers the full lifecycle at a high level of realism. The instructional design is solid: Zhou explains each step before the notebook executes it, and the notebooks surface real outputs (loss numbers, model responses) rather than simulated ones. The project is limited by its Lamini dependency and by the dataset scale — learners do not grapple with the data curation challenges that dominate real fine-tuning projects.

Content quality4.2 / 5

Across 11 short lessons (roughly 90 minutes total), the course covers a complete pipeline for multi-step LLM systems: how language models and tokenisation work, the chat format and system-user message separation, input classification for query routing, the OpenAI Moderation API, chain-of-thought prompting to handle multi-step questions, chaining several focused prompts where each consumes the previous output, output checking, and a two-part section on evaluating LLM responses at the system level. Reviewers consistently praise the logical progression and the theory-to-practice balance. The principal mark-down is age and depth: the course was built on GPT-3.5 Turbo in 2023 and has not been meaningfully updated, so it predates tool calling, structured JSON outputs, and reasoning models, and it stops short of real-world deployment concerns such as latency management, cost at scale, and production observability.

Instructor4.8 / 5

Isa Fulford, Member of Technical Staff at OpenAI, leads the code demonstrations while Andrew Ng frames the broader concepts and asks the questions a beginner would actually ask. Reviewers across blogs and Coursera call the pairing "highly knowledgeable and effective communicators." The teacher-demonstrator dynamic mirrors how a learner thinks through a new problem step by step, keeping each lesson of five to twenty minutes focused and coherent. Because Fulford comes directly from the team that built the ChatGPT API, the design decisions behind the Moderation API, the chat format, and tokenisation carry genuine authority rather than third-hand explanation.

Value for money4.9 / 5

The course is free on the DeepLearning.AI platform with every Jupyter notebook runnable directly in-browser — no OpenAI API key, no local Python environment, and no subscription required. The Coursera guided-project version is also free to audit. For roughly 90 minutes of hands-on instruction from two of the most credible names in the field, delivering reusable architecture patterns for multi-step LLM systems, the value proposition is essentially unmatched among paid or free alternatives. The only caveats are that a graded assignment and certificate on the Coursera version sit behind a paid enrolment, and the free tier leaves no portfolio artefact by default.

Real-world use4.0 / 5

The patterns taught — classify the input, moderate for safety, reason in steps, chain focused prompts rather than one monolithic prompt, then evaluate the output — are exactly how production LLM features are structured in practice. Multiple reviewers note that the progression from basic API calls to a multi-stage orchestrated system reflects real engineering work. The gap is that the 2023 course predates the patterns now central to production LLM development (tool calling, structured outputs, retrieval-augmented generation), and at least one practitioner reviewer noted that the finished chatbot example would require substantial hardening before it approached something ready for deployment beyond a prototype.

Practical projects4.2 / 5

Every lesson pairs a video with a runnable Jupyter notebook, and the course builds one coherent end-to-end example: a customer-service chatbot that classifies incoming queries, runs them through the Moderation API, applies chain-of-thought prompting to multi-step reasoning, chains successive focused prompts, retrieves product information, and evaluates whether its own output actually addresses the user's question. The Coursera version holds a 4.7/5 rating across 346 learners. The caveat is that there is no graded project or kept portfolio artefact on the free tier, and the supplied notebooks now require fixes (deprecated API syntax, missing helper files) to run locally outside the course sandbox.

Scoring methodology applies identically to every course on the site — see the formula.