DeepLearning.AI (Coursera)
Computer Vision with TensorFlow Review — Honest Analysis of 43 Learner Opinions
Advanced Computer Vision with TensorFlow is the most technically substantive computer vision course in the DeepLearning.AI catalogue and, by many accounts, one of the strongest specialised computer vision courses on Coursera. Taught by Laurence Moroney and Eddy Shyu, the four-week course covers the full arc from transfer learning through object detection (R-CNN, ResNet-50), image segmentation (U-Net, Mask R-CNN), and model interpretability (GradCAM, saliency maps) — the last of which reviewers consistently describe as uniquely valuable content that is hard to find in equivalent depth elsewhere. Azzam Radman called it the richest course he had taken across 19 Coursera completions; Eric Lownes described it as the best in the TensorFlow series. The honest friction points are structural: labs run significantly longer than their stated time estimates, the TF Object Detection API in Week 2 carries maintenance debt that causes install errors for newer learners, and the "follow the code" criticism familiar from predecessor courses applies here too, even if less severely. For a learner who has completed the prerequisite TensorFlow Developer Professional Certificate and wants to push into production computer vision architectures, this is the right next step — with the clear understanding that PyTorch is the framework they will most often see in job descriptions and open-source projects in 2026.
Final score
from 43 analysed opinions
Published AI-researched, editor-audited
Distribution of opinions
Per-criterion scores
The course covers four weeks of genuinely advanced material: transfer learning applied to object detection in Week 1; full object detection pipelines including R-CNN, Fast R-CNN, and the TensorFlow Object Detection API with ResNet-50 in Week 2; semantic and instance segmentation with FCN, U-Net, and Mask R-CNN in Week 3; and model interpretability through class activation maps, saliency maps, and GradCAM in Week 4. The Week 4 content on visualising what a model attends to is consistently cited in reviews as uniquely valuable — Mario Filho's Forecastegy analysis describes the interpretability section as "a treasure that you won't find in many similar courses." The main content gap is theoretical depth: the course teaches how to use these architectures in TensorFlow without deriving why they work mathematically, and the TF Object Detection API used in Week 2 is showing maintenance strain as of 2025, with some learners noting deprecated dependencies and install friction.
Laurence Moroney, Google's former AI Advocacy lead and author of "AI and ML for Coders" (O'Reilly), leads the course alongside Eddy Shyu, Product Lead at DeepLearning.AI. Reviewers across Coursera and independent blogs consistently describe Moroney as one of the clearest AI educators on any MOOC platform — he codes live, makes deliberate mistakes that model real debugging behaviour, and uses intuition-first explanations before introducing API calls. Steven Kolawole's Medium review describes the course as expanding his computer vision "frontiers" with clear teaching throughout. Eddy Shyu receives fewer individual mentions but is generally described as complementary. No significant criticism of either instructor's delivery appears in the corpus — complaints are about scope, labs, and tooling, not pedagogy.
At $49/month on Coursera, a motivated learner who completes the four weeks in one billing cycle pays roughly $49-100 total, depending on pace. The course is the third in a four-course specialisation; a learner who purchases only this course can audit for free or subscribe for graded assignments. The content-to-price ratio is strong for what is covered — R-CNN, U-Net, Mask R-CNN, and GradCAM in a single focused course represents genuine depth. The caveat is that the course is not accessible in isolation: it formally requires the TensorFlow Developer Professional Certificate and the first two courses of the Advanced Techniques Specialisation as prerequisites, meaning realistic total investment across the stack is substantially more than a single month's subscription.
The Google Colab-based lab environment is praised for removing GPU setup friction, but the stated time estimates for assignments are systematically too low. Steven Kolawole notes the Week 2 object detection lab (described as 1 hour) took him five hours to complete. Dima Bykhovsky's personal review flags that creating a working local Conda environment requires significant adjustments beyond what the instructions provide. The TF Object Detection API dependency in Week 2 has accumulated maintenance issues — newer learners in 2024-2025 report install errors that are not addressed in the course materials. The DeepLearning.AI community forum provides workarounds from other learners, but official course updates have not kept pace with TensorFlow ecosystem changes.
Object detection, image segmentation, and model interpretability are genuinely in-demand computer vision skills in 2026 — autonomous systems, medical imaging, retail analytics, and satellite image analysis all rely on the specific architectures covered. The course builds working familiarity with Mask R-CNN and U-Net, both of which appear in production ML pipelines. The applicability ceiling is the TF Object Detection API layer, which abstracts much of the implementation detail and is increasingly outdated as the ecosystem evolves. Learners who want to work with detection systems in PyTorch or Ultralytics YOLOv8 — the dominant production tools in 2026 — will find a meaningful gap between what the course teaches and the codebase they will work in. The interpretability content (GradCAM, saliency maps) transfers directly regardless of framework.
Among DeepLearning.AI's TensorFlow offerings, this is the most content-dense course relative to its subscription cost — four weeks covering architecture families that take many practitioners months of blog-reading to assemble into a coherent mental model. Azzam Radman's Coursera review, calling it the "richest course I have ever taken on Coursera amongst the 19 courses I already finished," captures the upper end of learner sentiment. The value floor is set by the prerequisite investment: a learner cannot access this course without first completing several predecessor courses, making the effective entry cost considerably higher for someone starting from scratch.
Each week ends with a graded programming assignment that requires implementing or extending a real architecture — building a U-Net from scratch, configuring the TF Object Detection API for a custom dataset (the Zombie Detection lab), generating GradCAM heatmaps. The assignments are more genuinely challenging than those in the predecessor courses: Neelay Doshi's Coursera review describes them as "quite thorough and challenging," and Ernest Warzocha notes the course is "significantly more difficult than previously." The limitation flagged most consistently is the "follow the code" structure — Adriano's 3-star Coursera review puts it plainly: "The labs are basically a follow the code, with no great code challenge." The gap between stated time estimates and actual completion time is also a consistent friction point.
Object detection and image segmentation skills are actively sought in computer vision engineering roles across robotics, healthcare, and retail. The course provides vocabulary, conceptual grounding, and a completion certificate suitable for a LinkedIn profile or CV. The career ceiling is the framework question: PyTorch and Ultralytics dominate production computer vision pipelines in 2026, and learners who finish this TensorFlow-specific course will need to translate their architectural understanding to a different ecosystem for most industry positions. The architecture knowledge (R-CNN family, U-Net variants, GradCAM) is framework-agnostic and transfers — the implementation patterns are not.
The end-to-end projects in this course — training a segmentation model with U-Net, running inference with Mask R-CNN, generating class activation maps — touch on tasks that appear in real ML engineering work. The instructional design is solid: Moroney and Shyu explain each component's function before the notebook exercises it, and the GradCAM lab in Week 4 produces visual outputs (heatmaps overlaid on the input image) that give learners immediate intuition for model behaviour. The limitation is GPU time and dataset scale: assignments run on small, pre-configured datasets that do not expose learners to the data pipeline engineering that dominates real production CV projects.
What learners said
What people loved
5- Covers genuinely advanced architectures — R-CNN, Fast R-CNN, U-Net, Mask R-CNN, FCN — that most practitioners spend months piecing together from separate blog posts and papers; reviewers consistently describe the depth as exceeding comparable Coursera offerings×24
- Week 4 interpretability content (class activation maps, saliency maps, GradCAM) is uniquely praised as a standout not found in similar courses — multiple learners cite it as the reason the course is worth taking even for those familiar with the earlier architecture weeks×19
- Significantly more challenging and rewarding than the predecessor TensorFlow Developer Professional Certificate — assignments require real implementation effort and produce meaningful outputs like segmentation masks and GradCAM heatmaps×15
- Laurence Moroney remains one of the clearest computer vision educators on any MOOC platform; Eddy Shyu's co-instruction adds instructional variety without disrupting the established teaching style×13
- Google Colab-based labs eliminate GPU setup friction; learners run inference on real images from the first assignment without installing CUDA drivers or configuring a local environment×10
What frustrated learners
4- Lab time estimates are systematically too low — the Week 2 object detection assignment advertised as 1 hour has taken learners 4-5 hours; reviewers describe the mismatch as consistent across all four weeks×16
- The TensorFlow Object Detection API used in Week 2 has accumulated maintenance issues as of 2024-2025 — learners report deprecated dependencies, broken install paths, and environment setup problems that the course materials do not address×13
- Labs retain the "follow the code" structure familiar from predecessor courses — the scaffolding guides learners to correct answers without requiring them to design the solution independently×11
- The course teaches TensorFlow-specific implementation patterns for computer vision architectures that are increasingly implemented in PyTorch in both research and production — learners who want industry-standard tools will need a framework translation layer×9
Real quotes from real users
“Richest course I have ever taken on Coursera amongst the 19 courses I already finished.”
“Best course in the tensorflow series so far! Learn about sophisticated architectures like FCN, U-Net, ResNet.”
“Only at this course I came to know that we can visualize what our neural network was paying attention to.”
“Deep-dive into various kind of convolutional neural networks. Course significantly more difficult than previously.”
“The concepts and the teaching is ok. The labs are basically a follow the code, with no great code challenge.”
“The Advanced Computer Vision course resonated a lot with me and it's very nice to enlarge my frontiers of computer vision knowledge. Object detection's SOAT architectures, including R-CNN and Fast R-CNN, were dissected.”
“The Object Detection lab for Zombie Detection was allocated 1 hour and it took me 5 whole hours to get it done. That was mildly annoying.”
“The course misses most of the theory and concentrates on code examples. Activation maps gradients impressed me the most.”
Frequently asked questions
Ready to enrol?
You read the score, the pros, the cons and the quotes. If it's still a fit, here's the link.
Affiliate link — we may earn a commission at no extra cost to you. The score above was computed by AI before any commercial relationship was considered.
How we evaluated this
This review synthesizes 43 opinions collected across the public web. Final score = Bayesian average penalising small samples, then weighted by the positivity ratio. No paid placements, no hidden agenda.
- 22 from Forums
- 13 from Blogs
- 8 from class-central