Chevron Left
Back to Machine Learning Data Lifecycle in Production

Learner Reviews & Feedback for Machine Learning Data Lifecycle in Production by DeepLearning.AI

814 ratings

About the Course

In the second course of Machine Learning Engineering for Production Specialization, you will build data pipelines by gathering, cleaning, and validating datasets and assessing data quality; implement feature engineering, transformation, and selection with TensorFlow Extended and get the most predictive power out of your data; and establish the data lifecycle by leveraging data lineage and provenance metadata tools and follow data evolution with enterprise data schemas. Understanding machine learning and deep learning concepts is essential, but if you’re looking to build an effective AI career, you need production engineering capabilities as well. Machine learning engineering for production combines the foundational concepts of machine learning with the functional expertise of modern software development and engineering roles to help you develop production-ready skills. Week 1: Collecting, Labeling, and Validating data Week 2: Feature Engineering, Transformation, and Selection Week 3: Data Journey and Data Storage Week 4: Advanced Data Labeling Methods, Data Augmentation, and Preprocessing Different Data Types...

Top reviews


Jul 2, 2021

Interesting material. There are quite a lot of typos and many code snippets are directly from the tfx manual pages however the instructions provided and logic of the course is clear.


Oct 13, 2021

It is a very informative course. I learned a lot about data, metadata, schema and feature engineering, Also, Robert Crowe sir is a very good teacher.

Filter by:

1 - 25 of 169 Reviews for Machine Learning Data Lifecycle in Production

By Mouni R

Dec 5, 2021

I really enjoyed the first course in the specialization but the instruction for the 2nd course (ML Data Life Cycle) was terrible. There were too many concepts packed and explained in an abstract fashion.

I am Technical PM at Amazon and work on ML systems. I was hoping to learn best practices with practical examples. I got a lot of it in the first course but this one was a mash of abstract academic concepts with very few practical examples.

It was irritating to go through the TFX sales pitch without explaining why exactly we are doing something in the ML pipeline.

I did learn some new concepts around auto-labelling and feature selection but I can teach the useful portion of this course in 60 minutes.

I urge Deep Learning AI to replace and rethink this course entirely.

By Tyler G

Jun 11, 2021

A somewhat disappointing and misleading followup to the excellent first course in this specialization. It's heavily focused on shallow learning on structured data, which is not at all what I think of when I think of the challenges in prod ML.

TFX feels more like a solution to technologies that were available well before the deep learning revolution. There are certainly some useful, albeit complicated, tools coming out of google/tensorflow. We'll see if TFX sticks or just becomes another tensorflow.estimator in the shadow of keras.

By Liqiang D

Jun 29, 2021

Too many concepts packed in the lecture videos. The lecturor basically reads the slide instead of going through them.

I have been using TF in a professional job for three years. I still find TF is too complex to used.

By Riju M

Jun 15, 2021

The labs and assignments were interesting but the lectures, content videos were not engaging.

By Arthur F

Aug 16, 2021

Most of the course feels like an advertisement for Tensorflow Extended data pipeline management tool. If you are using TF then the tool may be a necessity in some cases, but otherwise it is largely not useful. There is very little which is transferrable outside TF. For the parts which are high level and not TF specific either you know it because you've encountered production systems before, or you don't know it, in which case this course is not really going to help you that much to ramp up.

By Jungwei F

Jul 11, 2021

I have no doubt in Robert's knowledge on the subject, but delivering clear instruction with just right amount of contexts is an art that takes another few years to master. Way to go!

By Fanny J

Jan 21, 2022

This course is not up to the usual level of Andrew Ng's specializations on Coursera. In my opinion, it needs a big review to order contents better. The order in which topics are covered sometimes feels like it's just random. You see a subject in week one, video 4 and you see it again at the end of week 2, presented differently. It feels very weird, and if you're trying to summarise the class in a document, it's very hard. Also, there is surprisingly often no link from one slide to the next. It's just "let's change subject and speak about this now". Some quiz questions (for instance in week 2) are referring to material learned AFTER the quiz itself. Also, N-th level negation questions in quizzes are just a way of having us loose our time, not a clever way of checking if some material is well understood. The overall impression this course gave me is that if I had no knowledge of any of this before, it would unfortuntaley not have helped much. The course in general demotivated me into following it, because I was expecting something more interesting and better structured.

By Kamlesh K

Jun 28, 2021

Access to the code is not available. Most of the concepts are too complicated in implementation. Having used model management before, i think many things should be made much simpler and developer friendlier.

By Roger S P M

Aug 22, 2021

Sadly the lectures are rather dull. But take heart, the material is much more interesting in the next course. Press on. Get this one over with. You will be happier in course #3.

By Arturo M

May 10, 2022

I'm quite dissapointed by this continuation of the otherwise excellent Andrew Ng specialization.

I was expecting a course on frameworks and best practices for managing data in MLOps environments. Instead, this course is basically a commercial of Tensor Flow Extended, a MLOps framework by Google. Other tools often used in commercial applications (like cloud ML platforms) are not even mentioned.

It's true that the course does provide some tips, but they are often too general to be of practical use, specially for people with some experience in the field (e.g. "you need to validate your inputs").

I hope the next courses in the specialization are better.

By Chandra B

Apr 6, 2022

This course was nothing short of painful for someone who has had some industry experience, as well as som experience teaching. The video instructions were disjointed, unclear and did precious little to prepare one for the graded lab assignments. There was significant lack of cohesion between the ungraded and graded labs. Finally, the TFX ecosystem is esoteric, unnecessarily complex and a nightmare to use. As someone looking to adopt a data pipeline for their production ML model, all this course has done is convince me not to use TFX.

By Hitesh K

Jul 16, 2021

If you're new to ML pipelines this is an excellent course to understand ML pipelines. Moreover, the labs and assignment are of good quality. If you already are familiar with ML pipelines tech like Amazon sagemaker then this course might seem repetitive of many things but still you get to learn about google's Ml pipeline stack which is TFX.

By Nithiwat S

Jun 23, 2022

The course is poorly prepared and presented. The instructor basically talks through slides with no concrete technical content, simply babling from one bullet point to another, from one slide to the next, unorganized. Lectures were horrible -- broad, technical content barely scratches the surface, uninteresting way to deliver and speak. This is a practical course. The intructor should have structured the lecture around a practical implementation through a real-life example. It's not there at all. Very difficult to continue listening and it's very frustrating. Lab and Assignments in Jupyter Notebook are good. Overall, a huge disappointment considering the first course in the Specialization taught by Andrew Ng was so good.

By ChenChang S

Jun 23, 2021

This handful course allows me to understand how the tft works and how to inspect with statistic aspect of view about data. Much interesting is the practice, it offers much practical example about data preparation, especially the optional week 4 time series data !

By Aadidev S

May 16, 2021

This was quite exciting! A lot of new, innovative content in the TFX libraries along with all the theoretical background necessary for determining when to use each component in the data life-cycle, highly recommend!

By Robert K

Dec 24, 2021

Unfortunately, the Tensorflow documentation and guides would be much better here.

I felt like the teacher liked to show his ego, it was cringy at some times, and superficial at others. I felt like the entire course could be just put into one notebook, and read in 2 hours. It is a shame as I expected better.

By Hui J

Jan 4, 2022

A lot of the concepts are not well-explained. I feel like my mind is constantly drifting away when watching the video, to me, this course is more like a workshop/ads for tensorflow rather than explaining the data lifecycle properly.

By Francis Q L

Sep 9, 2021

I am kind of disappointed with this class. First, I feel this class is way too oriented towards structured data. But more importantly, I find that the labs are overly simple and I can't say that I would feel proficient applying the concepts learned in production.

By Rafayel D

Jul 7, 2022

Tensorflow is not a convinient tool

By Panagiotis S

Jan 25, 2022

Very poor content. Also it was not engaging at all. The instructor was just reading the slides and gave only a slight explanation on more advanced concepts. Also the graded assignments were too easy and only focused on Tensorflow products which not everybody out there uses. Personally I was dissapointed that using ONLY tensorflow components that cannot be used alongside with other libraries like Pytorch or MXNet. Very dissapointing..

By Gustavo Z F

Aug 12, 2021

The course takes an overall look over the general data life-cycle pipeline in production. That includes: (1) data collection, labeling and validation; (2) feature engineering, transformation and selection; and (3) data journey and storage. The instructor, Robert Crowe (TF engineer), presented a plain domain of the studied subjects and was fully able to explain them understandably. The technologies and libraries presented through the course are modern and applicable to the majority of my current projects. I would recommend this course to anyone interested into better understanding the data behavior in the production environment, as well as, how to use the introduce libraries to correct data anomalies/problems (e.g. data skew, data drift, others).

By Dr. F T

Aug 15, 2021

Great course. Looking for one on TFX since the tool was open sourced few years ago. While TFX could be quite technical and hard to undertsand, Robert may it clear with many examples to practice and better understand it. Data Scientist that plan to deploy model in production should take it.

By Reza M

Sep 8, 2021

This is to understand that Data Lifecycle is the rest of the iceberg, compared to Machine Learning Models being the tip of the iceberg. It is very good demonstration of TensorFlow capabilities processing and maintaining the data for operation.

By raveesh k

Sep 1, 2021

This is the best course for understanding the data lifecycle in production, everything has been explained in video and the assignments given in the course are the real life practical scenario for data pipeline management for machine learning.

By Srinesh C

Jul 3, 2021

Interesting material. There are quite a lot of typos and many code snippets are directly from the tfx manual pages however the instructions provided and logic of the course is clear.