When you enroll in this course, you'll also be enrolled in this Professional Certificate.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate from CertNexus
There are 4 modules in this course
This course is designed for business and data professional seeking to learn the first technical phase of the data science process known as Extract, Transform and Load or ETL.
Learners will be taught how to collect data from multiple sources so it is available to be transformed and cleaned and then will dive into collected data sets to prepare and clean data so that it can later be loaded into its ultimate destination. In the conclusion of the course learners will load data into its ultimate destination so that it can be analyzed and modeled.
The typical student in this course will have experience working with data and aptitude with computer programming.
The first truly hands-on technical phase of the data science process is actually a combination of related tasks known as extract, transform, and load (ETL). This is where you, the data science practitioner, start to mold and shape the data so that it can be as useful as possible for the later steps in the data science process. In this course, you'll go through each ETL task in order, starting with "E" (extract).
Correction of Data Formats and Date Conversion•9 minutes
Deduplication•4 minutes
Word Embedding•8 minutes
Image Data Representation•4 minutes
7 readings•Total 60 minutes
Overview•2 minutes
Guidelines for Parsing Data•3 minutes
Guidelines for Handling Irregular and Unusable Data•10 minutes
Guidelines for Correcting Data Formats•15 minutes
Guidelines for Deduplicating Data•5 minutes
Text Data Transformation Techniques•15 minutes
Guidelines for Transforming Data•10 minutes
1 assignment•Total 30 minutes
Transforming Data•30 minutes
1 discussion prompt•Total 5 minutes
Reflect on What You've Learned•5 minutes
4 ungraded labs•Total 145 minutes
Handling Irregular and Unusable Data•20 minutes
Correcting Data Formats•30 minutes
Deduplicating Data•20 minutes
Handling Textual Data•75 minutes
Load Data
Module 3•3 hours to complete
Module details
The last step in the ETL process is loading. In this module, you'll take the data you transformed and put it into a destination format and location, where it will be ready for you to work on as the project progresses.
CertNexus is a vendor-neutral certification body, providing emerging technology certifications and micro-credentials for Business, Data, Development, IT, and Security professionals. CertNexus’ exams meet the most rigorous development standards possible which outlines a global framework for developing personnel certification programs to narrow the widening skills gap.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.