As data collection has increased exponentially, so has the need for people skilled at using and interacting with data; to be able to think critically, and provide insights to make better decisions and optimize their businesses. This is a data scientist, “part mathematician, part computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being a data scientist is the best job in America; with a median base salary of $110,000 and thousands of job openings at a time. The skills necessary to be a good data scientist include being able to retrieve and work with data, and to do that you need to be well versed in SQL, the standard language for communicating with database systems.



SQL for Data Science
This course is part of Learn SQL Basics for Data Science Specialization

Instructor: Sadie St. Lawrence
Access provided by Justice Through Code at Columbia University
690,026 already enrolled
(17,025 reviews)
What you'll learn
- Identify a subset of data needed from a column or set of columns and write a SQL query to limit to those results. 
- Use SQL commands to filter, sort, and summarize data. 
- Create an analysis table from multiple queries using the UNION operator. 
- Manipulate strings, dates, & numeric data using functions to integrate data from different sources into fields with the correct format for analysis. 
Skills you'll gain
Details to know

Add to your LinkedIn profile
See how employees at top companies are mastering in-demand skills

Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate

There are 4 modules in this course
In this module, you will be able to define SQL and discuss how SQL differs from other computer languages. You will be able to compare and contrast the roles of a database administrator and a data scientist, and explain the differences between one-to-one, one-to-many, and many-to-many relationships with databases. You will be able to use the SELECT statement and talk about some basic syntax rules. You will be able to add comments in your code and synthesize its importance.
What's included
11 videos3 readings4 assignments2 discussion prompts
In this module, you will be able to use several more new clauses and operators including WHERE, BETWEEN, IN, OR, NOT, LIKE, ORDER BY, and GROUP BY. You will be able to use the wildcard function to search for more specific or parts of records, including their advantages and disadvantages, and how best to use them. You will be able to discuss how to use basic math operators, as well as aggregate functions like AVERAGE, COUNT, MAX, MIN, and others to begin analyzing our data.
What's included
9 videos1 reading3 assignments
In this module, you will be able to discuss subqueries, including their advantages and disadvantages, and when to use them. You will be able to recall the concept of a key field and discuss how these help us link data together with JOINs. You will be able to identify and define several types of JOINs, including the Cartesian join, an inner join, left and right joins, full outer joins, and a self join. You will be able to use aliases and pre-qualifiers to make your SQL code cleaner and efficient.
What's included
10 videos2 readings3 assignments1 discussion prompt
In this module, you will be able to discuss how to modify strings by concatenating, trimming, changing the case, and using the substring function. You will be able to discuss the date and time strings specifically. You will be able to use case statements and finish this module by discussing data governance and profiling. You will also be able to apply fundamental principles when using SQL for data science. You'll be able to use tips and tricks to apply SQL in a data science context.
What's included
11 videos3 readings4 assignments1 discussion prompt
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor

Offered by
Why people choose Coursera for their career




Learner reviews
17,025 reviews
- 5 stars73.49% 
- 4 stars19.96% 
- 3 stars3.55% 
- 2 stars1.43% 
- 1 star1.53% 
Showing 3 of 17025
Reviewed on Mar 15, 2020
Good course for beginners.- classes are fine- quizzes evaluate quite well your understanding I think- the final assessment contains plenty of VERY ambiguous questions and I think it should be reworked
Reviewed on Jan 29, 2019
Perfect course! But recommend the last main exercise do more readable and understandable. It is difficult to read in a text file, single-color task and it is not always clear what needs to be done.
Reviewed on Aug 27, 2020
The course is nice but I suppose the PowerPoint presentation that accompanies it could use some effects to show us the clauses one by one, as they are spoke. That would make understanding them easier.
Explore more from Data Science
 - University of California, Davis 
 - University of California, Davis 
 - Fractal Analytics 
¹ Some assignments in this course are AI-graded. For these assignments, your data will be used in accordance with Coursera's Privacy Notice.


