Site Logo

SQL for Data Science on Coursera

Created by UC Davis Continuing and Professional Education and hosted on the Coursera platform, this online course consists of pre-recorded video lectures, auto-graded and peer-reviewed assignments and community discussion forums. This course is self-paced and designed to teach you a new skill in as little as 4-6 weeks.

As data collection has increased exponentially, so has the need for people skilled at using and interacting with data—in order to think critically and provide insights that enable better decision making in organizations. Data scientists are “part mathematician, part computer scientist and part trend spotter” and are in higher demand than ever before (SAS Institute, Inc.). Good data scientists must be able to retrieve and work with data, and to do that, you must be well versed in SQL, the standard language for communicating with database systems.

This course is designed to give you a primer in the fundamentals of SQL and working with data so you can begin analyzing it for data science purposes. You will begin to ask the right questions and come up with good answers to deliver valuable insights for your organization. This course starts with the basics and assumes you do not have any prior knowledge or skills in SQL. It will build on that foundation and gradually have you write both simple and complex queries to help you select data from tables.  You'll start to work with different types of data like strings and numbers and discuss methods to filter and pare down your results.

You will create new tables and be able to move data into them. You will learn common operators and how to combine the data. You will use case statements and concepts like data governance and profiling. You will discuss topics on data, and practice using real-world programming assignments. You’ll also interpret the structure, meaning and relationships in source data and use SQL as a professional to shape your data for targeted analysis purposes.

Although we do not have any specific prerequisites or software requirements to take this course, a simple text editor is recommended for the final project. So what are you waiting for? This is your first step in what could be a great new career.

Data scientist course reviews with Kristofer Björnström

sql for data science peer review assignment

Course review: SQL for Data Science on Coursera

sql for data science peer review assignment

This is a course review of SQL for Data Science on Coursera , part 1 of 4 in a series of SQL courses by UC Davis University of California with Sadie St. Lawrence as course instructor.

SQL for Data Science was an easy to follow and short SQL course that can be done in about 10 hours of work. The ideal audience is the complete beginner in SQL. The course is divided in four “weeks” and the first two weeks of videos are just about selecting data, sorting, filtering and summing up with aggregate functions.

In week three the student is introduced to subqueries, joins and unions. Week four shows some functions to manipulate data by changing formats and you also get some tips about how to handle data when doing data analysis.

Overall this course will be to simple for most people who wants to break into data science but it’s very pedagogical and could be a great starting point when you start off with SQL.

Thanks for reading Data scientist course reviews with Kristofer Björnström! Subscribe for free to receive new posts and support my work.

Facts about SQL for Data Science

7 days free trial - after that a monthly fee is charged.

The DBMS used in the course is SQLLite but the instructor is highlighting some differences for users using MySQL and OracleDB.

No installation needed, everything is done in your browser.

Time required to complete the course is about 10 hours.

The course is mainly made up by videos, quizzes and programming exercises but there is also a final assignment that needs to be sent in and reviewed by peers. It might be hard to finish and receive a grade on this final assignment within the 7 days of free trial.

sql for data science peer review assignment

Ready for more?

sql for data science peer review assignment

  • Get Started!

Review: SQL for Data Science Specialization by UC Davis

sql for data science peer review assignment

Program Overview

There are winners and losers in the realm of analytics, and SQL is definitely a winner. After aggregating thousands of job descriptions for analytics careers, we identified SQL as the #2 most employer sought-after skill behind Tableau (which UC Davis also covers , btw). Enter UC Davis's SQL Basics for Data Science Specialization.  This specialization runs the gamut of concepts, from the basics of SQL syntax all the way to more theoretical concepts like SQL in Machine Learning and AB testing. If you're interested in learning more about the general field of analytics first, check out our pieces on data scientists and product analysts to get a better handle of the responsibilities. Either way, SQL is a very applicable skill for many different tech careers. ‍ The courses also topics such as the importance of cleaning data, how to optimize queries for performance, and how to use Databricks to enhance a company's data science program. The program finishes with a hands-on capstone where students can apply their newfound skills to a complete a comprehensive project with several custom datasets and metrics. The five courses to complete the specialization are:

  • SQL for Data Science
  • Data Wrangling, Analysis and AB Testing with SQL
  • Distributed Computing with Spark SQL
  • SQL for Data Science Capstone

The material is taught by real (and impressive , might we add) industry professionals, including two data scientists at Databricks, and the founder of Women in Data Sadie St. Lawrence- who is even a kinder soul in person! If you only have time for one course and want to be extra dangerous this year- maybe you're a product manager looking to influence others with your data skills- we especially loved very first course focused on SQL fundamentals. We loved it because it was super practical, and a completer could walk away able to analyze a database with the most popular queries. The curriculum is meant to be consumed over a 16-ish week period, but it can also be completed at your own pace. From the reviews, we think you could finish in as little as 6-8 weeks if you really push yourself.

Best for: Beginners, SQL Newbs and Aspiring Analytics Professionals

This course has some great material, but it is definitely aimed at an audience with no prior knowledge of data science or SQL. If that's you, great! We definitely recommend weighing this course as a way to broaden your expertise of the role and its responsibilities. If you have experience with data or web analytics tools, even in another program like Tableau, this specialization may be taught at a pace that's a little slow for you. If that's you, we've compiled some supplemental resources for your learning that progress at a more engaging pace. After completing the specialization, you'll be equipped to apply for entry-level analyst positions, and have a leg up at companies that use Tableau (which is a lot of them!).

Weekly Breakdowns

We’ve recapped the learning objectives from each week to set your expectations for course material. The great part about this program is that you can jump to any course, and any section if it’s interesting to you. For example, if you’re looking to just learn the rudimentary syntactical commands of SQL, check out course 1, week 2.

To audit an individual week-- find the exact course (we've linked them individually here) and click "audit" to save it to your profile. Then open the desired week on the side panel that aligns with our recaps.

sql for data science peer review assignment

Course 1: SQL for Data Science

Learning objectives from week 1: getting started and selecting & retrieving data with sql.

  • Learn the difference between SQL for data science applications and SQL for common data management.
  • Utilize an Entity Relationship diagram to show the relationships and inter-dependencies of data elements to answer business questions.
  • Practice retrieving relevant columns of data from a table using SQL queries.
  • Review basics of SQL and practice adding comments in queries for collaborators to understand.

Learning Objectives from Week 2: Filtering, Sorting and Calculating Data with SQL

  • Showcase the difference between working with filtered and unfiltered data sets, including performance metrics.
  • Get familiar with SQL syntax, including basic clauses (WHEN, IN, NOT, AND, OR), aggregate functions (AVG, COUNT, MAX, MIN, SUM), sorting (ORDER BY) and summary terms (GROUP and HAVING).
  • Learn how to utilize wildcards in data filtering and searching situations.

Learning Objectives from Week 3: Subqueries and Joins in SQL

  • Use subqueries to connect multiple tables and retrieve data.
  • Practice filtering a dataset using set theory by joining tables using Natural, Inner, Outer, and Self Joins.
  • Compare pros and cons with Cross and Cartesian Joins.
  • Learn how to create an analysis table from multiple queries using the UNION operator.

Learning Objectives from Week 4: Modifying and Analyzing Data with SQL

  • Integrate data from different sources by using strings, dates, and numeric data.
  • Identify circumstances that need a join when preparing data for analysis, including organizational, governance, business, and data considerations.
  • Practice implementing the 3 rules for translating an analysis question into a SQL statement: (1) identify the columns needed for the analysis, (2) specify the conditions for filtering the data, and (3) define the desired level of aggregation.

Course 2: Data Wrangling, Analysis and AB Testing with SQL

Learning objectives from week 1: data of unknown quality.

  • Practice identifying trustworthy and unreliable data points.
  • Troubleshoot why some data might be missing.
  • Learn how to answer more ambiguous questions by defining new metrics to measure.

Learning Objectives from Week 2: Creating Clean Datasets

  • Practice naming categories of data types and using tools to create trustworthy tables.
  • Show how unfiltered data can be worked into a table, and learn why a data warehouse is different than a production database.

Learning Objectives from Week 3: SQL Problem Solving

  • Practice mapping out joins and identifying the level of detail needed to answer different types of questions.
  • Practice creating plans to answer all questions with a data model.

Learning Objectives from Week 4: AB Testing Case Study

  • Learn how to use SQL with an AB testing calculator tool.
  • Practice checking data quality and identifying key metrics that are tied to business value.

Course 3: Distributed Computing with Spark SQL

Learning objectives from week 1: introduction to spark.

  • Learn the basic data structure of Spark, also known as a DataFrame.
  • Practice using the collaborative Databricks workspace and writing SQL code that executes against a machine cluster.
  • Identify the core concepts of distributed computing, and when it's useful for an organization.

Learning Objectives from Week 2: Spark Core Concepts

  • Learn the core concepts of Spark and use Spark SQL.
  • Practice caching data and making configurations for increased performance.
  • Show how Spark UI can analyze performance and identify bottlenecks.

Learning Objectives from Week 3: Engineering Data Pipelines

  • Access and compare the tradeoffs between a variety of data formats.
  • Examine semi-structured JSON data.
  • Create and train an end-to-end pipeline that reads, transforms, and saves data.

Learning Objectives from Week 4: Data Lakes, Warehouses and Lakehouses

  • Identify characteristics of data lakes and warehouses, and highlight the advantages of a lakehouse architecture.
  • Demonstrate the value add of combining Delta Lake with Apache Spark.
  • Practice building your own lakehouse with Delta Lake.
  • Recap how to utilize Spark for data science & machine learning use cases.

Course 4: SQL for Data Science Capstone Project

Week 1 milestone: project proposal and data selection/preparation.

  • Select your client, audience, datasets and create a project proposal for a data analysis.
  • Develop a working Entity Relationship Diagram (ERD).

Week 2 Milestone: Descriptive Stats & Understanding Your Data

  • Characterize, analyze and clean your dataset.
  • Prove and disprove your project hypothesis.

Week 3 Milestone: Beyond Descriptive Stats

  • Track relationships in your data by using advanced SQL techniques.

Week 4 Milestone: Present your Findings with a Data Story

  • Use best practices to identify your audience and nail your presentation.
  • Present your insights and recommendations to a group.

Cost and Auditing

The program is only $39/month, and comes with a Linkedin Certificate on behalf of the University of California Davis.If you complete the curriculum on the proposed timeline, it should take about 4-5 months, though you could blitz through it on a break in far less. While that seems steep, compared to a degree or bootcamp this micro-certification is a steal!

If you have a learning budget, or are dedicated to upskilling your career with a data-focus– we recommend paying for and completing the program to get the shareable certificate (GET RECEIPTS!). This will help make your Linkedin more searchable to recruiters who may be looking for specific keywords and programs.

To audit the program and simply learn the material, this program is completely free! Thanks Coursera!

sql for data science peer review assignment

Student Reviews

This program has been around since 2020, and the concepts are fairly up to date and relevant. The capstone has a little under 200, which is to be expected for a predominantly online class. For Coursera standards, the course is incredibly popular and highly rated!

Some of our favorite positive review points:

  • I thought this course was great! Great introduction to Relational Databases and SQLite. Highly recommend for anyone new to SQL, Databases, or someone looking to get started with a data science career. - Joshua G.
  • The course starts with the definition of SQL and how it is different from other computer languages. This course also provides related reading resources, which helped me gain more insights into this field and come to know about good resources from where I can practice this newly acquired skill set. This course also introduced ER diagrams, necessary clauses, and operators, including WHERE, BETWEEN, IN, OR, NOT, LIKE, ORDER BY, and GROUP BY,  subqueries, and joins with advantages and disadvantages. You will be able to use the wildcard function to search for more specific or parts of records, including their advantages and disadvantages and how best to use them. You will be able to discuss how to use basic math operators and aggregate functions like AVERAGE, COUNT, MAX, MIN, and others to begin analyzing our data.It also discussed how to modify strings by concatenating, trimming, changing the case, and using the substring function. Also discussed the date and time strings specifically. You will be able to use case statements and finish this module by discussing data governance and profiling. You will also be able to apply fundamental principles when using SQL for data science. You'll be able to use tips and tricks to apply SQL in a data science context. - Alpesh G.
  • Great high level overview for Spark beginners with focus on application. Course materials are reasonably up to date and well designed. Might be nice if there was a PySpark complement to this course but I understand that it's part of the SQL specialization. Would highly recommend. - David Y.

Aggregations of negative review points:

  • Material took way less than the given timeline (this feels like a good thing to us)
  • Course 3 tried to cover too much ground and had a lot of superficial Databricks content.
  • Capstone project is peer-reviewed and felt like an unnecessary addition to the specialization.

... and our favorite overall review:

Well-crafted course for a beginner. You don't need any prior knowledge of programming or other languages like C. The course pushes you to read about SQL from other sources as well. The quizzes are also designed well for enhanced learning. The final assignment was also interesting as it requires you to work like a data scientist, design your own problem and solve it. Final word: Go for this course if you're looking for a basic introduction to SQL. -Vishal G.

Supplemental Materials

For the fans of the data-storytelling piece: udemy's data storytelling.

This class has more than ten thousand reviews and averages a 4.5 star rating. Upon completion you will be able to use a 5-stage arc to tell memorable stories and engage your audience. It's tool agnostic, so if you're just looking for the story piece, this might be for you. Plus, it's only twenty bucks so your risk is low.

For the fans of data visualization: Mastering Data Visualization: Theory and Foundations

This class is extremely highly rated with a 4.7 and more than two thousand students. It focuses on how to present data clearly and effectively. It's a more high-level course and not based on a tool (take that how you will), but a great start for beginners who may be interested in the concept. The professor brags that once you take this class, you'll look at all charts in a new light!

For analysts who just need Tableau Help: Tableau 2022 A-Z: Hands-On Tableau Training for Data Science

This course is extremely popular on Udemy with more than eighty thousand reviews and a bestseller tag. It's also extremely highly rated with a 4.6 star average. The material is more updated than UC Davis, and goes way deeper into the functionality of Tableau as a tool. We'd recommend this as a follow-on to the course after you've learned the basics, or if you're familiar with another analytics tool and looking for something with a quicker pace.

Best Data Science Specializations in 2023

This is specifically aimed at Coursera Data Analaytics programs-- so if you sign up for Coursera unlimited, theoretically you could stack all of these. We don't recommend that, but we do recommend checking these out and seeing if any of them hit particular concepts that are interesting to you.

For the web analytics power-user: Google's Data Analytics Specialization

Google also sponsors a data analytics certificate program through Coursera. This is one of the more coveted certificates in the industry for learning the Google Analytics tool specifically, hence our recommendation of this course from Duke. Google's course is also free to audit, but same rules apply if you want the certificate to show off at $49 a month.

Comparable mid-level program: University of Minnesota's Analytics for Decision Making

The University of Minnesota runs a great program with a 4.7 star average. It's free to audit, but if you want the certificate it's covered under a $49/month Coursera subscription. We especially love course 2 for the experimenters out there... you can never go wrong with data-driven optimization strategies. ‍ This is a newer beginner-level class that has a great overview of types of analytics, and when to use each method to maximize effectiveness.

For a comprehensive overview of analytics fields: Wharton's (UPenn) Business Analytics Specialization

Wharton is a prestigious business school and offers a great overview of different analytics fields, including marketing, ops and HR analytics. ‍‍ This is definitely a beginner-level specialization for people looking to identify their favorite concepts. Read our full writeup here.

More data science-y: University of Michigan's Applied Data Science Specialization

The University of Michigan also runs a great specialization that focuses on python techniques for effective data science. The reviews said it was pretty tough, but might be worth it if you're looking to expand your skillset into data science and enrolled in other Coursera options. Read our full writeup here.

UC Davis's program is a great way to sharpen your SQL skillset in a traditional online classroom format. A certification in SQL is a great way to show diligence and focus on analytics technology without breaking the bank with an additional college degree or bootcamp.

Here at Bridged we are huge fans of stacking micro-certifications to achieve desired career results. This program could be one notch in your arsenal to really kick your technical expertise into gear!

header describing the article content

BRIDGED AT A GLANCE

explore careers

Find information on career paths for high-paying roles that align with your strengths and goals. Try our easy quiz to help you get started.

target skill gaps

View the skills you need to learn and develop with our state-of-the-art gap identifier. This is your next stop once you've found a role!

review certifications

Learn about affordable and reputable certifications that won't break your bank. No expensive bootcamps or schooling required.

identify dream roles

We've vetted jobs at top companies that need talent! Easily match with companies that work with your job preferences.

your ultimate career platform

It’s almost impossible to get jobs without experience, and experience is impossible to get without a job. We're working to change that.

Search code, repositories, users, issues, pull requests...

Provide feedback.

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly.

To see all available qualifiers, see our documentation .

  • Notifications

IMAGES

  1. SQL For Data Science

    sql for data science peer review assignment

  2. GitHub

    sql for data science peer review assignment

  3. GitHub

    sql for data science peer review assignment

  4. How to Master SQL for Data Science

    sql for data science peer review assignment

  5. SQL for Data Science 1 Step Solution for Beginners

    sql for data science peer review assignment

  6. Mastering SQL for data science || AnalytixLabs

    sql for data science peer review assignment

VIDEO

  1. TIKTOK Interview Question Solved

  2. Week 3

  3. Most asked in SQL Interviews #sql

  4. Replication in SQL server

  5. Uber Very Interesting SQL Interview Problem

  6. Top 10 SQL Projects || Data Analyst SQL Project || Data Analyst Portfolio Project

COMMENTS

  1. SQL for Data Data Science: Peer-review Assignment

    Something went wrong and this page crashed! If the issue persists, it's likely a problem on our side. Unexpected token < in JSON at position 4. keyboard_arrow_up. content_copy. SyntaxError: Unexpected token < in JSON at position 4. Refresh. Explore and run machine learning code with Kaggle Notebooks | Using data from No attached data sources.

  2. hkim1c/Coursera_SQL_for_Data_Science

    Final project in "SQL for Data Science". Contribute to hkim1c/Coursera_SQL_for_Data_Science development by creating an account on GitHub. ... Code review. Manage code changes Issues. Plan and track work Discussions. Collaborate outside of code ... Coursera: SQL for Data Science - Peer-graded Assignment. Hwanpyo Kim's Answer. About. Final ...

  3. SQL-for-Data-Science-Final-Assignment/README.md at main

    Contribute to SophiaQY/SQL-for-Data-Science-Final-Assignment development by creating an account on GitHub. ... Code review. Manage code changes Issues. Plan and track work Discussions. Collaborate outside of code Explore. All features Documentation GitHub Skills Blog ...

  4. How to create / integrate the Yelp Dataset in SQLite (Coursera SQL for

    I am doing the Coursera course SQL for Data Science. Logically I am a bloody beginner. For the final assignment you have to analyze the Yelp dataset. However, there is no description on how to obtain the dataset, how to transform it, how to integrate it etc. ... It is the last of the 4 sections in the Peer Review Assignment panel. Good luck ...

  5. GitHub

    Peer-graded Assignment: Data Scientist Role Play: Profiling and Analyzing the Yelp Dataset. This is a 2-part assignment. In the first part, you are asked a series of questions that will help you profile and understand the data just like a data scientist would. For this first part of the assignment, you will be assessed both on the correctness ...

  6. SQL Guide to Yelp Database (Final Assignment of "SQL for Data Science

    73. 1. This article will guide you on how to answer the final assignment* of SQL for Data Science, a course created by the University of California-Davis and offered by Coursera. I highly recommend the course if you want to have a certificate on and learn about SQL coding syntax. The final assignment accesses the Yelp dataset and provides a ...

  7. SQL for Data Science

    Data Models, Part 1: Thinking About Your Data • 5 minutes. Data Models, Part 2: The Evolution of Data Models • 3 minutes. Data Models, Part 3: Relational vs. Transactional Models • 5 minutes. Retrieving Data with a SELECT Statement • 4 minutes. Creating Tables • 7 minutes. Creating Temporary Tables • 4 minutes.

  8. SQL for Data Science Capstone Project

    There are 4 modules in this course. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses ...

  9. IBM SQL for data science course Peer-Graded Assignment

    hi I'm on SQL for data science course, I have a problem to connect to IBM database. ... could you explain me how did you connect with this example : %sql ibm_d... Stack Overflow. About; Products For Teams; ... IBM SQL for data science course Peer-Graded Assignment. Ask Question Asked 2 years, 3 months ago. Modified 1 year, ...

  10. SQL for Data Science on Coursera

    SQL for Data Science on Coursera. Created by UC Davis Continuing and Professional Education and hosted on the Coursera platform, this online course consists of pre-recorded video lectures, auto-graded and peer-reviewed assignments and community discussion forums. This course is self-paced and designed to teach you a new skill in as little as 4 ...

  11. SQL for Data Science Capstone Project

    A 4-week course by UC Davis, focusing on applying SQL skills to analyze data, solve business problems, and create a portfolio-worthy project. Includes peer-review feedback.

  12. SQL for Data Science

    This course is part of the Learn SQL Basics for Data Science Specialization. When you enroll in this course, you'll also be enrolled in this Specialization. Learn new concepts from industry experts. Gain a foundational understanding of a subject or tool. Develop job-relevant skills with hands-on projects.

  13. sql for data science with python peer-assignment

    Something went wrong and this page crashed! If the issue persists, it's likely a problem on our side. Unexpected token < in JSON at position 4. keyboard_arrow_up. content_copy. SyntaxError: Unexpected token < in JSON at position 4. Refresh. Explore and run machine learning code with Kaggle Notebooks | Using data from No attached data sources.

  14. GitHub

    You will be assessed both on the correctness of your SQL queries and results. A Jupyter notebook is provided to help you complete this assignment. Follow the instructions to complete all the problems, then share the Queries and Results with your peers for review. There are 25 points possible for this assignment. Here is the breakdown:

  15. Databases and SQL for Data Science with Python

    Accessing Databases using Python. Module 4 • 4 hours to complete. In this module you will learn the basic concepts of using Python to connect to databases. In a Jupyter Notebook, you will create tables, load data, query data using SQL magic and SQLite python library. You will also learn how to analyze data using Python.

  16. SQL For Data Science Peer Graded Assignment and Review For Weak 4

    In This Video I show you SQL For Data Science Peer Graded Assignment and Review For Weak 4 SolutionsAssignment link :https://coursera-assessments.s3.amazonaw...

  17. Course review: SQL for Data Science on Coursera

    This is a course review of SQL for Data Science on Coursera, part 1 of 4 in a series of SQL courses by UC Davis University of California with Sadie St. Lawrence as course instructor. SQL for Data Science was an easy to follow and short SQL course that can be done in about 10 hours of work. The ideal audience is the complete beginner in SQL. The course is divided in four "weeks" and the ...

  18. GitHub

    Course 2: Tools for Data Science. Course 5: Python Project for Data Science. Course 6: Databases and SQL for Data Science with Python. Course 7: Data Analysis with Python. Course 8: Data Visualization with Python. Course 10: Applied Data Science Capstone

  19. SQL for Data Science with R

    The purpose of this course is to introduce relational database concepts and help you learn and apply foundational knowledge of the SQL and R languages. It is also intended to get you started with performing SQL access in a data science environment. The emphasis in this course is on hands-on and practical learning.

  20. Review: SQL for Data Science Specialization by UC Davis

    Program Overview. There are winners and losers in the realm of analytics, and SQL is definitely a winner. After aggregating thousands of job descriptions for analytics careers, we identified SQL as the #2 most employer sought-after skill behind Tableau (which UC Davis also covers, btw). Enter UC Davis's SQL Basics for Data Science Specialization.

  21. SQL for Data Science Capstone Project

    Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. ... you will receive feedback through the peer-review ...

  22. Coursera-SQL-for-Data-Science/README.md at main

    This is a 2-part assignment. In the first part, you are asked a series of questions that will help you profile and understand the data just like a data scientist would. For this first part of the assignment, you will be assessed both on the correctness of your findings, as well as the code you used to arrive at your answer.

  23. Learner Reviews & Feedback for SQL for Data Science Course

    1 - 25 of 4,174 Reviews for SQL for Data Science. By lkflrs. •. Apr 30, 2019. I come from a background of teaching for 18 years, and not in SQL. My biggest criticism for this course is the lack of preparation for the culminating activities. The course saves all the challenging work for the last 2% of the course.