Data Analytics Foundations for Accountancy II

Data Analytics Foundations for Accountancy II

Instructor: Robert J. Brunner

3,906 already enrolled

Included with Learn more

Ask Coursera

9 modules

Gain insight into a topic and learn the fundamentals.

11 reviews

Beginner level

No prior experience required

7 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

9 modules

Gain insight into a topic and learn the fundamentals.

11 reviews

Beginner level

No prior experience required

7 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

9 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 9 modules in this course

Welcome to Data Analytics Foundations for Accountancy II! I'm excited to have you in the class and look forward to your contributions to the learning community.

To begin, I recommend taking a few minutes to explore the course site. Review the material we’ll cover each week, and preview the assignments you’ll need to complete to pass the course. Click Discussions to see forums where you can discuss the course material with fellow students taking the class. If you have questions about course content, please post them in the forums to get help from others in the course community. For technical problems with the Coursera platform, visit the Learner Help Center. Good luck as you get started, and I hope you enjoy the course!

You will become familiar with the course, your classmates, and our learning environment. The orientation will also help you obtain the technical skills required for the course.

What's included

3 videos5 readings1 assignment1 discussion prompt

3 videosTotal 9 minutes

Welcome to Data Analytics Foundations for Accountancy II5 minutes
Meet Professor Brunner4 minutes
Learn on Your Terms1 minute

5 readingsTotal 50 minutes

Syllabus10 minutes
About the Discussion Forums10 minutes
Online Education at Gies College of Business10 minutes
Updating Your Profile10 minutes
Social Media10 minutes

1 assignmentTotal 30 minutes

Orientation Quiz30 minutes

1 discussion promptTotal 10 minutes

Getting to Know Your Classmates10 minutes

This module provides the basis for the rest of the course by introducing the basic concepts behind machine learning, and, specifically, how to perform machine learning by using Python and the scikit learn machine learning module. First, you will learn how machine learning and artificial intelligence are disrupting businesses. Next, you will learn about the basic types of machine learning and how to leverage these algorithms in a Python script. Third, you will learn how linear regression can be considered a machine learning problem with parameters that must be determined computationally by minimizing a cost function. Finally, you will learn about neighbor-based algorithms, including the k-nearest neighbor algorithm, which can be used for both classification and regression tasks.

What's included

4 videos3 readings1 assignment1 programming assignment4 ungraded labs

4 videosTotal 47 minutes

Introduction to Module 16 minutes
Introduction to Machine Learning14 minutes
Introduction to Linear Regression15 minutes
Introduction to k-nn13 minutes

3 readingsTotal 30 minutes

Module 1 Overview10 minutes
Lesson 1-1 Readings10 minutes
Lesson 1-2 Readings10 minutes

1 assignmentTotal 20 minutes

Module 1 Graded Quiz20 minutes

1 programming assignmentTotal 180 minutes

Module 1 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Machine Learning Notebook60 minutes
Introduction to Linear Regression Notebook60 minutes
Introduction to k-nn Notebook60 minutes
Module 1 Programming Assignment Notebook60 minutes

This module introduces several of the most important machine learning algorithms: logistic regression, decision trees, and support vector machine. Of these three algorithms, the first, logistic regression, is a classification algorithm (despite its name). The other two, however, can be used for either classification or regression tasks. Thus, this module will dive deeper into the concept of machine classification, where algorithms learn from existing, labeled data to classify new, unseen data into specific categories; and, the concept of machine regression, where algorithms learn a model from data to make predictions for new, unseen data. While these algorithms all differ in their mathematical underpinnings, they are often used for classifying numerical, text, and image data or performing regression in a variety of domains. This module will also review different techniques for quantifying the performance of a classification and regression algorithms and how to deal with imbalanced training data.

What's included

5 videos4 readings1 assignment1 programming assignment4 ungraded labs

5 videosTotal 52 minutes

Introduction to Module 26 minutes
Introduction to Fundamental Algorithms4 minutes
Introduction to Logistics Regression14 minutes
Introduction to Decision Trees15 minutes
Introduction to Support Vector Machine14 minutes

4 readingsTotal 40 minutes

Module 2 Overview10 minutes
Lesson 2-1 Readings10 minutes
Lesson 2-3 Readings10 minutes
Lesson 2-4 Readings10 minutes

1 assignmentTotal 30 minutes

Module 2 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 2 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Logistic Regression Notebook60 minutes
Introduction to Decision Trees Notebook60 minutes
Introduction to Support Vector Machine Notebook60 minutes
Module 2 Programming Assignment Notebook60 minutes

This module introduces several important and practical concepts in machine learning. First, you will learn about the challenges inherent in applying data analytics (and machine learning in particular) to real world data sets. This also introduces several methodologies that you may encounter in the future that dictate how to approach, tackle, and deploy data analytic solutions. Next, you will learn about a powerful technique to combine the predictions from many weak learners to make a better prediction via a process known as ensemble learning. Specifically, this module will introduce two of the most popular ensemble learning techniques: bagging and boosting and demonstrate how to employ them in a Python data analytics script. Finally, the concept of a machine learning pipeline is introduced, which encapsulates the process of creating, deploying, and reusing machine learning models.

What's included

5 videos3 readings1 assignment1 programming assignment4 ungraded labs

5 videosTotal 40 minutes

Introduction to Module 34 minutes
Introduction to Modeling Success6 minutes
Introduction to Bagging11 minutes
Introduction to Boosting10 minutes
Introduction to ML Pipelines9 minutes

3 readingsTotal 30 minutes

Module 3 Overview10 minutes
Lesson 3-1 Readings10 minutes
Lesson 3-2 Readings10 minutes

1 assignmentTotal 30 minutes

Module 3 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 3 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Bagging Notebook60 minutes
Introduction to Boosting Notebook60 minutes
Practical Concerns in Machine Learning60 minutes
Module 3 Programming Assignment Notebook60 minutes

This module introduces the concept of regularization, problems it can cause in machine learning analyses, and techniques to overcome it. First, the basic concept of overfitting is presented along with ways to identify its occurrence. Next, the technique of cross-validation is introduced, which can mitigate the likelihood that overfitting can occur. Next, the use of cross-validation to identify the optimal parameters for a machine learning algorithm trained on a given data set is presented. Finally, the concept of regularization, where an additional penalty term is applied when determining the best machine learning model parameters, is introduced and demonstrated for different regression and classification algorithms.

What's included

5 videos4 readings1 assignment1 programming assignment4 ungraded labs

5 videosTotal 48 minutes

Introduction to Module 44 minutes
Introduction to Overfitting5 minutes
Introduction to Cross-Validation14 minutes
Introduction to Model-Selection17 minutes
Introduction to Regularization9 minutes

4 readingsTotal 40 minutes

Module 4 Overview10 minutes
Lesson 4-1 Readings10 minutes
Lesson 4-2 Readings10 minutes
Lesson 4-3 Readings10 minutes

1 assignmentTotal 30 minutes

Module 4 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 4 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Cross-Validation Notebook60 minutes
Introduction to Model-Selection Notebook60 minutes
Introduction to Regularization Notebook60 minutes
Module 4 Programming Assignment Notebook60 minutes

This module starts by discussing practical machine learning workflows that are deployed in production environments, which emphasizes the big picture view of machine learning. Next this module introduces two additional fundamental algorithms: naive Bayes and Gaussian Processes. These algorithms both have foundations in probability theory but operate under very different assumptions. Naive Bayes is generally used for classification tasks, while Gaussian Processes are generally used for regression tasks. This module also discusses practical issues in constructing machine learning workflows.

What's included

4 videos4 readings1 assignment1 programming assignment3 ungraded labs

4 videosTotal 22 minutes

Introduction to Module 54 minutes
Introduction to Practical Machine Learning3 minutes
Introduction to Naive Bayes5 minutes
Introduction to Gaussian Processes11 minutes

4 readingsTotal 40 minutes

Module 5 Overview10 minutes
Lesson 5-1 Readings10 minutes
Lesson 5-2 Readings10 minutes
Lesson 5-3 Readings10 minutes

1 assignmentTotal 30 minutes

Module 5 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 5 Programming Assignment180 minutes

3 ungraded labsTotal 180 minutes

Introduction to Naive Bayes Notebook60 minutes
Introduction to Gaussian Processes Notebook60 minutes
Module 5 Programming Assignment Notebook60 minutes

This module introduces an important concept in machine learning, the selection of the actual features that will be used by a machine learning algorithm. Along with data cleaning, this step in the data analytics process is extremely important, yet it is often overlooked as a method for improving the overall performance of an analysis. This module beings with a discussion of ethics in machine learning, in large part because the selection of features can have (sometimes) non-obvious impacts on the final performance of an algorithm. This can be important when machine learning is applied to data in a regulated industry or when the improper application of an algorithm might lead to discrimination. The rest of this module introduces different techniques for either selecting the best features in a data set, or the construction of new features from the existing set of features.

What's included

5 videos4 readings1 assignment1 programming assignment4 ungraded labs

5 videosTotal 40 minutes

Introduction to Module 65 minutes
Practical Concerns with Machine Learning6 minutes
Introduction to Feature Selection8 minutes
Introduction to Dimension Reduction12 minutes
Introduction to Manifold Learning9 minutes

4 readingsTotal 40 minutes

Module 6 Overview10 minutes
Lesson 6-1 Readings10 minutes
Lesson 6-3 Readings10 minutes
Lesson 6-4 Readings10 minutes

1 assignmentTotal 30 minutes

Module 6 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 6 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Feature Selection Notebook60 minutes
Introduction to Dimension Reduction Notebook60 minutes
Introduction to Manifold Learning Notebook60 minutes
Module 6 Programming Assignment Notebook60 minutes

This module introduces clustering, where data points are assigned to larger groups of points based on some specific property, such as spatial distance or the local density of points. While humans often find clusters visually with ease in given data sets, computationally the problem is more challenging. This module starts by exploring the basic ideas behind this unsupervised learning technique, as well as different areas in which clustering can be used by businesses. Next, one of the most popular clustering techniques, K-means, is introduced. Next the density-based DB-SCAN technique is introduced. This module concludes by introducing the mixture models technique for probabilistically assigning points to clusters.

What's included

5 videos5 readings1 assignment1 programming assignment4 ungraded labs

5 videosTotal 38 minutes

Introduction to Module 75 minutes
Introduction to Clustering4 minutes
Introduction to Spatial Clustering12 minutes
Introduction to Density-Based Clustering9 minutes
Introduction to Mixture Models8 minutes

5 readingsTotal 50 minutes

Module 7 Overview10 minutes
Lesson 7-1 Readings10 minutes
Lesson 7-2 Readings10 minutes
Lesson 7-3 Readings10 minutes
Lesson 7-4 Readings10 minutes

1 assignmentTotal 30 minutes

Module 7 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 7 Programming Assignment180 minutes

4 ungraded labsTotal 240 minutes

Introduction to Spatial Clustering Notebook60 minutes
Introduction to Density-Based Clustering Notebook60 minutes
Introduction to Mixture Models Notebook60 minutes
Module 7 Programming Assignment Notebook60 minutes

This module introduces the concept of an anomaly, or outlier, and different techniques for identifying these unusual data points. First, the general concept of an anomaly is discussed and demonstrated in the business community via the detection of fraud, which in general should be an anomaly when compared to normal customers or operations. Next, statistical techniques for identifying outliers are introduced, which often involve simple descriptive statistics that can highlight data that are sufficiently far from the norm for a given data set. Finally, machine learning techniques are reviewed that can either classify outliers or identify points in low density (or outside normal clusters) areas as potential outliers.

What's included

4 videos4 readings1 assignment1 programming assignment3 ungraded labs1 plugin

4 videosTotal 20 minutes

Introduction to Module 84 minutes
Introduction to Anomaly Detection4 minutes
Statistical Anomaly Detection6 minutes
Machine Learning and Anomaly Detection6 minutes

4 readingsTotal 40 minutes

Module 8 Overview10 minutes
Lesson 8-1 Readings10 minutes
Congratulations on completing the course!10 minutes
Get Your Course Certificate10 minutes

1 assignmentTotal 30 minutes

Module 8 Graded Quiz30 minutes

1 programming assignmentTotal 180 minutes

Module 8 Programming Assignment180 minutes

3 ungraded labsTotal 180 minutes

Statistical Anomaly Detection Notebook60 minutes
Machine Learning and Anomaly Detection Notebook60 minutes
Module 8 Programming Assignment Notebook60 minutes

1 pluginTotal 15 minutes

How was the course15 minutes

Instructor

Robert J. Brunner

University of Illinois Urbana-Champaign

8 Courses37,422 learners

Offered by

University of Illinois Urbana-Champaign

Explore more from Business Essentials

Status: Preview
Association of International Certified Professional Accountants
Introduction to Data Analytics for Accounting Professionals
Course
Status: Free Trial
University of Illinois Urbana-Champaign
Introduction to Accounting Data Analytics and Visualization
Course
Status: Free Trial
University of Illinois Urbana-Champaign
Applying Data Analytics in Accounting
Course
Status: Free Trial
University of Pennsylvania
Accounting Analytics
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
81.81%
4 stars
0%
3 stars
9.09%
2 stars
0%
1 star
9.09%

Showing 3 of 11

Reviewed on Jun 22, 2019

I like this course. Because it is very useful to accounting and auditing .

View more reviews

Unlock access to 10,000+ courses with a subscription
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Join over 4,700 global companies that choose Coursera for Business

Frequently asked questions

To access course materials, assignments, and earn a Certificate, you'll need to purchase the Certificate experience when you enroll in a course. Eligible learners may also have the option to start with a Free Trial. Some courses may also offer a Full Course, No Certificate option. This lets you access course materials, submit required assessments, and receive a final grade, but you won't be able to earn or purchase a Certificate.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.