The Predictive Analytics and Forecasting course is designed for advanced learners aiming to develop practical skills in analyzing data to make forward-looking business decisions. As organizations increasingly rely on data-driven strategies, this course equips future managers with the ability to understand and apply predictive analytics tools for improved decision-making. Learners will explore key concepts in data mining such as regression, classification, clustering, and forecasting, with a strong focus on real-world business applications.

Predictive Analytics and Forecasting

Predictive Analytics and Forecasting

Instructor: Krishan Kumar Pandey
Access provided by Yenepoya University
What you'll learn
Learn predictive analytics and data mining to uncover business insights.
Apply models to real-world challenges and enhance decision-making.
Skills you'll gain
Tools you'll learn
Details to know

Add to your LinkedIn profile
52 assignments
See how employees at top companies are mastering in-demand skills

There are 17 modules in this course
Welcome to the Predictive Analytics and Forecasting course! Predictive analytics is about using statistical data mining to analyze current and historical facts to make predictions about future events. As the business world rapidly progresses toward a paradigm of data-driven decision-making, the primary goal of this course is to understand both the power and limitations of some of the predictive analysis tools. This course will provide an overview of predictive analysis tools of data mining and their uses with the volume of data and business cases. The course is designed to allow future managers to communicate effectively with the data science team within an organization. The course further acquaints you with how to understand customer behavior and motivations, customers’ need, market segmentation, retailing, and business forecasting with the power of predictive data mining tools. Finally, the course will demonstrate a handful set of predictive analytics and data mining tools that can help young managers to make data-driven decisions in today’s business scenario. This is an advanced course intended for learners with a background in data analysis and interpretation. The knowledge you gain from this course will help you pursue analytics careers in any industry. To succeed in this course, you should have prior experience in or a basic understanding of regression, correlation, data visualization, and interpretation of statistical results. In this module, you will learn about various terminologies of data mining, such as predictive analytics, prescriptive analytics, data science, and business intelligence. Before starting with the core analytics, you should be first clear about the steps of data mining and how to pre-process your data before going for actual data analytics. This module also introduces you to various steps of data mining and data processing. After completing this module, you will be thorough with the preliminary steps of predictive analytics.
What's included
5 videos5 readings4 assignments1 discussion prompt
5 videos• Total 35 minutes
- Course Intro video• 2 minutes
- Data Mining and Predictive Analytics• 6 minutes
- Tools of Predictive Analytics • 7 minutes
- Fallacies and Steps in Data Mining • 10 minutes
- Preprocessing the Data • 11 minutes
5 readings• Total 250 minutes
- Course Overview• 10 minutes
- Recommended Reading: Introduction to Data Mining and Predictive Analytics • 60 minutes
- Recommended Reading: Overview of Tools of Predictive Analytics• 60 minutes
- Recommended Reading: Fallacies and Steps in Data Mining• 60 minutes
- Recommended Reading: Preprocessing the Data • 60 minutes
4 assignments• Total 18 minutes
- Introduction to Data Mining and Predictive Analytics• 6 minutes
- Overview of Tools of Predictive Analytics• 3 minutes
- Fallacies and Steps in Data Mining• 3 minutes
- Preprocessing the Data • 6 minutes
1 discussion prompt• Total 30 minutes
- Basics of Data Mining• 30 minutes
In this module, you will be able to develop your base for advanced predictive analytics through basic tools like correlation and regression. This module helps you differentiate between these two terms and acquaints you with how correlation measures the degree of association between two variables, whereas regression tells us about the functional relationship among the variables. In this module, you will be learning how to compute correlation coefficient, simple linear regression, and multiple linear regression for a given data set with the help of statistical software for social sciences. Finally, this module will cover the basic assumptions of multiple linear regression and help you test the significance of the correlation coefficient.
What's included
5 videos4 readings4 assignments
5 videos• Total 35 minutes
- Correlation and Its Significance • 4 minutes
- Zero-Order, Part, and Partial Correlation• 7 minutes
- Simple Linear Regression• 7 minutes
- Multiple Linear Regression: Part 1• 6 minutes
- Multiple Linear Regression: Part 2• 12 minutes
4 readings• Total 240 minutes
- Recommended Reading: Correlation and Its Significance • 60 minutes
- Recommended Reading: Zero-Order, Part, and Partial Correlation• 60 minutes
- Recommended Reading: Simple Linear Regression• 60 minutes
- Recommended Reading: Multiple Linear Regression• 60 minutes
4 assignments• Total 15 minutes
- Correlation and Its Significance • 3 minutes
- Zero-Order, Part, and Partial Correlation• 3 minutes
- Simple Linear Regression• 6 minutes
- Multiple Linear Regression• 3 minutes
In this module, you will be introduced to naïve Bayes classification. One of the most common predictive analytics models is the classification model. This module also introduces you to how these models work by categorizing information based on historical data. This module will help you understand how classification predicts the categorical class (or discrete values), whereas regression and other models predict continuous valued functions.
What's included
4 videos4 readings4 assignments
4 videos• Total 30 minutes
- Naïve Bayes Classification: Method Discussion• 5 minutes
- Manual Calculation of Naïve Bayes Classification Method• 10 minutes
- How to Run Naïve Bayes Classification in RStudio?• 12 minutes
- Benefits and Limitations of Naïve Bayes Classification• 3 minutes
4 readings• Total 240 minutes
- Recommended Reading: Introduction to Classification• 60 minutes
- Recommended Reading: Naïve Bayes Classification Working • 60 minutes
- Recommended Reading: Naïve Bayes Classification in RStudio• 60 minutes
- Recommended Reading: Naïve Bayes Classification: Advantages and Disadvantages• 60 minutes
4 assignments• Total 15 minutes
- Introduction to Classification• 3 minutes
- Naïve Bayes Classification Working• 6 minutes
- Naïve Bayes Classification in RStudio• 3 minutes
- Naïve Bayes Classification: Advantages and Disadvantages• 3 minutes
In this module, you will be continuing with classification modeling. This module will introduce you to the k nearest neighbors. This module will help you apply the k nearest neighbors method to business problems. This module will further explain the working of k nearest neighbors. After going through this module, you will be able to run k nearest neighbors in RStudio.
What's included
4 videos4 readings4 assignments1 discussion prompt
4 videos• Total 34 minutes
- Determining Neighbors and Classification Rule • 6 minutes
- Manual Classification of Sports Choice Example• 6 minutes
- Riding Mowers Example Classification in RStudio • 13 minutes
- Determining Value of k, Advantages, and Disadvantages of kNN Method• 10 minutes
4 readings• Total 240 minutes
- Recommended Reading: k Nearest Neighbors: Concept and Working• 60 minutes
- Recommended Reading: k Nearest Neighbors: Manual Computation• 60 minutes
- Recommended Reading: k Nearest Neighbors: Implementation in RStudio• 60 minutes
- Recommended Reading: k Nearest Neighbors: Determining Value of k• 60 minutes
4 assignments• Total 12 minutes
- k Nearest Neighbors: Concept and Working• 3 minutes
- k Nearest Neighbors: Manual Computation• 3 minutes
- k Nearest Neighbors: Implementation in RStudio• 3 minutes
- k Nearest Neighbors: Determining Value of k• 3 minutes
1 discussion prompt• Total 30 minutes
- Advantages and Disadvantages of Using a Small Value vs. a Large Value of k• 30 minutes
This assessment is a graded quiz based on the modules covered in this week.
What's included
1 assignment
1 assignment• Total 40 minutes
- Graded Quiz: Naïve Bayes and k Nearest Neighbors Classification Methods• 40 minutes
In this module, you will learn about logistic regression. When you are interested in predicting the likelihood of an event, the most widely used classification method is logistic regression. When the classification problem at hand is binary, true or false, and yes or no, then you use logistic regression-based classification.
What's included
4 videos4 readings4 assignments
4 videos• Total 36 minutes
- Concept of Odd Ratio and Probability• 5 minutes
- Attrition Example in RStudio, Concept of Null Deviance, and Residual Deviance• 14 minutes
- Attrition Example in Logistic Regression and Model Fit Verification• 8 minutes
- Logistics Regression: Model Validation in RStudio, Advantages, and Disadvantages• 10 minutes
4 readings• Total 240 minutes
- Recommended Reading: Logistics Regression: Method Discussion• 60 minutes
- Recommended Reading: Logistics Regression: Computation• 60 minutes
- Recommended Reading: Logistics Regression: Output Interpretation • 60 minutes
- Recommended Reading: Logistics Regression: Model Validation• 60 minutes
4 assignments• Total 15 minutes
- Logistics Regression: Method Discussion• 3 minutes
- Logistics Regression: Computation• 6 minutes
- Logistics Regression: Output Interpretation• 3 minutes
- Logistics Regression: Model Validation• 3 minutes
In this module, you will learn about discriminant analysis. When you know the groups a priori, the classification method used is discriminant analysis. This module will help you run discriminant analysis binomial and multinomial categorical variables.
What's included
4 videos4 readings4 assignments1 discussion prompt
4 videos• Total 37 minutes
- Concept of Discriminant Analysis• 6 minutes
- Panel Plot, Stacked Histogram, and Partition Plot• 15 minutes
- Multiple Category Categorical Variable Based DA in RStudio• 11 minutes
- Discriminant Analysis: Advantages and Disadvantages• 4 minutes
4 readings• Total 240 minutes
- Recommended Reading: Discriminant Analysis: Concept• 60 minutes
- Recommended Reading: Discriminant Analysis: Two Category Categorical Variable Implementation in RStudio• 60 minutes
- Recommended Reading: Discriminant Analysis: Multiple Category Categorical Variable Implementation in RStudio• 60 minutes
- Recommended Reading: Discriminant Analysis: Benefits and Limitations• 60 minutes
4 assignments• Total 12 minutes
- Discriminant Analysis• 3 minutes
- Discriminant Analysis: Two Category Categorical Variable Implementation in RStudio• 3 minutes
- Discriminant Analysis: Multiple Category Categorical Variable Implementation in RStudio• 3 minutes
- Discriminant Analysis: Benefits and Limitations• 3 minutes
1 discussion prompt• Total 30 minutes
- Supervised vs. Unsupervised Learning• 30 minutes
This assessment is a graded quiz based on the modules covered in this week.
What's included
1 assignment
1 assignment• Total 40 minutes
- Graded Quiz: Logistic Regression and Discriminant Analysis• 40 minutes
In this module, you will learn about decision trees. When there is non-linear data in hand for classification, the classification method that is used preferably is the decision tree. Their most important feature is the capability of capturing descriptive decision-making knowledge from the supplied data. This module will make you familiar with the concept of information gain and entropy. This module will further help you create the decision tree for business problems.
What's included
4 videos4 readings4 assignments
4 videos• Total 35 minutes
- Decision Tree: Recursive Partitioning, Information Gain, and Entropy• 8 minutes
- Manual Illustration on Decision Tree• 12 minutes
- Concept of Overfitting and Underfitting• 12 minutes
- Decision Tree: Bias and Variance – Advantages and Disadvantages• 3 minutes
4 readings• Total 240 minutes
- Recommended Reading: Decision Tree: Concept• 60 minutes
- Recommended Reading: Decision Tree: Manual Illustration• 60 minutes
- Recommended Reading: Decision Tree: Illustration in RStudio• 60 minutes
- Recommended Reading: Decision Tree: Bias and Variance• 60 minutes
4 assignments• Total 12 minutes
- Decision Tree: Concept• 3 minutes
- Decision Tree: Manual Illustration• 3 minutes
- Decision Tree: Illustration in RStudio• 3 minutes
- Decision Tree: Bias and Variance• 3 minutes
In this module, you will learn about neural networks. This module gives you an insight into how you can use a neural network when you have so much data with you (and computational power, of course), and accuracy matters the most to you. If it comes to predictive accuracy, then neural network–based classification models are the ones that are preferred.
What's included
4 videos4 readings4 assignments
4 videos• Total 42 minutes
- Type of Input and Output Requirement to Run NN• 8 minutes
- Sigmoid Activation Function and Manual Illustration• 19 minutes
- Neural Network: Illustration in RStudio• 10 minutes
- Neural Network: Termination Criteria, Advantages, and Disadvantages• 5 minutes
4 readings• Total 240 minutes
- Recommended Reading: Neural Network: Concept• 60 minutes
- Recommended Reading: Neural Network: Activation Function• 60 minutes
- Recommended Reading: Neural Network: Illustration in RStudio• 60 minutes
- Recommended Reading: Neural Network: Termination Criteria• 60 minutes
4 assignments• Total 12 minutes
- Neural Network: Concept• 3 minutes
- Neural Network: Activation Function• 3 minutes
- Neural Network: Illustration in RStudio• 3 minutes
- Neural Network: Termination Criteria• 3 minutes
In this module, you will learn about the important steps of dimension reduction. In data mining, one often encounters situations where there are a large number of variables in the database. Even when the initial number of variables is small, this set quickly expands in the data preparation step, where new derived variables are created, for instance, dummies for categorical variables and new forms of existing variables. In such situations, it is likely that subsets of variables are highly correlated with each other. Including highly correlated variables in a classification or prediction model or including variables that are unrelated to the outcome of interest can lead to overfitting, and accuracy and reliability can suffer.
What's included
4 videos4 readings4 assignments
4 videos• Total 30 minutes
- Meaning and Uses of EFA and CFA • 6 minutes
- Rules and Various Terminology Used in EFA • 10 minutes
- Running Factor Analysis on SPSS: Process and Result Interpretation – Part 1• 8 minutes
- Running Factor Analysis on SPSS: Process and Result Interpretation – Part 2 • 7 minutes
4 readings• Total 240 minutes
- Recommended Reading: Exploratory and Confirmatory Factor Analysis • 60 minutes
- Recommended Reading: Neural Network: Concept and Terminology of EFA• 60 minutes
- Recommended Reading: Exploratory Factor Analysis Computation and Inference – Part 1• 60 minutes
- Recommended Reading: Exploratory Factor Analysis Computation and Inference – Part 2• 60 minutes
4 assignments• Total 15 minutes
- Exploratory and Confirmatory Factor Analysis• 3 minutes
- Concept and Terminology of EFA• 6 minutes
- Exploratory Factor Analysis Computation and Inference – Part 1• 3 minutes
- Exploratory Factor Analysis Computation and Inference – Part 2• 3 minutes
In this module, you will learn how clustering refers to the grouping of records, observations, or cases into classes of similar objects. You will get insights into how a cluster is a collection of records that are similar to one another and dissimilar to records in other clusters. In this module, you will be able to understand distance measures and how different types of distance measures are used in clustering. You will also be introduced to the quality and an optimal number of clusters, and the various types of clustering methods, such as hierarchical clustering, single-linkage clustering, and complete-linkage clustering. Finally, you will learn about dendrograms, displaying the clustering process and results, and the limitations of hierarchical clustering.
What's included
4 videos4 readings4 assignments1 discussion prompt
4 videos• Total 30 minutes
- Meaning and Classification of Clusters• 6 minutes
- Distance and Dissimilarity Measures Used in Clustering• 7 minutes
- Hierarchical, Single-Linkage, and Complete-Linkage Clustering• 7 minutes
- Dendrograms and Limitations of Hierarchical Clustering• 9 minutes
4 readings• Total 240 minutes
- Recommended Reading: Basic Concepts of Clustering• 60 minutes
- Recommended Reading: Distance and Dissimilarity in Clustering• 60 minutes
- Recommended Reading: Hierarchical, Single-Linkage, and Complete-Linkage Clustering• 60 minutes
- Recommended Reading: Dendrograms and Its Limitation in Clustering • 60 minutes
4 assignments• Total 27 minutes
- Basic Concepts of Clustering• 9 minutes
- Distance and Dissimilarity in Clustering• 3 minutes
- Hierarchical, Single-Linkage, and Complete-Linkage Clustering• 9 minutes
- Dendrograms and Its Limitation in Clustering• 6 minutes
1 discussion prompt• Total 40 minutes
- Factor and Cluster Analysis• 40 minutes
This assessment is a graded quiz based on the modules covered in this week.
What's included
1 assignment
1 assignment• Total 40 minutes
- Graded Quiz: Dimension Reduction and Cluster Analysis• 40 minutes
In this module, you will be introduced to non-hierarchical clustering: the K-means clustering algorithm, its computation process, and its advantages. You will also learn to determine the correct number of clusters. Finally, you will be able to give the interpretation of clusters and market segmentation using conjoint analysis.
What's included
4 videos4 readings4 assignments
4 videos• Total 40 minutes
- Non-Hierarchical Clustering: k-Means Clustering Algorithm• 10 minutes
- Determine the Correct Number of Clusters and Their Interpretation• 9 minutes
- Market Segmentation Conjoint Analysis: Method Discussion • 9 minutes
- Market Segmentation Through Conjoint Analysis: An Example• 13 minutes
4 readings• Total 240 minutes
- Recommended Reading: Non-Hierarchical Clustering• 60 minutes
- Recommended Reading: Optimal Number of Clusters • 60 minutes
- Recommended Reading: Market Segmentation with Conjoint Analysis• 60 minutes
- Recommended Reading: Market Segmentation with Conjoint Analysis: An Example• 60 minutes
4 assignments• Total 12 minutes
- Non-Hierarchical Clustering• 3 minutes
- Optimal Number of Clusters• 3 minutes
- Market Segmentation with Conjoint Analysis• 3 minutes
- Market Segmentation with Conjoint Analysis: An Example• 3 minutes
In this module, you will learn how to use rule base machine learning models to analyze and discover interesting connections, patterns, and relationships between different item sets based on large volume transaction data. This module will give you an insight into how association rule mining measures the strength of co-occurrence between one item and another. The objective of this rule base data mining algorithm is not to predict an occurrence of an item, like classification or regression do, but to find usable patterns in the co-occurrences of the items. You will also learn about association rules learning, which is a branch of an unsupervised learning process that discovers hidden patterns in data, in the form of easily recognizable rules.
What's included
4 videos4 readings4 assignments1 discussion prompt
4 videos• Total 29 minutes
- What Is Association Rule Mining, and When to Use It?• 8 minutes
- Basic Concepts of Market Basket Analysis • 7 minutes
- Hands-On Market Basket Analysis – I• 9 minutes
- Hands-On Market Basket Analysis – II • 5 minutes
4 readings• Total 240 minutes
- Recommended Reading: Basic Concepts of Association Rule Mining • 60 minutes
- Recommended Reading: Basic Concepts of Market Basket Analysis• 60 minutes
- Recommended Reading: Market Basket Analysis Hands-On 1• 60 minutes
- Recommended Reading: Market Basket Analysis Hands-On 2• 60 minutes
4 assignments• Total 24 minutes
- Basic Concepts of Association Rule Mining• 9 minutes
- Basic Concepts of Market Basket Analysis• 9 minutes
- Market Basket Analysis Hands-On 1• 3 minutes
- Market Basket Analysis Hands-On 2• 3 minutes
1 discussion prompt• Total 30 minutes
- Association Rule Mining• 30 minutes
This assessment is a graded quiz based on the modules covered in this week.
What's included
1 assignment
1 assignment• Total 40 minutes
- Graded Quiz: Cluster Analysis and Association Rule Mining• 40 minutes
Course Wrap-Up video
What's included
1 video
1 video• Total 3 minutes
- Course Wrap-Up video• 3 minutes
Build toward a degree
This course is part of the following degree program(s) offered by O.P. Jindal Global University. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
Build toward a degree
This course is part of the following degree program(s) offered by O.P. Jindal Global University. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹
O.P. Jindal Global University
MBA in Business Analytics
Degree · 12 - 24 months
¹Successful application and enrollment are required. Eligibility requirements apply. Each institution determines the number of credits recognized by completing this content that may count towards degree requirements, considering any existing credits you may have. Click on a specific course for more information.
Instructor

Offered by

Offered by

O.P. Jindal Global University is recognised as an Institution of Eminence by the Ministry of Education, Government of India. It is also ranked the No. 1 Private University in India in the QS World University Rankings 2021. The university has 9000+ students across 12 schools that offer 52 degree programs. The university maintains a 1:9 faculty-student ratio. It is a research-intensive university, deeply committed to institutional values of interdisciplinary and innovative learning, pluralism and rigorous scholarship, globalism, and international engagement.
Why people choose Coursera for their career

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
Explore more from Data Science
UUniversity of Colorado Boulder
Course
UUniversity of Minnesota
Course

Specialization

Course