When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 4 modules in this course
Over-utilization of market and accounting data over the last few decades has led to portfolio crowding, mediocre performance and systemic risks, incentivizing financial institutions which are looking for an edge to quickly adopt alternative data as a substitute to traditional data. This course introduces the core concepts around alternative data, the most recent research in this area, as well as practical portfolio examples and actual applications. The approach of this course is somewhat unique because while the theory covered is still a main component, practical lab sessions and examples of working with alternative datasets are also key. This course is fo you if you are aiming at carreers prospects as a data scientist in financial markets, are looking to enhance your analytics skillsets to the financial markets, or if you are interested in cutting-edge technology and research as they apply to big data. The required background is: Python programming, Investment theory , and Statistics. This course will enable you to learn new data and research techniques applied to the financial markets while strengthening data science and python skills.
The consumption module introduces students to the basics of consumption-based alternative data.
By aggregating online and offline consumer purchase activity and behavioral datasets including geolocation data (e.g., cell locations, satellite imagery etc.), transaction data (e.g., credit card transaction logs and point of sale data), as well as consumer interaction with brands and products on social media, researchers can learn about company performance ahead of official company earning announcements.
Such information may be extremely useful and can provide investment and risk management advantages. This module reviews the theoretical aspects of various consumption datasets, and provides practical demonstrations of relevant data analytics.
Lab session: Introduction to the Uber Dataset•6 minutes
Lab session: Points of Interest•5 minutes
Lab session: Mapping Data with Folium•9 minutes
Lab session: Testing Seasonality•12 minutes
Application: Consumption data and earning surprises•7 minutes
Application:Consumption-based proxies for private information and managers behavior•7 minutes
Application: Additional applications of consumption data•7 minutes
5 readings•Total 187 minutes
Material at your disposal•5 minutes
Note about HeatMapWithTime•2 minutes
Extra materials on consumption•60 minutes
Additional resources on the interest of real-time corporate sales'measures•60 minutes
Additional resources on Predicting Performance using Consumer Big Data•60 minutes
1 assignment•Total 30 minutes
Graded Quiz on Consumption•30 minutes
1 discussion prompt•Total 10 minutes
Data biases•10 minutes
1 ungraded lab•Total 60 minutes
Code and Data•60 minutes
Textual Analysis for Financial Applications
Module 2•4 hours to complete
Module details
Module 2 is an introduction to text mining as well as a demonstration of how to get from data retrieval (web scraping) to financial market insights. Some of the classic text mining methodologies are covered such as vectorization of text (the bag of words approach), stop words for filtering, and term frequency-inverse document frequency (TF-IDF). Students will learn how text can be mathematically represented, and regularized/filtered to reduce noise. Measures of text-similarity will be covered in theoretical and practice sessions. Lab sessions go through examples of web scraping data, regularizing with the described techniques and finally, insights will be derived from the textual data.
What's included
8 videos2 readings1 assignment1 discussion prompt
Show info about module content
8 videos•Total 75 minutes
Introduction to the open web•4 minutes
Introduction to textual analysis•4 minutes
Processing text into vectors•12 minutes
Normalizing textual data•6 minutes
Lab session: Introduction to Webscraping•12 minutes
Lab session: Applied Text Data Processing•11 minutes
Lab session: Company Distances and Industry Distances•16 minutes
Application: applying similarity analysis on corporate filings to predict returns•10 minutes
2 readings•Total 130 minutes
Extra materials on Textual Analysis for Financial Applications•70 minutes
Additional resources on textual analysis for financial applications•60 minutes
1 assignment
Graded Quiz on Textual Analysis for Financial Applications•0 minutes
1 discussion prompt•Total 10 minutes
Web scraping•10 minutes
Processing Corporate Filings
Module 3•4 hours to complete
Module details
Module 3 is a practical extension of the text mining lessons to 10-K and 13-F, two of the most commonly researched corporate filings. This type of data can be extremely daunting when used by individual analysts due to the sheer size of the documents, but module 3 describes the methodologies for quantitatively analyzing these documents with Python code. Both the 10-K and 13-F documents are worked through, and within the lab sessions it is demonstrated how one can automatically pull this kind of data as well as define metrics around them. We investigate implementations of research in this field around similarity of given companies 10-K statements over time as well as similarity between fund holdings from the 13-F in the lab.
Application: network centrality, competition links and stock returns•8 minutes
Application: Using location data to measure home bias to predict returns•4 minutes
6 readings•Total 157 minutes
Instructor's announcement•2 minutes
Important note about 10-K lab•10 minutes
Important message regarding 13F data•10 minutes
Extra materials on Processing Corporate Filings•30 minutes
Additional resources•30 minutes
Additional resources on processing corporate fillings•75 minutes
1 assignment
Graded Quiz on Processing Corporate Filings•0 minutes
1 discussion prompt•Total 10 minutes
10-K and 13F filings•10 minutes
Using Media-Derived Data
Module 4•7 hours to complete
Module details
The final module introduces both sentiment analysis in the context of textual data as well as network analysis in the context of connectivity of firms. Sentiment analysis is an avenue of potentially fruitful information that when done correctly can display what a general population might believe about a company (through for example social media) or even whether the company itself is positive or negative on future outlook (through analysis of tone in corporate filings). Network analysis, as shown in the research of course instructors and his colleagues, can be used to accurately capture how a financial network is oriented and what companies might perform well because of other firm’s mentioning them as a threat. The lab session of this module extends the corporate filings analysis to examine sentiment while also introducing a set of tweets which are then transformed into a network representation.
Founded in 1906, EDHEC is now one of Europe’s top 15 business schools .
Based in Lille, Nice, Paris, London and Singapore, and counting over 90 nationalities on its campuses, EDHEC is a fully international school directly connected to the business world. With over 60,000 graduates in 120 countries, it trains committed managers capable of dealing with the challenges of a fast-evolving world.
Harnessing its core values of excellence, innovation and entrepreneurial spirit, EDHEC has developed a strategic model founded on research of true practical use to society, businesses and students, and which is particularly evident in the work of EDHEC-Risk Institute and Scientific Beta.
The School functions as a genuine laboratory of ideas and plays a pioneering role in the field of digital education via EDHEC Online, the first fully online degree-level training platform.
These various components make EDHEC a centre of knowledge, experience and diversity, geared to preparing new generations of managers to excel in a world subject to transformational change.
EDHEC in figures: 5 campuses (Paris, Lille, Nice, London, Singapore) 9400 students in academic education (60 000+ alumni), 25+ degree programmes ranging from bachelor to PhD level, 175 professors and researchers, 15 specialist research centres.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.