We'll will be using the programming process as the framework for teaching you the fundamentals of the SAS programming language. Before we get into each step, let's run a program that encompasses the entire process. The program analyzes real data for international storms, such as hurricanes, typhoons, and cyclones that has been collected since 1980. The data is stored in an Excel workbook. And we want to turn this data into informative reports, data, and graphics. Let's start by getting familiar with the storm data that we'll be using in our demonstrations. I have an Excel spreadsheet open right now that's called Storm.xlsx. And the first worksheet is called Storm Summary. And we have here one row for every storm from the 1980 season through 2016. So you can see we have the name of each storm. Basin represents a location. And I'm actually going to jump ahead to a different worksheet, so that you can see what these different values of basin represent. Each of these basin codes represent an ocean and a location. We'll use these values quite a bit as we analyze our data going forward. Going back to the Storm Summary spreadsheet, we also have a type code. And let show you what those values represent. Each code represents a different type of storm. The other values we have in Storm Summary include the maximum wind that was measured during the storm. And this is recorded in miles per hour and the minimum pressure. Of course, we have a start and end date, hemispheres, and a location with latitude and longitude. The storm detailed data includes multiple rows for each storm. We actually record one measurement every six hours. And you can see we have several of the same values-- this season, basin, storm name, latitude, longitude, et cetera. In storm damage, we have the most expensive storms that occurred in North America. These are all storms where the cost exceeded $1 billion. You can see we have quite an extensive summary for each of these storms. Storm range also includes one row per storm between the 1980 and 2016 seasons. And for each of these storms, we have the top four or the highest wind measurements that were recorded. In the storm 2017 spreadsheet, we have storms recorded for the 2017 season. You'll notice we have some of the same values, such as basin, name, dates, maximum and minimum pressure. But we also have this extra column location that describes where the storm occurred. And then the remaining tabs were the lookout values that I mentioned earlier. It's hard to make sense of this data just by glancing at it here in Excel. So we're going to use SAS to make sense of it and answer interesting questions. So here in SAS Studio, I have a SAS program that we've written already that will illustrate the steps of the SAS programming process. Now, I don't want you to focus on the syntax quite yet. We'll be learning the rules of the SAS language going forward throughout our course. But for now, I want you to get an idea of what is possible with the SAS language. So in the first section of the program, we're accessing data. These statements are connecting to that Excel workbook and importing the data in a way that we can use it in the following steps. I'll highlight the section of the program and run it. So basically we've imported that Excel workbook into SAS. I'll return to the Code tab and look at the second section. In Section 2, we're exploring the data. We need to learn a little bit about the values that are included in our data to see if there's any problems, maybe inconsistencies or incorrect values that we may need to account for. So I'll run the second section. This code creates some reports for us. First, we start with some frequency reports based on the basin codes and the type codes. So one thing I notice right off the bat is that the basin codes include a lower case value for an a where all of the other codes are recorded in upper case. That's something we'll have to take care of later. Also included in our exploration are some summary statistics. So we can see values, such as the average, the minimum and maximum for our MaxWindMPH and MinPressure values. Notice we have minimum pressure values that are negative 9,999. Those are probably not correct. We're going to have to account for those as well. And we're also taking a peek at the first five rows of that Storm Damage table. That's a good start for exploration phase. So I return to the code and advance to the third section. Here's where we're preparing the data. We're fixing some of those issues that we saw, such as case inconsistencies. We're creating new columns by extracting values from the existing codes or concatenating values together, computing a storm length by determining how many days passed between the start date and the end date. We're also doing some conditional processing to determine what the ocean was based on the basin code. If I scroll down a bit further, I'll also show you that one other thing we're doing in this data preparation phase is joining tables. I'm going to highlight this section. You'll notice it's probably the longest in the entire program and run it. As a result, we created new tables that are clean, validated, and ready to move on to the next phase in the programming process. So back to the code. In Section 4, we're ready to analyze and report on the data and export the results that we create. In this section, we're doing all sorts of interesting things. We're creating frequency reports based on some of the codes within the data. It looks like we're also analyzing storms for the 2016 year and the Basin North Atlantic. We're computing some summary statistics, creating a map illustrating where the storms occurred and their maximum wind speeds and writing all of these results to an Excel spreadsheet. So I'll highlight that section of the program and run it. So now in our final results, we have several reports and graphics that help us to better understand storms. We can look at the results that were created here in SAS Studio. But I'm going to go back to the Excel workbook that we created as output. Here's the Excel workbook that we created. I'll download it and open to view. So the first tab summarizes the 2016 storms by basin. You can see the full descriptive basin name and frequency counts for the storms, as well as a nice bar chart that visualizes that distribution. A tab for wind statistics, and a map that shows us the most powerful storms in the North Atlantic Basin for 2016 season. You will be able to create all of these results at the end of our course.