September 2, 2019

AT40900 Capstone Project: Digging Deeper

Introduction:
            Our project deals with a lot of data, the initial challenges will be to join tables of disparate information and filter them so that we may begin to see patterns that we can investigate  further. There are two main ways to merge and filter the data: Microsoft Excel and ArcGIS Pro. There are advantages and disadvantages to using each program and our group will be using a hybrid of both programs. Below is table 1 showing some of the advantages and disadvantages to using Excel versus ArcGIS Pro for merging and filtering.
Table 1: Advantages & Disadvantages of ArcGIS Pro Versus Excel when merging and filtering

Problems and Certain Solutions:
            There are many challenges to be overcome in order to analyze the data. These challenges along with certain solutions are listed below

            Problem 1: Sam is working with the UAS Incidents database and has to reorganize the data, as the time/date, state,city, summary format does not stay consistently in the same cells. Additionally, U.S. territories and commonwealths are noted in these reports, so it will create a margin of error.Some Canadian UAS sightings are noted as well, which will need to be sorted through to determine if they should be mapped or not.  He also needs to find out the number of incidents reported by state.
            Solution 1: To reorganize the master document containing all UAS reports, Sam will have to manually cut and paste the information into the correct columns, and then re enter any deleted data from the original source documents. Using the find and replace tool, Sam will sort through all territories and commonwealth areas and remove them, as the group wants to focus only on the 50 states and the District of Columbia. Figure 1 below shows the find and replace tool being used.
Figure 1: Find and Replace Tool

            The process is as simple as hitting control+f and putting in the territory, and then hitting find next to find each term and manually deleting the selected row (see figure 2)
Figure 2: Manual Deletion of Incidents Outside of the Continental United States

            This same tool will be used to find all instances of Canada in the state column. Sam will look a each instance to determine if it occured in U.S. airspace or was mislabeled. If mislabeled, the instance will be deleted. To find the number of instances of an incident in each state, Sam will use the count IF function in excel (see figure 3).
Figure 3: Count IF function in Excel

            For example, this is his input for California. As the State row is the entire B column B:B is the input for range, and as we are looking for California, it is our criteria. The output will be the number of times excel finds our criteria in that range: 1676 in this case.

            Problem 2: Jaspar is working with the Airmen Certification Database which is separated into two separate datasets. The first, Pilot_Basic, includes a unique ID, name, address and medical class information while the second, Pilot_Cert, includes a unique ID, name, type, level, and rating.
            Solution 2: In order to merge the two datasets they were both converted into tables using the From Table/Range tool    located under the data ribbon. When loading the data Only Create Connection was selected. Once both datasets were converted to tables by clicking Get Data then Combine Queries then Merge (See figure 4). Within Merge, Pilot_Basic and Pilot were selected along with the unique ID as the key between the two (see figure 5). Once done, OK was clicked and the tables were joined. To learn more about the specifics of how to perform table joins, click here.
Figure 4: Merging Pilot_Basic with Pilot_Cert

Figure 5: Merge Tool Selections

            The problem with using excel to merge the datasets was that it was very difficult to implement and would often glitch extending the process time significantly.

            Alternate Solution 2: An alternate, more easily implemented solution tried is to merge the two datasets directly in ArcGIS Pro. This has the advantage of being able to both merge the datasets and geocode the data so that each point will be represented on the map. In order to merge the datasets, the data must first be geocoded.
            The process of geocoding is relatively simple when using the wizard. First, import the table, in our case, Pilot_ Basic. Next, right click on the table and click Geocode Table then follow the steps of the wizard (See figure 6).
Figure 6: How to Locate Geocode Table
            Supplementary information on how to perform the geocoding is available here. It is worth noting that geocoding is a cloud service that uses 40 credits costing $4 for every 1000 geocodes. This may not seem like much but for the entire Pilot_Basic database it costs $2380.72 to geocode. Professor Hupy already geocoded the entire Pilot_Basic database once so Jaspar utilized that database and merged other databases using the methods described in problem 3 below. Figure 7 shows the geocoded Pilot_Basic and Pilot_Cert tables merged.
Figure 7: Table Join of Pilot_Basic and Pilot_Cert Geocoded

            Problem 3: We were not sure how to perform table joins in ArcPro to associate our data with a shapefile so that we can show symbology.
            Solution 3: Using help from Zach Miller, a graduate student and the ArcGIS help page linked here, we found out how merge tables.
            Here is an example of how to do a table join using our UAS Incident table and the shapefile for the continental U.S. states. We first need to right click on our shape file and click joins and relates and then add join (see figure 8).
Figure 8: Locating Table Join in ArcPro

            In the add join tab, we have our main table, the state shape file, and the table that we are joining to it, the UAS sightings file (see figure 9 below).
Figure 9: Table Join Settings

            The common join field that we are going to use to combine these tables is the state name. This allows us to add the number of UAS incidents per state to the states shape file. Figure 10 below shows the Output Join Fields for our example being filled in.
Figure 10: Output Join Fields Set

            After clicking the run button at the bottom right hand corner of the “add join” box, you should see that the tables have been successfully joined (see figure 11).
Figure 11: Successful Table Join

            Problem 4: We need to figure out how to filter the datasets, especially with the Airmen Certification Database, so that it is easier to find details. Currently, the joined Pilot_Basic and Pilot_Cert is over 130 columns wide and needs to be slimmed down.
            Solution 4:  Open the table, highlight the column to be deleted, right click and click delete.

            Problem 5: we don’t know how to find hotspots in our data so that we can find where there are high concentrations of UAS sightings and UAS pilots.
            Solution 5: We found a link discussing how to perform hotspot analysis and applied it in an example shown below.
            The first step is to pull up the hotspot analysis by going to the “tools” bar and searching for it. For our input feature class,we will use our states shapefile, which will automatically display the output feature class as well (see figure 12).
Figure 12: Hot Spot Analysis Tool

            We then put in our # of UAS sightings as our input field and click run. Although there is some more sorting out to be done within the UAS sightings set itself, this map displays states with a large number of UAS incidents (see figure 13).
Figure 13: Hotspot Analysis Output

Allocations:
  • Sam did the reorganization of data in excel, how to perform table joins in ArcPro, as well as how to perform hotspot analysis.
  • Jaspar worked on the intro, how to merge data sets in excel and ArcPro, as well as filtering datasets.