[Project 3] Day 1: Initial observation about ‘Analyze Boston’ Dataset

Today I commenced my exploration of the ‘Economic Dataset’ from ‘Analyze Boston’.

  • There are 19 columns which are divided as follows:
    • Date: which gives 2 columns of ‘Year’ and ‘Month’
    • Tourism: Has 2 columns which give ‘Number of domestic and international passengers at Logan Airport’ and ‘Total international flights at Logan Airport’
    • Hotel Market: Has 2 columns which give ‘Hotel occupancy for Boston’ and ‘Hotel average daily rate for Boston’
    • Labor Market: Has 3 columns which give details of ‘Total Jobs’ ‘Unemployment rate for Boston’ and ‘Labor rate for Boston’
    • Real Estate Board approved development projects: Has 4 columns which give the details of ‘Number of units approved’, ‘Total development cost of approved projects’, ‘Square feet of approved projects’ and ‘Construction jobs’.
    • Real Estate (Housing): has 6 columns which give the details of ‘Foreclosure house petitions’, ‘Foreclosure house deeds’, ‘Median housing sales price’, ‘Number of houses sold’, ‘New housing construction permits’ and ‘New affordable construction permits’.
  • Since this dataset give reference to economic groups, my first thought is that i should perform some sort of cluster analysis.
  • It may also be possible to check the relation between the ‘Tourism’ and ‘Hotel market’ as well as the relation between ‘Labor market’ and ‘Real Estate’ variables.

Leave a Reply

Your email address will not be published. Required fields are marked *