Travel Data Analysis

Read More

Git:

git

Team:

Sarita Bhateja, Kavya Guruprasad and Nandini Goswami

Technologies:

Technologies: Spark, Python, Databricks, Tableau

Problem Statement

The project is to analyse the travel data which includes data such as destination city, source city, mode of commute, the hotel they will be staying at, type of audience such as youth, children, adults etc. This project will prove helpful for both the hospitality industry and customers.

Dataset Desciption

Attribute Information:

System Design

We have implemented this using Apache Spark and Map reduce and used python as the language for writing the code. We have tried to find the top 20 destinations that people travel to, from and the most frequently visited city pairs where people prefer travelling. We have printed the top 20 places in descending order. We have found the top 10 cities that generate high airline revenues for travel. From this analysis airline companies can give better deals and make better profit.Top source-destination sites that people prefer travelling is also found. This can help airlines give better deals for in these routes. We also found The most often visited hotels by travellers, what combination of traveling do people prefer(Air+Hotel or Hotel+Car or Air+Hotel+Car), Top places from where people generally travel by car, Which airlines do people prefer the most. This kind of analysis is especially helpful to the travel industry to make wise decisions in mak-ing their costumer's stay a better one.

Other Projects

FoodSalsa

A website for food lovers that will facilitate easy access of menu as per cuisine and food review posts for different food joints within the University.

Read More »

Diabetic nutritionist

An application which will serve as a personal nutritionist for diabetic patients that predicts whether a meal is suitable for a diabetic patient or not and provide healthier food substitutes.

Read More »