Descriptive analysis of Airbnb data from Seattle, Boston and Copenhagen
Descriptive analysis of Airbnb data from Seattle, Boston and Copenhagen (see also @dima806/digging-into-airbnb-data-reviews-sentiments-superhosts-and-prices-prediction-part1-6c80ccb26c6a">this Medium post for detailed description)
pip install eli5
pip install geopy
pip install matplotlib
pip install nltk
pip install numpy
pip install pandas
pip install scipy
pip install seaborn
pip install sklearn
pip install tqdm
The aim of the project is to analyze the latest Airbnb data publicly available for three different cities (Seattle, Boston and Copenhagen), to perform sentiment analysis of the reviews for their customers, to reveal the difference between Airbnb superhosts from ordinary hosts, and to understands main factors responsible for the prise of Airbnb apartments.
airbnb_final_analysis_v3.ipynb
- jupyter notebook with all details about preprocessing and analysisREADME.md
- this file> 95%
) of Airbnb reviews are either positive or neutral.3.5%
, or 4-6 times larger than in Boston and Seattle.10%
) than Seattle (40%
) or Boston (23%
).hosting_listings_count
is valuable for US cities (especially for Boston) but is negligible for Copenhagen.None.