Price Classification for NYC Airbnb listings

Evie Wei
Jan 24, 2021
1 min read

Updated: Jan 24, 2021

The objectives for this project were to predict the price category of Airbnbs in New York City in order to assist hosts when they price their Airbnbs.

I classified Airbnbs into two categories, budget-friendly or high-end, based on features such as neighborhood, room type, availability during the year, and frequency of reviews. To do this, I utilized five different machine learning methods: k-NN, Naive Bayes, SVM, Random Forest, and Logistic Regression.

After reviewing the confusion matrix, accuracy, and AUC scores from each model, I found that, overall, Random Forest provided the best fitting model for classifying both Airbnb price categories. I can also note that k-NN and SVM Linear performed well during high-end classification according to the AUC and F1 Score metrics.

However, ultimately, because I believe Airbnb consumers to be price sensitive, I recommend Random Forest as the best option for classifying price categories because it tends to generate lower prices than other algorithms and will be more attractive to consumers. I will use this algorithm to implement a Price Recommendation System for Airbnb hosts to use. Airbnb hosts will be able to input features of their rental and receive a recommended price category for their listing: budget-friendly, mid-range, or high-end.

For future analysis, to improve predictive accuracy, we can identify more variables that affect pricing such as square footage and additional services Airbnb hosts may provide.

Price Classification for NYC Airbnb listings

Recent Posts

Comments