Introduction to Kaggle and Scoring Top 7% in the Titanic Competition | by Oliver S | Apr, 2024
Kaggle is a fun platform hosting a variety of data science and machine learning competitions — covering topics such as sports, energy or autonomous driving.
In this post we will give an introduction to Kaggle, and tackle the introductory “Titanic” challenge. We will explain how to approach and solve such a challenge, and demonstrate this with a top 7% solution for “Titanic”.
You can find the full code on Github, and with that following along while reading this article, as well as reproduce my exact score. In it, we follow some things I consider best practice for Python and use helpful tools, such as mypy and poetry. With that being said, let’s dive right into it.
Kaggle offers a wide variety of data science / machine learning competitions, see the intro for examples. It is a great way to test and improve your data science / ML knowledge and learn how to solve problems hands-on. Plus, you can even win monetary prices! However, Kaggle is populated by some of the best data scientists and ML people out there — and prices are only given to the few top solutions (out of several hundreds or thousands) — thus winning here is…