The Olympic Games have been the pinnacle of international sports since 1896. This project explores the historical dataset of the Olympics, uncovering trends, athlete performance, and country-wise participation. Through data cleaning, visualization, and analysis, we gain insights into how the Games have evolved over time.
The size of athlete_events.csv is more than 20mb so i had provided the link and description in the file name 'Dataset_link.py'
This analysis uses two primary datasets:
-
athlete_events.csv - Contains detailed records of Olympic athletes, including:
- Name, Age, Gender
- Sport, Event, Medal (if won)
- Country (NOC), Year, Season (Summer/Winter)
-
noc_regions.csv - Maps National Olympic Committees (NOCs) to country names, helping in regional analysis.
- Perform Exploratory Data Analysis (EDA) to understand athlete trends.
- Visualize country-wise medal counts and athlete participation.
- Analyze gender representation and its evolution in the Olympics.
- Identify the most successful athletes and countries over the years.
To run this project on your local m achine:
- Clone the Repository:
git clone https://github.com/VIPULbunny/olympics-analysis.git
- Navigate to the Project Directory:
cd olympics-analysis - Install Dependencies:
pip install numpy pandas matplotlib seaborn
- Run the Jupyter Notebook:
jupyter notebook
- Total Athletes Participated:
{total_athletes} - Total Olympic Games Editions:
{total_games} - Top 10 Countries by Athlete Count:
- USA, Germany, UK, France, China, etc.
- Most Successful Athletes:
- Michael Phelps, Usain Bolt, etc.
- Gender Representation Over Time:
- Increasing female participation in modern Olympics.
- Merged
athlete_events.csvwithnoc_regions.csvfor accurate country mapping. - Handled missing values in age, medal, and region data.
- Converted categorical features (
Sex,Medal) into structured formats for better analysis.
This project is licensed under the MIT License.
We welcome contributions! If you’d like to improve the analysis or add new insights:
- Fork the repository.
- Create a feature branch:
git checkout -b feature-branch - Commit your changes:
git commit -m "Added new analysis" - Push to GitHub:
git push origin feature-branch - Open a Pull Request 🚀
For queries or collaborations, reach out via email or open an issue on GitHub.
⭐ If you find this project useful, please give it a star! 🌟




