Felipe Espinoza

Logo

Senior Data Analyst

View the Project on GitHub f-espinoza/main-projects

Portfolio

Press the title of each project and you will be redirected to the project’s page.

If you want, you can find me in my other pages:

Project 1: Death in Chile 2016 - 2021(I)

Data exploration within Kaggle Notebook using Python on a dataset containing information about the date, primary cause of death and other information concerning deaths in Chile from 2016 to 2021.

With this data I was able to answer various questions:

  1. What was the primary cause of death in 2021 in Chile?
  2. What were the top five causes of deaths in Chile in 2021, each year since 2016, and accumulated?
  3. What is the most common cause of death by region?
  4. Is there any difference between the causes of death for men and women?
  5. What is the main cause of death among children of 3 and under?
  6. Evolution of deaths caused by COVID disease (2020-2021)

There is also a second part of this analysis that you can find here.

The data source was the chilean Department of Statistics and Health Information. The dataset can be found here

Project 2: Energy production, trade and consumption

SQL data exploration of an Energy, production, trade and consumption database. With this exploration I was able understand this dataset with insights like:

  1. Compare the Energy supply per capita in gigajoules between South America, North America and Europe.
  2. The average primary energy production in the world.
  3. Percentage difference between the country with the lowest energy suppply per capita and the rest in a list containing: Brazil, Chile, Argentina, Peru, Uruguay, Bolivia, Colombia and Ecuador.
  4. Seeing the evoution of energy production in Germany and comparing it with the Russian Federation.

You can also visit the Tableau visualization for this dataset here

The dataset can be found here

Project 3: Death in Chile 2016 - 2021. Tableau Visualization

Data visualization of the information obtained in the Death in Chile Python’s analysis. This is a Tableau Dashboard, so it is fully interactive. With this dashboard you can filter the visualizations for:

  1. Year
  2. Day of the week
  3. Month
  4. Biological sex

Project 4: Data visualization with Python: Github Public Repositories visualization

Data visualization only with Python’s library Seaborn. With this visualization I was able to show the evolution of the top 10 programming languages in the GitHub Repositories from 2011 to 2022. With this kind of visualizations you could compare and analyse any programming language present in the database with a hight level of control. Some of the insights you can get from this visualizations are:

  1. Top 10 by number of (public) repositories
  2. Programming language with the maximum amount of issues in a quarter from 2011 to 2022.
  3. Visualization for each of the top 10 programming languages by number of issues.
  4. Maximum number of pulling requests (2011-2022) in a quarter.

Project 5: Historical IDA statement of Credits and Grants in R

Data exploration of the historical International Development Association statement of Credits and Grants made in R. Examples of information we obtained thought this analysis were:

  1. Regions which were receiving credis.
  2. The median and the maximum of the Original Principal Amount by country
  3. The number of credits and grants that were for China, and what are their status
  4. The number of transactions and projects between IDA and China and India.

The dataset can be found here.

Project 6: Hate crimes in Catalunya

Simple Excel Dashboard showing hate crimes comitted in Catalunya during 2020. Also contains a pivot table for more information. With this dashboard you can filter the data by:

  1. Way in which the crime took place.
  2. Province.
  3. Total number of victims.

This dataset was cleaned with Python before using it in Excel. The dataset can be found here

You can download the Excel file here. This file is in catalan.