PORTFOLIO

Explore my analytics expertise and continuous learning journey through projects utilizing SQL, Tableau, Python, and R. Engage with interactive dashboards, in-depth data exploration, predictive modeling, and comprehensive insights on data visualization and algorithm development.

  • Advanced Data Pipeline Project

    Advanced Data Pipeline Project (Coming Soon)

    Data Engineering, Big Data Technologies

    An end-to-end data pipeline project involving data collection, processing, analysis, and visualization using technologies such as Kafka, Airflow, PySpark, and Cassandra.


    Coming Soon

  • COVID-19 DE and Dashboards

    COVID-19 Data Exploration and Dashboards

    SQL, Tableau

    Analyzed COVID-19 data to understand the global distribution of deaths and infection rates. Created a comprehensive dashboard in Tableau Public to visualize the data.

    Data Exploration

    Dashboards

  • Titanic Survival Prediction Project

    Titanic Survival Prediction Project


    R (logistic regression)

    Built a logistic regression model to predict the survival likelihood of Titanic passengers using demographic and socio-economic data. Performed data preprocessing, exploratory data analysis, and model evaluation in R.

    Kaggle

  • Python - Foundations of Data Science and Algorithms

    Foundations of Data Science and Algorithms

    Python (Data Science & Algorithms)

    A collection of Python projects showcasing data exploration, visualization, hypothesis testing, algorithms, and game development. Includes NumPy, SciPy, Matplotlib, Pandas, and interactive games like Hangman and Scrabble.

    Hypothesis Testing

    Games & Algorithms

  • SQL - Healthcare Sample Database Analysis

    Healthcare Sample Database Analysis

    SQL (Data exploration & analysis)

    Conducted an in-depth analysis of CDHealthcare's sample database using SQL. The project focused on data exploration and cleaning, followed by an analysis of the top five primary care providers (PCPs) who deliver the best customer service.


    GitHub

  • R - User Analysis of Fitbit Fitness Tracker Data

    User Analysis of Fitbit Fitness Tracker Data

    R (Data exploration, and visualization)

    Analyzed a dataset containing two months of usage data from 30+ eligible Fitbit users to gain insights into user habits of health-focused smart devices. Performed exploratory data analysis in R, created reports using R Markdown on Kaggle, and identified major customer groups to develop marketing strategies.

    Kaggle

SKILLS

Discover the skills section for an overview of my technical proficiencies. This part highlights the specific abilities and knowledge I've acquired, demonstrating my capability to excel in various projects and collaborative efforts

  • PROGRAMMING LANGUAGES

    Python (e.g., numpy, pandas)
    R (e.g., tidyverse, dplyr)
    SQL (e.g., MySQL, PostgreSQL)

  • CLOUD, BIG DATA, AND STREAMING

    AWS S3
    GCP
    Airflow
    Spark (PySpark)
    Hadoop
    Docker

  • DATABASES AND DATA WAREHOUSE

    Tableau
    Snowflake
    Looker (LookML)
    Google Data Studio