davidhellerw.com davidhellerw.com
  • Home
  • About Me
  • Education
  • Portfolio
  • Contact Me
davidhellerw.com

Inbio is a all in one personal portfolio WordPress theme. You can customize everything.

  • Home
  • About Me
  • Education
  • Portfolio
  • Contact Me
find with me

Archives: Projects

  • Home
  • Projects - davidhellerw.com
573

Predicting Diabetes Onset Using Logistic Regression in R

Predicting Diabetes Onset Using Logistic Regression in R

LIKE THIS 573
VIEW PROJECT

This project focuses on predicting the onset of diabetes using logistic regression on
the Pima Indians Diabetes Database from Kaggle. The dataset includes variables such as glucose
concentration, blood pressure, BMI, and age. The model achieved an AUC value of 0.8396 and
an overall accuracy of 78.39%, effectively identifying key risk factors associated with diabetes.

Tools/Skills

R, Statistical Modeling, Machine Learning, Logistic Regression

Read the Analysis
GitHub Repository

573

Regularized Regression: Comparing Ridge vs Lasso Models in Predicting College Graduation Rates in R

Regularized Regression: Comparing Ridge vs Lasso Models in Predicting College Graduation Rates in R

LIKE THIS 573
VIEW PROJECT

This project explores Ridge and Lasso regression techniques in predicting college
graduation rates using the College dataset. Ridge regression slightly outperformed Lasso
regression in terms of predictive accuracy, while Lasso offered more interpretable results by
performing feature selection.

Tools/Skills

R, Regularized Regression

Read the Analysis
GitHub Repository
573

Comparative Analysis of Principal Component Regression (PCR) and Partial Least Squares Regression (PLS) on Air Quality Data Using R

Comparative Analysis of Principal Component Regression (PCR) and Partial Least Squares Regression (PLS) on Air Quality Data Using R

LIKE THIS 573
VIEW PROJECT

This project provides a comparative analysis of Principal Component Regression
(PCR) and Partial Least Squares Regression (PLS) for predicting benzene (C6H6) concentrations
using the Air Quality Dataset from the UCI Machine Learning Repository. The primary goal is to
address multicollinearity and dimensionality reduction to improve predictive accuracy. Results
showed that PLS had superior performance with RMSE of 0.972 and R-squared of 0.974, while
PCR had a higher RMSE of 1.572 and R-squared of 0.933.

Tools/Skills

R, Principal Component Analysis, PLS, Regression Models, Data Cleaning

Read the Analysis
GitHub Repository
572

A/B Testing Analysis with Python

A/B Testing Analysis with Python

LIKE THIS 572
VIEW PROJECT

The A/B Testing Project aims to explore and analyze the effectiveness of a new
design variant compared to an existing one through rigorous statistical analysis and
experimentation. By leveraging user interaction data, the project seeks to uncover actionable
insights into various metrics such as completion rates, time spent on steps, error rates, and
abandonment rates. Through data preparation, exploration, analysis, and statistical testing using
Python libraries such as scipy and statsmodels, the project determines whether the proposed
design changes lead to meaningful improvements in user engagement and overall user
experience.

Tools/Skills

Python, A/B Testing, Hypothesis Testing, Data Visualization, Exploratory Data
Analysis

Read the Analysis
GitHub Repository
573

FIFA World Cup 2022 Analysis Using SQL

FIFA World Cup 2022 Analysis Using SQL

LIKE THIS 573
VIEW PROJECT

A comprehensive data analysis project focused on the FIFA World Cup 2022,
sourced from HiCounselor.com. The project involved advanced SQL techniques, including
complex joins, subqueries, Common Table Expressions (CTEs), and stored procedures, to
analyze player and team performance, match outcomes, and key tournament insights.

Tools/Skills

MySQL, Python

Read the Analysis
GitHub Repository
572

Spotify Top 50 Charts in Spanish-Speaking Countries Tableau Dashboard

Spotify Top 50 Charts in Spanish-Speaking Countries Tableau Dashboard

LIKE THIS 572
VIEW PROJECT

This project focuses on analyzing the top 50 charts of all Spanish-speaking
countries (except Cuba) for a specific week. The analysis includes data collection from Spotify,
data storage and management using SQL, detailed data analysis through SQL queries in Python,
and visualization using Tableau.

Tools/Skills

MySQL, Python, Tableau, Data Collection, Data Analysis

View the Dashboard
GitHub Repository
572

Washington State Electric Vehicles Tableau Dashboard

Washington State Electric Vehicles Tableau Dashboard

LIKE THIS 572
VIEW PROJECT

This project provides an in-depth analysis of the electric vehicle (EV) landscape in
Washington State using Tableau. It covers data from the Washington State Department of
Licensing, detailing the distribution, growth, and characteristics of Battery Electric Vehicles
(BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs).

Tools/Skills

Tableau, Data Visualization

View the Dashboard
GitHub Repository
574

US Apartment Rent Price Prediction and Data Analytics App

US Apartment Rent Price Prediction and Data Analytics App

LIKE THIS 574
VIEW PROJECT

This Streamlit app predicts monthly rent prices for apartments across various states
in the U.S. The project involved data cleaning, exploratory data analysis, and applying machine
learning models to identify the best predictors. The app allows users to input apartment features
and compare rental price distributions and averages across states.

Tools/Skills

Python, Streamlit, Machine Learning, Data Cleaning, Data Analysis

Access the App
GitHub Repository
573

Cryptocurrency Explorer App

Cryptocurrency Explorer App

LIKE THIS 573
VIEW PROJECT

Cryptocurrency Explorer is a comprehensive application built to provide real-time data tracking, historical analysis, and predictive modeling of cryptocurrency prices. It leverages technologies like Streamlit to offer a user-friendly experience for both casual enthusiasts and data professionals alike. Users can track real-time cryptocurrency prices, analyze historical trends, and predict future prices using the ARIMA forecasting model.

Tools/Skills

Streamlit, Python, Data Visualization, ARIMA, Time Series Forecasting, CoinGecko API

Access the App
GitHub Repository

Recent Posts

  • The Easiest Way to Become a Successful Writer and Authors.
  • The Quickest Way to Deliver Your Message? Make It Visual.
  • Why We Don’t Have Technical Interviews for Technical Roles at Buffer.
  • Why Successful People Wear The Same Thing Every Day.
  • What I Learned From Being a Broke, Unemployed Graduate.

Recent Comments

No comments to show.

Archives

  • September 2021
  • August 2021

Categories

  • Development
  • Gallery
  • Quote
  • Technician

Featured posts

pretty-curly-young-woman-writing-notes-startup-project
The Easiest Way to Become a Successful
  • September 7, 2021
  • 4 min read
road-sign-direction-perforated-paper-arrow
The Quickest Way to Deliver Your Message?
  • September 7, 2021
  • 4 min read
expressive-senior-woman-posing
Why We Don’t Have Technical Interviews for
  • August 15, 2021
  • 4 min read

Categories

  • Development
  • Gallery
  • Quote
  • Technician

Find Me

Tags

Art Fashion Happy Love Move Photography Travel