Skip to content

Analytics Projects I’ve Contributed To

Corruption & HDI Visualizer

Analyzed the global correlation between Corruption (CPI) and Human Development (HDI). Built a publication-ready, Economist-styled scatter plot using R to highlight governance trends and outliers.

Economist-style aesthetic theme
Log-linear trend regression
R Language ggplot2 data.table ggthemes Data Viz
Deep Research Agent with Pydantic AI

Built a multi-step Deep Research Agent using Pydantic AI and GPT‑5‑mini that orchestrates DuckDuckGo web searches to generate structured, evidence-based reports on stock tickers and complex general topics.

Intent-aware stock vs. general research
Parallel deep dives with cited evidence
Python Pydantic AI GPT‑5‑mini DuckDuckGo search tools Structured research reports
Health Insurance Risk Modeling

Built regression-based cost models in Python to quantify how age, BMI, number of children, smoking status, and region influence annual medical insurance charges and risk tiers.

Smoker vs non‑smoker risk gaps
Age & BMI impact on charges
Python pandas scikit‑learn Linear & Ridge Regression
R Netflix Movie Ratings Regression

Applied multiple regression in R to analyze how genre, viewing hours, global release, and season impact Netflix movie ratings across diverse content.

Multi-Variable Analysis R OLS Regression
Pathfinder AI AI Impact Challenge

Developed full-stack AI platform with HTML/CSS/JavaScript frontend and n8n-powered Agentic AI backend to deliver personalized CTE & dual-enrollment recommendations for DC students.

Full-Stack Dev HTML/CSS/JS n8n Workflows Agentic AI AI for Good
R Ratings & Global Reach Analysis

Applied Chi-square tests & regression in R to explore statistical relationships between Netflix movie ratings and worldwide availability patterns.

Chi-Square Test Regression R
R Netflix Global Availability & Ratings

Applied simple linear regression in R to analyze how global availability impacts Netflix movie ratings, revealing statistical patterns in worldwide content performance.

Linear Regression R Data Analysis
R Spotify Song Popularity Drivers

Analyzed 1M+ Spotify tracks in R exploring how danceability, energy, genre, duration, and temporal trends influence song popularity using correlation and regression models.

1M+ Songs Multi-Feature Analysis R