Portfolio
This Page is dedicated to the various class projects, public projects done for employers, and personal projects.
Papers
Principal Components of Happiness
Skills: R, Principal Component Analysis, and Data Wrangling
Summary: In this article oublished in Regional Science Policy and Practice we disscus a novel use of Principal Componenet Analysis to index wellbeing in Mexico in a method that contrasted the leading economic measures. This was a project started in a workshop in 2020 through the CIMAT summer system and involved a presentation in the 61st annual conference of the Western Regional Science Association.
Projects
Big Data (DSAN 6000) Final Project
Link to Deliverable | Github | December 2024
Skills: Python, SQL, Spark, SparkSQL, SparkNLP, Spark ML, AWS, Azure, Data Cleaning, Data Wrangling, and Data Visualization
Summary: We created a website that had exploratory data analysis, natural language processing, and machine learning aspects all applied to a 100 GB dataset of reddit data. For this project I specifically found a subset of the data where US States were mentioned, created visualizations of this dataset, found the sentiment associated with each state, and created a machine learning model to predict the score of the post based on all the variables.
Natural Language Processing (DSAN 5800) Final Project
Link to Deliverable | Github | December 2024
Skills: Python, Huggingface, and Data Wrangling
Summary: For this project we created a presentation and report (which is in the github) on the use of various retrieval augmented generation (RAG) methods which had the goal of finding cases relevant to a search. Large Language models normally have a problem with hallucinating results and RAG is a method to sidestep that issue.
Election Night Dashboard
Link to Deliverable | Github | November 2024
Skills: R, Shiny dasboards, Data wrangling and cleaning, API querying, Data Visualization, ggplot2, and Plotly
Summary: A Dashboard made in R for the Mecklenburg County Democrats in Charlotte, NC. We were working with them in 2024 during the election and they wanted a way to review their results and compare them to the 2020 and 2022 election seasons. The Dashboard was made in R using Shiny and Plotly to make the interactive graphs.
Returning Student Scholarship
Link to Deliverable | Github | June 2024
Skills: R, ggplot2 and Plotly, Data Visualization, Data Cleaning, Clustering, Tableau, and Regression
Summary: When entering the second year of my two year program they gave the students a chance to compete for a merit-based scholarship through projects we make and our contributions to the program. We submitted a project based on soil water retention data from a study on Natural Bridges National Park. I submitted my project and won the second highest prize for my work.
Nueral Networks (DSAN 6600) Final Project
Link to Deliverable | Github | May 2024
Skills: Python, Tensorflow, Pytorch, Convolutional and Artifical Nerual Networks, Pandas, and Data Wrangling
Summary: For this group project we took a dataset of item from Amazon (found on Kaggle) which had price and an image of the product. We fit a convolutional neural network to the images in order to predict the price of the product itself. In this process we transformed and permutated the original images to better train the model. We were midly successful with interesting results that the model tended to look at logos or other branding for information about price.
Analytics for Statistical Learning (DSAN 5300) Final Project
Link to Deliverable | Github | May 2024
Skills: R, Python, Artificial Neural Netowrks, Linear Regression, Geospatial mapping, Data Wrangling, and Data Visualization
Summary: On this group project we applied out knowledge of various advance statistical methods including Artificial Neural Networks to analyze power use across estonia. In particular, I structured and ran the artificial neural networks for a regression and classification question and completed some of the initial data cleaning for the project.
Data Visualization (DSAN 5200) Final Project
Link to Deliverable | Github | May 2024
Skills: Python, R, Data Visualization, Data Cleaning, D3
Summary: Using crime statistics from 2018-2022, we visualized very aspects of the incidents reported as if they were featured in a clue game, as in if a crime happened to you in DC what was your relation to the perpetrator, where did it happen, and what weapon was used. Otherwise we used interactive and linked visualizations using packages like ggplot, plotly, and D3.
Intro to Statistics (DSAN 5100) Final Project
Link to Deliverable | Github | December 2023
Skills: R, Data Visualization, Regression, Anova, and Predictive Analysis
Summary: For this final project I worked alone to analyze a dataset of nobel prize winners for differences in gender over time and across the different categories. The analysis for this project was more baseline and mostly involved proving statistical independence and differences to support the thesis of my research.
DSAN 5000 Final
Link to Deliverable | Github | December 2023
Skills: R, Python, Clustering, Naive Bayes, Dimensionality Reduction, Decision trees, API querying, Data Visualization, Data Cleaning, and Exploratory Data Analysis
Summary: For this project, we were asked to pick a dataset and apply a variety of machine learning methods. I chose a public knitting database on which I cleaned the data;performed exploratory data analysis; used analysis techniques like Naive Bayes Predictive modeling, Clustering, Dimensionality Reduction, and Decision trees/Random forests; and concluded the results all while documenting my code.
Articles
The Swarthmore Phoenix
This links to all of the articles I wrote for the Swarthmore Phoenix while I was a staff writiter there. I wrote for the Campus Journal, Opinions, and Arts section. The wriritng didn’t overlap as much with my academic interests but I was happy to contribute to the periodical periodically.
Legacy Dilemma
Interviewed by the New York Times for an article on Legacy in the context of the end of Affirmative Action.
Five Global Health Priorities for the future President
While Interning for the Friends of the Global Fight I helped write this article published by Ambassador Mark Lagon in the American Interest