Rachel Gao


All Projects


Pixelate Images and Cluster Colors Using K-Means

2024

Pixelate images using vanilla Python and then reduce the number of colors using Scikit-Learn K-Means clustering. The algorithm starts from the raw image with optional clustering, packaged in a Python class with supporting methods. Pydantic is used for parameter validation.

Learn More...
Project Image

Bird Song Classifier with Vision Transformer

2024

A continuation of the Bird Song Classifier with Machine Learning project by utilizing a vision transformer to classify bird species based on bird songs/calls. Python programming language is used with HuggingFace Transformers and Tensorflow as the primary machine learning libraries.

Learn More...
Project Image

Bird Song Classifier with Transfer Learning

2024

A continuation of the Bird Song Classifier with Machine Learning project by utilizing transfer learning techniques to classify bird species based on bird songs/calls. Python programming language is used with HuggingFace Transformers and Tensorflow as the primary machine learning libraries.

Learn More...
Project Image

Fine Tune LLM for Grammatical Classification

2023

Fine tuned RoBERTa using Tensorflow on the CoLA dataset to create a grammatical acceptability classifier. Experimented with different hyperparameters and achieved results on par with the original published paper. All data cleaning, analysis, and model building were conducted using the Python programming language.

Learn More...
Project Image

Evaluating the Impact of BMI on Cholesterol

2023

Utilizing Large Sample Ordinary Least Squares (OLS) Regression to evaluate the possible causal relationships between Body Mass Index (BMI) and cholesterol ratio. Data was sourced from the 2005-2006 National Health and Nutrition Examination Survey (NHANES), and all data cleaning, analysis, and model building were conducted using the R programming language.

Learn More...
Project Image

Bird Song Classifier with Machine Learning

2023

Utilizing various machine learning algorithms (traditional, shallow neural networks, and deep neural networks) to classify bird species based on bird songs/calls. Data was sourced from the BirdCLEF 2023 kaggle competition, and all data cleaning, analysis, and model building were conducted using the Python programming language.

Learn More...
Project Image