Developer & Data Scientist
From exploratory data analysis to modeling, I have experience applying my knowledge of statistical and machine learning methods to solve real-world problems. My academic training has culminated in a deep skillset, broadened further by my various Data Science projects. My work includes projects like wildfire trend analytics, climbing performance prediction, and building neural networks for handwriting recognition.
With a deep understanding of Python, I've engineered full-stack solutions that bridge back-end data processing / analytics, APIs, and intuitive front-end experiences. I have experience building Flask and Streamlit webservices from the ground up, crafting ETL pipelines with PostgreSQL and Pythom and deploying production-level software using Docker and cloud tools like Snowflake, Google Cloud, and AWS.
I specialize in turning complex datasets into meaningful narratives and model-driven decsions. I’ve built analytics pipelines with GCP and PostgreSQL to connect CRM, booking, and survey data, then visualized it in Tableau to track trends and optimize marketing strategy. This work directly informed business decisions and improved performance, much like my current position which contributes toward ethical AI research by evaluating and contributing to LLMs for helpfulness, safety, and honesty.
Developer and Data Scientist
From an early age, I’ve always gravitated toward problem-solving and STEM, driven by a deep fascination with how systems work. That passion evolved naturally into my pursuit of a B.S. in Statistics and Data Science at the University of California, Santa Barbara, where I immersed myself in the rapidly changing landscape of AI and modern analytics. I’m a fast learner, analytical thinker, and detail-oriented problem solver who values understanding concepts from the ground up rather than taking shortcuts. Data science and software engineering have long captured my curiosity because they merge creativity with logic. I wish to be at the forefront of technology and production, and I want my work to have meaningful impact.
This passion for problem-solving extends perhaps even deeper into my personal life. During the pandemic, I spent a majority of my time outside rock climbing near my home in Joshua Tree, California. The technical aspects of body, mind, and gear within the sport fascinated me immediately and will continue to for the rest of my life. The pure focus and determination that arises when rock climbing is incomparable, and I believe it is a litmus test that demonstrates my skills in those aspects. As of now, I rock climb at a near professional level, and I foresee that this passion will never conclude.
A selection of some of my current leading projects.
Developed a full-stack web application inspired by GeoGuessr, designing and deploying a Flask-based Python backend that supports dynamic image serving, user session management, and real-time gameplay logic. Implemented large-scale data integration pipelines using BeautifulSoup4 and Requests to web-scrape and process over 50,000 records, storing them in a fully managed PostgreSQL database hosted on AWS for efficient querying and scalability.
Integrated multiple Google Cloud APIs, including Gemini for LLM-powered text generation, Google Maps for geospatial visualiza-tion, and Google OAuth for secure user authentication and account management. Designed an interactive frontend interface using HTML, CSS, and JavaScript elements to deliver a smooth, responsive gameplay experience, emphasizing usability, fast load times, and seamless communication with the Flask backend.
Engineered an automated end-to-end ML data pipeline processing over 3,200 CAL FIRE wildfire incidents and California Department of Water Resources precipitation records, performing advanced feature engineering, data validation, and seamless database integration through Google Cloud SQL to support scalable predictive modeling. Developed and refined wildfire risk assessment models leveraging spatial clustering, seasonal decomposition, and time-series lag analysis with Scikit-learn, forecasting regional fire probabilities across California counties and visualizing dynamic trends through interactive Tableau dashboards and custom Streamlit interfaces.
Developed an interactive machine learning model in R to quantify rock climbing ability (V-grades) using physical and training metrics from 630 climbers, cleaning and engineering data through categorical encoding, regression-based imputation, and 10-fold cross-validation to ensure robust predictive performance. Trained, tuned, and compared multiple classification algorithms (LDA, KNN, Random Forest, SVM, Pruned Decision Trees), achieving an approx 85% training/testing accuracy, and integrated results into an interactive visualization interface that highlights key predictors influencing climbing difficulty and performance progression.