Hi there!👋🏼

I’m Tri Watanasuparp, an aspiring data scientist focused on transforming complex datasets into clear, actionable insights that spark innovation and drive impactful decisions. đź’» âś¨

problem solver â‹… storyteller â‹… lifelong learner

Get in touch 👉🏼 watanasuparp.t@northeastern.edu

background

I'm a student at Northeastern University pursuing a Master’s in Data Science. In particular, I enjoy working on interesting projects that lie at the intersection of data, technology, and storytelling.

During my undergraduate studies, as well as through various projects and internships, I developed strong proficiency in Python, SQL, and R for data analysis, visualization, and predictive modeling. I gained hands-on experience in data wrangling, exploratory data analysis, and machine learning, enhancing my ability to extract actionable insights and communicate findings effectively. I am eager to deepen my expertise in machine learning and natural language processing, and to explore advanced techniques and real-world applications that address complex challenges and drive innovative, data-driven solutions.

When I'm not in front of a computer screen, you’ll likely find me choreographing—or just wiggling—around a dance studio, capturing moments with my Canon EOS 77D, or exploring whichever city I’m in to uncover hidden culinary gems.

skills
Languages
  • Python
  • SQL
  • R
  • Java
  • JavaScript
  • HTML/CSS
Frameworks/Libraries
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Scikit-learn
  • Plotly
  • Folium
  • D3.js
  • Apache Spark
  • NLTK
  • Altair
  • React
  • Flask
Tools
  • Jupyter Notebooks
  • VSCode
  • Tableau
  • Google Colab
  • Rtudio
  • LaTeX
  • Excel
  • Git
  • GitHub
  • Google Analytics
  • Notion
Design/Creative
  • Figma
  • Canva
  • Adobe Photoshop
  • Adobe Premiere Pro
experience
Jan 2023 - June 2023
IT Business Applications Co-op
IBM
June 2022 - Aug 2022
IBM Accelerate: Talent Identification Program - Software Developer Track
Apr 2022 - June 2022
Full-Stack Web Developer Intern
Jan 2022 - June 2022
Digital Marketing & Analysis Co-op
Click here to view my resume
projects
Predicting Exoplanet Habitability with Machine Learning | Apr. 2025 - May 2025

Analyzed the NASA Exoplanet Archive to identify Earth-like exoplanets using features like mass, radius, orbital distance, and stellar magnitude. Engineered a binary habitability label based on NASA/PNAS criteria. Built classification models (logistic regression, SVM, random forest) to predict potential habitability, multiclass models for planet type, and regression models for orbital period (ridge, polynomial). Conducted EDA to reveal trends in planetary and stellar characteristics. Evaluated models with F1-score, RMSE, cross-validation, addressing class imbalance and multicollinearity for improved performance and interpretability.

Python Pandas NumPy Scikit-learn Logistic Regression Polynomial Regression Random Forest SVM EDA
MassWeatherHub Dashboard | Nov. 2024 - Dec. 2024

Designed and developed a comprehensive data integration pipeline that seamlessly merges weather data from the Open-Meteo API, OpenWeather API, and TIGER/Line Shapefiles into a unified format for analysis. Created interactive maps and charts to visualize current, forecasted, and historical weather data for Massachusetts counties, delivering an intuitive user interface for clear and accessible data interpretation.

Python Flask JavaScript/HTML/CSS Pandas Matplotlib Seaborn NumPy Folium
Exploring New York City Airbnb Listings | Feb. 2024 - May 2024

Processed and analyzed Kaggle’s NYC Airbnb dataset to uncover key patterns and factors influencing the short-term rental market. Identified spatial trends using interactive heatmaps, choropleth maps, and custom plots, offering valuable insights into pricing, availability, and demand across various NYC neighborhoods.

Python JavaScript/HTML/CSS Matplotlib Seaborn Altair NumPy Pandas
Zooming into MyDramaList: Ratings of Top Dramas | Oct. 2023 - Nov. 2023

Analyzed and visualized viewer preferences across networks and genres through bar charts. Conducted regression analysis, including Leave-One-Out Cross-Validation and polynomial regression, to explore correlations between drama duration and generated scores. Used BeautifulSoup for web scraping and Pandas for data preprocessing, ensuring clean datasets for regression analysis and visualizations with Matplotlib and Plotly.

Python Pandas BeautifulSoup Matplotlib Seaborn Plotly Scikit-learn
Personal Website

You are here! I designed and developed this responsive website as an online portfolio to showcase my skills, experiences, projects, and other creative work.

HTML CSS JavaScript