career
Internships
Data Science Intern at Claritas
Duration: Apr 2023 - Feb 2024 Location: San Diego, California, United States · Hybrid
Responsibilities:
- Designed and implemented a scalable PySpark pipeline to filter and process over a million IP addresses from diverse data sources within a distributed data environment.
- Developed a program that extracts campaign data from Parquet files, processes it, and calculates the daily percentage of data quality warnings across multiple metrics and tables.
Intern at Nurlink Technology
Duration: Jul 2021 - Aug 2021 Location: San Diego, California, United States · Hybrid
Responsibilities:
- Worked on an automation tool that cleans and preprocesses circuitry data in preparation for loading into JMP software for data visualization.
- Built data visualizations using JMP software on circuit bench datasets.
- Ran analysis on features such as inductance and capacitance to measure their correlation with circuit KPIs like output power.
- Applied dimensionality reduction techniques like PCA to conduct 2-D visualization of noise in output power.
Intern at San Diego State University
Duration: Jan 2019 - Jun 2020 Location: San Diego, California, United States · Hybrid
Responsibilities:
- Worked with Professor Ke Huang at the SDSU ECE department to analyze data on integrated circuits.
- Developed a Python script to gather statistics on thousands of features in thousands of samples on circuit voltage measurements.
- Cleaned and preprocessed the data (e.g., min-max normalization on certain features) using Pandas.
- Visualized aggregate statistics using MatPlotLib.