career

Internships

Data Science Intern at Claritas

Duration: Apr 2023 - Feb 2024 Location: San Diego, California, United States · Hybrid

Responsibilities:

  • Designed and implemented a scalable PySpark pipeline to filter and process over a million IP addresses from diverse data sources within a distributed data environment.
  • Developed a program that extracts campaign data from Parquet files, processes it, and calculates the daily percentage of data quality warnings across multiple metrics and tables.

Duration: Jul 2021 - Aug 2021 Location: San Diego, California, United States · Hybrid

Responsibilities:

  • Worked on an automation tool that cleans and preprocesses circuitry data in preparation for loading into JMP software for data visualization.
  • Built data visualizations using JMP software on circuit bench datasets.
  • Ran analysis on features such as inductance and capacitance to measure their correlation with circuit KPIs like output power.
  • Applied dimensionality reduction techniques like PCA to conduct 2-D visualization of noise in output power.

Intern at San Diego State University

Duration: Jan 2019 - Jun 2020 Location: San Diego, California, United States · Hybrid

Responsibilities:

  • Worked with Professor Ke Huang at the SDSU ECE department to analyze data on integrated circuits.
  • Developed a Python script to gather statistics on thousands of features in thousands of samples on circuit voltage measurements.
  • Cleaned and preprocessed the data (e.g., min-max normalization on certain features) using Pandas.
  • Visualized aggregate statistics using MatPlotLib.