- Programming Languages:
- Python, SQL, Java, R, JavaScript
- Platforms/Tools:
- AWS, GCP, Power BI, Databricks, PySpark, Airflow, Kafka, ADF, Scikit-Learn
- Databases/Datawarehouses:
- Microsoft SQL Server, Google BigQuery, Snowflake, MySQL
- Masters of Data Science
- GPA: 3.65
- B.S. / Bachelor of Software Engineering
- GPA: 3.63
- Awards:
- Cum Laude
- Provost's List
- B.B.A / Bachelor of Business Administration
- GPA: 3.56
- Awards:
- Academic Excellence in Business Administration
- Departmental Honors in Business Administration
Software Developer
- Developed ETL pipelines on GCP Databricks using Python, SQL, and Airflow, enabling real-time Machine Learning Analysis.
- Modeled SQL Data Warehouse Tables on Google BigQuery to efficiently store GPT chatbot data in a structured format.
- Improved data pipeline efficiency and reduced associated costs by migrating ETL pipelines from Azure Data Factory (ADF) and SQL Server Integration Services (SSIS) to GCP Databricks.
- Developed Java Kafka pipelines, helping automate transitioning thousands of loans from originations to servicing.
ETL Developer
- Architected and implemented a solution consisting of a Microsoft SQL Server Database, Python ETL Program, & Microsoft Power BI Dashboard, providing near real-time insights for IT Business Executives and security analysts on endpoint security.
- The SQL database was designed using ER-diagramming and accommodated diverse data models.
- The ETL program extracts data from multiple sources via REST APIs and utilizes Python libraries & SQL to transform data.
- The Power BI Dashboard contains interactive visualizations with drill-down capabilities, allowing users custom datasets.
Software Developer Intern
- Using AWS services (EC2, S3), Python, and SQL, developed REST APIs to enable RBAC features for the Developer Console.
- Achieved a 15% improvement in a KPI, monthly disk I/O operations, by optimizing the SQL database’s indexes and queries, with performance enhancements measured through AWS CloudWatch monitoring of the EC2 server.
- Ensured data quality and integrity by implementing Python unit tests into the GitHub Actions CI/CD pipeline.
Web Developer Intern
- Developed for an internal blog for the HatchLabs division of Pacific Life, serving as a centralized platform for employees to gain insights into the division's innovative projects.
- Built various UI pages using React.js and leveraged GraphQL to query data from Contentful CMS, enabling dynamic content rendering and seamless integration of data into the blog.
- Engineered a highly customizable React Grid component that became an integral part of the codebase by providing a flexible and consistent layout structure for nearly all pages and components, saving developer hours in development.
Research Intern
- Utilized Python and Scikit-Learn to develop a feature engineering framework that finds optimal Histogram of Oriented Gradients (HOG) feature descriptors and SVM hyperparameters for Computer Vision Machine Learning.
- Developed an explainable AI method for Computer Vision models by using weights from trained SVM models.
- Leveraged the TensorFlow Keras GradientTape API to determine which regions of images are most important for CNNs.
Undergraduate Researcher
- Organize and input raw patient data, worked with medical staff to collect samples from patients, and performed spirometer respiratory function and DLCO tests on patients.
- Co-author of abstract: Abstract 2018 American Thoracic Society: Electronic Cigarette, Conventional Cigarette and Marijuana Use Patterns in San Diego County