Professional Experience
Jan. 2025 — Present
Auto-Owners Insurance
Software Developer
- Spearheaded the migration of a high-volume data pipeline to an Apache Iceberg architecture, automating the daily ingestion of 90k+ policy renewal records. Developed end-to-end workflows using Argo and Control-M, providing smooth data delivery to SQL Server while managing CI/CD monitoring through Jenkins.
- Leading an ELT project to extract policy data from the underwriting source system, landing it into the data lakehouse in Apache Iceberg, and applying curated data transformations in SQL via staging and conformed models in dbt Labs.
- In SQL, calculating and balancing billions of premiums to verify amounts accurately represents the financial cash flow from the company’s personal property insurance policies, accounting for all transaction types and coverages.
SQLPythondbt LabsApache IcebergStarburstBitBucketArgoControl-MJenkins
May. 2024 — Nov. 2024
EntryPoint
Software Engineer
- Engineered Google OAuth 2.0 integration using Google Cloud Platform to streamline user authentication, enhance security, and simplify access to the company’s application.
- Integrated Stripe’s API into backend code in Python, facilitating user payments/subscriptions to company products.
- As part of the CI/CD process, developed a testing suite in Python and JavaScript using GitHub Actions to ensure quality performance of code prior to company-wide deployment.
PythonJavascriptSQLAWS (Secrets, RDS)GCP (Oauth 2.0)PostgreSQL
Jan. 2024 — Jun. 2024
Northwestern University
Paid Research Assistant
Project: Detecting Childhood Speech Language Disorders via Machine Learning
- Preprocessed written phonetic data by leveraging Python’s pandas library and leading data mapping of n-grams for supervised learning.
- Trained and tested logistic regression models on preprocessed data using Python’s scikit-learn machine learning library to determine whether specific phonemes could be predicted in speech.
- Ran Automated Speech Recognition models on audio files to analyze models’ results and determine how to more accurately transcribe minority childhood speech.
Pythonpandasscikit-learnASR / STT
Jun. 2023 — Aug. 2023
TimeCredit
Software Engineer
- Parsed and preprocessed text documents, including extracting Q&As and text boxes from financial textbooks.
- Constructed a program in Python, utilizing the spaCy library, for redacting sensitive information from financial documents.
- Mastered fundamental Natural Language Processing concepts, including implementation of backpropagation and transformer model. Proficient with NumPy; familiar with PyTorch.
PythonPyMuPDF (fitz)spaCyNumpyPyTorch