Education
- M.S. in Information Studies - Data Science and Analytics, University of Texas at Austin, May 2025 (expected)
- GPA: 3.89/4.0
- Coursework: Data Wrangling, Scientific Machine Learning, Data Visualization, Natural Language Processing, Data Storytelling
- B.Tech. in Computer Science, Vellore Institute of Technology, Jun 2021
- GPA: 8.47/10
- Coursework: Database Management, Data Structures and Algorithms, Parallel and Distributed Computing, Statistics for Engineers
Work experience
- Data Science Intern, Texas Department of Transportation, May 2024 - Present
- Fine-tuned a BERT model with LORA to classify incident narratives to interpreted fields in CR-3 Texas Peace Officer’s crash report.
- Conducted exploratory data analysis and created balanced datasets to optimize model performance and increase its accuracy.
- Developed a multimodal model using visual features from crash site images, categorical and textual data, reaching an accuracy of 92%.
- Data Engineer, ZS Associates, Aug 2021 - May 2023
- Built and deployed 50+ ETL pipelines on AWS Step Functions using DynamoDB, AWS Glue, and Snowflake for data warehousing processes.
- Developed a PySpark module for data ingestion of unstructured S3 data into a structured format within 1hr, orchestrated via Airflow.
- Designed data models and profiled commercial clinical data across 5+ domains to identify KPIs and data anomalies.
- Performed integration testing to identify and resolve bugs during SIT/UAT phases, optimizing pipeline runtime and efficiency by 40%.
- Mentored 6+ junior colleagues on project onboarding and best practices through knowledge transfer sessions.
- Led CI/CD production deployments with a 100% on-time record.
- Data Engineering Intern, ZS Associates, Feb 2021 - Jul 2021
- Developed Python Flask microservices and REST APIs for an Angular web application, serving 2000+ users for a major east coast client.
- Designed a Python tool to auto-format SQL queries, saving up to 60% of time on script refinement for readability.
Download Resume