PRANAV DIXIT
Updated 60 days ago
Analyze activity on a music app by artists, songs, users over time. We do so using a star schema data warehouse architecture with dimensions: artists, songs, users, time with facts being the activity by the user of playing songs from the app. We are using spark's schema-on read to create fact and dimension tables stored as parquet files...
Create data pipeline to create data warehouse for immigration data analysis, specifically finding the correlation between immigration events and the temperature of the place visited. Implemented using Spark, S3, Redshift, Airflow...
Creation of new 3D chair models from given set of chairs using mix and match of part collections and with better alignment using part correspondence. Used SVM model to check the plausibility of chairs and k-mean clustering to create interesting chairs.