In this video, I am demonstrating, processing of a large dataset that contains over 42+ million rows and 18GB in size. The dataset contains the records of Parking Violation tickets issued in NYC in the time period around 2013 ~ 2017.
Video Topics
1. Demonstration
2. Downloading Dataset ( 01:15 )
3. Populating data to DB ( 02:50 )
4. Creating spring-boot project with Spring Data ( 03:24 )
5. Processing data ( 06:25 )
6. Final demonstration ( 09:56 )
Data from: https://www.kaggle.com/new-york-city/...
If you are interested in the code, please download project files from:
https://github.com/imuneer/jpa-stream...