Pratik Tilekar

Big Data Architecture | Lambda Architecture

  • Posted by Pratik Tilekar
  • In blog
Every business will not be willing to wait for hours to get the updated analytics, at the same time not all results need to be updated very quick and fast, So the trade of needs to happen between the time and the effort as well as cost that involves generating the results. Basically the trade […]

Clearing HDFS Trash

  • Posted by Pratik Tilekar
  • In blog
HDFS has a feature where whatever the file that you delete, it will get moved into trash, which acts like a recycle bin. that is controlled with 2 properties, Trash interval and Trash interval checkpoint whatever the value that we have within the trash interval, for that particular interval, the file will be kept in […]