A Review Paper on Big Data and Hadoop for Data Science




Big data is a collection of large datasets that cannot be processed using traditional computing techniques. It is not a single technique or a tool, rather it has become a complete subject, which involves various tools, technqiues and frameworks. Hadoop is an open source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

by Mr. Ketan Bagade | Mrs. Anjali Gharat | Mrs. Helina Tandel “A Review Paper on Big Data and Hadoop for Data Science”

Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-4 | Issue-1 , December 2019,

URL: https://www.ijtsrd.com/papers/ijtsrd29816.pdf

Paper URL: https://www.ijtsrd.com/computer-science/data-miining/29816/a-review-paper-on-big-data-and-hadoop-for-data-science/mr-ketan-bagade

manuscript submission, paper publication for engineering, ugc approved journals for social science