hadoop based projects



Hadoop performance tuning-a pragmatic iterative approach
free download

Hadoop represents a Java-based distributed computing framework that is designed to support applications that are implemented via the MapReduce programming model. In general, workload dependent Hadoop performance optimization efforts have to focus on 3

A Hadoop -based approach for efficient web service management
free download

In this paper, we propose a Hadoop -based approach for Web service management in Telecommunication and Internet domains. The basic idea of this approach is to adopt two components of Hadoop , HBase and MapReduce, to manage Web services. In HBase, we

Towards an approach based on hadoop to improve and organize online search results in big data environment
free download

In this article we study the technical specifications required for the proper conduct of online search process in Big Data environment, with the intention to evaluate the consistency of collected data and identify opportunities to improve and organize search results through a

Big Data Analytics: An Approach using Hadoop Distributed File System
free download

Todays world is driven by Growth and Innovation for a better future. All of which are based on analysis and harnessing of tons of data, typically known as Big Data. The tasks involved for achieving results at such a scale can be challenging and painfully slow. This paper works

Hamake: A Data Flow Approach to Data Processing in Hadoop .
free download

Most non-trivial data processing scenarios using Hadoop typically involve launching more than one MapReduce job. Usually, such processing is data-driven with the data funneled through a sequence of jobs. The processing model could be expressed in terms of dataflow

Computation of Big Data: Performance Optimization Approach towards a Parallel Frequent Item Set Mining Algorithm for Transaction Data based on Hadoop
free download

The Huge amount of Big Data is constantly arriving with the rapid development of business organizations and they are interested in extracting knowledgeable information from collected data. Frequent item mining of Big Data helps with business decision and to provide

An improved approach for analysis of hadoop data for all files
free download

Here in this paper an efficient Framework is implemented for Hadoop Platform for almost all types of Files. The Proposed Methodology implemented here is based on various algorithms implemented on Hadoop Platform such as Scan, Read, Sort etc. Various Workloads are

A novel approach for identification of hadoop cloud temporal patterns using map reduce
free download

Abstract− Due to the latest developments in the area of science and Technology resulted in the developments of efficient data transfer, capability of handling huge data and the retrieval of data efficiently. Since the data that is stored is increasing voluminously, methods to

Personalized movie recommender system using rank boosting approach on hadoop
free download

Today we are living in an era of Big Data. Large numbers of services are available to customers; from these services it is difficult for them to choose those that are most appropriate for them. In this scenario a wide variety of service recommender systems will

A novel approach for replica synchronization in hadoop distributed file systems
free download

Abstract The Map Reduce framework provides a scalable model for large scale data intensive computing and fault tolerance. In this paper, we propose an algorithm to improve the I/O performance of the Hadoop distributed file system. The results prove that the

An overall approach to achieve load balancing for Hadoop Distributed File System
free download

Hadoop Distributed File System (HDFS) is a popular cloud storage system that can scale up easily to meet the increasing demand for more storage capacity. In HDFS, files are divided into fixed-size blocks, which are then replicated and randomly stored on many DataNodes to

Towards an ontology-based semantic approach to tuning parameters to improve hadoop application performance
free download

Hadoop MapReduce assists companies and researchers to deal with processing large volumes of data. Hadoop has a lot of configuration parameters that must be tuned in order to obtain a better application performance. However, the best tuning of the parameters is not

Decision Tree Learning and Regression Models to Predict Endocrine Disruptor Chemicals-A Big Data Analytics Approach with Hadoop and Apache Spark
free download

Predictive toxicology calls for innovative and flexible approaches to mine and analyse the mounting quantity and complexity of data used in it. Classification and regression based machine learning algorithms are used in this study in order to computationally predict

Hadoop -Really a Preferred Approach over Relational Database Management Systems
free download

The Hadoop framework transparently provides both reliability and data motion to applications. Hadoop implements a computational paradigm named MapReduce, where the application is divided into many small fragments of work, each of which be executed or

A Text Sentimental Approach for Online Portals Using Hadoop
free download

Big data is an emerging technology to process the vast amount of both structured and unstructured data. Now a day social media such as twitter, face book, blogs and forums are the well suitable source to gathering the huge amount of data. Text sentiment analysis for

An Integrated Approach for Configuring Hadoop Clusters by Ambari on Horton Sandbox
free download

ABSTRACT In Information Technology satisfying customer needs is still remains a milestone because of their increasing demands. When taking the comparison strategy Industry is not only altered in the way of providing solutions, also handling of techniques and resource

Parallel Image Processing from Cloud using CUDA and HADOOP Architecture: A Novel Approach
free download

In There is an increased, large quantity if data with the super-resolution quality data, hence there is an increased demand in high quality image data. This requirements causes a challenge in disk space in single PC or computers. A primary solution to employ the storage

A clustering approach for network traffic classification in Hadoop distributed computing environment
free download

Various technological innovations are motivating the dramatic increase in data and data gathering. This is why big data has turn into a recent area of tactical investment for IT organizations. Big data need to be properly synthesized and analyzed, so that it provides

An efficient approach to optimize the performance of massive small files in hadoop MapReduce framework
free download

Abstract The most popular open source distributed computing framework called Hadoop was designed by Doug Cutting and his team, which involves thousands of nodes to process and analyze

A Novel Big Data Approach to Classify Bank Customers-Solution by Combining PIG, R and Hadoop
free download

Large amount of data that is characterized by its volume, velocity, veracity, value and variety is termed Big Data. Extracting hidden patterns, customer preferences, market trends, unknown correlations, or any other useful business information from large collection of