ENGINEERING RESEARCH PAPERS

CPU-Central Processing Unit 2016 IEEE PAPER




Coherence Stalls or Latency Tolerance: Informed CPU Scheduling for Socket and Core Sharing
free download

Abstract The efficiency of modern multiprogrammed multicore machines is heavily impacted by traffic due to data sharing and contention due to competition for shared resources. In this paper, we demonstrate the importance of identifying latency tolerance coupled with

High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous Platform
free download

Page 1. High Throughput Large Scale Sorting on a CPU-FPGA Heterogeneous Platform Chi Air traffic network WWW Neural network CPU-FPGA Heterogeneous Platform 4 FPGA and CPU share the coherent cache system

Improving mobile gaming performance through cooperative CPU-GPU thermal management
free download

ABSTRACT State-of-the-art thermal management techniques independently throttle the frequencies of high-performance multi-core CPU and powerful graphics processing units (GPU) on heterogeneous multiprocessor system-on-chips deployed in latest mobile

PERFORMANCE EVALUATION OF SYNTHETIC JET COOLING FOR CPU
free download

Abstract-Synthetic jet cooling increases the rate of heat transfer as compared to other cooling techniques. In the study, a synthetic jet is created for a single and multi-nozzle orifice with the help of vibrations of sound system. The heat transfer characteristics of a synthetic

DRAMA: Exploiting DRAM addressing for cross-cpu attacks
free download

Abstract In cloud computing environments, multiple tenants are often co-located on the same multi-processor system. Thus, preventing information leakage between tenants is crucial. While the hypervisor enforces software isolation, shared hardware, such as the CPU

Thermally-aware composite run-time CPU power models
free download

Accurate and stable CPU power modelling is fundamental in modern system-on-chips (SoCs) for two main reasons: 1) they enable significant online energy savings by providing a run-time manager with reliable power consumption data for controlling CPU energy-

LVCSR System on a Hybrid GPU-CPU Embedded Platform for Real-Time Dialog Applications
free download

Abstract We present the implementation of a largevocabulary continuous speech recognition (LVCSR) system on NVIDIA's Tegra K1 hyprid GPU-CPU embedded platform. The system is trained on a standard 1000-hour corpus, LibriSpeech, features a trigram WFST-based

CPU FAN NOISE CONTROL BY ACTIVE NOISE CANCELLATION
free download

Abstract Active Noise Cancellation (ANC) has been growing in integrated circuits for controlling the speed of cooling fans in CPU and other electronic equipment. Small electrical fans are cheap and have been used for cooling electronic equipment for more than half a

Enhancement of Cooling in Central Processing CPU by using Jet Impingement with and without Nano Fluid
free download

Abstract The heat released from the flat surface is cooled by the use of jet impingement technique. The heat flux from the isothermally heated flat surface due to two-dimensional turbulent twin oblique confined slot-jet impingement is studied numerically using the

CPU Task Scheduling Using Gravitational Search Algorithm
free download

Abstract: This paper includes the scientific method to develop a deeper understanding of CPU schedulers; explain and understand the sometimes erratic behavior of CPU schedulers. Scheduling is a decision making process which deals with the allocation of

Comparative Study of Computationally Intensive Algorithms on CPU and GPU
free download

Abstract This paper presents comparative study on one of the popular cryptographic algorithms AES algorithm, implemented using CUDA on GPU and on CPU. In present day scenario the AES algorithm suffers from very high CPU resource consumption, latency

Multi-queue CPU Process Prioritization using a Dynamic Quantum Time Algorithm Compared with Varying Time Quantum and Round-Robin Algorithms
free download

ABSTRACT In Round-Robin Scheduling, the quantum time is static and tasks are scheduled such that no process uses CPU time more than one slice time each cycle. If quantum time is too large, the response time of the processes will not be tolerated in an interactive

Thermal Analysis of Heat Sink in CPU with Varying Fin Designs
free download

ABSTRACT Due to the passage of current, the electronic components generate heat during the course of their operation. The main objective is to cool the electronic component by removing the heat generated in order to ensure the optimal working of the component.

Automated Forecasting Approach Minimizing Prediction Errors of CPUAvailability in Distributed Computing Systems.
free download

Abstract:Forecasting CPU availability in volunteer computing systems using a single prediction algorithm is insufficient due to the diversity of the world-wide distributed resources. In this paper, we draw-up the main guidelines to develop an appropriate CPU

The speedup analysis in GEM detector based acquisition system algorithms withCPU and PCIe cards
free download

The GEM detector based acquisition systems are the compound multimodule electronic devices whose function is to provide quick acquisition, storing and processing of the data from the detector. The systems are described in [1, 2]. The scheme of main elements of the

Energy-Aware Power Management of Virtualized Multi-core Servers through DVFS and CPU Consolidation
free download

Abstract:Considerable energy consumption of datacenters results in high service costs beside environmental pollutions. Therefore, energy saving of operating data centers received a lot of attention in recent years. In spite of the fact that modern multi-core

A Practical approach on MLQ–fuzzy logic in CPU Scheduling
free download

ABSTRACT In this paper we propose using the scientific method to develop a deeper understanding of CPU schedulers; we use this approach to explain and understand the sometimes erratic behavior of CPU schedulers. Scheduling is a decision making process

A Survey on Parallel Processing in a CPU-GPU Collaborative Environment Using Ant Colony Optimization, Artificial Neural Networks Genetic Algorithm
free download

Abstract-The purpose of this paper is to present a survey on various papers that shows various aspects of ACO, ANN, GA and their respective strategies that can be applied on a CPUGPU collaborative environment making use of the concept of parallel processing. The

CFD ANALYSIS OF CPU FOR COOLING OF DESKTOP COMPUTERS
free download

ABSTRACT The life of the electronic components strongly depends upon the heat transfer which is generated within that component or the cooling of that device. For this reason the need for forced air cooling is the main factor that should be consider at the starting phase

Impact of Physical Exercise on functions of Human CPU-the Brain
free download

Impact of Physical Exercise on functions of Human CPU the Brain Some areas perform complex functions like emotions, thinking, learning and memorizing. The brain's functions are very similar to the functions of a CPU- Central Processing Unit, of a computer.

Executing Database Query on Modern CPU-GPU Heterogeneous Architecture
free download

Abstract Graphics processor (GPU) have emerged as a powerful co-processor for general- purpose computation. Compared with commodity CPUs, GPUs have an order of magnitude higher computation power as well as memory bandwidth. The execution time of database

Comparison of Different CPU Cache Technology Used in Testing for Research Purposes
free download

Abstract:The objective for this paper is to understand different cache configurations. The higher the level of cache, there is an increase in cache capacity. There is a trend where level one cache will always be SRAM and the other two levels can have different RAM device

Comparison Analysis of CPU Scheduling: FCFS, SJF and Round Robin
free download

Abstract-Task scheduling is needed to maintain every process that comes with a processor in parallel processing. In several conditions, not every algorithm works better on the significant problem. Sometimes FCFS algorithm is better than the other in short burst time

Study of CPU Usage Data Properties for Using in Performance Monitoring
free download

CPU usage data (CPU user, system, iowait etc. load level) are often used for performance monitoring. These data are provided by OS. In this paper we analyze different aspects of CPU usage data given by Linux kernel. We examine kernel source code and provide test

PERFORMANCE ANALYSIS OF MORPHOLOGICAL OPERATIONS IN CPU AND GPU FOR ACCELERATING DIGITAL IMAGE APPLICATIONS
free download

ABSTRACT In this paper, we evaluate the performance of morphological operations in central processing unit (CPU) and graphics processing unit (GPU) on various sizes of image and structuring element. The languages selected for algorithm implementation are C++,

Efficient HEVC Decoder for Heterogeneous CPU with GPU Systems
free download

Abstract:The High Efficiency Video Coding (HEVC) standard provides higher compression efficiency than other video coding standards but at the cost of increased computational load, which makes it hard to achieve real-time encoding/decoding of high-resolution, high-

Review of CPU and GPU Faddeeva Implementations
free download

Abstract The Faddeeva error function is frequently used when computing electric fields generated by two-dimensional Gaussian charge distributions. Numeric evaluation of the Faddeeva function is particularly challenging since there is no single expansion that

A Capability-Based Hybrid CPU/GPU Pattern Matching Algorithm for Deep Packet Inspection
free download

Abstract: Network applications have been developed quickly during recent years, and communications between these applications involve a large quantity of data transfer through high speed networks. Deep packet inspection (DPI) becomes indispensable to ensure

Algorithms for CPU and DRAM DVFS Under Inefficiency Constraints
free download

Abstract:Dynamic voltage and frequency scaling (DVFS) of both the core and DRAM provides opportunities to trade-off performance in order to save energy. Previous approaches to core and DRAM power management using DVFS used performance,

Multi Distributed Server Load Balancing for SaaS: With Parameterized CPU and Memory Statistics
free download

Abstract: The term load balancing refers to distribution of load among the various nodes in order to improve the job response time, resource utilization and avoid the scenario where some node doing more work and some are lightly loaded. The load balancing facilitates

CPU Usage Prediction Technique for Live Migration of Virtual Machines
free download

Abstract–Virtualization is a very important technology of cloud computing which partition the physical host into several Virtual Machines (VMs). The number of active host and their power consumption is easily managed by migrate the virtual machines based on their resource

PERFORMANCE ANALYSIS OF _-STEPPING ALGORITHM ON CPU AND GPU
free download

Abstract-stepping is an algorithm for solving single source shortest path problem. It is very efficient on a large class of graphs, providing nearly linear time complexity in sequential implementation, and can be parallelized. However, this algorithm requires some tuning

CPU vs. SOC–The battle for the future of computing
free download

A central processing unit (CPU) is the electronic circuitry within a computer that carries out the instructions of a computer program by performing the basic arithmetic, logical, control and input/output (I/O) operations specified by the instructions. The term has been used in

Identification of CPU Latency and Energy Consumption Across Various Publications
free download

Abstract:The analysis and comprehension of professional research publications is a vital skill for all aspiring computer engineers. In the spirit of exploring this often intimidating domain, the focus of this paper is thus to analyze 10 different professional research

Speeding-up Document Scoring with Tree Ensembles using CPU SIMD Extensions
free download

Abstract. Scoring documents with learning-to-rank (LtR) models based on large ensembles of regression trees is currently deemed one of the best solutions to effectively rank query results to be returned by large scale Information Retrieval systems. This extended abstract

Genetic algorithms–CPU vs GPU implementation discussion
free download

Page 1. Genetic algorithms – CPU vs GPU implementation discussion Since these types of algorithms require a high degree of computations, it is interesting to see how a normal implementation done using a Central Processing Unit (CPU) works and how long it takes.

Evaluation of a new designed microchannel heat sink for CPU cooling based on IR–thermography synchronized with high-speed flow visualization
free download



Fault studies of MMC-HVDC links using FPGA and CPU on a real-time simulator with iteration capability
free download

HVDC links is one of the most challenging tasks in power system validation today, requiring the combined use of CPU and FPGA technologies. The inclusion of surge arresters in the real-time fault tests further increases the difficulties because of the highly non-linear

Parallel Graph Partitioning on a CPU-GPU Architecture
free download

Abstract:Graph partitioning has important applications in multiple areas of computing, including scheduling, social networks, and parallel processing. In recent years, GPUs have proven successful at accelerating several graph algorithms. However, the irregular nature

On Minimum Variance CPU-Scheduling Algorithm for Interactive Systems using Goal Programming
free download

ABSTRACT Improving response time is considered a fundamental objective in interactive environments. CPU scheduling aimed mainly to optimize the response time by minimizing its average in order to attain faster responses to users' requests. However, for interactive

Accelerating Equi-Join on a CPU-FPGA Heterogeneous Platform
free download

Abstract:Accelerating database applications using FPGAs has recently been an area of growing interest in both academia and industry. Equi-join is one of the key database operations whose performance highly depends on sorting, which exhibits high memory

The effects of multilevel cache on CPU and main memory interactions
free download

Abstract:the focus of this paper is to provide an overview of different cache technology and memory design. The purpose of multi-level cache is to provide the fastest data transfer between the Dynamic RAM with the lowest cost possible. The three main mapping

OSCAR: Orchestrating STT-RAM Cache Traffic for Heterogeneous CPU-GPU Architectures
free download

Abstract:As we integrate data-parallel GPUs with generalpurpose CPUs on a single chip, the enormous cache traffic generated by GPUs will not only exhaust the limited cache capacity, but also severely interfere with CPU requests. Such heterogeneous multicores

CPU Utilization-based Energy Consumption Model in Cloud Computing
free download

ABSTRACT: This paper aims to set a proper and reliable energy consumption model for the server of the Could Data Center based on the cloud infrastructure, as well as analyze the effects of different sample modes and mathematical methods on the energy consumption

MIPS CPU DESIGN AND IMPLEMENTATION BASED CYCLONE II FPGA BOARD
free download

Abstract: The aim of this work is to design and implement a simple MIPS CPU by using Cyclone II FPGA without complex control unit. MIPS-Processor has been studied and realized to design, simulate and implement its components by using VHDL and FPGA

An Enhanced Round Robin CPU Scheduling Algorithm
free download

Abstract: CPU scheduling is a process which allows one process to use the CPU while the another process is in waiting state due to unavailability of any resource like I/O etc, thereby making full use of CPU. The aim of CPU scheduling is to make the system efficient, fast

Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing
free download

Abstract. Existing work on accelerating analytic DB query processing with (discrete) GPUs fails to fully realize their potential for speedup through parallelism: Published results do not achieve significant speedup over more performant CPU-only DBMSes when processing

Elastic Multi-Resource Fairness: Balancing Fairness and Efficiency in CoupledCPU-GPU Architectures
free download

Abstract:Fairness and efficiency are two important concerns for users in a shared computer system, and there tends to be a tradeoff between them. Heterogeneous computing poses new challenging issues on the fair allocation of computational resources among users

The real cost of a CPU hour
free download

COMPUTING PRACTICES cash flow over time, taking into account the time value of money, 5 when buying or leasing equipment for a fixed duration. The investment strategy that results in the higher NPV is the rational choice. However, current NPV models also make two

Comparison of the three CPU schedulers in Xen
free download

Comparison of the Three CPU Schedulers in Xen Lucy Cherkasova (HPLabs) Diwaker Gupta (UCSD) Amin Vahdat (UCSD) Page 2. Page 3. April 20, 2007 3 History of three CPU schedulers in Xen BVT: Borrowed Virtual Time Lack of non-work-conserving mode

Theano: A CPU and GPU math compiler in Python
free download

Abstract:Theano is a compiler for mathematical expressions in Python that combines the convenience of NumPy's syntax with the speed of optimized native machine language. The user composes mathematical expressions in a high-level description that mimics NumPy's

Measuring CPU Overhead for I/O Processing in the Xen Virtual Machine Monitor.
free download

Abstract. Virtual Machine Monitors (VMMs) are gaining popularity in enterprise environments as a software-based solution for building shared hardware infrastructures via virtualization. In this work, using the Xen VMM, we present a light weight monitoring system for measuring

Balancing batteries, power, and performance: system issues in cpu speed-setting for mobile computing
free download

Abstract This thesis studies the problem of balancing power and performance in mobile computers, specifically, trading off power for performance by CPU speed-setting. The traditional approach to power-performance trade-offs assumes that batteries and memory

Hierarchical multiprocessor CPU reservations for the linux kernel
free download

Page 1. Hierarchical Multiprocessor CPU Reservations for the Linux Kernel Fabio Checconi, Tommaso Cucinotta, Dario Faggioli and Giuseppe Lipari December 10, 2009 Page 2. Goal 2 / 32 Page 3. CPU Scheduling in the IRMOS Project IRMOS uses KVM to deploy its VMUs.

Beyond the CPU: Defeating hardware based RAM acquisition
free download

Direct Memory Access does not involve CPU Acquisition device talks directly to the memory controller Even if the whole OS is compromised, still we can get a real image of the

Understanding the Linux 2.6. 8.1 CPU scheduler
free download

Linux kernel development is relatively fast-paced given the size and complexity of the code base. This is because of its widespread adoption by hobbyists, home users, businesses (including many Fortune 500 companies), and educational institutions. The Linux kernel

Exploiting In-Kernel Data Paths to Improve I/O Throughput and CPU Availability.
free download

Abstract We present the motivation, design, implementation, and performance evaluation of a UNIX kernel mechanism capable of establishing fast in-kernel data pathways between I/O objects. A new system call, splice () moves data asynchronously and without user-process

Evaluating the software as a service business model: From CPU time-sharing to online innovation sharing
free download

ABSTRACT The evolving literature on the new Software as a Service (SaaS) concept gives a coherent picture of the technical arrangements required between the vendor and the customer in order to enable the new online renting of applications. Many of these articles

Fastchart-a fast time deterministic CPU and hardware based real-time-kernel.
free download

Abstract The designer of hard realtime systems requires deterministic behaviour of the system. Today there are problems because of the hardware and the real-time kernel. So one gets only statistic statements regarding timing. This article describes a new hardware

A fistful of red-pills: How to automatically generate procedures to detect CPUemulators
free download

Abstract Malware includes several protections to complicate their analysis: the longer it takes to analyze a new malware sample, the longer the sample survives and the larger number of systems it compromises. Nowadays, new malware samples are analyzed

Adapting Software Fault Isolation to Contemporary CPU Architectures.
free download

Abstract Software Fault Isolation (SFI) is an effective approach to sandboxing binary code of questionable provenance, an interesting use case for native plugins in a Web browser. We present software fault isolation schemes for ARM and x86-64 that provide control-flow and

B-tree Indexes and CPU Caches
free download

Abstract Since many existing techniques for exploiting CPU caches in the implementation of B-tree indexes have not been discussed in the literature, most of them are surveyed here. Rather than providing a detailed performance evaluation for one or two of them on some

Performance characterization of spec cpu benchmarks on intel's core microarchitecture based processor
free download

January 21, 2007 Performance Characterization of SPEC CPU Benchmarks on Intel's Core Microarchitecture based processor 1 Performance Characterization of SPEC CPU System Design and Details Performance Characterization Results of SPEC CPU Benchmarks

An optimized round robin scheduling algorithm for CPU scheduling
free download

Abstract:-The main objective of this paper is to develop a new approach for round robin scheduling which help to improve the CPU efficiency in real time and time sharing operating system. There are many algorithms available for CPU scheduling. But we cannot

CPU performance evaluation and execution time prediction using narrow spectrum benchmarking
free download

ABSTRACT Runs of a benchmark or a suite of benchmarks are inadequate either to characterize a given machine or to predict the running time of some benchmark not included in the suite. Further, the observed results are quite sensitive to the nature of the

Performance analysis of AES candidates on the 6805 CPU core
free download

Abstract The AES candidate block ciphers Crypton, Mars, RC6, Rijndael, and Serpent were implemented on the Motorola 6805 series 8-bit architecture. Their performance, including ROM and RAM sizes and time to encrypt a single block, was measured in simulation, and

Attacking SMM memory via Intel CPU cache poisoning
free download

Abstract In this paper we describe novel practical attacks on SMM memory (SMRAM) that exploit CPU caching semantics of Intel-based systems. keywords: CPU Cache, System

Evaluation of synthesizable CPU cores
free download

Abstract The three synthesizable processors: LEON2 from Gaisler Research, MicroBlaze from Xilinx, and OpenRISC 1200 from OpenCores are evaluated and discussed. Performance in terms of benchmark results and area resource usage is measured.

CPU Cache Consistency with Software Support and Using" one Time Identifiers"
free download

Abstract Multiprocessors with shared memory are currently viewed as the best way to obtain high (aggregate) performance at moderate or low cost. Shared memory is needed for the efficient and effective cooperation of processes and high performance requires the use of

Using CPU system management mode to circumvent operating system security functions
free download

Abstract. In this paper we show how hardware functionalities can be misused by an attacker to extend her control over a system. The originality of our approach is that it exploits seldom used processor and chipset functionalities, such as switching to system management

An improved Round Robin scheduling algorithm for CPU scheduling
free download

Abstract There are many functions which are provided by operating system like process management, memory management, file management, input/output management, networking, protection system and command interpreter system. In these functions, the

Reducing GPU offload latency via fine-grained CPU-GPU synchronization.
free download

Abstract GPUs are seeing increasingly widespread use for general purpose computation due to their excellent performance for highly-parallel, throughput-oriented applications. For many workloads, however, the performance benefits of offloading are hindered by the

A priority based round robin CPU scheduling algorithm for real time systems
free download

Abstract-The main objective of this paper is to develop a new approach for round robin CPU scheduling algorithm which improves the performance of CPU in real time operating system. The proposed Priority based Round-Robin CPU Scheduling algorithm is based on the

Modeling and analyzing CPU power and performance: Metrics, methods, and abstractions
free download

Battery technology Linear improvements, nowhere near the exponential power increases we've seen Cooling techniques Air-cooled is reaching limits Fans often undesirable (noise, weight, expense) $1 per chip per Watt when operating in the 40W realm Water-cooled!

Ray-triangle intersection algorithm for modern CPU architectures
free download

Abstract We present an algorithm for determining if a ray intersects a triangle interior; and computing intersection point parameters as well as distance of intersection in response to the ray intersecting a triangle interior. Particularly a variation of a hybrid test having all

Accelerating Kirchhoff Migration by CPU and GPU Cooperation.
free download

Abstract We discuss the performance of Petrobras production Kirchhoff prestack seismic migration on a cluster of 64 GPUs and 256 CPU cores. Porting and optimization of the application hot spot (98.2% of a single CPU core execution time) to a single GPU reduces

Functional Verification of a Multiple-issue, Pipelined, Superscalar Alpha Processor the Alpha 21164 CPU Chip
free download

136 Digital Technical Journal Vol. 7 No. 1 1995 chip to ensure that no serious functional bugs remained. The goal for second-pass silicon was to be bugfree so that these chips could be shipped to customers for use in revenue-producing systems. Secondary goals

Implementing an interior point method for linear programs on a CPU-GPU system
free download

Abstract. Graphics processing units (GPUs), present in every laptop and desktop computer, are potentially powerful computational engines for solving numerical problems. We present a mixed precision CPU-GPU algorithm for solving linear programming problems using

Fast ray tracing for modern general purpose CPU
free download

Abstract We present a study of the system implications of various aspects of the classic Ray Tracing algorithm. We show how these algorithms can be modified to yield the highest

Implementation of 2D Convolution on FPGA, GPU and CPU
free download

Abstract The 2D convolution algorithm is a memory intensive algorithm with a regular access structure. Implementation on an FPGA can exploit data streaming and pipelining. The GPU is unable to hold onto previously accessed data, this report exemplifies this limitation.

Nehalem-ex cpu architecture
free download

This presentation contains forward-looking statements that involve a number of risks and uncertainties. These statements do not reflect the potential impact of any mergers, acquisitions, divestitures, investments or other similar transactions that may be completed

Analysis of multi-megabyte secondary CPU cache memories
free download

The memory system is a critical component of any high-performance computer system. Memory speed is often a major component of the perceived execution speed of the computer since the processor can only execute as fast as the memory system provides

Linux kernel hotplug CPU support
free download

Abstract During the 2.5 development series, many people collaborated on the infrastructure to add (easy) and remove (hard) CPUs under Linux. This paper will cover the approaches we used, tracing back to the initial PowerPC hack with Anton Blanchard in February 2001,

Extending the Thread Programming Model Across Hybrid FPGA/CPUArchitectures
free download

Embedded PPC 405 CPU+ Sea of free FPGA Gates (CLB's)+ BRAM provides efficient storage to save system states. System components provided as libraries or soft IPs:- System buses (PLB, OPB)-Interrupt controllers, UARTs Migration of system services from

A 60 MHz 50 W fine-grain package integrated VR powering a CPU from 3.3 V
free download

Page 1. A 60MHz 50W Fine-Grain Package-Integrated VR Powering a CPU from 3.3V Gerhard Schrom, Fabrice Paillet, Jaehong Hahn A laptop modified to use the IVR-powered CPU booted

Designing a simple fpga-optimized risc cpu and system-on-a-chip
free download

Abstract–This paper presents the complete design of a simple FPGA RISC processor core and system-on-a-chip in synthesizable Verilog. It defines a RISC instruction set architecture and then describes how to implement every part of the processor. Next, an interrupt facility

High speed CPU simulation using JIT binary translation
free download

Abstract:Instruction set simulators are indispensable tools for exploring the design-space of innovative processor architectures, for processor verification, and for software development. Traditional interpretive simulators are too slow to cope with the increasing

Wabi CPU emulation
free download

Emulator Goals q Run correct Windows applications on multiple RISC platforms–as fast as possible–in as small a footprint as possible q Favor speed over space, within reason q x86 memory little-endian regardless of host–simplifies external interfacing q Emulate 80486