Mat-Core is a research processor aiming at exploiting the increasingly number of transistors per IC to improve the performance of a wide range of applications. It extends a general-purpose scalar processor with a matrix unit for processing vector/matrix data, where scalar and vector/matrix instructions are executed out-of-order. To hid memory latency, the extended matrix unit is decoupled into two components: address generation and data computation, which communicate through data queues and executed also out-of-order. Like vector architectures, the data computation unit is organized in parallel lanes. However, on parallel lanes, Mat-Core can execute scalar-matrix, vector-matrix, and matrix-matrix instructions in addition to scalar-vector and vector-vector instructions. By extending the well known scoreboard algorithm, these instructions are executed out-of-order on parallel pipelines. This paper describes the SystemC (system level modeling language) implementation of Mat-Core and evaluates its performance on vector and matrix kernels. On four parallel lanes and matrix registers of size 4×8 or 32 elements each, the performance of Mat-Core with queues size of 10, start up time of 6 clock cycles, and memory latency of 10 clock cycles, is about 1.4, 2.1, 4.2, 2.6, 4.2, and 6.4 FLOPs per clock cycle; achieved on scalar-vector multiplication, SAXPY, Givens, rank-1 update, vector-matrix multiplication, and matrix-matrix multiplication, respectively.
Abstract
Key Words
1. Introduction
2. The Architecture of Out-of-Order Mat-Core Processor
3. Performance Evaluation of the Out-of-Order Mat-Core
4. Summaries
Refrences
Abstract
I. Introduction
II. Process of Airplane Configuration Management
III. Module-Based Simplified Airplane Configuration Management
IV. Conclusions
V. References
Abstract
Key Words
1. Introduction
2. Problem Formulation
3. Robust Constrained LSCMA
4. Sinmulation
5. Conclusions
References
Abstract
Key Words
1. Introduction
2. Particle Swarm Optimization
3. Cylindricity Analysis
4. Examples
5. Conclusions
References
Abstract
Keywords
1. Introduction
2. Intelligent-Based Methods for Integration Enhancement
3. ABS/GPS Navigation System Architecture
4. ABS Navigation Algorithm: Kinematic and Dynamic Model of a Kind of Land Vehicle
5. Fuzzy Navigation Algorithm
6. Implementation and Expriment Results
7. Conclusion
References
Abstract
Key Words
1. Introduction
2. Model Description
3. Extraction of Model Parameters
4. Summaries
References
Abstract
Key Words
1. Introduction
2. Analytical Cutting Force Component
3. Description of the Experimental Equipment
5. Description of the Neural Networks Model
5. Results
6. Conclusions
References
Abstract
Key Words
1. Introduction
2. System Architecture
3. Search Engine Based on Nutch
4. Retrieval System
5. Application Based on DPRSP
6. Conclusions and Future Work
References
Abstract
Key Words
1 Introduction
2. The System
3. Structured Light
4. Camera Stereo Calibration
5. Regiatration
6. Reconstruction of a Sole Surface
Summaries
Acknowledgment
References
Abstract
1 Introduction
2 Related Works
3 Web Services Mapping Mechanism
4 Hierarchical Composing Structure
5 Implement and Experimentation
6 Conclusions
References
Abstract
Key Words
1 Introduction
2. System Description
3. Modeling of Prop Control System
4. System Simulations and Performance Analysis in the Process of Support Lifting
5. Simulation Results and System Test in the Process of Support Loading
6. Conclusions
Acknowledgment
References
Abstract
Key Words
1 Introduction
2. The Model Structure of CGF
3 The Behavior Model of CGF
4. Application of the Novel Behavior Modeling Method
5. Conclusion
References
Abstract
Key Words
1 Introduction
2. Pipeline ADC Architecture
3 Design of a 4Bit Pipeline ADC
4. Results
Conclusion
References
Abstract
Key Words
1. Introduction
2. The Modelling Activity Paradigms and Formalisms
3. Approach towards the SMA Modelling
4. The Producer Consumer Agents Structure Concepts
5. The Producer Consumer Agents Behaviour Concepts
6. Summaries
References
Abstract
Key Words
1 Being Bought Forward of Session Authentication Protocol
2 The Problems in the Original Protocol
3 The Improvement for Session Authentication Mechanism
4 Summaries
References