CS 6461 Computer Architecture

Reference Links

 

Parallel Computing

Parallel Programming

Flynn Very High-Speed Computing Systems - Categories

Cost-Effective Parallel Computing – David A. Wood

The Landscape of Parallel Computing Research: A View from Berkeley

A Six Lecture Primer on Parallel Computing

EPIC-IA-64

 

Cache

David Patterson Cache Lecture – Berkeley

Cache Memories

Cache Write Policies and Performance

Cache Coherence Tutorial

Cache Coherence Tutorial Part 2: Directory Based

VICTIM CACHE STRATEGIES  Stallings (content from publisher site)

Synchronization, Coherence, and Event Ordering in Multiprocessors

An Overview of Cache Optimization Techniques and Cache Aware Numerical Algorithms*

Shared Memory Consistency Models: A Tutorial

 

Architecture and Computer Security

          RowHammer

          Spectre


Application of Architecture

          DNA Read Mapping

          Genome Realignment Filter

          Neural Nets and Branch Prediction

          Rowclone Fast Copy


Graphics Processors

NVidia

AMD

Machine Learning with GPU

 

Input/Output

A Study of I/O System Organizations

 

Memory/Virtual Memory

INTERLEAVED MEMORY   Stallings (content from publisher site)

Shared Memory Consistency Models: A Tutorial

Virtual Memory

Working Sets Past and Present

Working Set

 

Multiprocessor

A Survey of Multicore Processors

Completing an MIMD Multiprocessor Taxonomy

Computer Interconnection Structures:  Taxonomy, Characteristics, and Examples

 

Performance Optimization

Clock Rate versus IPC: The End of the Road for Conventional Microarchitectures

Instruction-Level Parallel Processing:  History, Overview and Perspective

Multiscalar Processors

A STUDY OF BRANCH PREDICTION STRATEGIES

An Efficient Algorithm for Exploiting Multiple Arithmetic Units - Tomasulo

Reorder Buffer

 

Pipelines

Pipeline Architecture

Implementation of Precise Interrupt in Pipelined Processors

 

RISC

Reduced Instruction Set Computers

Performance from Architecture:  Comparing a RISC and a CISC with Similar Hardware Organization

 

Vector Processors

Appendix G From Computer Architecture Publisher Web Site