posted on 2010-05-10, 07:59authored byVassilios Chouliaras, James FlintJames Flint, Yibin Li, Jose L. Nunez-Yanez
We discuss a configurable, System-on-Chip vector
multiprocessor for accelerating the Transmission Line
Modeling (TLM) algorithm with an architecture capable of
exploiting the two primary forms of parallelism in the code,
thread and data level parallelism. Theoretical results
demonstrate an order of magnitude reduction in the dynamic
instruction count for a scalar-processor/vector-coprocessor
configuration at a vector length of sixteen 32-bit singleprecision
elements. Furthermore, a multi-vector SoC
architecture consisting of ten such vector accelerators provides
a near-linear theoretical performance benefit of the order of
88% in three out of four benchmark configurations which is
orthogonal to the benefit realized by vectorization alone. We
discuss in detail this potent architecture and present
implementation data for the 2-way multi-processor VLSI
macrocell.
History
School
Mechanical, Electrical and Manufacturing Engineering
Citation
CHOULIARAS, V.A. ... et al., 2005. A system-on-chip vector multiprocessor for transmission line modelling acceleration. IN: IEEE Workshop on Signal Processing Systems Design and Implementation (SiPS 2005), Athens, Greece, 2-4 November, pp. 568-572.