Compilation of Nested Loop Programs into Process Networks.
Researchers: |
Bart Kienhuis |
Advisor: | Edward A. Lee |
Novel high-performance, domain specific, embedded architectures are
often composed of a microprocessor, some memory, and a number
of dedicated coprocessors. On these embedded architectures,
real-time applications will be executed that belong to the domains of
multi-media processing, mobile communication, and adaptive array
processing. These applications are often written in the C or
Matlab language and contain parts that have a specific structure,
referred to as a nested loop program.
The C and Matlab languages realize an imperative model of
computation. Although this model of computation is well accepted to
specify applications, it does not reveal parallelism due to its
inherent sequential nature. But many applications contain alot of
parallelism. Compilers exist that are able to extract instruction level
parallelism from the original specifications at a very fine level of
granularity. They are, however, unable to exploit coarse-grained
parallelism offered by the coprocessors of the architectures. A better
specification format would be to use an inherently parallel model of
computation like process networks.
This research focuses on the automatic transformation of nested loop
programs into process network specifications. The transformation
consists of a number of steps.
- Extraction of a parameterized dependence graph from the nested
loop program using parametric integer programming (PIP) [1]. The
dependence graph expresses the available parallelism in the nested
loop program.
- Conversion of the parameterized dependence graph into a
polyhedral reduced dependence graph model. This model is based on
polytopes, making the dependence graph more amenable to further
mathematical manipulation.
- Manipulation of the polytope model. Various techniques are used
like Ehrhart polynomials [2], Fourier/Motzkin, and various projection
techniques to further manipulate polytopes and (re)construct missing
polytopes [3].
- Generation of the network and the individual process network
actors from the polyhedral reduced dependence graph model.
The resulting process network model is presented in the Ptolemy II
system. This allows the combination of the derived process network
description with other domains, thus enabling the description and
simulation of more complex systems. Also, by attributing the process
network model with power and latency numbers, various kinds of design
space exploration can be done.
This work is done in
cooperation with Delft University of Technology.
- [1]
- P. Feautrier, "Dataflow Analysis of Arrays and Scalar Reference"
Recherche Operationelle; Operations Research. 1991
- [2]
- Ph. Clauss and V. Loechner, "Parametric Analysis of Polyhedral
Iteration Spaces". Journal of VLSI Signal Processing. 1998
- [3]
- Patrice Quinton, Sanjay Rajopadhye, and Tanguy Risset, "On
Manipulating Z-polyhedra", Tech. Rep., Institut de Recherche en
Informatique et Systemes Aleatoires. 1996
Last updated 11/05/99