- Multi-Vdd, variable Vdd and Multi-Vth seems to be a good global solution.
- Reducing the clock speed will result in low power consumption, but on the cost of performance.
- Using power headers and power footer transistors on logic gates cuts down power.
- You could separate the design in blocks, which can go in to sleep mode.
- Another solutions is variable VDD and variable frequency (as Intel or AMD do).This means, you adapt VDD and frequency to the necessary performance.
- Gated clocks and Logic Addressable clocks, dis adv - timing problems due to improper latching of signals, and difficult to test.
- -ve edge triggered flops, (nor+inv) = 1.5 gates, +ve edge triggered flops, (and+inv) = 2.5 gates, so -ve has less gates, less glitching and hence low power.
Any more thoughts and ideas are welcome.
Your comments will be moderated before it can appear here.