



Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
These are the Lecture Slides of Program Optimization for Multi Core Architectures which includes Triangular Lower Limits, Multiple Loop Limits, Dependence System Solvers, Single Equation, Simple Test, Extreme Value Test etc.Key important points are: Multi Core Computing, Contents, Scheduling, Scheduling Criteria, Scheduling Algorithms, Preemptive, Adding Priority
Typology: Slides
1 / 7
This page cannot be seen from the preview
Don't miss anything!
Hardware for extracting ILP has reached the point of diminishing return Need a large number of in-flight instructions Supporting such a large population inside the chip requires power-hungry delay- sensitive logic and storage Verification complexity is getting out of control How to exploit so many transistors? Must be a de-centralized design which avoids long wires
Put a few reasonably complex processors or many simple processors on the chip Each processor has its own primary cache and pipeline Often a processor is called a core Often called a chip-multiprocessor (CMP) Did we use the transistors properly? Depends on if you can keep the cores busy Introduces the concept of thread-level parallelism (TLP)
Ideal for shared address space Fast on-chip hardwired communication through cache (no OS intervention) Two types of architectures Tiled CMP: each core has its private cache hierarchy (no cache sharing); Intel Pentium D, Dual Core Opteron , Intel Montecito, Sun UltraSPARC IV, IBM Cell (more specialized) Shared cache CMP: Outermost level of cache hierarchy is shared among cores; Intel Woodcrest (server-grade Core duo), Intel Conroe (Core2 duo for desktop), Sun Niagara, IBM Power4, IBM Power
A good reading is Parallel Computer Architecture by Culler, Singh with Gupta Caveat: does not talk about multi-core, but introduces the general area of shared memory multiprocessors Papers Check out the most recent issue of Intel Technology Journal http://www.intel.com/technology/itj/ http://www.intel.com/technology/itj/archive.htm Conferences: ASPLOS, ISCA, HPCA, MICRO, PACT Journals: IEEE Micro, IEEE TPDS, ACM TACO