Steve Behling example of TLB miss effect 375 MHz Power3 TLB is 256 entry, 2-way set associative, maps 4 KB pages (= 1 MB) 64 KB L1 dcache, 128-way set associative; 4 MB L2 cache subroutine sub_nest(a1,a2,a3,n) parameter (NCELL=400) real(8) a1(NCELL,NCELL,NCELL) real(8) a2(NCELL,NCELL,NCELL) real(8) a3(NCELL,NCELL,NCELL) integer(4) n integer(4) i,j,k real(8) s s=1.1d0 do 10 k=1,NCELL Time = 21.5 s. do 10 j=1,NCELL 329.7 LD/TLB miss do 10 i=1,NCELL a3(i,j,k)=a2(i,j,k) + s*a1(i,j,k) 10 continue end do 10 i=1,NCELL Time = 980.6 s. do 10 j=1,NCELL 0.667 LD/TLB miss do 10 k=1,NCELL a3(i,j,k)=a2(i,j,k) + s*a1(i,j,k) 10 continue end do 10 k=1,NCELL Time = 178.0 s. do 10 i=1,NCELL 0.853 LD/TLB miss do 10 j=1,NCELL a3(i,j,k)=a2(i,j,k) + s*a1(i,j,k) 10 continue end