lockstat.txt 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180
  1. LOCK STATISTICS
  2. - WHAT
  3. As the name suggests, it provides statistics on locks.
  4. - WHY
  5. Because things like lock contention can severely impact performance.
  6. - HOW
  7. Lockdep already has hooks in the lock functions and maps lock instances to
  8. lock classes. We build on that (see Documentation/lockdep-design.txt).
  9. The graph below shows the relation between the lock functions and the various
  10. hooks therein.
  11. __acquire
  12. |
  13. lock _____
  14. | \
  15. | __contended
  16. | |
  17. | <wait>
  18. | _______/
  19. |/
  20. |
  21. __acquired
  22. |
  23. .
  24. <hold>
  25. .
  26. |
  27. __release
  28. |
  29. unlock
  30. lock, unlock - the regular lock functions
  31. __* - the hooks
  32. <> - states
  33. With these hooks we provide the following statistics:
  34. con-bounces - number of lock contention that involved x-cpu data
  35. contentions - number of lock acquisitions that had to wait
  36. wait time min - shortest (non-0) time we ever had to wait for a lock
  37. max - longest time we ever had to wait for a lock
  38. total - total time we spend waiting on this lock
  39. acq-bounces - number of lock acquisitions that involved x-cpu data
  40. acquisitions - number of times we took the lock
  41. hold time min - shortest (non-0) time we ever held the lock
  42. max - longest time we ever held the lock
  43. total - total time this lock was held
  44. From these number various other statistics can be derived, such as:
  45. hold time average = hold time total / acquisitions
  46. These numbers are gathered per lock class, per read/write state (when
  47. applicable).
  48. It also tracks 4 contention points per class. A contention point is a call site
  49. that had to wait on lock acquisition.
  50. - CONFIGURATION
  51. Lock statistics are enabled via CONFIG_LOCK_STATS.
  52. - USAGE
  53. Enable collection of statistics:
  54. # echo 1 >/proc/sys/kernel/lock_stat
  55. Disable collection of statistics:
  56. # echo 0 >/proc/sys/kernel/lock_stat
  57. Look at the current lock statistics:
  58. ( line numbers not part of actual output, done for clarity in the explanation
  59. below )
  60. # less /proc/lock_stat
  61. 01 lock_stat version 0.3
  62. 02 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  63. 03 class name con-bounces contentions waittime-min waittime-max waittime-total acq-bounces acquisitions holdtime-min holdtime-max holdtime-total
  64. 04 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  65. 05
  66. 06 &mm->mmap_sem-W: 233 538 18446744073708 22924.27 607243.51 1342 45806 1.71 8595.89 1180582.34
  67. 07 &mm->mmap_sem-R: 205 587 18446744073708 28403.36 731975.00 1940 412426 0.58 187825.45 6307502.88
  68. 08 ---------------
  69. 09 &mm->mmap_sem 487 [<ffffffff8053491f>] do_page_fault+0x466/0x928
  70. 10 &mm->mmap_sem 179 [<ffffffff802a6200>] sys_mprotect+0xcd/0x21d
  71. 11 &mm->mmap_sem 279 [<ffffffff80210a57>] sys_mmap+0x75/0xce
  72. 12 &mm->mmap_sem 76 [<ffffffff802a490b>] sys_munmap+0x32/0x59
  73. 13 ---------------
  74. 14 &mm->mmap_sem 270 [<ffffffff80210a57>] sys_mmap+0x75/0xce
  75. 15 &mm->mmap_sem 431 [<ffffffff8053491f>] do_page_fault+0x466/0x928
  76. 16 &mm->mmap_sem 138 [<ffffffff802a490b>] sys_munmap+0x32/0x59
  77. 17 &mm->mmap_sem 145 [<ffffffff802a6200>] sys_mprotect+0xcd/0x21d
  78. 18
  79. 19 ...............................................................................................................................................................................................
  80. 20
  81. 21 dcache_lock: 621 623 0.52 118.26 1053.02 6745 91930 0.29 316.29 118423.41
  82. 22 -----------
  83. 23 dcache_lock 179 [<ffffffff80378274>] _atomic_dec_and_lock+0x34/0x54
  84. 24 dcache_lock 113 [<ffffffff802cc17b>] d_alloc+0x19a/0x1eb
  85. 25 dcache_lock 99 [<ffffffff802ca0dc>] d_rehash+0x1b/0x44
  86. 26 dcache_lock 104 [<ffffffff802cbca0>] d_instantiate+0x36/0x8a
  87. 27 -----------
  88. 28 dcache_lock 192 [<ffffffff80378274>] _atomic_dec_and_lock+0x34/0x54
  89. 29 dcache_lock 98 [<ffffffff802ca0dc>] d_rehash+0x1b/0x44
  90. 30 dcache_lock 72 [<ffffffff802cc17b>] d_alloc+0x19a/0x1eb
  91. 31 dcache_lock 112 [<ffffffff802cbca0>] d_instantiate+0x36/0x8a
  92. This excerpt shows the first two lock class statistics. Line 01 shows the
  93. output version - each time the format changes this will be updated. Line 02-04
  94. show the header with column descriptions. Lines 05-18 and 20-31 show the actual
  95. statistics. These statistics come in two parts; the actual stats separated by a
  96. short separator (line 08, 13) from the contention points.
  97. The first lock (05-18) is a read/write lock, and shows two lines above the
  98. short separator. The contention points don't match the column descriptors,
  99. they have two: contentions and [<IP>] symbol. The second set of contention
  100. points are the points we're contending with.
  101. The integer part of the time values is in us.
  102. Dealing with nested locks, subclasses may appear:
  103. 32...............................................................................................................................................................................................
  104. 33
  105. 34 &rq->lock: 13128 13128 0.43 190.53 103881.26 97454 3453404 0.00 401.11 13224683.11
  106. 35 ---------
  107. 36 &rq->lock 645 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75
  108. 37 &rq->lock 297 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a
  109. 38 &rq->lock 360 [<ffffffff8103c4c5>] select_task_rq_fair+0x1f0/0x74a
  110. 39 &rq->lock 428 [<ffffffff81045f98>] scheduler_tick+0x46/0x1fb
  111. 40 ---------
  112. 41 &rq->lock 77 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75
  113. 42 &rq->lock 174 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a
  114. 43 &rq->lock 4715 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54
  115. 44 &rq->lock 893 [<ffffffff81340524>] schedule+0x157/0x7b8
  116. 45
  117. 46...............................................................................................................................................................................................
  118. 47
  119. 48 &rq->lock/1: 11526 11488 0.33 388.73 136294.31 21461 38404 0.00 37.93 109388.53
  120. 49 -----------
  121. 50 &rq->lock/1 11526 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54
  122. 51 -----------
  123. 52 &rq->lock/1 5645 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54
  124. 53 &rq->lock/1 1224 [<ffffffff81340524>] schedule+0x157/0x7b8
  125. 54 &rq->lock/1 4336 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54
  126. 55 &rq->lock/1 181 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a
  127. Line 48 shows statistics for the second subclass (/1) of &rq->lock class
  128. (subclass starts from 0), since in this case, as line 50 suggests,
  129. double_rq_lock actually acquires a nested lock of two spinlocks.
  130. View the top contending locks:
  131. # grep : /proc/lock_stat | head
  132. &inode->i_data.tree_lock-W: 15 21657 0.18 1093295.30 11547131054.85 58 10415 0.16 87.51 6387.60
  133. &inode->i_data.tree_lock-R: 0 0 0.00 0.00 0.00 23302 231198 0.25 8.45 98023.38
  134. dcache_lock: 1037 1161 0.38 45.32 774.51 6611 243371 0.15 306.48 77387.24
  135. &inode->i_mutex: 161 286 18446744073709 62882.54 1244614.55 3653 20598 18446744073709 62318.60 1693822.74
  136. &zone->lru_lock: 94 94 0.53 7.33 92.10 4366 32690 0.29 59.81 16350.06
  137. &inode->i_data.i_mmap_mutex: 79 79 0.40 3.77 53.03 11779 87755 0.28 116.93 29898.44
  138. &q->__queue_lock: 48 50 0.52 31.62 86.31 774 13131 0.17 113.08 12277.52
  139. &rq->rq_lock_key: 43 47 0.74 68.50 170.63 3706 33929 0.22 107.99 17460.62
  140. &rq->rq_lock_key#2: 39 46 0.75 6.68 49.03 2979 32292 0.17 125.17 17137.63
  141. tasklist_lock-W: 15 15 1.45 10.87 32.70 1201 7390 0.58 62.55 13648.47
  142. Clear the statistics:
  143. # echo 0 > /proc/lock_stat