associativity.rst 4.6 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105
  1. ============================
  2. NUMA resource associativity
  3. ============================
  4. Associativity represents the groupings of the various platform resources into
  5. domains of substantially similar mean performance relative to resources outside
  6. of that domain. Resources subsets of a given domain that exhibit better
  7. performance relative to each other than relative to other resources subsets
  8. are represented as being members of a sub-grouping domain. This performance
  9. characteristic is presented in terms of NUMA node distance within the Linux kernel.
  10. From the platform view, these groups are also referred to as domains.
  11. PAPR interface currently supports different ways of communicating these resource
  12. grouping details to the OS. These are referred to as Form 0, Form 1 and Form2
  13. associativity grouping. Form 0 is the oldest format and is now considered deprecated.
  14. Hypervisor indicates the type/form of associativity used via "ibm,architecture-vec-5 property".
  15. Bit 0 of byte 5 in the "ibm,architecture-vec-5" property indicates usage of Form 0 or Form 1.
  16. A value of 1 indicates the usage of Form 1 associativity. For Form 2 associativity
  17. bit 2 of byte 5 in the "ibm,architecture-vec-5" property is used.
  18. Form 0
  19. ------
  20. Form 0 associativity supports only two NUMA distances (LOCAL and REMOTE).
  21. Form 1
  22. ------
  23. With Form 1 a combination of ibm,associativity-reference-points, and ibm,associativity
  24. device tree properties are used to determine the NUMA distance between resource groups/domains.
  25. The “ibm,associativity” property contains a list of one or more numbers (domainID)
  26. representing the resource’s platform grouping domains.
  27. The “ibm,associativity-reference-points” property contains a list of one or more numbers
  28. (domainID index) that represents the 1 based ordinal in the associativity lists.
  29. The list of domainID indexes represents an increasing hierarchy of resource grouping.
  30. ex:
  31. { primary domainID index, secondary domainID index, tertiary domainID index.. }
  32. Linux kernel uses the domainID at the primary domainID index as the NUMA node id.
  33. Linux kernel computes NUMA distance between two domains by recursively comparing
  34. if they belong to the same higher-level domains. For mismatch at every higher
  35. level of the resource group, the kernel doubles the NUMA distance between the
  36. comparing domains.
  37. Form 2
  38. -------
  39. Form 2 associativity format adds separate device tree properties representing NUMA node distance
  40. thereby making the node distance computation flexible. Form 2 also allows flexible primary
  41. domain numbering. With numa distance computation now detached from the index value in
  42. "ibm,associativity-reference-points" property, Form 2 allows a large number of primary domain
  43. ids at the same domainID index representing resource groups of different performance/latency
  44. characteristics.
  45. Hypervisor indicates the usage of FORM2 associativity using bit 2 of byte 5 in the
  46. "ibm,architecture-vec-5" property.
  47. "ibm,numa-lookup-index-table" property contains a list of one or more numbers representing
  48. the domainIDs present in the system. The offset of the domainID in this property is
  49. used as an index while computing numa distance information via "ibm,numa-distance-table".
  50. prop-encoded-array: The number N of the domainIDs encoded as with encode-int, followed by
  51. N domainID encoded as with encode-int
  52. For ex:
  53. "ibm,numa-lookup-index-table" = {4, 0, 8, 250, 252}. The offset of domainID 8 (2) is used when
  54. computing the distance of domain 8 from other domains present in the system. For the rest of
  55. this document, this offset will be referred to as domain distance offset.
  56. "ibm,numa-distance-table" property contains a list of one or more numbers representing the NUMA
  57. distance between resource groups/domains present in the system.
  58. prop-encoded-array: The number N of the distance values encoded as with encode-int, followed by
  59. N distance values encoded as with encode-bytes. The max distance value we could encode is 255.
  60. The number N must be equal to the square of m where m is the number of domainIDs in the
  61. numa-lookup-index-table.
  62. For ex:
  63. ibm,numa-lookup-index-table = <3 0 8 40>;
  64. ibm,numa-distace-table = <9>, /bits/ 8 < 10 20 80 20 10 160 80 160 10>;
  65. ::
  66. | 0 8 40
  67. --|------------
  68. |
  69. 0 | 10 20 80
  70. |
  71. 8 | 20 10 160
  72. |
  73. 40| 80 160 10
  74. A possible "ibm,associativity" property for resources in node 0, 8 and 40
  75. { 3, 6, 7, 0 }
  76. { 3, 6, 9, 8 }
  77. { 3, 6, 7, 40}
  78. With "ibm,associativity-reference-points" { 0x3 }
  79. "ibm,lookup-index-table" helps in having a compact representation of distance matrix.
  80. Since domainID can be sparse, the matrix of distances can also be effectively sparse.
  81. With "ibm,lookup-index-table" we can achieve a compact representation of
  82. distance information.