123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119 |
- PCIe Device AER statistics
- --------------------------
- These attributes show up under all the devices that are AER capable. These
- statistical counters indicate the errors "as seen/reported by the device".
- Note that this may mean that if an endpoint is causing problems, the AER
- counters may increment at its link partner (e.g. root port) because the
- errors may be "seen" / reported by the link partner and not the
- problematic endpoint itself (which may report all counters as 0 as it never
- saw any problems).
- What: /sys/bus/pci/devices/<dev>/aer_dev_correctable
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: List of correctable errors seen and reported by this
- PCI device using ERR_COR. Note that since multiple errors may
- be reported using a single ERR_COR message, thus
- TOTAL_ERR_COR at the end of the file may not match the actual
- total of all the errors in the file. Sample output::
- localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_correctable
- Receiver Error 2
- Bad TLP 0
- Bad DLLP 0
- RELAY_NUM Rollover 0
- Replay Timer Timeout 0
- Advisory Non-Fatal 0
- Corrected Internal Error 0
- Header Log Overflow 0
- TOTAL_ERR_COR 2
- What: /sys/bus/pci/devices/<dev>/aer_dev_fatal
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: List of uncorrectable fatal errors seen and reported by this
- PCI device using ERR_FATAL. Note that since multiple errors may
- be reported using a single ERR_FATAL message, thus
- TOTAL_ERR_FATAL at the end of the file may not match the actual
- total of all the errors in the file. Sample output::
- localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_fatal
- Undefined 0
- Data Link Protocol 0
- Surprise Down Error 0
- Poisoned TLP 0
- Flow Control Protocol 0
- Completion Timeout 0
- Completer Abort 0
- Unexpected Completion 0
- Receiver Overflow 0
- Malformed TLP 0
- ECRC 0
- Unsupported Request 0
- ACS Violation 0
- Uncorrectable Internal Error 0
- MC Blocked TLP 0
- AtomicOp Egress Blocked 0
- TLP Prefix Blocked Error 0
- TOTAL_ERR_FATAL 0
- What: /sys/bus/pci/devices/<dev>/aer_dev_nonfatal
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: List of uncorrectable nonfatal errors seen and reported by this
- PCI device using ERR_NONFATAL. Note that since multiple errors
- may be reported using a single ERR_FATAL message, thus
- TOTAL_ERR_NONFATAL at the end of the file may not match the
- actual total of all the errors in the file. Sample output::
- localhost /sys/devices/pci0000:00/0000:00:1c.0 # cat aer_dev_nonfatal
- Undefined 0
- Data Link Protocol 0
- Surprise Down Error 0
- Poisoned TLP 0
- Flow Control Protocol 0
- Completion Timeout 0
- Completer Abort 0
- Unexpected Completion 0
- Receiver Overflow 0
- Malformed TLP 0
- ECRC 0
- Unsupported Request 0
- ACS Violation 0
- Uncorrectable Internal Error 0
- MC Blocked TLP 0
- AtomicOp Egress Blocked 0
- TLP Prefix Blocked Error 0
- TOTAL_ERR_NONFATAL 0
- PCIe Rootport AER statistics
- ----------------------------
- These attributes show up under only the rootports (or root complex event
- collectors) that are AER capable. These indicate the number of error messages as
- "reported to" the rootport. Please note that the rootports also transmit
- (internally) the ERR_* messages for errors seen by the internal rootport PCI
- device, so these counters include them and are thus cumulative of all the error
- messages on the PCI hierarchy originating at that root port.
- What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_cor
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: Total number of ERR_COR messages reported to rootport.
- What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_fatal
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: Total number of ERR_FATAL messages reported to rootport.
- What: /sys/bus/pci/devices/<dev>/aer_stats/aer_rootport_total_err_nonfatal
- Date: July 2018
- KernelVersion: 4.19.0
- Contact: [email protected], [email protected]
- Description: Total number of ERR_NONFATAL messages reported to rootport.
|