usbmon.txt 14 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356
  1. * Introduction
  2. The name "usbmon" in lowercase refers to a facility in kernel which is
  3. used to collect traces of I/O on the USB bus. This function is analogous
  4. to a packet socket used by network monitoring tools such as tcpdump(1)
  5. or Ethereal. Similarly, it is expected that a tool such as usbdump or
  6. USBMon (with uppercase letters) is used to examine raw traces produced
  7. by usbmon.
  8. The usbmon reports requests made by peripheral-specific drivers to Host
  9. Controller Drivers (HCD). So, if HCD is buggy, the traces reported by
  10. usbmon may not correspond to bus transactions precisely. This is the same
  11. situation as with tcpdump.
  12. Two APIs are currently implemented: "text" and "binary". The binary API
  13. is available through a character device in /dev namespace and is an ABI.
  14. The text API is deprecated since 2.6.35, but available for convenience.
  15. * How to use usbmon to collect raw text traces
  16. Unlike the packet socket, usbmon has an interface which provides traces
  17. in a text format. This is used for two purposes. First, it serves as a
  18. common trace exchange format for tools while more sophisticated formats
  19. are finalized. Second, humans can read it in case tools are not available.
  20. To collect a raw text trace, execute following steps.
  21. 1. Prepare
  22. Mount debugfs (it has to be enabled in your kernel configuration), and
  23. load the usbmon module (if built as module). The second step is skipped
  24. if usbmon is built into the kernel.
  25. # mount -t debugfs none_debugs /sys/kernel/debug
  26. # modprobe usbmon
  27. #
  28. Verify that bus sockets are present.
  29. # ls /sys/kernel/debug/usb/usbmon
  30. 0s 0u 1s 1t 1u 2s 2t 2u 3s 3t 3u 4s 4t 4u
  31. #
  32. Now you can choose to either use the socket '0u' (to capture packets on all
  33. buses), and skip to step #3, or find the bus used by your device with step #2.
  34. This allows to filter away annoying devices that talk continuously.
  35. 2. Find which bus connects to the desired device
  36. Run "cat /sys/kernel/debug/usb/devices", and find the T-line which corresponds
  37. to the device. Usually you do it by looking for the vendor string. If you have
  38. many similar devices, unplug one and compare the two
  39. /sys/kernel/debug/usb/devices outputs. The T-line will have a bus number.
  40. Example:
  41. T: Bus=03 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 2 Spd=12 MxCh= 0
  42. D: Ver= 1.10 Cls=00(>ifc ) Sub=00 Prot=00 MxPS= 8 #Cfgs= 1
  43. P: Vendor=0557 ProdID=2004 Rev= 1.00
  44. S: Manufacturer=ATEN
  45. S: Product=UC100KM V2.00
  46. "Bus=03" means it's bus 3. Alternatively, you can look at the output from
  47. "lsusb" and get the bus number from the appropriate line. Example:
  48. Bus 003 Device 002: ID 0557:2004 ATEN UC100KM V2.00
  49. 3. Start 'cat'
  50. # cat /sys/kernel/debug/usb/usbmon/3u > /tmp/1.mon.out
  51. to listen on a single bus, otherwise, to listen on all buses, type:
  52. # cat /sys/kernel/debug/usb/usbmon/0u > /tmp/1.mon.out
  53. This process will be reading until killed. Naturally, the output can be
  54. redirected to a desirable location. This is preferred, because it is going
  55. to be quite long.
  56. 4. Perform the desired operation on the USB bus
  57. This is where you do something that creates the traffic: plug in a flash key,
  58. copy files, control a webcam, etc.
  59. 5. Kill cat
  60. Usually it's done with a keyboard interrupt (Control-C).
  61. At this point the output file (/tmp/1.mon.out in this example) can be saved,
  62. sent by e-mail, or inspected with a text editor. In the last case make sure
  63. that the file size is not excessive for your favourite editor.
  64. * Raw text data format
  65. Two formats are supported currently: the original, or '1t' format, and
  66. the '1u' format. The '1t' format is deprecated in kernel 2.6.21. The '1u'
  67. format adds a few fields, such as ISO frame descriptors, interval, etc.
  68. It produces slightly longer lines, but otherwise is a perfect superset
  69. of '1t' format.
  70. If it is desired to recognize one from the other in a program, look at the
  71. "address" word (see below), where '1u' format adds a bus number. If 2 colons
  72. are present, it's the '1t' format, otherwise '1u'.
  73. Any text format data consists of a stream of events, such as URB submission,
  74. URB callback, submission error. Every event is a text line, which consists
  75. of whitespace separated words. The number or position of words may depend
  76. on the event type, but there is a set of words, common for all types.
  77. Here is the list of words, from left to right:
  78. - URB Tag. This is used to identify URBs, and is normally an in-kernel address
  79. of the URB structure in hexadecimal, but can be a sequence number or any
  80. other unique string, within reason.
  81. - Timestamp in microseconds, a decimal number. The timestamp's resolution
  82. depends on available clock, and so it can be much worse than a microsecond
  83. (if the implementation uses jiffies, for example).
  84. - Event Type. This type refers to the format of the event, not URB type.
  85. Available types are: S - submission, C - callback, E - submission error.
  86. - "Address" word (formerly a "pipe"). It consists of four fields, separated by
  87. colons: URB type and direction, Bus number, Device address, Endpoint number.
  88. Type and direction are encoded with two bytes in the following manner:
  89. Ci Co Control input and output
  90. Zi Zo Isochronous input and output
  91. Ii Io Interrupt input and output
  92. Bi Bo Bulk input and output
  93. Bus number, Device address, and Endpoint are decimal numbers, but they may
  94. have leading zeros, for the sake of human readers.
  95. - URB Status word. This is either a letter, or several numbers separated
  96. by colons: URB status, interval, start frame, and error count. Unlike the
  97. "address" word, all fields save the status are optional. Interval is printed
  98. only for interrupt and isochronous URBs. Start frame is printed only for
  99. isochronous URBs. Error count is printed only for isochronous callback
  100. events.
  101. The status field is a decimal number, sometimes negative, which represents
  102. a "status" field of the URB. This field makes no sense for submissions, but
  103. is present anyway to help scripts with parsing. When an error occurs, the
  104. field contains the error code.
  105. In case of a submission of a Control packet, this field contains a Setup Tag
  106. instead of an group of numbers. It is easy to tell whether the Setup Tag is
  107. present because it is never a number. Thus if scripts find a set of numbers
  108. in this word, they proceed to read Data Length (except for isochronous URBs).
  109. If they find something else, like a letter, they read the setup packet before
  110. reading the Data Length or isochronous descriptors.
  111. - Setup packet, if present, consists of 5 words: one of each for bmRequestType,
  112. bRequest, wValue, wIndex, wLength, as specified by the USB Specification 2.0.
  113. These words are safe to decode if Setup Tag was 's'. Otherwise, the setup
  114. packet was present, but not captured, and the fields contain filler.
  115. - Number of isochronous frame descriptors and descriptors themselves.
  116. If an Isochronous transfer event has a set of descriptors, a total number
  117. of them in an URB is printed first, then a word per descriptor, up to a
  118. total of 5. The word consists of 3 colon-separated decimal numbers for
  119. status, offset, and length respectively. For submissions, initial length
  120. is reported. For callbacks, actual length is reported.
  121. - Data Length. For submissions, this is the requested length. For callbacks,
  122. this is the actual length.
  123. - Data tag. The usbmon may not always capture data, even if length is nonzero.
  124. The data words are present only if this tag is '='.
  125. - Data words follow, in big endian hexadecimal format. Notice that they are
  126. not machine words, but really just a byte stream split into words to make
  127. it easier to read. Thus, the last word may contain from one to four bytes.
  128. The length of collected data is limited and can be less than the data length
  129. reported in the Data Length word. In the case of an Isochronous input (Zi)
  130. completion where the received data is sparse in the buffer, the length of
  131. the collected data can be greater than the Data Length value (because Data
  132. Length counts only the bytes that were received whereas the Data words
  133. contain the entire transfer buffer).
  134. Examples:
  135. An input control transfer to get a port status.
  136. d5ea89a0 3575914555 S Ci:1:001:0 s a3 00 0000 0003 0004 4 <
  137. d5ea89a0 3575914560 C Ci:1:001:0 0 4 = 01050000
  138. An output bulk transfer to send a SCSI command 0x28 (READ_10) in a 31-byte
  139. Bulk wrapper to a storage device at address 5:
  140. dd65f0e8 4128379752 S Bo:1:005:2 -115 31 = 55534243 ad000000 00800000 80010a28 20000000 20000040 00000000 000000
  141. dd65f0e8 4128379808 C Bo:1:005:2 0 31 >
  142. * Raw binary format and API
  143. The overall architecture of the API is about the same as the one above,
  144. only the events are delivered in binary format. Each event is sent in
  145. the following structure (its name is made up, so that we can refer to it):
  146. struct usbmon_packet {
  147. u64 id; /* 0: URB ID - from submission to callback */
  148. unsigned char type; /* 8: Same as text; extensible. */
  149. unsigned char xfer_type; /* ISO (0), Intr, Control, Bulk (3) */
  150. unsigned char epnum; /* Endpoint number and transfer direction */
  151. unsigned char devnum; /* Device address */
  152. u16 busnum; /* 12: Bus number */
  153. char flag_setup; /* 14: Same as text */
  154. char flag_data; /* 15: Same as text; Binary zero is OK. */
  155. s64 ts_sec; /* 16: gettimeofday */
  156. s32 ts_usec; /* 24: gettimeofday */
  157. int status; /* 28: */
  158. unsigned int length; /* 32: Length of data (submitted or actual) */
  159. unsigned int len_cap; /* 36: Delivered length */
  160. union { /* 40: */
  161. unsigned char setup[SETUP_LEN]; /* Only for Control S-type */
  162. struct iso_rec { /* Only for ISO */
  163. int error_count;
  164. int numdesc;
  165. } iso;
  166. } s;
  167. int interval; /* 48: Only for Interrupt and ISO */
  168. int start_frame; /* 52: For ISO */
  169. unsigned int xfer_flags; /* 56: copy of URB's transfer_flags */
  170. unsigned int ndesc; /* 60: Actual number of ISO descriptors */
  171. }; /* 64 total length */
  172. These events can be received from a character device by reading with read(2),
  173. with an ioctl(2), or by accessing the buffer with mmap. However, read(2)
  174. only returns first 48 bytes for compatibility reasons.
  175. The character device is usually called /dev/usbmonN, where N is the USB bus
  176. number. Number zero (/dev/usbmon0) is special and means "all buses".
  177. Note that specific naming policy is set by your Linux distribution.
  178. If you create /dev/usbmon0 by hand, make sure that it is owned by root
  179. and has mode 0600. Otherwise, unpriviledged users will be able to snoop
  180. keyboard traffic.
  181. The following ioctl calls are available, with MON_IOC_MAGIC 0x92:
  182. MON_IOCQ_URB_LEN, defined as _IO(MON_IOC_MAGIC, 1)
  183. This call returns the length of data in the next event. Note that majority of
  184. events contain no data, so if this call returns zero, it does not mean that
  185. no events are available.
  186. MON_IOCG_STATS, defined as _IOR(MON_IOC_MAGIC, 3, struct mon_bin_stats)
  187. The argument is a pointer to the following structure:
  188. struct mon_bin_stats {
  189. u32 queued;
  190. u32 dropped;
  191. };
  192. The member "queued" refers to the number of events currently queued in the
  193. buffer (and not to the number of events processed since the last reset).
  194. The member "dropped" is the number of events lost since the last call
  195. to MON_IOCG_STATS.
  196. MON_IOCT_RING_SIZE, defined as _IO(MON_IOC_MAGIC, 4)
  197. This call sets the buffer size. The argument is the size in bytes.
  198. The size may be rounded down to the next chunk (or page). If the requested
  199. size is out of [unspecified] bounds for this kernel, the call fails with
  200. -EINVAL.
  201. MON_IOCQ_RING_SIZE, defined as _IO(MON_IOC_MAGIC, 5)
  202. This call returns the current size of the buffer in bytes.
  203. MON_IOCX_GET, defined as _IOW(MON_IOC_MAGIC, 6, struct mon_get_arg)
  204. MON_IOCX_GETX, defined as _IOW(MON_IOC_MAGIC, 10, struct mon_get_arg)
  205. These calls wait for events to arrive if none were in the kernel buffer,
  206. then return the first event. The argument is a pointer to the following
  207. structure:
  208. struct mon_get_arg {
  209. struct usbmon_packet *hdr;
  210. void *data;
  211. size_t alloc; /* Length of data (can be zero) */
  212. };
  213. Before the call, hdr, data, and alloc should be filled. Upon return, the area
  214. pointed by hdr contains the next event structure, and the data buffer contains
  215. the data, if any. The event is removed from the kernel buffer.
  216. The MON_IOCX_GET copies 48 bytes to hdr area, MON_IOCX_GETX copies 64 bytes.
  217. MON_IOCX_MFETCH, defined as _IOWR(MON_IOC_MAGIC, 7, struct mon_mfetch_arg)
  218. This ioctl is primarily used when the application accesses the buffer
  219. with mmap(2). Its argument is a pointer to the following structure:
  220. struct mon_mfetch_arg {
  221. uint32_t *offvec; /* Vector of events fetched */
  222. uint32_t nfetch; /* Number of events to fetch (out: fetched) */
  223. uint32_t nflush; /* Number of events to flush */
  224. };
  225. The ioctl operates in 3 stages.
  226. First, it removes and discards up to nflush events from the kernel buffer.
  227. The actual number of events discarded is returned in nflush.
  228. Second, it waits for an event to be present in the buffer, unless the pseudo-
  229. device is open with O_NONBLOCK.
  230. Third, it extracts up to nfetch offsets into the mmap buffer, and stores
  231. them into the offvec. The actual number of event offsets is stored into
  232. the nfetch.
  233. MON_IOCH_MFLUSH, defined as _IO(MON_IOC_MAGIC, 8)
  234. This call removes a number of events from the kernel buffer. Its argument
  235. is the number of events to remove. If the buffer contains fewer events
  236. than requested, all events present are removed, and no error is reported.
  237. This works when no events are available too.
  238. FIONBIO
  239. The ioctl FIONBIO may be implemented in the future, if there's a need.
  240. In addition to ioctl(2) and read(2), the special file of binary API can
  241. be polled with select(2) and poll(2). But lseek(2) does not work.
  242. * Memory-mapped access of the kernel buffer for the binary API
  243. The basic idea is simple:
  244. To prepare, map the buffer by getting the current size, then using mmap(2).
  245. Then, execute a loop similar to the one written in pseudo-code below:
  246. struct mon_mfetch_arg fetch;
  247. struct usbmon_packet *hdr;
  248. int nflush = 0;
  249. for (;;) {
  250. fetch.offvec = vec; // Has N 32-bit words
  251. fetch.nfetch = N; // Or less than N
  252. fetch.nflush = nflush;
  253. ioctl(fd, MON_IOCX_MFETCH, &fetch); // Process errors, too
  254. nflush = fetch.nfetch; // This many packets to flush when done
  255. for (i = 0; i < nflush; i++) {
  256. hdr = (struct ubsmon_packet *) &mmap_area[vec[i]];
  257. if (hdr->type == '@') // Filler packet
  258. continue;
  259. caddr_t data = &mmap_area[vec[i]] + 64;
  260. process_packet(hdr, data);
  261. }
  262. }
  263. Thus, the main idea is to execute only one ioctl per N events.
  264. Although the buffer is circular, the returned headers and data do not cross
  265. the end of the buffer, so the above pseudo-code does not need any gathering.