发布时间:2012-8-22 16:43
分类名称:PKI
今天偶尔看MSND,发现里面讲解的DER编码通俗易懂,图文并茂。
Abstract Syntax Notation One (ASN.1) defines the following rule sets that govern how data structures that are being sent between computers are encoded and decoded.
The original rule set was defined by the BER specification. CER and DER were developed later as specialized subsets of BER. PER was developed in response to criticisms about the amount of bandwidth required to transmit data using BER or its variants. PER provides a significant savings.
DER was created to satisfy the requirements of the X.509 specification for secure data transfer. The Certificate Enrollment API uses DER exclusively.
DER Transfer Syntax
Applying an encoding rule to the data structures described by an abstract syntax provides a transfer syntax that governs how bytes in a stream are organized when sent between computers. The transfer syntax used by Distinguished Encoding Rules always follows a Tag, Length, Value format. The format is usually referred to as a TLV triplet in which each field (T, L, or V) contains one or more bytes.
The Tag field specifies the type of the data structure being sent, the Length field specifies the number of bytes of content being transferred, and the Value field contains the content. Note that the Value field can be a triplet if it contains a constructed data type as shown by the following illustration.
Encoded Tag Bytes
The Tag field in a TLV triplet identifies the type of the data structure being sent between computers. For example, the tag for an integer is 0x02, and the tag for an object identifier is 0x06. Although multiple bytes are permitted, none of the data types used by the Certificate Enrollment API require more than one. The following illustration shows the breakdown of a Tag value. Bits 7 and 6 identify the ASN.1 tagging class. There are four available classes, but the Certificate Enrollment API uses data types that belong only to the UNIVERSAL class. Bit 5 identifies whether the encoding form is primitive or constructed. Basic and string types are encoded by using primitive forms, constructed types by using a constructed form. For more information, see ASN.1 Type System. Bits 4 through 0 contain the tag number.
The following table lists the data types supported by the Certificate Enrollment API, the encoding form used, and the tag value.
Type | ASN.1 class | Encoding form | Tag value |
BIT STRING | UNIVERSAL | Primitive | 00000011 (0x03) |
BOOLEAN | UNIVERSAL | Primitive | 00000001 (0x01) |
INTEGER | UNIVERSAL | Primitive | 00000010 (0x02) |
NULL | UNIVERSAL | Primitive | 00000101 (0x05) |
OBJECT IDENTIFIER | UNIVERSAL | Primitive | 00000110 (0x06) |
OCTET STRING | UNIVERSAL | Primitive | 00000100 (0x04) |
BMPString | UNIVERSAL | Primitive | 00011110 (0x1E) |
IA5String | UNIVERSAL | Primitive | 00010110 (0x16) |
PrintableString | UNIVERSAL | Primitive | 00010011 (0x13) |
TeletexString | UNIVERSAL | Primitive | 00010100 (0x14) |
UTF8String | UNIVERSAL | Primitive | 00001100 (0x0C) |
SEQUENCE | UNIVERSAL | Constructed | 00110000 (0x30) |
SEQUENCE OF | UNIVERSAL | Constructed | 00110000 (0x30) |
SET | UNIVERSAL | Constructed | 00110001 (0x31) |
SET OF | UNIVERSAL | Constructed | 00110001 (0x31) |
Encoded Length and Value Bytes
The Length field in a TLV triplet identifies the number of bytes encoded in the Value field. The Value field contains the content being sent between computers. If the Value field contains fewer than 128 bytes, the Length field requires only one byte. Bit 7 of the Length field is zero (0) and the remaining bits identify the number of bytes of content being sent. If the Value field contains more than 127 bytes, bit 7 of the Length field is one (1) and the remaining bits identify the number of bytes needed to contain the length. Examples are shown in the following illustration.
DER Encoding of ASN.1 Types
BIT STRING
The BIT STRING data type is encoded into a TLV triplet that begins with a Tag byte of 0x03. The Value field of the TLV triplet contains a leading byte that specifies the number of bits left unused in the final byte of content. In the following example, the Length field is set to 0x03 because three content bytes follow, and the leading byte of the Value field is set to 0x04 because there are four unused bits in the last content byte. Each unused bit is denoted by the letter x.
The following example, adapted from the PKCS #10 Encoded ASN.1 topic, shows the encoded signature of a sample PKCS #10 certificate request. The first byte contains the Tag value for the BIT STRING data type, 0x03. The second and third bytes contain the length of the byte array. Bit 7 of the second byte is set to 1 because there are more than 127 bytes of content. Bits 0 through 6 of the second byte specify the number of trailing Length bytes, in this case one. The third byte specifies the number of content bytes, 0x81. The fourth byte, 0x00, specifies the number of unused bits that exist in the last content byte. Note that the signature is encoded in big-endian byte order.
0299: 03 81 81 ; BIT_STRING (81 Bytes)
029c: 00
029d: 47 eb 99 5a df 9e 70 0d fb a7 31 32 c1 5f 5c 24
02ad: c2 e0 bf c6 24 af 15 66 0e b8 6a 2e ab 2b c4 97
02bd: 1f e3 cb dc 63 a5 25 ec c7 b4 28 61 66 36 a1 31
02cd: 1b bf dd d0 fc bf 17 94 90 1d e5 5e c7 11 5e c9
02dd: 55 9f eb a3 3e 14 c7 99 a6 cb ba a1 46 0f 39 d4
02ed: 44 c4 c8 4b 76 0e 20 5d 6d a9 34 9e d4 d5 87 42
02fd: eb 24 26 51 14 90 b4 0f 06 5e 52 88 32 7a 95 20
030d: a0 fd f7 e5 7d 60 dd 72 68 9b f5 7b 05 8f 6d 1e
OBJECT IDENTIFIER
3 out of 3 rated this helpful - Rate this topic
The OBJECT IDENTIFIER data type is encoded into a TLV triplet that begins with a Tag value of 0x06. Each integer of a dotted decimal object identifier (OID) is encoded according to the following rules:
These points are shown by the following illustration.
The following example shows how the ClientId attribute is encoded in a certificate request.
Copy
06 09 ; OBJECT_ID (9 Bytes)
| 2b 06 01 04 01 82 37 15 14 ; 1.3.6.1.4.1.311.21.20
31 4a ; SET (4a Bytes)
30 48 ; SEQUENCE (48 Bytes)
02 01 ; INTEGER (1 Bytes)
| 09
0c 23 ; UTF8_STRING (23 Bytes)
| 76 69 63 68 33 64 2e 6a ; vich3d.j
| 64 6f 6d 63 73 63 2e 6e ; domcsc.n
| 74 74 65 73 74 2e 6d 69 ; ttest.mi
| 63 72 6f 73 6f 66 74 2e ; crosoft.
| 63 6f 6d ; com
0c 15 ; UTF8_STRING (15 Bytes)
| 4a 44 4f 4d 43 53 43 5c ; JDOMCSC\
| 61 64 6d 69 6e 69 73 74 ; administ
| 72 61 74 6f 72 ; rator
0c 07 ; UTF8_STRING (7 Bytes)
63 65 72 74 72 65 71 ; certreq
BOOLEAN
A Boolean value can be TRUE or FALSE. FALSE is encoded as a TLV triplet in which the Value field is zero (0). TRUE is encoded as a nonzero value. The Tag for the BOOLEAN data type is 0x01.
INTEGER
Integer values are encoded into a TLV triplet that begins with a Tag value of 0x02. The Value field of the TLV triplet contains the encoded integer if it is positive, or its two's complement if it is negative. If the integer is positive but the high order bit is set to 1, a leading 0x00 is added to the content to indicate that the number is not negative. For example, the high order byte of 0x8F (10001111) is 1. Therefore a leading zero byte is added to the content as shown in the following illustration.
If the integer contains fewer than 128 bytes, the Length field requires only one byte to specify the content length. If the integer is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.
The following example, from PKCS #10 Encoded ASN.1, shows the encoding for a 128 byte public key. The first byte contains the Tag value for the INTEGER data type, 0x02. The second and third bytes contain the Length value. Bit 7 of the second byte is set to 1 because there are more than 127 bytes of content. Bits 0 through 6 of the second byte specify the number of trailing bytes needed, in this case one, to accurately specify the content length. The third byte specifies the number of content bytes, 0x81. The fourth byte, 0x00, is added to the content to indicate that the integer is indeed a positive value even though the sign bit of the leading content byte (0x8F) is set to 1.
02 81 81 ; INTEGER (81 Bytes)
| 00
| 8f e2 41 2a 08 e8 51 a8 8c b3 e8 53 e7 d5 49 50
| b3 27 8a 2b cb ea b5 42 73 ea 02 57 cc 65 33 ee
| 88 20 61 a1 17 56 c1 24 18 e3 a8 08 d3 be d9 31
| f3 37 0b 94 b8 cc 43 08 0b 70 24 f7 9c b1 8d 5d
| d6 6d 82 d0 54 09 84 f8 9f 97 01 75 05 9c 89 d4
| d5 c9 1e c9 13 d7 2a 6b 30 91 19 d6 d4 42 e0 c4
| 9d 7c 92 71 e1 b2 2f 5c 8d ee f0 f1 17 1e d2 5f
| 31 5b b1 9c bc 20 55 bf 3a 37 42 45 75 dc 90 65
The following example shows how the integer value 0x03 is encoded. The Tag byte contains 0x02, and the Length byte specifies that there is one byte of content.
02 01 ; INTEGER (1 Bytes)
| 03
NULL
The NULL value is encoded into a TLV triplet that begins with a Tag value of 0x05, a Length of 0x00, and no Value byte as shown by the following illustration.
The fifth line of the following example, adapted from the PKCS #10 Encoded ASN.1 topic, shows an encoded NULL value. The first byte is 0x05, and the second byte is 0x00. There is no content byte.
30 81 9f ; SEQUENCE (9f Bytes)
| 30 0d ; SEQUENCE (d Bytes)
| | 06 09 ; OBJECT_ID (9 Bytes)
| | | 2a 86 48 86 f7 0d 01 01 01 ; 1.2.840.113549.1.1.1
| | 05 00 ; NULL (0 Bytes)
| 03 81 8d ; BIT_STRING (8d Bytes)
| 00
| 30 81 89 ; SEQUENCE (89 Bytes)
| 02 81 81 ; INTEGER (81 Bytes)
| | 00
| | 8f e2 41 2a 08 e8 51 a8 8c b3 e8 53 e7 d5 49 50
| | b3 27 8a 2b cb ea b5 42 73 ea 02 57 cc 65 33 ee
| | 88 20 61 a1 17 56 c1 24 18 e3 a8 08 d3 be d9 31
| | f3 37 0b 94 b8 cc 43 08 0b 70 24 f7 9c b1 8d 5d
| | d6 6d 82 d0 54 09 84 f8 9f 97 01 75 05 9c 89 d4
| | d5 c9 1e c9 13 d7 2a 6b 30 91 19 d6 d4 42 e0 c4
| | 9d 7c 92 71 e1 b2 2f 5c 8d ee f0 f1 17 1e d2 5f
| | 31 5b b1 9c bc 20 55 bf 3a 37 42 45 75 dc 90 65
| 02 03 ; INTEGER (3 Bytes)
| 01 00 01
OCTET STRING
The ASN.1 OCTET STRING data type is encoded into a TLV triplet that begins with a Tag byte of 0x04. The OCTET STRING and BIT STRING data types are very similar. Thus, the two types are encoded in a similar manner except that, because the trailing byte of an OCTET STRING cannot have unused bits, no leading bytes must be added to the content. The following example, adapted from the CMC Encoded ASN.1 topic, shows how the name of a certificate template is encoded as a byte array.
30 17 ; SEQUENCE (17 Bytes)
| 06 09 ; OBJECT_ID (9 Bytes)
| | 2b 06 01 04 01 82 37 14 02 ; 1.3.6.1.4.1.311.20.2
| 04 0a ; OCTET_STRING (a Bytes)
| 1e 08 00 55 00 73 00 65 00 72 ; ...U.s.e.r
If the byte array contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If it is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. This is shown in the following example where the high order bit of the second byte on the first line is set to 1 and the byte indicates that there is a trailing Length byte. The third byte therefore specifies that the content is 0x80 bytes long.
04 81 80 ; OCTET_STRING (80 Bytes)
38 10 60 e2 70 69 91 4a ; 8.`.pi.J
8b b5 22 57 2a 62 ef de ; .."W*b..
15 7d 59 d6 4e 20 9a 45 ; .}Y.N .E
2b e3 fd fc 68 ba af bf ; +...h...
9c 17 b0 8e 6d c4 29 1e ; ....m.).
e3 21 ac bb 5a 8a c9 67 ; .!..Z..g
0a d4 45 93 10 c0 26 eb ; ..E...&.
0a 83 c2 b1 40 87 36 f7 ; ....@.6.
a0 26 da b9 bb 46 73 88 ; .&...Fs.
7a 67 b9 e6 b3 6f ea 59 ; zg...o.Y
28 8a d3 92 72 f6 7b 89 ; (...r.{.
a0 d8 2d 9e 40 eb 1e bb ; ..-.@...
6e ae f0 5a ed 16 c9 e3 ; n..Z....
27 59 37 8f f3 4a 98 60 ; 'Y7..J.`
f8 fb a7 0a ee 1b 6e 91 ; ......n.
95 96 cf 0d 56 ac ab 35 ; ....V..5
BMPString
1 out of 1 rated this helpful - Rate this topic
The ASN.1 BMPString data type, called a UNICODE_STRING in the Certificate Enrollment API, is encoded into a TLV triplet that begins with a Tag byte of 0x1E. The following example, adapted from the CMC Encoded ASN.1 topic, shows the encoding for a TemplateName extension. The name can be specified by using the IX509ExtensionTemplateName interface. The object identifier for the extension is 1.3.6.1.4.1.311.13.2.1.
Copy
06 0a ; OBJECT_ID (a Bytes)
| 2b 06 01 04 01 82 37 0d 02 01 ; 1.3.6.1.4.1.311.13.2.1
31 34 ; SET (34 Bytes)
30 32 ; SEQUENCE (32 Bytes)
1e 26 ; UNICODE_STRING (26 Bytes)
| 00 43 00 65 00 72 00 74 ; .C.e.r.t
| 00 69 00 66 00 69 00 63 ; .i.f.i.c
| 00 61 00 74 00 65 00 54 ; .a.t.e.T
| 00 65 00 6d 00 70 00 6c ; .e.m.p.l
| 00 61 00 74 00 65 ; .a.t.e
1e 08 ; UNICODE_STRING (8 Bytes)
00 55 00 73 00 65 00 72 ; .U.s.e.r
If the string contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If the string is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.
IA5String
The ASN.1 IA5tring data type is encoded into a TLV triplet that begins with a Tag byte of 0x16. The following example, adapted from the CMC Encoded ASN.1 topic, shows how the OSVersion attribute is encoded as an IA5tring type. The version number can be specified by using the IX509AttributeOSVersion interface. The object identifier for the attribute is 1.3.6.1.4.1.311.13.2.3.
06 0a ; OBJECT_ID (a Bytes)
| 2b 06 01 04 01 82 37 0d 02 03 ; 1.3.6.1.4.1.311.13.2.3
31 0c ; SET (c Bytes)
16 0a ; IA5_STRING (a Bytes)
36 2e 30 2e 35 33 36 31 2e 32 ; 6.0.5361.2
If the string contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If the string is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.
PrintableString
The ASN.1 PrintableString data type is encoded into a TLV triplet that begins with a Tag byte of 0x13. The following example, from the PKCS #10 Encoded ASN.1 topic, shows how a user common name of TestCN is encoded as a PrintableString type. The object identifier for a common name is 2.5.4.3.
06 03 ; OBJECT_ID (3 Bytes)
| 55 04 03 ; 2.5.4.3 Common Name (CN)
13 06 ; PRINTABLE_STRING (6 Bytes)
54 65 73 74 43 4e ; TestCN
If the string contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If the string is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.
UTF8String
The ASN.1 UTF8String data type is encoded into a TLV triplet that begins with a Tag byte of 0x0C. The following example, from the CMC Encoded ASN.1 topic, shows how the ClientId attribute is encoded as an integer and three UTF8String types. The object identifier for the attribute is 1.3.6.1.4.1.311.21.20. The information, which can be specified by using the IX509AttributeClientId interface, includes a client ID number, the Domain Name System (DNS) computer name, the Security Accounts Manager (SAM) user name, and the name of the application that created the certificate request.
06 09 ; OBJECT_ID (9 Bytes)
| 2b 06 01 04 01 82 37 15 14 ; 1.3.6.1.4.1.311.21.20
31 4a ; SET (4a Bytes)
30 48 ; SEQUENCE (48 Bytes)
02 01 ; INTEGER (1 Bytes)
| 09
0c 23 ; UTF8_STRING (23 Bytes)
| 76 69 63 68 33 64 2e 6a ; vich3d.j
| 64 6f 6d 63 73 63 2e 6e ; domcsc.n
| 74 74 65 73 74 2e 6d 69 ; ttest.mi
| 63 72 6f 73 6f 66 74 2e ; crosoft.
| 63 6f 6d ; com
0c 15 ; UTF8_STRING (15 Bytes)
| 4a 44 4f 4d 43 53 43 5c ; JDOMCSC\
| 61 64 6d 69 6e 69 73 74 ; administ
| 72 61 74 6f 72 ; rator
0c 07 ; UTF8_STRING (7 Bytes)
63 65 72 74 72 65 71 ; certreq
If the string contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If the string is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.
SEQUENCE
A SEQUENCE contains an ordered field of one or more types. It is encoded into a TLV triplet that begins with a Tag byte of 0x30. The following Certutil.exe output from the PKCS #10 Encoded ASN.1 topic provides multiple examples of SEQUENCE data structures. The output shows a 128 byte public key and three byte exponent.
30 81 9f ; SEQUENCE (9f Bytes)
| 30 0d ; SEQUENCE (d Bytes)
| | | 06 09 ; OBJECT_ID (9 Bytes)
| | | 2a 86 48 86 f7 0d 01 01 01 ; 1.2.840.113549.1.1.1
| | 05 00 ; NULL (0 Bytes)
| 03 81 8d ; BIT_STRING (8d Bytes)
| 00
| 30 81 89 ; SEQUENCE (89 Bytes)
| 02 81 81 ; INTEGER (81 Bytes)
| | 00
| | 8f e2 41 2a 08 e8 51 a8 8c b3 e8 53 e7 d5 49 50
| | b3 27 8a 2b cb ea b5 42 73 ea 02 57 cc 65 33 ee
| | 88 20 61 a1 17 56 c1 24 18 e3 a8 08 d3 be d9 31
| | f3 37 0b 94 b8 cc 43 08 0b 70 24 f7 9c b1 8d 5d
| | d6 6d 82 d0 54 09 84 f8 9f 97 01 75 05 9c 89 d4
| | d5 c9 1e c9 13 d7 2a 6b 30 91 19 d6 d4 42 e0 c4
| | 9d 7c 92 71 e1 b2 2f 5c 8d ee f0 f1 17 1e d2 5f
| | 31 5b b1 9c bc 20 55 bf 3a 37 42 45 75 dc 90 65
| 02 03 ; INTEGER (3 Bytes)
| 01 00 01
If the SEQUENCE contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If it is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For example, the second byte of the first line in the preceding example indicates that there is one trailing Length byte that specifies 0x9F bytes of content (most of the SEQUENCE is not shown). For more information, see Encoded Length and Value Bytes.
SET
A SET contains an unordered series of fields of one or more types. It is encoded into a TLV triplet that begins with a Tag byte of 0x31. The following example, adapted from the CMC Encoded ASN.1 topic, shows how a ClientId attribute is encoded in a SET data structure. The attribute can be specified by using the IX509AttributeClientId interface.
31 59 ; SET (59 Bytes)
30 57 ; SEQUENCE (57 Bytes)
06 09 ; OBJECT_ID (9 Bytes)
| 2b 06 01 04 01 82 37 15 14 ; 1.3.6.1.4.1.311.21.20
31 4a ; SET (4a Bytes)
30 48 ; SEQUENCE (48 Bytes)
02 01 ; INTEGER (1 Bytes)
| 09
0c 23 ; UTF8_STRING (23 Bytes)
| 76 69 63 68 33 64 2e 6a ; vich3d.j
| 64 6f 6d 63 73 63 2e 6e ; domcsc.n
| 74 74 65 73 74 2e 6d 69 ; ttest.mi
| 63 72 6f 73 6f 66 74 2e ; crosoft.
| 63 6f 6d ; com
0c 15 ; UTF8_STRING (15 Bytes)
| 4a 44 4f 4d 43 53 43 5c ; JDOMCSC\
| 61 64 6d 69 6e 69 73 74 ; administ
| 72 61 74 6f 72 ; rator
0c 07 ; UTF8_STRING
If the SET contains fewer than 128 bytes, the Length field of the TLV triplet requires only one byte to specify the content length. If it is more than 127 bytes, bit 7 of the Length field is set to 1 and bits 6 through 0 specify the number of additional bytes used to identify the content length. For more information, see Encoded Length and Value Bytes.