|
BDE 4.14.0 Production release
|
Macros | |
| #define | BDLT_ISO8601UTIL_DEPRECATE_GENERATE |
| #define | BDLT_ISO8601UTIL_DEPRECATE_GENERATE_ORDER |
| #define | BDLT_ISO8601UTIL_DEPRECATE_GENERATERAW |
| #define | BDLT_ISO8601UTIL_DEPRECATE_PARSERELAXED |
Provide conversions between date/time objects and ISO 8601 strings.
This component provides a namespace, bdlt::Iso8601Util, containing functions that convert bsls timeinterval and bdlt date, time, and datetime objects to and from ("generate" and "parse", respectively) corresponding string representations that are compliant with the ISO 8601 standard. The version of the ISO 8601 standard that is the basis for this component can be found at:
In general terms, Iso8601Util functions support what ISO 8601 refers to as complete representations in extended format. We first present a brief overview before delving into the details of the ISO 8601 representations that are supported for each of the relevant vocabulary types.
Each function that generates ISO 8601 strings (named generate and generateRaw) takes an object and a char * buffer, bsl::string, or bsl::ostream, and writes an ISO 8601 representation of the object to the buffer, string, or stream. The "raw" functions are distinguished from their non-"raw" counterparts in three respects:
char * buffer is not supplied to the generateRaw functions.generateRaw functions do not output a null terminator.generate functions that provide an ptrdiff_t bufferLength parameter truncate the generated output to bufferLength characters. (Neither the generateRaw functions nor the generate functions taking bsl::string or bsl::ostream do any truncation of their generated output.)Since the generate functions always succeed, no status value is returned. Instead, either the number of characters output to the char * buffer or string, or a reference to the stream, is returned. (Note that the generating functions also take an optional bdlt::Iso8601UtilConfiguration object, which is discussed shortly.)
Each function that parses ISO 8601 strings (named parse) take the address of a target object and a const char * (paired with a length argument) or bsl::string_view, and loads the object with the result of parsing the character string. Since parsing can fail, the parse functions return an int status value (0 for success and a non-zero value for failure). Note that, besides elementary syntactical considerations, the validity of parsed strings are subject to the semantic constraints imposed by the various isValid* class methods, (i.e., Date::isValidYearMonthDay, Time::isValid, etc.).
As this component concerns ISO 8601, some terms from that specification are used liberally in what follows. Two ISO 8601 terms of particular note are zone designator and fractional second.
An ISO 8601 zone designator corresponds to what other bdlt components commonly refer to as a timezone offset (or simply as an offset; e.g., see bdlt_datetimetz ). For example, the ISO 8601 string 2002-03-17T15:46:00+04:00 has a zone designator of +04:00, indicating a timezone 4 hours ahead of UTC.
An ISO 8601 fractional second corresponds to, for example, the combined millisecond and microsecond attributes of a bdlt::Time object, or the combined millisecond and microsecond attributes of a bdlt::Datetime object. For example, the Time value (and ISO 8601 string) 15:46:09.330000 has a millisecond attribute value of 330 and a microsecond attribute of 0 (i.e., a fractional second of .33).
Strings produced by the generate and generateRaw functions are a straightforward transposition of the attributes of the source value into an appropriate ISO 8601 format, and are best illustrated by a few examples. Note that for Datetime and DatetimeTz, the fractional second is generated with the precision specified in the configuration. Also note that for Time and TimeTz, the fractional second is generated with the precision specified in the configuration up to a maximum precision of 6.
[*] Note that the ISO 8601 specification does not have an equivalent to bdlt::DateTz.
The generate and generateRaw functions provide an optional configuration parameter. This optional parameter, of type Iso8601UtilConfiguration, enables configuration of four aspects of ISO 8601 string generation:
. or ,.: is optional in zone designators.Z is output for the zone designator instead of +00:00 (UTC).Iso8601UtilConfiguration has four attributes that directly correspond to these aspects. In addition, for generate methods that are not supplied with a configuration argument, a process-wide configuration takes effect. See bdlt_iso8601utilconfiguration for details.
The parse functions accept all strings that are produced by the generate functions. In addition, the parse functions accept some variation in the generated strings, the details of which are discussed next. Note that the parse methods are not configurable like the generate methods (i.e., via an optional Iso8601UtilConfiguration argument). Moreover, the process-wide configuration has no effect on parsing either. Instead, the parse methods automatically accept . or , as the decimal sign in fractional seconds, and treat +00:00, +0000, Z, and z as equivalent zone designators (all denoting UTC).
Parsing in Relaxed mode accepts "relaxed" ISO 8601 format that is a superset of the strict ISO 8601 format, meaning this function will parse ISO 8601 values as well as supporting some common variations. Currently this allows a SPACE character to be used as an alternative separator between date and time elements (where strict ISO 8601 requires a T), but the set of extensions may grow in the future. Relaxed parsing is done by specifying the relaxed option using the Iso8601UtilParseConfiguration type and passing it to the parse functions. There are also parseRelaxed functions which are now deprecated.
The zone designator is optional, and can be present when parsing for any type other than bsls::TimeInterval, i.e., even for Date, Time, and Datetime. If a zone designator is parsed for a Date, it must be valid, so it can affect the status value that is returned in that case, but it is otherwise ignored. For Time and Datetime, any zone designator present in the parsed string will affect the resulting object value (unless the zone designator denotes UTC) because the result is converted to UTC. If the zone designator is absent, it is treated as if +00:00 were specified:
There is also the basic format, which may be specified when parsing by using the bdlt::Iso8601UtilParseConfiguration type, in which case there are to be no -s in the Date and no :s in the Time. The : in the time zone is always optional, whether in basic or default format.
In the last example above, the conversion to UTC incurs a carry into the day attribute of the Date component of the resulting Datetime value. Note that if such a carry causes an underflow or overflow at the extreme ends of the valid range of dates (0001/01/01 and 9999/12/31), then parsing for Datetime fails.
The fractional second is optional. When the fractional second is absent, it is treated as if .0 were specified. When the fractional second is present, it can have one or more digits (i.e., it can contain more than six). For Datetime, DatetimeTz, Time, and TimeTz, if more than six digits are included in the fractional second, values are rounded to a full microsecond; i.e., values greater than or equal to .5 microseconds are rounded up. For bsls::TimeInterval, if more than nine digits are included in the fractional second, values are rounded to a full nanosecond; i.e., values greater than or equal to .5 microseconds are rounded up. These roundings may incur a carry of one second into the second attribute:
Note that, for Datetime and DatetimeTz, if a carry due to rounding of the fractional second would cause an overflow at the extreme upper end of the valid range of dates (i.e., 9999/12/31), then parsing would fail.
Leap seconds are not representable by bdlt::Time or bdlt::Datetime. Hence, they are not produced by any of the Iso8601Util generate functions. However, positive leap seconds are supported by the parse functions. A leap second is recognized when the value parsed for the second attribute of a Time is 60 – regardless of the values parsed for the hour, minute, millisecond, and microsecond attributes. Note that this behavior is more generous than that afforded by the ISO 8601 specification (which indicates that a positive leap second can only be represented as "23:59:60Z").
When a leap second is detected during parsing of an ISO 8601 string, the second attribute is taken to be 59, so that the value of the Time object can be validly set; then an additional second is added to the object. Note that the possible carry incurred by a leap second (i.e., when loading the result of parsing into a Datetime or DatetimeTz object) has the same potential for overflow as may occur with fractional seconds that are rounded up (although in admittedly pathological cases).
According to the ISO 8601 specification, the time 24:00 is interpreted as midnight, i.e., the last instant of a day. However, this concept is not supported by bdlt. Although 24:00 is representable by bdlt, i.e., as the default value for bdlt::Time, Time(24, 0) does not represent midnight when it is the value for the "time" attribute of a Datetime (or DatetimeTz) object. For example:
It is important to be aware of this peculiarity of Datetime (and DatetimeTz) as it relates to ISO 8601.
The following table shows some examples of parsing an ISO 8601 string containing "24:00". Note that parsing fails if the zone designator is not equivalent to "+00:00" when the time 24:00 is encountered:
An hour attribute value of 24 is also "preserved" by the generate functions provided by this component:
The syntax description below summarizes the ISO 8601 string representations supported by this component. Although not quoted (for readability), [+-:.,TtZz] are literal characters that can occur in ISO 8601 strings. Furthermore, for clarity, the (rarely used) lowercase t and z characters are omitted from the specifications below, as well as from the function-level documentation. The characters [YMDhms] each denote a decimal digit, {} brackets optional elements, () is used for grouping, and | separates alternatives:
The syntax description below summarizes the ISO 8601 string representations for durations supported by this component. Although not quoted (for readability), [.,PWDTHMS] are literal characters that can occur in ISO 8601 strings. The characters [wdhms] each denote a decimal digit, {} brackets optional elements, () is used for grouping, and | separates alternatives:
This section illustrates intended use of this component.
This example demonstrates basic use of one generate function and two parse functions.
First, we construct a few objects that are prerequisites for this and the following example:
Then, we construct a bdlt::DatetimeTz object for which a corresponding ISO 8601-compliant string will be generated shortly:
For comparison with the ISO 8601 string generated below, note that streaming the value of sourceDatetimeTz to stdout:
produces:
Next, we use a generate function to produce an ISO 8601-compliant string for sourceDatetimeTz, writing the output to a bsl::ostringstream, and assert that both the return value and the string that is produced are as expected:
For comparison, see the output that was produced by the streaming operator above.
Now, we parse the string that was just produced, loading the result of the parse into a second bdlt::DatetimeTz object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceDatetimeTz):
Finally, we parse the iso8601 string a second time, this time loading the result into a bdlt::Datetime object (instead of a bdlt::DatetimeTz):
Note that this time the value of the target object has been converted to UTC.
This example demonstrates use of a bdlt::Iso8601UtilConfiguration object to influence the format of the ISO 8601 strings that are generated by this component by passing that configuration object to generate. We also take this opportunity to illustrate the flavor of the generate functions that outputs to a char * buffer of a specified length.
First, we construct a bdlt::TimeTz object for which a corresponding ISO 8601-compliant string will be generated shortly:
For comparison with the ISO 8601 string generated below, note that streaming the value of sourceTimeTz to stdout:
produces:
Then, we construct the bdlt::Iso8601UtilConfiguration object that indicates how we would like to affect the generated output ISO 8601 string. In this case, we want to use , as the decimal sign (in fractional seconds) and omit the : in zone designators:
Next, we define the char * buffer that will be used to stored the generated string. A buffer of size bdlt::Iso8601Util::k_TIMETZ_STRLEN + 1 is large enough to hold any string generated by this component for a bdlt::TimeTz object, including a null terminator:
Then, we use a generate function that accepts our configuration to produce an ISO 8601-compliant string for sourceTimeTz, this time writing the output to a char * buffer, and assert that both the return value and the string that is produced are as expected. Note that in comparing the return value against BUFLEN - 5 we account for the omission of the : from the zone designator, and also for the fact that, although a null terminator was generated, it is not included in the character count returned by generate. Also note that we use bsl::strcmp to compare the resulting string knowing that we supplied a buffer having sufficient capacity to accommodate a null terminator:
For comparison, see the output that was produced by the streaming operator above.
Next, we parse the string that was just produced, loading the result of the parse into a second bdlt::TimeTz object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceTimeTz). Note that BUFLEN - 5 is passed and not BUFLEN because the former indicates the correct number of characters in buffer that we wish to parse:
Then, we parse the string in buffer a second time, this time loading the result into a bdlt::Time object (instead of a bdlt::TimeTz):
Note that this time the value of the target object has been converted to UTC.
Finally, we modify the configuration to display the bdlt::TimeTz without fractional seconds:
| #define BDLT_ISO8601UTIL_DEPRECATE_GENERATE |
| #define BDLT_ISO8601UTIL_DEPRECATE_GENERATE_ORDER |
| #define BDLT_ISO8601UTIL_DEPRECATE_GENERATERAW |
| #define BDLT_ISO8601UTIL_DEPRECATE_PARSERELAXED |