Quick Links: |
Provide conversions between date/time objects and ISO 8601 strings. More...
Namespaces | |
namespace | bsls |
namespace | bdlt |
bdlt::Iso8601Util | namespace for ISO 8601 date/time conversion functions |
bdlt::Iso8601Util
, containing functions that convert bsls
timeinterval and bdlt
date, time, and datetime objects to and from ("generate" and "parse", respectively) corresponding string representations that are compliant with the ISO 8601 standard. The version of the ISO 8601 standard that is the basis for this component can be found at: http://dotat.at/tmp/ISO_8601-2004_E.pdf
Iso8601Util
functions support what ISO 8601 refers to as complete representations in extended format. We first present a brief overview before delving into the details of the ISO 8601 representations that are supported for each of the relevant vocabulary types. generate
and generateRaw
) takes an object and a char *
buffer, bsl::string
, or bsl::ostream
, and writes an ISO 8601 representation of the object to the buffer, string, or stream. The "raw" functions are distinguished from their non-"raw" counterparts in three respects: char *
buffer is not supplied to the generateRaw
functions. generateRaw
functions do not output a null terminator. generate
functions that provide an int bufferLength
parameter truncate the generated output to bufferLength
characters. (Neither the generateRaw
functions nor the generate
functions taking bsl::string
or bsl::ostream
do any truncation of their generated output.) char *
buffer or string, or a reference to the stream, is returned. (Note that the generating functions also take an optional bdlt::Iso8601UtilConfiguration
object, which is discussed shortly.) parse
) take the address of a target object and a const char *
(paired with a length
argument) or bsl::string_view
, and loads the object with the result of parsing the character string. Since parsing can fail, the parse functions return an int
status value (0 for success and a non-zero value for failure). Note that, besides elementary syntactical considerations, the validity of parsed strings are subject to the semantic constraints imposed by the various isValid*
class methods, (i.e., Date::isValidYearMonthDay
, Time::isValid
, etc.). bdlt
components commonly refer to as a timezone offset (or simply as an offset; e.g., see bdlt_datetimetz
). For example, the ISO 8601 string 2002-03-17T15:46:00+04:00
has a zone designator of +04:00
, indicating a timezone 4 hours ahead of UTC. millisecond
and microsecond
attributes of a bdlt::Time
object, or the combined millisecond
and microsecond
attributes of a bdlt::Datetime
object. For example, the Time
value (and ISO 8601 string) 15:46:09.330000
has a millisecond
attribute value of 330 and a microsecond
attribute of 0 (i.e., a fractional second of .33). generate
and generateRaw
functions are a straightforward transposition of the attributes of the source value into an appropriate ISO 8601 format, and are best illustrated by a few examples. Note that for Datetime
and DatetimeTz
, the fractional second is generated with the precision specified in the configuration. Also note that for Time
and TimeTz
, the fractional second is generated with the precision specified in the configuration up to a maximum precision of 6. +--------------------------------------+---------------------------------+ | Object Value | Generated ISO 8601 String | | | (using default configuration) | +======================================+=================================+ | bsls::TimeInterval(1000, 3000000) | PT16M40.003S | +--------------------------------------+---------------------------------+ | Date(2002, 03, 17) | 2002-03-17 | +--------------------------------------+---------------------------------+ | Time(15, 46, 09, 330) | 15:46:09.330 | +--------------------------------------+---------------------------------+ | Datetime(Date(2002, 03, 17) | | | Time(15, 46, 09, 330)) | 2002-03-17T15:46:09.330 | +--------------------------------------+---------------------------------+ | DateTz(Date(2002, 03, 17), -120) | 2002-03-17-02:00 [*] | +--------------------------------------+---------------------------------+ | TimeTz(Time(15, 46, 09, 330), 270) | 15:46:09.330+04:30 | +--------------------------------------+---------------------------------+ | DatetimeTz(Datetime( | | | Date(2002, 03, 17), | | | Time(15, 46, 09, 330)), | | | 0) | 2002-03-17T15:46:09.330+00:00 | +--------------------------------------+---------------------------------+
bdlt::DateTz
. generate
and generateRaw
functions provide an optional configuration parameter. This optional parameter, of type Iso8601UtilConfiguration
, enables configuration of four aspects of ISO 8601 string generation: .
or ,
. :
is optional in zone designators. Z
is output for the zone designator instead of +00:00
(UTC). Iso8601UtilConfiguration
has four attributes that directly correspond to these aspects. In addition, for generate methods that are not supplied with a configuration argument, a process-wide configuration takes effect. See bdlt_iso8601utilconfiguration
for details. parse
functions accept all strings that are produced by the generate functions. In addition, the parse functions accept some variation in the generated strings, the details of which are discussed next. Note that the parse methods are not configurable like the generate methods (i.e., via an optional Iso8601UtilConfiguration
argument). Moreover, the process-wide configuration has no effect on parsing either. Instead, the parse methods automatically accept .
or ,
as the decimal sign in fractional seconds, and treat +00:00
, +0000
, Z
, and z
as equivalent zone designators (all denoting UTC). parseRelaxed
functions accept "relaxed" ISO 8601 format that is a superset of the strict ISO 8601 format, meaning this function will parse ISO 8601 values as well as supporting some common variations. Currently this allows a SPACE character to be used as an alternative separator between date and time elements (where strict ISO 8601 requires a T
), but the set of extensions may grow in the future. Date
, Time
, and Datetime
. If a zone designator is parsed for a Date
, it must be valid, so it can affect the status value that is returned in that case, but it is otherwise ignored. For Time
and Datetime
, any zone designator present in the parsed string will affect the resulting object value (unless the zone designator denotes UTC) because the result is converted to UTC. If the zone designator is absent, it is treated as if +00:00
were specified: +------------------------------------+-----------------------------------+ | Parsed ISO 8601 String | Result Object Value | +====================================+===================================+ | 2002-03-17-02:00 | Date(2002, 03, 17) | | | # zone designator ignored | +------------------------------------+-----------------------------------+ | 2002-03-17-02:65 | Date: parsing fails | | | # invalid zone designator | +------------------------------------+-----------------------------------+ | 15:46:09.330+04:30 | Time(11, 16, 09, 330) | | | # converted to UTC | +------------------------------------+-----------------------------------+ | 15:46:09.330+04:30 | TimeTz(Time(15, 46, 09, 330), | | | 270) | +------------------------------------+-----------------------------------+ | 15:46:09.330 | TimeTz(Time(15, 46, 09, 330), | | | 0) | | | # implied '+00:00' | +------------------------------------+-----------------------------------+ | 2002-03-17T23:46:09.222-05:00 | Datetime(Date(2002, 03, 18), | | | Time(04, 46, 09, 222)) | | | # carry into 'day' attribute | | | # when converted to UTC | +------------------------------------+-----------------------------------+
day
attribute of the Date
component of the resulting Datetime
value. Note that if such a carry causes an underflow or overflow at the extreme ends of the valid range of dates (0001/01/01 and 9999/12/31), then parsing for Datetime
fails. .0
were specified. When the fractional second is present, it can have one or more digits (i.e., it can contain more than six). For Datetime
, DatetimeTz
, Time
, and TimeTz
, if more than six digits are included in the fractional second, values are rounded to a full microsecond; i.e., values greater than or equal to .5 microseconds are rounded up. For bsls::TimeInterval
, if more than nine digits are included in the fractional second, values are rounded to a full nanosecond; i.e., values greater than or equal to .5 microseconds are rounded up. These roundings may incur a carry of one second into the second
attribute: +--------------------------------------+---------------------------------+
| Parsed ISO 8601 String | Result Object Value |
+======================================+=================================+
| 15:46:09.1 | Time(15, 46, 09, 100) |
+--------------------------------------+---------------------------------+
| 15:46:09-05:00 | TimeTz(Time(15, 46, 09, 000), |
| | -300) |
| | # implied '.0' |
+--------------------------------------+---------------------------------+
| 15:46:09.99999949 | Time(15, 46, 09, 999, 999) |
| | # truncate last two digits |
+--------------------------------------+---------------------------------+
| 15:46:09.9999995 | Time(15, 46, 10, 000, 000) |
| | # round up and carry |
+--------------------------------------+---------------------------------+
Datetime
and DatetimeTz
, if a carry due to rounding of the fractional second would cause an overflow at the extreme upper end of the valid range of dates (i.e., 9999/12/31), then parsing would fail. bdlt::Time
or bdlt::Datetime
. Hence, they are not produced by any of the Iso8601Util
generate functions. However, positive leap seconds are supported by the parse functions. A leap second is recognized when the value parsed for the second
attribute of a Time
is 60 -- regardless of the values parsed for the hour
, minute
, millisecond
, and microsecond
attributes. Note that this behavior is more generous than that afforded by the ISO 8601 specification (which indicates that a positive leap second can only be represented as "23:59:60Z"). second
attribute is taken to be 59, so that the value of the Time
object can be validly set; then an additional second is added to the object. Note that the possible carry incurred by a leap second (i.e., when loading the result of parsing into a Datetime
or DatetimeTz
object) has the same potential for overflow as may occur with fractional seconds that are rounded up (although in admittedly pathological cases). bdlt
. Although 24:00 is representable by bdlt
, i.e., as the default value for bdlt::Time
, Time(24, 0)
does not represent midnight when it is the value for the "time" attribute of a Datetime
(or DatetimeTz
) object. For example: bdlt::Datetime notMidnight = bdlt::Datetime(bdlt::Date(2002, 03, 17), bdlt::Time(24, 0, 0)); notMidnight.addSeconds(1); assert(notMidnight == bdlt::Datetime(bdlt::Date(2002, 03, 17), bdlt::Time( 0, 0, 1));
Datetime
(and DatetimeTz
) as it relates to ISO 8601. +------------------------------------+-----------------------------------+ | Parsed ISO 8601 String | Result Object Value | +====================================+===================================+ | 24:00:00.000000 | Time(24, 0, 0, 0) | | | # preserve default 'Time' value | +------------------------------------+-----------------------------------+ | 24:00:00.000000-04:00 | TimeTz: parsing fails | | | # zone designator not UTC | +------------------------------------+-----------------------------------+ | 0001-01-01T24:00:00.000000 | Datetime(Date(0001, 01, 01), | | | Time(24, 0, 0, 0)) | | | # preserve 'Datetime' default | | | # value | +------------------------------------+-----------------------------------+ | 2002-03-17T24:00:00.000000 | Datetime(Date(2002, 03, 17), | | | Time(24, 0, 0, 0)) | | | # preserve default 'Time' value | +------------------------------------+-----------------------------------+
hour
attribute value of 24 is also "preserved" by the generate functions provided by this component: +------------------------------------+-----------------------------------+ | Source Object Value | Generated ISO 8601 String | +====================================+===================================+ | Time(24, 0, 0, 0) | 24:00:00.000 | +------------------------------------+-----------------------------------+ | Datetime(Date(2002, 03, 17), | 2002-03-17T24:00:00.000 | | Time(24, 0, 0, 0)) | | +------------------------------------+-----------------------------------+
[+-:.,TtZz]
are literal characters that can occur in ISO 8601 strings. Furthermore, for clarity, the (rarely used) lowercase t
and z
characters are omitted from the specifications below, as well as from the function-level documentation. The characters [YMDhms]
each denote a decimal digit, {}
brackets optional elements, ()
is used for grouping, and |
separates alternatives: <Generated Date> ::= <DATE> <Parsed Date> ::= <Parsed DateTz> <Generated DateTz> ::= <DATE><ZONE> <Parsed DateTz> ::= <DATE>{<ZONE>} <Generated Time> ::= <TIME FLEXIBLE> <Parsed Time> ::= <Parsed TimeTz> <Generated TimeTz> ::= <TIME FLEXIBLE><ZONE> <Parsed TimeTz> ::= <TIME FLEXIBLE>{<ZONE>} <Generated Datetime> ::= <DATE>T<TIME FLEXIBLE> <Parsed Datetime> ::= <Parsed DatetimeTz> <Generated DatetimeTz> ::= <DATE>T<TIME FLEXIBLE><ZONE> <Parsed DatetimeTz> ::= <DATE>T<TIME FLEXIBLE>{<ZONE>} <DATE> ::= YYYY-MM-DD <TIME FLEXIBLE> ::= hh:mm:ss{(.|,)s+} # one or more digits in the # fractional second <ZONE> ::= (+|-)hh{:}mm|Z # zone designator
[.,PWDTHMS]
are literal characters that can occur in ISO 8601 strings. The characters [wdhms]
each denote a decimal digit, {}
brackets optional elements, ()
is used for grouping, and |
separates alternatives: <Date Duration> ::= {w+W}{d+D} <Time Duration> ::= T{h+H}{m+M}{s+{(.|,)s+}S} # must contain # at least one # optional field # (i.e. "T" is # not valid) <Generated Duration> ::= P<Date Duration><Time Duration> # all values # guaranteed to # be less than # their modulus # (weeks can be # be up to 14 # digits), and # it must # contain the # seconds # portion <Parsed Duration> ::= P<Date Duration>{<Time Duration>} # must contain # at least one # optional field # in <Date # Duration> or # must contain # a <Time # Duration> # (i.e. "P" is # not valid)
generate
function and two parse
functions. const bdlt::Date date(2005, 1, 31); // 2005/01/31 const bdlt::Time time(8, 59, 59, 123); // 08:59:59.123 const int tzOffset = 240; // +04:00 (four hours west of UTC)
bdlt::DatetimeTz
object for which a corresponding ISO 8601-compliant string will be generated shortly: const bdlt::DatetimeTz sourceDatetimeTz(bdlt::Datetime(date, time), tzOffset);
sourceDatetimeTz
to stdout
: bsl::cout << sourceDatetimeTz << bsl::endl;
31JAN2005_08:59:59.123000+0400
generate
function to produce an ISO 8601-compliant string for sourceDatetimeTz
, writing the output to a bsl::ostringstream
, and assert that both the return value and the string that is produced are as expected: bsl::ostringstream oss; const bsl::ostream& ret = bdlt::Iso8601Util::generate(oss, sourceDatetimeTz); assert(&oss == &ret); const bsl::string iso8601 = oss.str(); assert(iso8601 == "2005-01-31T08:59:59.123+04:00");
bdlt::DatetimeTz
object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceDatetimeTz
): bdlt::DatetimeTz targetDatetimeTz; int rc = bdlt::Iso8601Util::parse(&targetDatetimeTz, iso8601.c_str(), static_cast<int>(iso8601.length())); assert( 0 == rc); assert(sourceDatetimeTz == targetDatetimeTz);
iso8601
string a second time, this time loading the result into a bdlt::Datetime
object (instead of a bdlt::DatetimeTz
): bdlt::Datetime targetDatetime; rc = bdlt::Iso8601Util::parse(&targetDatetime, iso8601.c_str(), static_cast<int>(iso8601.length())); assert( 0 == rc); assert(sourceDatetimeTz.utcDatetime() == targetDatetime);
bdlt::Iso8601UtilConfiguration
object to influence the format of the ISO 8601 strings that are generated by this component by passing that configuration object to generate
. We also take this opportunity to illustrate the flavor of the generate
functions that outputs to a char *
buffer of a specified length. bdlt::TimeTz
object for which a corresponding ISO 8601-compliant string will be generated shortly: const bdlt::TimeTz sourceTimeTz(time, tzOffset);
sourceTimeTz
to stdout
: bsl::cout << sourceTimeTz << bsl::endl;
08:59:59.123+0400
bdlt::Iso8601UtilConfiguration
object that indicates how we would like to affect the generated output ISO 8601 string. In this case, we want to use ,
as the decimal sign (in fractional seconds) and omit the :
in zone designators: bdlt::Iso8601UtilConfiguration configuration; configuration.setOmitColonInZoneDesignator(true); configuration.setUseCommaForDecimalSign(true);
char *
buffer that will be used to stored the generated string. A buffer of size bdlt::Iso8601Util::k_TIMETZ_STRLEN + 1
is large enough to hold any string generated by this component for a bdlt::TimeTz
object, including a null terminator: const int BUFLEN = bdlt::Iso8601Util::k_TIMETZ_STRLEN + 1; char buffer[BUFLEN];
generate
function that accepts our configuration
to produce an ISO 8601-compliant string for sourceTimeTz
, this time writing the output to a char *
buffer, and assert that both the return value and the string that is produced are as expected. Note that in comparing the return value against BUFLEN - 5
we account for the omission of the :
from the zone designator, and also for the fact that, although a null terminator was generated, it is not included in the character count returned by generate
. Also note that we use bsl::strcmp
to compare the resulting string knowing that we supplied a buffer having sufficient capacity to accommodate a null terminator: rc = bdlt::Iso8601Util::generate(buffer, BUFLEN, sourceTimeTz, configuration); assert(BUFLEN - 5 == rc); assert( 0 == bsl::strcmp(buffer, "08:59:59,123+0400"));
bdlt::TimeTz
object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceTimeTz
). Note that BUFLEN - 5
is passed and not BUFLEN
because the former indicates the correct number of characters in buffer
that we wish to parse: bdlt::TimeTz targetTimeTz; rc = bdlt::Iso8601Util::parse(&targetTimeTz, buffer, BUFLEN - 5); assert( 0 == rc); assert(sourceTimeTz == targetTimeTz);
buffer
a second time, this time loading the result into a bdlt::Time
object (instead of a bdlt::TimeTz
): bdlt::Time targetTime; rc = bdlt::Iso8601Util::parse(&targetTime, buffer, BUFLEN - 5); assert( 0 == rc); assert(sourceTimeTz.utcTime() == targetTime);
configuration
to display the bdlt::TimeTz
without fractional seconds: configuration.setFractionalSecondPrecision(0); rc = bdlt::Iso8601Util::generate(buffer, BUFLEN, sourceTimeTz, configuration); assert(BUFLEN - 9 == rc); assert( 0 == bsl::strcmp(buffer, "08:59:59+0400"));