BDE 4.14.0 Production release
|
Macros | |
#define | BDLT_ISO8601UTIL_DEPRECATE_GENERATE |
#define | BDLT_ISO8601UTIL_DEPRECATE_GENERATE_ORDER |
#define | BDLT_ISO8601UTIL_DEPRECATE_GENERATERAW |
#define | BDLT_ISO8601UTIL_DEPRECATE_PARSERELAXED |
Provide conversions between date/time objects and ISO 8601 strings.
This component provides a namespace, bdlt::Iso8601Util
, containing functions that convert bsls
timeinterval and bdlt
date, time, and datetime objects to and from ("generate" and "parse", respectively) corresponding string representations that are compliant with the ISO 8601 standard. The version of the ISO 8601 standard that is the basis for this component can be found at:
In general terms, Iso8601Util
functions support what ISO 8601 refers to as complete representations in extended format. We first present a brief overview before delving into the details of the ISO 8601 representations that are supported for each of the relevant vocabulary types.
Each function that generates ISO 8601 strings (named generate
and generateRaw
) takes an object and a char *
buffer, bsl::string
, or bsl::ostream
, and writes an ISO 8601 representation of the object to the buffer, string, or stream. The "raw" functions are distinguished from their non-"raw" counterparts in three respects:
char *
buffer is not supplied to the generateRaw
functions.generateRaw
functions do not output a null terminator.generate
functions that provide an ptrdiff_t bufferLength
parameter truncate the generated output to bufferLength
characters. (Neither the generateRaw
functions nor the generate
functions taking bsl::string
or bsl::ostream
do any truncation of their generated output.)Since the generate functions always succeed, no status value is returned. Instead, either the number of characters output to the char *
buffer or string, or a reference to the stream, is returned. (Note that the generating functions also take an optional bdlt::Iso8601UtilConfiguration
object, which is discussed shortly.)
Each function that parses ISO 8601 strings (named parse
) take the address of a target object and a const char *
(paired with a length
argument) or bsl::string_view
, and loads the object with the result of parsing the character string. Since parsing can fail, the parse functions return an int
status value (0 for success and a non-zero value for failure). Note that, besides elementary syntactical considerations, the validity of parsed strings are subject to the semantic constraints imposed by the various isValid*
class methods, (i.e., Date::isValidYearMonthDay
, Time::isValid
, etc.).
As this component concerns ISO 8601, some terms from that specification are used liberally in what follows. Two ISO 8601 terms of particular note are zone designator and fractional second.
An ISO 8601 zone designator corresponds to what other bdlt
components commonly refer to as a timezone offset (or simply as an offset; e.g., see bdlt_datetimetz ). For example, the ISO 8601 string 2002-03-17T15:46:00+04:00
has a zone designator of +04:00
, indicating a timezone 4 hours ahead of UTC.
An ISO 8601 fractional second corresponds to, for example, the combined millisecond
and microsecond
attributes of a bdlt::Time
object, or the combined millisecond
and microsecond
attributes of a bdlt::Datetime
object. For example, the Time
value (and ISO 8601 string) 15:46:09.330000
has a millisecond
attribute value of 330 and a microsecond
attribute of 0 (i.e., a fractional second of .33).
Strings produced by the generate
and generateRaw
functions are a straightforward transposition of the attributes of the source value into an appropriate ISO 8601 format, and are best illustrated by a few examples. Note that for Datetime
and DatetimeTz
, the fractional second is generated with the precision specified in the configuration. Also note that for Time
and TimeTz
, the fractional second is generated with the precision specified in the configuration up to a maximum precision of 6.
[*] Note that the ISO 8601 specification does not have an equivalent to bdlt::DateTz
.
The generate
and generateRaw
functions provide an optional configuration parameter. This optional parameter, of type Iso8601UtilConfiguration
, enables configuration of four aspects of ISO 8601 string generation:
.
or ,
.:
is optional in zone designators.Z
is output for the zone designator instead of +00:00
(UTC).Iso8601UtilConfiguration
has four attributes that directly correspond to these aspects. In addition, for generate methods that are not supplied with a configuration argument, a process-wide configuration takes effect. See bdlt_iso8601utilconfiguration for details.
The parse
functions accept all strings that are produced by the generate functions. In addition, the parse functions accept some variation in the generated strings, the details of which are discussed next. Note that the parse methods are not configurable like the generate methods (i.e., via an optional Iso8601UtilConfiguration
argument). Moreover, the process-wide configuration has no effect on parsing either. Instead, the parse methods automatically accept .
or ,
as the decimal sign in fractional seconds, and treat +00:00
, +0000
, Z
, and z
as equivalent zone designators (all denoting UTC).
Parsing in Relaxed
mode accepts "relaxed" ISO 8601 format that is a superset of the strict ISO 8601 format, meaning this function will parse ISO 8601 values as well as supporting some common variations. Currently this allows a SPACE character to be used as an alternative separator between date and time elements (where strict ISO 8601 requires a T
), but the set of extensions may grow in the future. Relaxed
parsing is done by specifying the relaxed option using the Iso8601UtilParseConfiguration
type and passing it to the parse
functions. There are also parseRelaxed
functions which are now deprecated.
The zone designator is optional, and can be present when parsing for any type other than bsls::TimeInterval, i.e., even for Date
, Time
, and Datetime
. If a zone designator is parsed for a Date
, it must be valid, so it can affect the status value that is returned in that case, but it is otherwise ignored. For Time
and Datetime
, any zone designator present in the parsed string will affect the resulting object value (unless the zone designator denotes UTC) because the result is converted to UTC. If the zone designator is absent, it is treated as if +00:00
were specified:
There is also the basic
format, which may be specified when parsing by using the bdlt::Iso8601UtilParseConfiguration
type, in which case there are to be no -
s in the Date
and no :
s in the Time
. The :
in the time zone is always optional, whether in basic or default format.
In the last example above, the conversion to UTC incurs a carry into the day
attribute of the Date
component of the resulting Datetime
value. Note that if such a carry causes an underflow or overflow at the extreme ends of the valid range of dates (0001/01/01 and 9999/12/31), then parsing for Datetime
fails.
The fractional second is optional. When the fractional second is absent, it is treated as if .0
were specified. When the fractional second is present, it can have one or more digits (i.e., it can contain more than six). For Datetime
, DatetimeTz
, Time
, and TimeTz
, if more than six digits are included in the fractional second, values are rounded to a full microsecond; i.e., values greater than or equal to .5 microseconds are rounded up. For bsls::TimeInterval
, if more than nine digits are included in the fractional second, values are rounded to a full nanosecond; i.e., values greater than or equal to .5 microseconds are rounded up. These roundings may incur a carry of one second into the second
attribute:
Note that, for Datetime
and DatetimeTz
, if a carry due to rounding of the fractional second would cause an overflow at the extreme upper end of the valid range of dates (i.e., 9999/12/31), then parsing would fail.
Leap seconds are not representable by bdlt::Time
or bdlt::Datetime
. Hence, they are not produced by any of the Iso8601Util
generate functions. However, positive leap seconds are supported by the parse functions. A leap second is recognized when the value parsed for the second
attribute of a Time
is 60 – regardless of the values parsed for the hour
, minute
, millisecond
, and microsecond
attributes. Note that this behavior is more generous than that afforded by the ISO 8601 specification (which indicates that a positive leap second can only be represented as "23:59:60Z").
When a leap second is detected during parsing of an ISO 8601 string, the second
attribute is taken to be 59, so that the value of the Time
object can be validly set; then an additional second is added to the object. Note that the possible carry incurred by a leap second (i.e., when loading the result of parsing into a Datetime
or DatetimeTz
object) has the same potential for overflow as may occur with fractional seconds that are rounded up (although in admittedly pathological cases).
According to the ISO 8601 specification, the time 24:00 is interpreted as midnight, i.e., the last instant of a day. However, this concept is not supported by bdlt
. Although 24:00 is representable by bdlt
, i.e., as the default value for bdlt::Time
, Time(24, 0)
does not represent midnight when it is the value for the "time" attribute of a Datetime
(or DatetimeTz
) object. For example:
It is important to be aware of this peculiarity of Datetime
(and DatetimeTz
) as it relates to ISO 8601.
The following table shows some examples of parsing an ISO 8601 string containing "24:00". Note that parsing fails if the zone designator is not equivalent to "+00:00" when the time 24:00 is encountered:
An hour
attribute value of 24 is also "preserved" by the generate functions provided by this component:
The syntax description below summarizes the ISO 8601 string representations supported by this component. Although not quoted (for readability), [+-:.,TtZz]
are literal characters that can occur in ISO 8601 strings. Furthermore, for clarity, the (rarely used) lowercase t
and z
characters are omitted from the specifications below, as well as from the function-level documentation. The characters [YMDhms]
each denote a decimal digit, {}
brackets optional elements, ()
is used for grouping, and |
separates alternatives:
The syntax description below summarizes the ISO 8601 string representations for durations supported by this component. Although not quoted (for readability), [.,PWDTHMS]
are literal characters that can occur in ISO 8601 strings. The characters [wdhms]
each denote a decimal digit, {}
brackets optional elements, ()
is used for grouping, and |
separates alternatives:
This section illustrates intended use of this component.
This example demonstrates basic use of one generate
function and two parse
functions.
First, we construct a few objects that are prerequisites for this and the following example:
Then, we construct a bdlt::DatetimeTz
object for which a corresponding ISO 8601-compliant string will be generated shortly:
For comparison with the ISO 8601 string generated below, note that streaming the value of sourceDatetimeTz
to stdout
:
produces:
Next, we use a generate
function to produce an ISO 8601-compliant string for sourceDatetimeTz
, writing the output to a bsl::ostringstream
, and assert that both the return value and the string that is produced are as expected:
For comparison, see the output that was produced by the streaming operator above.
Now, we parse the string that was just produced, loading the result of the parse into a second bdlt::DatetimeTz
object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceDatetimeTz
):
Finally, we parse the iso8601
string a second time, this time loading the result into a bdlt::Datetime
object (instead of a bdlt::DatetimeTz
):
Note that this time the value of the target object has been converted to UTC.
This example demonstrates use of a bdlt::Iso8601UtilConfiguration
object to influence the format of the ISO 8601 strings that are generated by this component by passing that configuration object to generate
. We also take this opportunity to illustrate the flavor of the generate
functions that outputs to a char *
buffer of a specified length.
First, we construct a bdlt::TimeTz
object for which a corresponding ISO 8601-compliant string will be generated shortly:
For comparison with the ISO 8601 string generated below, note that streaming the value of sourceTimeTz
to stdout
:
produces:
Then, we construct the bdlt::Iso8601UtilConfiguration
object that indicates how we would like to affect the generated output ISO 8601 string. In this case, we want to use ,
as the decimal sign (in fractional seconds) and omit the :
in zone designators:
Next, we define the char *
buffer that will be used to stored the generated string. A buffer of size bdlt::Iso8601Util::k_TIMETZ_STRLEN + 1
is large enough to hold any string generated by this component for a bdlt::TimeTz
object, including a null terminator:
Then, we use a generate
function that accepts our configuration
to produce an ISO 8601-compliant string for sourceTimeTz
, this time writing the output to a char *
buffer, and assert that both the return value and the string that is produced are as expected. Note that in comparing the return value against BUFLEN - 5
we account for the omission of the :
from the zone designator, and also for the fact that, although a null terminator was generated, it is not included in the character count returned by generate
. Also note that we use bsl::strcmp
to compare the resulting string knowing that we supplied a buffer having sufficient capacity to accommodate a null terminator:
For comparison, see the output that was produced by the streaming operator above.
Next, we parse the string that was just produced, loading the result of the parse into a second bdlt::TimeTz
object, and assert that the parse was successful and that the target object has the same value as that of the original (i.e., sourceTimeTz
). Note that BUFLEN - 5
is passed and not BUFLEN
because the former indicates the correct number of characters in buffer
that we wish to parse:
Then, we parse the string in buffer
a second time, this time loading the result into a bdlt::Time
object (instead of a bdlt::TimeTz
):
Note that this time the value of the target object has been converted to UTC.
Finally, we modify the configuration
to display the bdlt::TimeTz
without fractional seconds:
#define BDLT_ISO8601UTIL_DEPRECATE_GENERATE |
#define BDLT_ISO8601UTIL_DEPRECATE_GENERATE_ORDER |
#define BDLT_ISO8601UTIL_DEPRECATE_GENERATERAW |
#define BDLT_ISO8601UTIL_DEPRECATE_PARSERELAXED |