[Data Ingestion and Storage] add new attribute Date Format to the StreamingAPI payload
To ensure consistency and correct parsing of date-type fields in streaming data (pipeline), we need to add a new attribute Date Format to the StreamingAPI payload.
in the same way as the separators for CSV files were included. This information needs to be stored in Governance as well.
Integration with Quality Evaluator
A new attribute, date_format
, should be added to the payload. At this stage, it is expected to follow the Python strftime/strptime format specification: https://docs.python.org/3/library/datetime.html#strftime-and-strptime-format-codes
Example:
{
"object_storage_url": "....",
"dataset_id": "b743a885-18e5-403e-b5cc-ff7e476430f0",
"column_types": {
"Year": "text",
"Semester": "text",
"Date": "text",
"Municipality": "text",
"Executed Network Connection Requests": "text",
"CodConcelho": "text"
},
"csv_options": {
"field_delimiter": ";",
"decimal_delimiter": "."
},
"charset": "UTF_8",
"mime_type": "text/csv",
"date_format": "%Y-%m-%dT%H:%M:%SZ"
}
Edited by Antoni Gimeno