I am working with some complex data that originated in JSON format that I am downloading from Databricks into EXCEL format. What I am trying to do is parse out the Column names from an individual nested column. Basically something like this, The Excel file has multiple columns but a single column like Product can have multiple entries within (Nested). They don't always have the same amount of column headers within, some can have 10, some can have 1. I am trying to get the metadata of the column headers within Product to determine the schema of that field. I need someway to loop through the records and the field and parse all the would be column headers within Product - Product_Details, Product_Transfer, Product_Number, Product_Engine, Product_Lifecycle, Product_Service ect... . The data I am trying to parse is in the column Product between {} . Any advice would be appreciated
RowID, Product
100 { "Product_Details": "N/A", "Product_Transfer": "N/A", "Product_Number": "1", "Product_Engine": "Intel"}
101 { "Product_Lifecycle" : "2", "Product_Size" : "2x"}
102 { "Product_Details": "NA", "Product_Size": "1x", "Product_Service": "Amazon"}
RowID, Product
100 { "Product_Details": "N/A", "Product_Transfer": "N/A", "Product_Number": "1", "Product_Engine": "Intel"}
101 { "Product_Lifecycle" : "2", "Product_Size" : "2x"}
102 { "Product_Details": "NA", "Product_Size": "1x", "Product_Service": "Amazon"}