In our organization, we use S3 to store data as parquet files and work with Athena AWS. Currently, with Pigment's import mechanism based solely on integrated CSV files, we are compelled to establish a process that converts our parquet files into CSV. This isn't a favored method by our (BD&D) Team, to say the least.
I can also elaborate on the pros of using the native connector to Athena AWS:
- Improved Efficiency: A native connector eliminates the need to convert parquet files to CSV for import into Pigment. This can significantly reduce the time and resources spent on file conversion, leading to improved efficiency in data operations.
- Enhanced Data Quality: Using a native connector can reduce the potential for data errors or loss that might occur during the file conversion process.
- Streamlined Process: Integration with Athena AWS via a native connector simplifies the data workflow, making getting data from S3 storage into Pigment easier and quicker.
- Increased Flexibility: A native connector allows for more flexibility in handling data, as you can directly leverage the querying benefits of Athena AWS and the diverse data types it supports.
- Cost Savings: Direct integration with Athena AWS via a native connector bypasses the need to store and manage extra CSV files, potentially resulting in cost savings.
I am interested in understanding if there are other organizations, similar to ours, which use S3 for data storage as parquet files and work with Athena AWS, and feel the need for a native Athena AWS connector to Pigment.
As it stands, Pigment's import methodology is entirely reliant on integrated CSV files.
Do other companies share our belief that introducing a direct Athena AWS connector in Pigment would enhance its functionality and relevance?
Any shared experiences would be appreciated.