Two validation reports describe ways to access and govern Parquet data stored on FSx for ONTAP from AWS analytics services using S3 Access Points (S3 APs). In the EMR Serverless Spark report, the author shows a fully serverless Spark ETL pipeline that reads and writes Parquet files directly through S3 APs using the EMRFS filesystem with an s3:// URI format. The validation includes Spark transformations such as GROUP BY and window functions and reports ~16 seconds of Spark execution time for a test workload, with ~37 seconds total including EMR Serverless cold start. It also identifies compatibility constraints: using s3a:// with S3 AP aliases fails, and Parquet timestamps must be microsecond resolution for Spark to read them.

In the Redshift Spectrum plus Lake Formation report, the author validates that Redshift Serverless can query FSx for ONTAP data through Spectrum external schemas mapped to the same Glue Catalog tables used by Athena, and that Lake Formation adds enterprise governance. The reports verify table-level access, fine-grained column-level permissions, row filtering, and LF-tag-based classification. Performance measurements show Redshift Serverless queries are slower than Athena for simple scans, and both Spectrum-based querying and governance changes depend on the configured Glue and Lake Formation permissions.