Query Amazon S3 Data And Get Results Instantly With Amazon Athena

Feb 19, 2020

Query Amazon S3 Data Instantly Using Standard SQL With Amazon Athena

Amazon Simple Storage Service(S3) is one of the most popular and widely used object storage service from AWS. Any amount of data can be stored on S3 and be easily retrieved from anywhere, anytime. Data Security, Scalability, 99.999999999% availability and performance is the reason for S3’s popularity. 

What is Amazon Athena?

Amazon Athena is an interactive query service for querying data stored in Amazon S3.Athena works directly with the data thus eliminating the need to aggregate or load the data to Athena prior to querying.



How is Amazon Athena useful?

  1. Run ad-hoc queries using ANSI SQL and get results instantly.
  2. Generate Reports using business intelligence tools or SQL Clients connected via ODB or JDBC drivers.
  3. Integrates with Amazon QuickSight for easy visualizations.

Can Amazon Athena  process unstructured or semi-structured data?

Yes, one of the biggest advantages of Athena is its ability to process structured, semi-structured and unstructured data (emails, documents,images, files etc). It can process multiple data formats like CSV, JSON,Avro, Apache Parquet and Apache ORC.Athena can handle complex analysis, including large joins, window functions, and arrays.

How can I access Amazon Athena ?

Athena can be accessed in multiple ways…

1.Access via the AWS Management Console.

2.Access via the API

3.Access via ODBC or JDBC driver.

How secure is Amazon Athena?

Amazon Athena allows you to control access to your data by using AWS Identity and Access Management (IAM) policies, access control lists (ACLs), and Amazon S3 bucket policies.

What are the advantages of Amazon Athena?

1.Athena is serverless which means zero expenses on infrastructure and no time spent on configurations or software updates.

2.Athena automatically scales as datasets grow.

3.Athena is optimized for fast performance.Automatically executes queries in parallel, so that you get query results in seconds, even on large datasets.

How will I be charged for the usage of Amazon Athena?

For every TB of data scanned, the charge is $5.00. You only pay for the queries you run .There is no charge for DDL Statements(Create/Drop/Alter) or failed queries, but cancelled queries will be charged based on the amount of data scanned.

How can I reduce my per-query costs and see significant savings?

As per Amazon, you can save from 30% to 60% on your per-query costs when you compress your data or partition it or use Columnar Data Formats.For complete detail on pricing visit official page.



