We believe in using the best tools to serve our customers. Our team of experts are always in search of innovative ideas and innovative products that can help them be more efficient. Amazon Redshift commanded attention of everyone in the house. It is a pretty impressive and useful data warehousing service.
“Our data analysis platform collects tens of terabytes of log data each day from a variety of internal and external sources. Migrating to Amazon Redshift two years ago allowed us to scale to over ten petabytes of uncompressed data with a ten times performance improvement over our prior on-premises system,” said Mick Etoh, Senior Vice President and General Manager of Innovation Management Department, NTT DOCOMO – Japan’s largest mobile service provider.
“Customers such as Amgen, Boingo Wireless, Electronic Arts, Hearst, Lyft, Nasdaq, Scholastic, TripAdvisor, and Yahoo! are migrating to Amazon Redshift in droves because it leverages the scale of AWS to analyze petabytes of data with ten times the performance at one-tenth the cost of old guard data warehouses,” said Raju Gulabani, Vice President, Databases, Analytics, and AI, AWS.
Before we get to the webinar let’s discuss this Data Warehousing service from AWS. So, what exactly is Amazon Redshift?
Amazon Redshift is a fully managed data warehouse service. It is amazingly scalable. You can start with a few hundred gigabytes of data and when needed, you can scale up to a petabyte and more. Customers can analyze all of their data stored on high performance local disks with the help of standard SQL and existing Business Intelligence (BI) tools at impressive speed. And yes, all this is available to the users for the tenth of the cost of traditional warehouses.
#Amazon Redshift is Fast
You can provision a data warehouse in minutes. Amazon Redshift uses columnar storage technology which improves I/O efficiency. You only have to create a data warehouse cluster and define your schema. You can then begin loading and querying your data. Amazon manages the provisioning, configuration and patching part. The queries are parallelized across multiple nodes which enables it to deliver fast query performance. The hardware is optimized for I/O intensive workloads at 4GB/Node/Sec. Networking is enhanced with over 1 million packets/Node/Sec. There is regular cadence of auto-patched improvements.
#Amazon Redshift is Scalable
With the help of a few clicks in your console or with a simple API call you can resize your Amazon Redshift Data Warehouse Cluster. When you modify the cluster, the changes you request for are immediately applied. You can run multiple clusters with the same data in Amazon S3 with the help of Redshift Spectrum. You can scale all the way up to petabyte or more. You can create very large data warehouses with the help of Dense Storage (DS) for minimal price point. Very high performance data warehouses can be created with the help of Dense Compute (DC) nodes. During the process of resizing you can continue to query your data warehouse in read-only mode until the new cluster is ready for use.
#Amazon Redshift Is Friendly & Compatible
Amazon Redshift is amazingly compatible with your preferred BI and ETL tools. It uses industry standard SQL and it can be accessed using standard PostgreSQL, JDBC and ODBC drivers. Custom JDBC and ODBC drivers can be downloaded from the AWS console. Integrations with popular BI and ETL vendors have been validated by Amazon Redshift. Most of them offer free trials to help you get started with loading and analyzing.
#Amazon Redshift Backs Up Your Data Continuously
Amazon Redshift replicates data across nodes. If a Node suddenly fails, there are always at least two more nodes ready to be pressed into service containing the same data. There is replication of data within the cluster and a backup with Amazon S3. This feature helps it in recovering from unexpected failures. Backups are continuous and automatic.
#Amazon Redshift Keeps Your Data Secure
Amazon Redshift uses industry standard encryption techniques to keep your data secure in transit and at rest. When your data is in transit it is kept secure by supporting SSL enabled connections between client application and your Redshift data warehouse cluster. When your data is at rest and is being written to disk, Amazon Redshift encrypts each block using hardware accelerated AES-256. All the data including intermediate query results is encrypted. Backed-up data is encrypted as well. Compute nodes are not directly accessible. Amazon Redshift takes care of your key management by default. If you so desire, you can manage your keys by using your own hardware security modules (HSMs) or with the help of AWS Key Management Service.
#Amazon Redshift Helps You Monitor Your Data
You can use AWS Management Console or Amazon CloudWatch APIs to monitor your Amazon Redshift Data warehouse clusters. You can track your metrics on storage utilization, read/write traffic, compute utilization and a lot more free of cost. You can also get information on query and cluster performance. You can diagnose performance issues and track the users and the type of queries that consume the maximum resources.
Know more about Amazon Redshift from our experts. We, at OpenXcell conduct webinars on the latest development tools and technologies. Do go through one on Amazon Redshift.
Key takeaways from the webinar are listed below.
- Importance of Data
- Data Growth Rate
- Data Warehouse Challenges
- Why Data Warehouse Services?
- AWS Database Services
- What is Amazon Redshift?
- Architecture of Amazon Redshift
- Benefits of Amazon Redshift
You can now go through the webinar.
You can download the Amazon Redshift presentation from here.
We routinely offer plenty of useful information through webinars. You can go through the webinar on ‘Serverless Architecture’ here. The one on ‘Serverless Integrations’ is here. The one on ‘Amazon Lamda in Action’ is here.
And that is all from my desk for now. Until next time, Adios.