How would you optimize this scenario to solve performance issues and automate the process as much as possible?

A company is running a batch analysis every hour on their main transactional DB. running on an RDS MySQL
instance to populate their central Data Warehouse running on Redshift During the execution of the batch their
transactional applications are very slow When the batch completes they need to update the top management
dashboard with the new data The dashboard is produced by another system running on-premises that is
currently started when a manually-sent email notifies that an update is required The on-premises system
cannot be modified because is managed by another team.
How would you optimize this scenario to solve performance issues and automate the process as much as
possible?

A company is running a batch analysis every hour on their main transactional DB. running on an RDS MySQL
instance to populate their central Data Warehouse running on Redshift During the execution of the batch their
transactional applications are very slow When the batch completes they need to update the top management
dashboard with the new data The dashboard is produced by another system running on-premises that is
currently started when a manually-sent email notifies that an update is required The on-premises system
cannot be modified because is managed by another team.
How would you optimize this scenario to solve performance issues and automate the process as much as
possible?

A.
Replace RDS with Redshift for the batch analysis and SNS to notify the on-premises system to update the
dashboard

B.
Replace ROS with Redsnift for the oaten analysis and SQS to send a message to the on-premises system to
update the dashboard

C.
Create an RDS Read Replica for the batch analysis and SNS to notify me on-premises system to update the
dashboard

D.
Create an RDS Read Replica for the batch analysis and SQS to send a message to the on-premises system to
update the dashboard.



Leave a Reply 21

Your email address will not be published. Required fields are marked *

three + 19 =


Frank

Frank

Hi,

I would say C

– Since SNS is automating the “manually-sent email notifies”.
– Not D (SQS) since the on-premises system cannot be modified.

Thx, Frank

KwagongMakisig

KwagongMakisig

C

Redshift can solve the performance problem but it is not designed as atransactional database

Krish

Krish

Question does not clarify, if the dashboard is part of AWS solution or outside it. Either ways, SNS notification seems to make more sense. With that in mind , read replicas will help improve the situation, with out disturbing the apple cart. “C” is my choice

Partha

Partha

1. Receivers have to poll SQS to receive messages. (Option “b” & “d” are incorrect)
2. Both Amazon Redshift and Amazon RDS enable you to run traditional relational databases in the cloud while offloading database administration. Customers use Amazon RDS databases both for online-transaction processing (OLTP) and for reporting and analysis.

Amazon Redshift harnesses the scale and resources of multiple nodes and uses a variety of optimizations to provide order of magnitude improvements over traditional databases for analytic and reporting workloads against very large data sets. Amazon Redshift provides an excellent scale-out option as your data and query complexity grows or if you want to prevent your reporting and analytic processing from interfering with the performance of your OLTP workload.
3. Option “a” looks correct

Saad

Saad

I’m leaning towards C.

“if you want to prevent your reporting and analytic processing from interfering with the performance of your OLTP workload.”
If I understand the above statement correctly, they are saying to separate reporting and analytic processing from OLTP. In other word, use RedShift for reporting and analytic processing and use RDS for OLTP workload.

Thoughts?

kirrim

kirrim

agree with Saad’s logic, I vote C as well

vladam

vladam

C is the right answer.

You can’t easily replace RDS with Redshift as you need to load data into Redshift first.

orby

orby

I think transactional application was the key word which make it appropriate to choose C

Korean27

Korean27

I think best answer A
We are using Redshift for solve performance issues.

ddbullfrog

ddbullfrog

Answer: C

I don’t think redshift can be used for OLTP.

Q: When would I use Amazon Redshift vs. Amazon RDS?

Both Amazon Redshift and Amazon RDS enable you to run traditional relational databases in the cloud while offloading database administration. Customers use Amazon RDS databases both for online-transaction processing (OLTP) and for reporting and analysis. Amazon Redshift harnesses the scale and resources of multiple nodes and uses a variety of optimizations to provide order of magnitude improvements over traditional databases for analytic and reporting workloads against very large data sets. Amazon Redshift provides an excellent scale-out option as your data and query complexity grows or if you want to prevent your reporting and analytic processing from interfering with the performance of your OLTP workload.

Abe

Abe

Correct Answer is A

Focus on the first line of the question: “A company is running batch analysis on their main transactional database”.

Company is already using Redshift so A seems to be correct.

Moh

Moh

I have big DB experience. In question they are using transnational DB on RDS MySQL, which populates the data in their data warehouse system that is running Redshift. Problem is when batch job runs, it make their transnational application to go slow. Transnational application using transnational DB which is MySQL. So what we have to do in RDS side to improve the performance is the question. Having Read Replica of transnational DB is the right choice with SNS to notify. C is the answer

Wajahat

Wajahat

Answer is A

Amazon Redshift, a hosted data warehouse product, forms part of the larger cloud-computing platform Amazon Web Services. It is built on top of technology from the massive parallel processing (MPP) data-warehouse company ParAccel (later acquired by Actian) (WIKI)

recovery22

recovery22

C, no doubt.