When the batch completes they need to update the top ma…

A company is running a batch analysis every hour on their main transactional DB, running
on an RDS MySQL instance, to populate their central Data Warehouse running on Redshift. During the
execution of the batch, their transactional applications are very slow. When the batch completes they need to
update the top management dashboard with the new dat

A company is running a batch analysis every hour on their main transactional DB, running
on an RDS MySQL instance, to populate their central Data Warehouse running on Redshift. During the
execution of the batch, their transactional applications are very slow. When the batch completes they need to
update the top management dashboard with the new dat

A.
The dashboard is produced by another system running on-premises that is currently started when a
manually-sent email notifies that an update is required. The on-premises system cannot be modified
because is managed by another team.
How would you optimize this scenario to solve performance issues and automate the process as much as
possible?

B.
Replace RDS with Redshift for the batch analysis and SNS to notify the on-premises system to update the
dashboard

C.
Replace RDS with Redshift for the oaten analysis and SQS to send a message to the on-premises system
to update the dashboard

D.
Create an RDS Read Replica for the batch analysis and SNS to notify me on-premises system to update
the dashboard

E.
Create an RDS Read Replica for the batch analysis and SQS to send a message to the on-premises
system to update the dashboard.



Leave a Reply 5

Your email address will not be published. Required fields are marked *

10 − 9 =


Salim Masani

Salim Masani

D.
Create an RDS Read Replica for the batch analysis and SNS to notify me on-premises system to update
the dashboard

John

John

Create an RDS Read Replica for the batch analysis and SNS to notify me on-premises system to update the dashboard

iMil

iMil

I’d say D also. Notification should be done with SNS in order to do push, and it is said that the DB is the main relational DB, not to be hosted in Redshift.

James

James

B.
My understanding is that Redshift takes over the part of batch analysis function from RDS into its data warehouse system; Redshift does not replace the RDS MySQL instance. This is because the batch analysis requires computation powers and should not be residing with the RDS (OLTP) MySQL instance together. Read replica serves read-heavy database workloads that do not require heavy computation powers.