There is No Best Design - Only Pros and Cons ⚖


Hello Reader,

When I became a Cloud Architect a decade back, I always pursued perfect architecture. I'd search for AWS service which is THE best! However, after designing and deploying many critical systems, I came to the conclusion that no system design is perfect. Every service, tool, and design has trade-offs. In today's edition, let's discuss some of the main ones:

  • Serverless Vs Kubernetes: Let's start with the most contentious one. I have been in countless meetings when person A thinks Lambda is the best and person B thinks Kubernetes is the best. In reality, both have their superpowers and considerations. With AWS Lambda, you don't need to manage any physical or virtual infrastructure. It scales automatically, pay as you go, and is inherently highly available. You might think you'd use AWS Lambda for every application! However, it's time-consuming (and sometimes complex) to refactor existing apps running in VM to Lambda, and Lambda doesn't support GPU and can only run for 15 minutes. On the other hand, if you are Kubernetes fanatic, you may think it's open source, cloud agnostic, and can run virtually anything. But you have to consider the overhead to manage and upgrade the cluster, take care of the AMI of the worker nodes, and the time needed to upskill resources. These are just a few examples. Hopefully, you are starting to see that nothing is the absolute best or worst but a mix of pros and cons.

  • DynamoDB Vs DSQL Vs Amazon RDS : Database requires critical consideration before choosing. Often I'd come across students who are gung-ho on Dynamo, and want to use it for everything. Sure, DynamoDB offers low latency, high availability, active active replication via Global Table, and more. But in real-world systems you need the capability to join tables, and run complex queries, which Dynamo doesn't support. On the other hand, Amazon RDS supports multiple database engines (MySQL, Oracle, etc.) , is easier to migrate to from on-prem databases, and provides complex join and querying capabilities. But you pay the cost of the underlying RDS instance irrespective of how much you are utilizing it; you have to ensure high availability, and it can't handle as high a scale as DynamoDB without you implementing different techniques such as sharding, read replica, storage scaling, caching, etc.
    • DSQL is AWESOME - I mean, I myself said this in my LinkedIn post after its release at Re:Invent 2024. Even though it has awesome features like autoscaling, pay as you go, and active active replication for a SQL database, it has some considerations as well. For example, DSQL does NOT support foreign keys, which is widely used in transactional systems. It also doesn't support Views, Triggers, Triggers etc. Hopefully, it's becoming clear that every service can be awesome or inapplicable depending on your project requirements.

  • Increased Reliability Vs Cost: This one is pretty evident. You can provision more EC2s in multiple AZs and even other AWS regions. The reliability increases, and so does the cost. Hence, you need to decide on the degree of reliability based on the criticality of the app and the cost. I have implemented projects where we had to multisite active-active DR, which is super expensive, but the app was a critical one for the company, and that application was bringing lot of revenue ;). On the other hand, I have even implemented a single AZ deployment because the application is just informational and not critical.

As you can see, there is no right or wrong decision; you choose based on the requirements. I would like to end this newsletter with one of my favorite quote from former AWS CEO, and current Amazon CEO Andy Jassy: "You have to use the right tool for the right job"

Over to you - what design trade-offs have you made in the past?

If you have found this newsletter helpful, and want to support me 🙏:

Checkout my bestselling courses on AWS, System Design, Kubernetes, DevOps, and more: Max discounted links

AWS SA Bootcamp with Live Classes, Mock Interviews, Hands-On, Resume Improvement and more: https://www.sabootcamp.com/

Keep learning and keep rocking 🚀,

Raj

Fast Track To Cloud

Free Cloud Interview Guide to crush your next interview. Plus, real-world answers for cloud interviews, and system design from a top Solutions Architect at AWS.

Read more from Fast Track To Cloud

Hello Reader, In the previous newsletter edition, we took a look at top Gen AI tools, and how can you benefit from the trend. The matter of the fast is, even in general SA interviews, you have to expect Gen AI questions. This is similar to how you expect fundamental containerization and DevOps questions. Gen AI is becoming quite popular, and this is no exception. In today's edition, let's go over AWS Gen AI landsacape, that I am following: The image illustrates the AWS Generative AI (Gen AI)...

Hello Reader, Agentic AI is the new buzzword. Every YouTube video, LinkedIn article, and blog is about Agentic AI. In today's edition, we will go over what is Agentic AI, and why is it becoming so popular. To understand this, we have to see the evolution of LLMs, let's find out: In the beginning, there was a single LLM, and we were doing prompting. This was significant first step, but it had the following challenges: Static, pre-trained information Not able to integrate project-specific data...

Hello Reader, The landscape of generative AI (Gen AI) consumer applications is evolving rapidly, with new players emerging and established ones innovating at an unprecedented pace. In today's newsletter, we will take a look at the top Gen AI apps and, more importantly, how YOU can use this for your career growth and get more money. Generative AI Consumer Apps: The Top Performers ChatGPT's Resurgence: After an initial plateau, ChatGPT has surged to 400 million weekly active users, driven by...