Dimensions of performance and scalability
Performance is the efficiency of a system in executing tasks and responding to requests. It’s measured by various metrics:
- Latency: The time taken for the system to respond to a request.
- Throughput: The number of requests processed in a given time frame.
- Resource utilization: The percentage of resources (for example CPU, memory, network, files, and so on) used during operations.
- Concurrent users: The number of users effectively served by the system simultaneously without degradation in performance.
- Page load time: The total time taken for a screen to fully load, including all assets (images, videos, scripts, and so on)
- Queue size: The number of requests waiting to be processed by the server
- Time to first byte (TTFB): The time that elapsed from when the client initiated a request to the client receiving the first byte from the server.
- Cache hit ratio: The percentage of requests served from the cache...