Rate-Limiting | Engity-Glossary

Jun 25, 20252 min read

Rate limiting is a method of controlling the data traffic to a service or server. This is achieved by limiting the number of requests that can be made within a given time frame. This procedure is necessary to prevent resource abuse, ensure fair use of services and control the flow of traffic to web servers, APIs, or other online services.

Rate limiting can be used to prevent servers from being overloaded and crashed due to too many requests, as in the case of DDoS attacks. At the same time, it reduces downtime or slow response times. It is also possible to control the flow of data, which is particularly important for APIs that process and transfer large amounts of data. This makes it possible to distribute the data evenly between the elements of the API and control costs to prevent the use of too many resources.

Rate limiting is carried out in an application and not in the web server itself. It monitors how many requests are made from each IP address and the time between them. If there are too many requests within a specified time window, they are either processed slowly or blocked completely. This makes it more difficult for attackers to overwhelm a server or service.

Rate limiting also protects against brute force attacks. By limiting the number of login attempts that can be made per hour, the statistical probability of a brute force attack being successful is reduced.

Rate Limiting