Elastic Search always returns top 10 results by default. To get large volume of results, we need to user Search API.
The Search API provides
size parameters that can be used to retrieve predefined amount of data. But using from and size should be avoided to request very large volume at once. Reason: Search request works with multiple shards storing its requested hits into memory which leads into high memory + CPU usage.
Moreover Elastic Search has set the maximum limit of 10,000 hits to paginate using
from parameters. It's actually a safeguard mechanism of ES. More info can be found on ES-docs.
A scenario can exist where we need to paginate through ES and retrieve very large set of data. In such case
search_after parameter can be used.
This approach requests for data until there is none left. This seems to be the most easier way to get all the data chunks by chunks. The most important thing to consider here is: Make sure you know how much data you are paginating through.