5. Test Case
To make you easy to understand, the operation logic of any test will be described in SQL. During the test, Hive and Impala implement SQL statements directly, while in esProc, we will write the codes to achieve SQL function-equivalent operations.
5.1 Use Case for Scan
This case is used for testing performance of full-table scanning of big data, while separately considering simple counting, integer summary, float summary and numeric summary, as well as filtering of integer, digital, string and date, and other functions.
Use case description:
Note: Time unit is in seconds.
1. esProc, Impala and Hive degrade in turn, at top level of performance, esProc is 2-3 times higher than Impala, and 3-4 times than Hive.
2. Computing amount has a little impact on the performances of three test objects.
3. Data type also produces a very minimal impact on the performances of three test objects.