The test task asks to write the code finding "similar" HTML elements in given files. There is no definition of "similarity" in task description; they expect "similarity from the human point of view". There was 4 files provided (4 test cases).
The task description explicitly says "Target completion time is 2 hours. We would rather see what you were able to do in 2 hours than a full-blown algorithm you’ve spent days implementing" and "remember that working software is the main goal so something simple that works is generally better, than a complex unfinished solution"
They also asked to write detailed readme and to provide executable file.
It was allowed to use libraries.