Logger: grouping news in threads in ../../sampledata_ru/ directory Logger: FIRST WE HAVE TO PREPROCESS DATA - REPEATING TASK 1 ON THE GIVEN DATASET... Logger: filtering English & Russian texts in ../../sampledata_ru/ directory 11866 exec time: 31.440988063812256 number of texts: 11866 count in dataset: 2 count in dataset: 11584 Logger: GROUPING NEWS INTO THREADS - TASK 4