Description
In this question, you are required to perform parallel computing on fuzzy matching. As you might have observed, the fuzzy matching is quite time consuming. Therefore, in this question we will transform it into a parallel processing version, and it should run much faster.
You should use the Pool function in the multiprocessing library. And parallelly process each row of the acquirer data. The number of processes can be any number greater than or equal to 2.
In the output, each acquirer should be matched to five bank names with the highest similarity. Column 0 represents the highest similarity, Column 1 represents the second highest similarity, and similarly for Column 2, 3 and 4.
Input
acquirers.xlsx
bank_names.csv
Output
output.csv
Sample Input
acquirersbank_names
Chemical FinancialINC.HARTLAND FINANCIAL, INC.
CHEMICAL FINANCIAL CORPORATION
CHOICEONE FINANCIAL SERVICES
Sample Output
acquirers012
Chemical FinancialCHEMICAL FINANCIAL CORPORATIONCHOICEONE
Reviews
There are no reviews yet.