+ 1
Spreadsheet comparison
Any thoughts on how to use python to compare two excel spreadsheet for similarities? I work with a lot of information where I upload information from different locations.
1 Answer
+ 1
You might benefit from using pandas, numpy and maybe nltk, but the data sets will have to similar data types. So, integers compared to integers, strings to strings, etc. Pandas can pull down the excel data, numpy can arrange the datasets, you can probably compare at this level. The nltk library will help with tokenizing strings, defining stop words and comparing strings.
After defining these methods, you can provide the two excel spreadsheets to compare. I recently created a string comparator on SoloLearn, its code is below, it might also help and it has fewer moving parts, especially if its string comparing
https://code.sololearn.com/cBMcdSFzU654/?ref=app