0

NLP with python - ranking texts on ‘similarity’?

Lets say there is a set of text files and we need to rank them in order of ‘similarity’ to a given sample text. There are many nuances, but Im interested in easy to adopt, ‘quick and dirty’ solution to get me started. I think it is a very generic problem that probably already has been solved. For example, there are algorithms to detect plagiarism, which probably implement similar approaches. Any ideas on how to tackle this? Thank you in advance!

15th Mar 2019, 9:06 AM
Pasha
Pasha - avatar
1 Antwort
+ 1
You can use priority queues for this.
15th Mar 2019, 9:30 AM
Maninder $ingh
Maninder $ingh - avatar