DevOps Articles

Curated articles, resources, tips and trends from the DevOps World.

Text Similarity : Python-sklearn on MongoDB Collection

6 years ago dzone.com

Summary: This is a summary of an article originally published by the source. Read the full original article here →

In this article, I set up a Python script that allows us to calculate the similarity of an indexed field between all the documents of a MongoDB collection. In the process I parallelized the executions on four threads to improve performance. The script is detailed below, I hope it will be useful.