如何比對字串資料庫相似度?

如何比對字串資料庫相似度?

當有一個句子,例如 This is a book. 希望可以在資料庫中找出相似的語句時,

有哪些工具可以利用?

資料庫範例

  • This is my book.
  • These are books.
  • This is a blue book.
  • This is a car

 

 

字串相似度比對工具

https://pypi.python.org/pypi/gensim

https://radimrehurek.com/gensim/

 

http://www.codeproject.com/Articles/13525/Fast-memory-efficient-Levenshtein-algorithm

 

http://mlsec.org/harry/

https://github.com/rieck/harry

Leave a Reply

Your email address will not be published. Required fields are marked *