Digital Library
Close Browse articles from a journal
 
<< previous    next >>
     Journal description
       All volumes of the corresponding journal
         All issues of the corresponding volume
           All articles of the corresponding issues
                                       Details for article 36 of 38 found articles
 
 
  Using Non-Zero Dimensions for the Cosine and Tanimoto Similarity Search Among Real Valued Vectors
 
 
Title: Using Non-Zero Dimensions for the Cosine and Tanimoto Similarity Search Among Real Valued Vectors
Author: Kryszkiewicz, Marzena
Appeared in: Fundamenta informaticae
Paging: Volume 127 (2013) nr. 1-4 pages 307-323
Year: 2013-10-16
Contents: The cosine and Tanimoto similarity measures are typically applied in the area of chemical informatics, bio-informatics, information retrieval, text and web mining as well as in very large databases for searching sufficiently similar vectors. In the case of large sparse high dimensional data sets such as text or Web data sets, one typically applies inverted indices for identification of candidates for sufficiently similar vectors to a given vector. In this article, we offer new theoretical results on how the knowledge about non-zero dimensions of real valued vectors can be used to reduce the number of candidates for vectors sufficiently cosine and Tanimoto similar to a given one. We illustrate and discuss the usefulness of our findings on a sample collection of documents represented by a set of a few thousand real valued vectors with more than ten thousand dimensions.
Publisher: IOS Press
Source file: Elektronische Wetenschappelijke Tijdschriften
 
 

                             Details for article 36 of 38 found articles
 
<< previous    next >>
 
 Koninklijke Bibliotheek - National Library of the Netherlands