Metric measure
The fact that a measure is a metric allows optimizations in many applications. For example, many efficient algorithms for searching shortest paths in graphs, such as Dijkstra’s algorithm, make use of the fact that the underlying measure is a metric.
is called metric if the following holds:
- non-negativity
- symmetry
- identity
- triangle inequality
Metric measure sometimes called (mathematical) distance. But not every string measure which is historically called “distance” is a metric, so it’s better to use word metric to avoid confusion.
Minkowski distance
Minkowski distance is an example of metric measure (not typically used as string measure though, given here as an example)
where . For , the Minkowski distance is a metric.
- Manhattan distance
- Euclidean distance
- Chebyshev distance
Indexing metric spaces
Use two fingers to pan and zoom
from Exploring intersection trees for indexing metric spaces ↗
Data Structure | Space Complexity | Construction Complexity | Claimed Query Complexity | Extra CPU Query Time |
---|---|---|---|---|
BKT | ptrs | — | ||
FQT | ptrs | — | ||
FHQT | ptrs | |||
FQA | bits | |||
VPT | ptrs | — | ||
MVPT | ptrs | — | ||
VPF | ptrs | — | ||
BST | ptrs | not analyzed | — | |
GHT | ptrs | not analyzed | — | |
GNAT | dsts | not analyzed | — | |
VT | ptrs | not analyzed | — | |
MT | ptrs | not analyzed | — | |
SAT | ptrs | — | ||
AESA | dsts | |||
LAESA | dsts | |||
LC | ptrs | not analyzed |
from Searching in Metric Spaces ↗
- BKT - Burkhard–Keller Tree
- FQT - A further development over BKTs is the fixed queries tree
- FHQT - fixedheight FQT
- FQA - fixed queries array
- VPT - vantage-point trees
- MVPT - multi-vantage-point tree
- VPF - excluded middle vantage point forest
- BST - bisector trees
- GHT - generalized-hyperplane tree
- GNAT - The GHT is extended to an m-ary tree. Geometric near-neighbor access tree
- VT - Voronoi tree
- MT - M-tree
- SAT - spatial approximation tree
- AESA - approximating eliminating search algorithm
- LAESA - linear AESA
- LC - list of clusters
Category | Index | Storage | Distance Domain |
---|---|---|---|
Pivot-based tables | AESA, LAESA, EPT, CPT | Main-memory | Continuous |
Pivot-based trees | BKT, FQT, FQA | Main-memory | Discrete |
VPT, MVPT | Main-memory | Continuous | |
Pivot-based external indexes | PM-tree, Omni-family, M-index, SPB-tree | Disk | Continuous |
from Pivot-based Metric Indexing ↗